Active slices for sliced Stein discrepancy

Jan 1, 2021·

Wenbo Gong

K Zhang

Y Li

JM Hernandez-Lobato

· 0 min read

PDF Cite Code Source Document

Abstract

Sliced Stein discrepancy (SSD) and its kernelized variants have demonstrated promising successes in goodness-of-fit tests and model learning in high dimensions. Despite the theoretical elegance, their empirical performance depends crucially on the search of the optimal slicing directions to discriminate between two distributions. Unfortunately, previous gradient-based optimisation approach returns sub-optimal results for the slicing directions. It is computationally expensive, sensitive to initialization, and it lacks theoretical guarantee for convergence. We address these issues in two steps. First, we show in theory that the requirement of using optimal slicing directions in the kernelized version of SSD can be relaxed validating the resulting discrepancy with finite random slicing directions. Second, given that good slicing directions are crucial for practical performance, we propose a fast algorithm for finding good slicing directions based on ideas of active sub-space construction and spectral decomposition. Experiments in goodness-of-fit tests and model learning show that our approach achieves both the best performance and the fastest convergence. Especially, we demonstrate 14-80x speed-up in goodness-of-fit tests when compared with the gradient-based approach.

Type

Conference paper

Publication

ICML 2021

Last updated on Jan 1, 2021

Approximate Inference Bayesian Inference

Authors

Wenbo Gong

Senior Researcher

Senior Researcher at Microsoft Research Cambridge working on learning dynamics and optimization for foundation models, with prior work on causality and approximate inference.

← Simultaneous missing value imputation and structure learning with groups Jan 1, 2022

Interpreting diffusion score matching using normalizing flow Jan 1, 2021 →