Comparing distributions: Kernels estimate good representations, l1 distances give good tests
Note
Given two set of observations, are they drawn from the same distribution? Our paper Comparing distributions: l1 geometry improves kernel two-sample testing at the NeurIPS 2019 conference revisits this classic statistical problem known as “two-sample testing”.
This post explains the context and the paper with a bit of hand …