Minimizing ƒ-Divergences by Interpolating Velocity Fields

Song Liu, Jiahao Yu, Jack I Simons, Mingxuan Yi, Mark A Beaumont

Research output: Chapter in Book/Report/Conference proceedingConference Contribution (Conference Proceeding)

Abstract

Many machine learning problems can be seen as approximating a target distribution using a particle distribution by minimizing their statistical discrepancy. Wasserstein Gradient Flow can move particles along a path that minimizes the ƒ-divergence between the target and particle distributions. To move particles, we need to calculate the corresponding velocity fields derived from a density ratio function between these two distributions. Previous works estimated such density ratio functions and then differentiated the estimated ratios. These approaches may suffer from overfitting, leading to a less accurate estimate of the velocity fields. Inspired by non-parametric curve fitting, we directly estimate these velocity fields using interpolation techniques. We prove that our estimators are consistent under mild conditions. We validate their effectiveness using novel applications on domain adaptation and missing data imputation. The code for reproducing our results can be found at https://github.com/anewgithubname/gradest2.
Original languageEnglish
Title of host publicationProceedings of the 41st International Conference on Machine Learning
PublisherProceedings of Machine Learning Research
Pages32308-32331
Number of pages24
Publication statusPublished - 27 Jul 2024
EventThe 41st International Conference on Machine Learning - Messe Wien Exhibition Congress Center, Vienna, Austria
Duration: 21 Jul 202427 Jul 2024
https://icml.cc/Conferences/2024

Publication series

NameProceedings of Machine Learning Research
Volume235
ISSN (Electronic)2640-3498

Conference

ConferenceThe 41st International Conference on Machine Learning
Abbreviated titleICML 2024
Country/TerritoryAustria
CityVienna
Period21/07/2427/07/24
Internet address

Fingerprint

Dive into the research topics of 'Minimizing ƒ-Divergences by Interpolating Velocity Fields'. Together they form a unique fingerprint.

Cite this