Empirical Optimal Transport under Estimated Costs: Distributional Limits and Statistical Applications

2023 | preprint. A publication with affiliation to the University of Göttingen.

Jump to: Cite & Linked | Documents & Media | Details | Version history

Cite this publication

​Empirical Optimal Transport under Estimated Costs: Distributional Limits and Statistical Applications​
Hundrieser, S.; Mordant, G.; Weitkamp, C. A.& Munk, A. ​ (2023). DOI: https://doi.org/10.48550/ARXIV.2301.01287 

Documents & Media

License

GRO License GRO License

Details

Authors
Hundrieser, Shayan; Mordant, Gilles; Weitkamp, Christoph Alexander; Munk, Axel 
Abstract
Optimal transport (OT) based data analysis is often faced with the issue that the underlying cost function is (partially) unknown. This paper is concerned with the derivation of distributional limits for the empirical OT value when the cost function and the measures are estimated from data. For statistical inference purposes, but also from the viewpoint of a stability analysis, understanding the fluctuation of such quantities is paramount. Our results find direct application in the problem of goodness-of-fit testing for group families, in machine learning applications where invariant transport costs arise, in the problem of estimating the distance between mixtures of distributions, and for the analysis of empirical sliced OT quantities. The established distributional limits assume either weak convergence of the cost process in uniform norm or that the cost is determined by an optimization problem of the OT value over a fixed parameter space. For the first setting we rely on careful lower and upper bounds for the OT value in terms of the measures and the cost in conjunction with a Skorokhod representation. The second setting is based on a functional delta method for the OT value process over the parameter space. The proof techniques might be of independent interest.
Issue Date
2023
Project
EXC 2067: Multiscale Bioimaging 
SFB 1456: Mathematik des Experiments: Die Herausforderung indirekter Messungen in den Naturwissenschaften 
SFB 1456 | Cluster A: Data with Geometric Nonlinearities 
SFB 1456 | Cluster A | A04: Dynamics of cytoskeletal networks: From geometric structure to cell mechanics 
Organization
Institut für Mathematische Stochastik 
Working Group
RG Munk 
Extent
62
Language
English

Reference

Citations


Social Media