Multimodal data is a precious asset enabling a variety of downstream tasks in machine learning. However, real-world data collected across different modalities is often not paired, which is a significant challenge to learn a joint distribution. A prominent approach to address the modality coupling problem is Minimum Entropy Coupling (MEC), which seeks to minimize the joint Entropy, while satisfying constraints on the marginals. Existing approaches to the MEC problem focus on finite, discrete distributions, limiting their application for cases involving continuous data. In this work, we propose a novel method to solve the continuous MEC problem, using well-known generative diffusion models that learn to approximate and minimize the joint Entropy through a cooperative scheme, while satisfying a relaxed version of the marginal constraints. We empirically demonstrate that our method, DDMEC, is general and can be easily used to address challenging tasks, including unsupervised single-cell multi-omics data alignment and unpaired image translation, outperforming specialized methods.
Learning to match unpaired data with minimum entropy coupling
ICML 2025, 42nd International Conference on Machine Learning, 13-19 July 2025, Vancouver, Canada
      
  Type:
        Conférence
      City:
        Vancouver
      Date:
        2025-07-13
      Department:
        Data Science
      Eurecom Ref:
        8146
      Copyright:
        © EURECOM. Personal use of this material is permitted. The definitive version of this paper was published in ICML 2025, 42nd International Conference on Machine Learning, 13-19 July 2025, Vancouver, Canada and is available at : 
      See also:
        
      PERMALINK : https://www.eurecom.fr/publication/8146
 
     
                       
                      