Optimal transportation and its application to fairness
Translator | Li Rui
Reviewer | Sun Shujuan
Optimal transportation originated from economics and is now developed as a tool for how to best allocate resources. The origins of optimal transportation theory can be traced back to 1781, when French scientist Gaspard Monge studied a method of purportedly "moving the earth" and building fortifications for Napoleon's army. Overall, optimal transportation is the problem of how to move all resources (such as iron ore) from a set of starting points (mines) to a set of end points (steel plants) while minimizing the total distance the resources must travel. Mathematically, the researchers wanted to find a function that maps each origin to a destination while minimizing the total distance between the origin and its corresponding destination. Despite its innocuous description, progress on the original conception of the problem, known as Munger's conception, stalled for nearly 200 years.
In the 1940s, Soviet mathematician Leonid Kantorovich adapted the formulation of the problem into a modern version, now known as Monge Kantorov's theory, which was the first step toward a solution. The novelty here is allowing some iron ore from the same mine to be supplied to different steel plants. For example, 60% of the iron ore from a mine can be provided to a steel plant, while the remaining 40% of the iron ore from the mine can be provided to another steel plant. Mathematically, this is no longer a function, as the same origin now maps to potentially multiple destinations. In contrast, this is known as the coupling between the origin distribution and the destination distribution, as shown in the figure below; selecting a mine from the blue distribution (origin) and moving vertically along the figure shows where the iron ore is sent Distribution of steel plants (destination).
As part of this new development, Kantorivich introduced an important concept called the Wasserstein distance. Similar to the distance between two points on a map, the Wasserstein distance (also known as the bulldozer distance inspired by its original scenario) measures the distance between two distributions, such as the blue and magenta distributions in this case. If all iron mines are far from all iron plants, then the Wasserstein distance between the distribution (location) of mines and the distribution of steel plants will be large. Even with these new improvements, it's still unclear whether there really is a best way to transport iron ore resources, let alone which method. Finally, in the 1990s, the theory began to develop rapidly as improvements in mathematical analysis and optimization led to partial solutions to the problem. In the 21st century, optimal transportation began to spread to other fields, such as particle physics, fluid dynamics, and even statistics and machine learning.
Optimal Transportation in Modern Times
With the explosion of new theories, optimal transportation has become the center of many new statistical and artificial intelligence algorithms over the past two decades. In almost every statistical algorithm, data are modeled, explicitly or implicitly, as having some underlying probability distribution. For example, if data on individual income are collected in different countries, there will be a probability distribution of that population's income in each country. If you wish to compare two countries based on the income distribution of their population, you need a way to measure the gap between the two distributions. This is exactly why optimizing transportation (especially Wasserstein distance) becomes so useful in data science. However, Wasserstein distance is not the only measure of the distance between two probability distributions. In fact, due to their connection to physics and information theory, the two options L-2 distance and Kullback-Leibler (KL) divergence have historically been more common. The main advantage of Wasserstein distance over these alternatives is that it takes into account both the values and their probabilities when calculating the distance, whereas L-2 distance and KL divergence only take into account probabilities. The image below shows an example of an artificial dataset on income for three fictional countries.
In this case, since the distributions do not overlap, the L-2 distance (or KL divergence) between the blue and magenta distributions will be the same as the blue and magenta distributions The L-2 distance between green distributions is roughly the same. On the other hand, the Wasserstein distance between the blue and magenta distributions will be much smaller than the Wasserstein distance between the blue and green distributions because there is a significant difference between the values (horizontal separation). This property of Wasserstein distance makes it ideal for quantifying differences between distributions, especially differences between data sets.
Achieving Fairness with Optimal Transport
With massive amounts of data being collected every day and machine learning becoming more common in many industries, data scientists must be increasingly careful not to let them Analytics and algorithms perpetuate existing biases and biases in the data. For example, if a home mortgage approval data set contains information about the race of applicants, but minorities were discriminated against in the collection process due to the methods used or unconscious bias, then a model trained on that data will reflect the underlying deviation.
Optimizing transportation can help mitigate this bias and improve fairness in two ways. The first and simplest method is to use Wasserstein distance to determine whether there is potential bias in the data set. For example, one can estimate the Wasserstein distance between the distribution of loan amounts approved for women and the distribution of loan amounts approved for men. If the Wasserstein distance is very large, that is, statistically significant, then a potential bias may be suspected. This idea of testing whether there is a difference between two groups is known in statistics as a two-sample hypothesis test.
Alternatively, optimal shipping can even be used to enforce fairness in the model when the underlying dataset itself is biased. This is useful from a practical perspective, as many real-world datasets exhibit some degree of bias, and collecting unbiased data can be very expensive, time-consuming, or unfeasible. Therefore, it is more practical to use existing data, no matter how imperfect, and try to ensure that the model mitigates this bias. This is accomplished by enforcing a constraint in the model called strong demographic parity, which forces model predictions to be statistically independent of any sensitive attributes. One approach is to map the distribution of model predictions to the distribution of adjusted predictions that do not depend on sensitive attributes. However, adjusting predictions also changes the performance and accuracy of the model, so there is a trade-off between model performance and the degree to which the model relies on sensitive attributes (i.e., fairness).
Achieve optimal shipping by changing predictions as little as possible to ensure optimal model performance, while still ensuring that new predictions are independent of sensitive attributes. The new distribution predicted by this adjusted model is called the Wasserstein centroid and has been the subject of much research over the past decade. The Wasserstein center of gravity is similar to the mean of a probability distribution in that it minimizes the total distance from itself to all other distributions. The image below shows three distributions (green, blue and magenta) along with their Wasserstein centers of gravity (red).
In the above example, suppose a model is built to predict someone's age and income based on a dataset that contains a sensitive attribute, such as marital status. There are three possible values: single (blue), married (green), and widowed/divorced (magenta). The scatter plot shows the distribution of model predictions for each different value. But wishing to adjust these values so that the predictions of the new model are blind to a person's marital status, each of these distributions can be mapped to the center of gravity in red using optimal transport. Because all values map to the same distribution, one can no longer judge a person's marital status based on income and age, or vice versa. The center of gravity preserves the fidelity of the model as much as possible.
The increasing ubiquity of data and machine learning models used in business and government decision-making has led to the emergence of new social and ethical questions about how to ensure the fair application of these models. Many datasets contain some kind of bias due to the nature of how they are collected, so it is important that models trained on them do not exacerbate this bias or any historical discrimination. Optimal transportation is just one way to solve this problem, which has been growing in recent years. Nowadays, there are fast and efficient ways to calculate optimal transportation maps and distances, making this approach suitable for modern large data sets. As people increasingly rely on data-based models and insights, fairness has and will continue to be a core issue in data science, and optimal transportation will play a key role in achieving this goal.
Original title: Optimal Transport and its Applications to Fairness, author: Terrence Alsup
The above is the detailed content of Optimal transportation and its application to fairness. For more information, please follow other related articles on the PHP Chinese website!
![Can't use ChatGPT! Explaining the causes and solutions that can be tested immediately [Latest 2025]](https://img.php.cn/upload/article/001/242/473/174717025174979.jpg?x-oss-process=image/resize,p_40)
ChatGPT is not accessible? This article provides a variety of practical solutions! Many users may encounter problems such as inaccessibility or slow response when using ChatGPT on a daily basis. This article will guide you to solve these problems step by step based on different situations. Causes of ChatGPT's inaccessibility and preliminary troubleshooting First, we need to determine whether the problem lies in the OpenAI server side, or the user's own network or device problems. Please follow the steps below to troubleshoot: Step 1: Check the official status of OpenAI Visit the OpenAI Status page (status.openai.com) to see if the ChatGPT service is running normally. If a red or yellow alarm is displayed, it means Open

On 10 May 2025, MIT physicist Max Tegmark told The Guardian that AI labs should emulate Oppenheimer’s Trinity-test calculus before releasing Artificial Super-Intelligence. “My assessment is that the 'Compton constant', the probability that a race to

AI music creation technology is changing with each passing day. This article will use AI models such as ChatGPT as an example to explain in detail how to use AI to assist music creation, and explain it with actual cases. We will introduce how to create music through SunoAI, AI jukebox on Hugging Face, and Python's Music21 library. Through these technologies, everyone can easily create original music. However, it should be noted that the copyright issue of AI-generated content cannot be ignored, and you must be cautious when using it. Let’s explore the infinite possibilities of AI in the music field together! OpenAI's latest AI agent "OpenAI Deep Research" introduces: [ChatGPT]Ope

The emergence of ChatGPT-4 has greatly expanded the possibility of AI applications. Compared with GPT-3.5, ChatGPT-4 has significantly improved. It has powerful context comprehension capabilities and can also recognize and generate images. It is a universal AI assistant. It has shown great potential in many fields such as improving business efficiency and assisting creation. However, at the same time, we must also pay attention to the precautions in its use. This article will explain the characteristics of ChatGPT-4 in detail and introduce effective usage methods for different scenarios. The article contains skills to make full use of the latest AI technologies, please refer to it. OpenAI's latest AI agent, please click the link below for details of "OpenAI Deep Research"

ChatGPT App: Unleash your creativity with the AI assistant! Beginner's Guide The ChatGPT app is an innovative AI assistant that handles a wide range of tasks, including writing, translation, and question answering. It is a tool with endless possibilities that is useful for creative activities and information gathering. In this article, we will explain in an easy-to-understand way for beginners, from how to install the ChatGPT smartphone app, to the features unique to apps such as voice input functions and plugins, as well as the points to keep in mind when using the app. We'll also be taking a closer look at plugin restrictions and device-to-device configuration synchronization

ChatGPT Chinese version: Unlock new experience of Chinese AI dialogue ChatGPT is popular all over the world, did you know it also offers a Chinese version? This powerful AI tool not only supports daily conversations, but also handles professional content and is compatible with Simplified and Traditional Chinese. Whether it is a user in China or a friend who is learning Chinese, you can benefit from it. This article will introduce in detail how to use ChatGPT Chinese version, including account settings, Chinese prompt word input, filter use, and selection of different packages, and analyze potential risks and response strategies. In addition, we will also compare ChatGPT Chinese version with other Chinese AI tools to help you better understand its advantages and application scenarios. OpenAI's latest AI intelligence

These can be thought of as the next leap forward in the field of generative AI, which gave us ChatGPT and other large-language-model chatbots. Rather than simply answering questions or generating information, they can take action on our behalf, inter

Efficient multiple account management techniques using ChatGPT | A thorough explanation of how to use business and private life! ChatGPT is used in a variety of situations, but some people may be worried about managing multiple accounts. This article will explain in detail how to create multiple accounts for ChatGPT, what to do when using it, and how to operate it safely and efficiently. We also cover important points such as the difference in business and private use, and complying with OpenAI's terms of use, and provide a guide to help you safely utilize multiple accounts. OpenAI


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

MinGW - Minimalist GNU for Windows
This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

SublimeText3 Chinese version
Chinese version, very easy to use

DVWA
Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

Zend Studio 13.0.1
Powerful PHP integrated development environment

PhpStorm Mac version
The latest (2018.2.1) professional PHP integrated development tool
