Schelling’s Segregation Model
This model, proposed by the economist Thomas Schelling in 1971, at the time addressed a very important, and current, political issue: how to explain racial segregation in American cities. Specifically, Schelling wondered whether segregation was necessarily the result of racism—the prejudice of individuals of one race against those of another. Indeed, a frequently adopted mental model is that the overall state of segregation merely translates, by aggregation, the racism of individuals. If people actively wanted to live with others of a different color, or did not care at all about their race, then we should find that, globally, neighborhoods mix the different colors. Is this implicit model correct? In other words, does a global segregated state necessarily imply an individual preference for segregation?
To test this hypothesis, Schelling proposed the following model: take a chessboard, whose squares represent dwellings. Each house can be occupied by a green or a red person, or be empty. As is usual in economics, we translate the preferences of people to their neighborhoods by a utility function, which can be arbitrarily chosen. In this example, I will use a function that reflects the fact that people have a strong preference (maximum value taken as one) for a mixed neighborhood—that is to say in which as many reds as greens live. If there are more people of the same color, the value decreases to a value equal to 1/2 (half the maximum). If they find themselves in a neighborhood inhabited by a majority of people of another color, their utility becomes zero (reflecting strong displeasure). It is necessary to specify a last ingredient to determine fully the model, one that will give it its dynamics. We will assume that at every turn, say every day that passes, a resident is chosen at random, and we suggest that he moves into an empty place also chosen at random. This inhabitant then calculates, using his satisfaction function, if this move increases his satisfaction. If this is the case, he will agree to move; otherwise, he refuses and stays in place. Once the choice is made, another resident and another empty box are chosen, always randomly, and the procedure starts again.
Our intuition tells us that, as everyone moves to improve their satisfaction, and this is maximal when neighborhoods are mixed, the city should become increasingly racially mixed. The interest of Schelling’s model is to contradict this implicit modeling. Indeed, starting from a city in which people are randomly distributed and rolling out the simulation, we see that gradually the city becomes increasingly segregated, and this result is robust. With other physicists, we have managed to show mathematically that after a number of moves, for the satisfaction function presented here and for many other similar functions, it is certain that the city, in the end, will be segregated. To simplify, the explanation of the paradox is that when a person chooses to move, she considers only the variation of her own satisfaction, and does not consider that of her neighbors. Do the other residents she leaves or joins see their satisfaction increase or decrease? We show that, overall, average satisfaction decreases, because the loss of many agents affected by the move of one person does not offset the gain experienced by the person who moved. Yet, in this model governed by the individualism that is the norm in economics, the individual gain that dominates the dynamics inevitably leads to a disadvantageous social situation for everyone.
What is the relevance of this simple model to real urban segregation—which is a far more complex phenomenon than the model implies, of course, not least because it results from the preferences of individuals, but also because many other factors come into play, such as house prices, locality of schools, additional characteristics of people such as age or wealth, etc.?
To answer this question, we must examine modeling in general. There are two essential steps for modeling. The first, which is the most difficult and the most important, is to adopt a perspective, a simplification of reality that allows you to tame it and turn it into a formal model that can then (and this is the second step) be solved with digital tools (the computer) or mathematics. The first step is always more or less arbitrary. Which entities are important, and with what characteristics, and, therefore, which variables are considered negligible?
For example, Thomas Schelling has chosen to keep only the individual characteristics (the satisfaction of each person) that are supposed to depend solely on the ethnic composition of the area, assuming the function of satisfaction is immutable, and leaving aside many other factors, as seen above.
We should realize that this step toward simplification is not a stopgap: the models are of interest only by way of the accuracy with which they choose the less important elements. We must forget the fantasy of the perfect model, which completely reproduces reality. It would be as useless as the famous map that is as great in size as the territory it depicts, described by Argentine writer Jorge Luis Borges in “Del rigor en la ciencia.” This is the price we pay in order for the model to become formally manipulable, and allow for predictions (realistic models) or to clarify our ideas (simplistic models). This is true of social models as well as for models of natural systems.
What, then, is the interest of modeling? For realistic models, the answer is well known. It is about building a model, a prototype system, which is under study in order to manipulate it easily and make predictions. For social models, it is often helpful for policymakers to base their decisions on robust models that provide reliable predictions. Pushed to the limit, the idea would be to reach a rational policy at the last, freed from endless discussions. This idea is not new. We could go back to Plato and his Republic ruled by surveyors.
However, what is the point of simple models that do not claim the ability to predict anything? There are two major advantages to this type of modeling: the obligation to explain the models that would otherwise remain implicit, and then their conceptual clarity, which enables us to understand what we do. To understand these two points, I shall go back to Schelling’s segregation model. You might have in mind an implicit model that links the segregation observed at neighborhood level to individual preference for segregation. To put it simply: intuitively, the observation of racial segregation involves individual racism. By modeling formally, we have to specify this fuzzy idea, studying the effect of individual preferences on the choice of residence. We learn that this intuition is not reliable, that the overall effects of individual decisions are not always intuitive. More generally, this model shows rigorously that it is not enough to let economic agents pursue their own interests selfishly, to achieve, inevitably, by the grace of the free market, an optimal situation.
Of course, the model says nothing about what is going on in the real economy. It just says (but it is interesting nonetheless) that, logically, individual profit does not necessarily mean social welfare. The educational interest of simple models is that, thanks to the small number of ingredients they contain, we can understand their results in detail, the causalities at work, without feeling overwhelmed by the complexity of realistic models. Thus, in Schelling’s model, we understand that if the result of accumulated selfishness is bad socially, it is because the movements of individuals are made without considering their effect on their neighbors. We can study these effects in detail and show why they are stronger than the additional satisfaction obtained by the person who moves, leading to an overall negative effect. This ability to understand the causalities at work contrasts with the “black-box” character of realistic models, where the complexity of the processes often prevents us from understanding the origin of specific results.