In this research paper we propose a measure of inequality with a multidimensional approach. For that purpose, we use a Principal Component Analysis for a set of variables that characterize the households of an economy. Specifically, the proposed methodology is tested using data from Mexico’s National Household Income and Expenses Survey. The results are consistent with those of conventional measures of inequality when we analyze them between two periods of time. However, when the inequality among the States of Mexico is analyzed, the proposed index identifies greater inequality in those where there are gaps in services and poor housing conditions. The methodology that we propose is innovative to analyze multidimensional inequality and its implementation is easier to handle with than the one of the recent methods proposed by the literature. Based on our results, we suggest that if policymakers try to implement an income redistribution, this must be accompanied by an improvement in services that government offers, with an especial attention in education.
En este trabajo de investigación proponemos una medida de desigualdad con un enfoque multidimensional. Para tal fin, usamos un Análisis de Componentes Principales para un conjunto de variables que caracterizan a los hogares de una economía. Específicamente, la metodología propuesta se prueba con los datos de la Encuesta Nacional de Ingresos y Gastos de los Hogares de México. Los resultados son consistentes con los de las medidas convencionales de desigualdad cuando los analizamos entre dos periodos de tiempo. Sin embargo, cuando se analiza la desigualdad entre los Estados de México, el índice propuesto identifica una mayor desigualdad en aquellos donde existen brechas en los servicios y condiciones de vivienda deficientes. La metodología que proponemos es innovadora para analizar la desigualdad multidimensional y su implementación es más fácil de manejar que la de los métodos recientes propuestos por la literatura. Con base en nuestros resultados, sugerimos que si los diseñadores de políticas intentan implementar una redistribución del ingreso, esta debe ir acompañada de una mejora en los servicios que ofrece el gobierno, con especial atención en la educación.
The existing income gap between different sectors of society has intensified among the last years all around the world, while the social status and political power have involved a divided humanity, thereby undermining social and economic development of nations. Indeed, according to
On the words of
Through the evolution of economic science, the study of economic development and poverty has become increasingly important, especially in developing countries. Classical and recent contributions (for example
The study of multidimensional inequality was pioneered by
Recently,
It is unbelievable that, given the broad indices proposed, there is still not an official way to measure the inequality with a multidimensional approach. Hence, in this research we propose a straightforward method to measure multidimensional inequality. However, we do not formally asses the standard set of axioms that an inequality index should satisfy given the econometric treatment of data. In any case, we leave this task to further research.
The natural antecedent of our index, by its multidimensional character, is the Human Development Index (HDI) proposed by the United Nations Development Program (UNDP) in 1990, which opened a new door to think about the welfare of people. However, after receiving strong criticism (for example:
In an attempt to consider a more accurate measure of human development,
Currently, the UNDP has five measures of human development: the HDI, the inequality-adjusted HDI, the Gender Development Index (GDI), the Gender Inequality Index (GII), and the Multidimensional Poverty Index (MPI), however there is still not a measure if inequality which considers the dimensions of human development.
The idea of using the PCA to construct an index has been widely adopted given that the components obtained have the desirable characteristic of being orthogonal (i.e., not correlated). Recently this method has been used in studies of development economics such as the one carried by
Our research aims to develop a new methodology for the creation of an index which measures the inequality with a multidimensional approach. Our proposed method could be used with any data from household surveys that contains information of at least the following variables: income, education, health and some measure of access to services. We show how this method works using data from Mexico. Hence, we measure the degree of inequality using our Multidimensional Inequality Index (MII) among the Mexican states and we compare it with the results derived from the Gini index. We conclude that our index identifies greater inequality in those states where there are gaps in services and poor housing conditions. We also analyze areas; urban and rural, and gender inequality. We find that among the females there are less inequality than among males, with statistically significant differences. On the other hand, we find that rural areas are more unequal than urban ones.
The Principal Component Analysis (PCA) is a statistical technique used for data reduction. This technique helps to reduce the number of variables of any dataset by describing a number of uncorrelated linear combinations of the variables that contain most of the variance. The PCA was originated with the work of
The aim of the PCA is to find the vector of linear combinations of the variables with the greatest variance. The first principal component has the highest overall variance. The second major component has maximum variance among all linear combinations of the vector that are not correlated with the first principal component, etc. The last major component has the smallest variance among all linear combinations of vector variables.
For purposes of our research, let define
Given the PCA, we obtain
Although the selection of the eigenvector using the Kaiser criterion is standard and straightforward, it is important to include just the ones with enough variance with the purpose of have differences among the individuals of the household surveys.
Suppose that we consider just
The interpretation of our MII is analogous to the one of the Gini Index; namely, the values closest to 0 mean perfect equality, while the values closest to the unity represent perfect inequality.
To perform the computation and measurement of the Multidimensional Inequality Index (MII), we use as source of information the ENIGH, for the year 2014, which is published by the National Institute of Statistics and Geography (INEGI). The ENIGH data provide a statistical overview of the performance of revenues and expenditures of Mexican households in terms of amount, origin and distribution. This survey is statistically representative at national level, and for the States of Puebla and Tabasco. Particularly we use the Socioeconomic Conditions Module (MCS), which is an annex to the ENIGH and which provides information on occupational and socio-demographic characteristics of household members, and the characteristics of the infrastructure of housing and household equipment, the latter was especially useful to include multidimensional characteristics to the MII. The MCS is statistically representative at national level and at State level, so do our results.
The multidimensional characteristics require, as Anand and
For the case of Mexico, the MII proposed is composed of 10 variables that characterize the socioeconomic status of household members, namely: institution that provides health service as the survival component, years of schooling as the education component, and total current income per-capita as the income component. Additionally, we consider overcrowding, fuel for cooking, drainage, access to potable water, construction material of walls, ceilings and floors of the house, as measures of the provision of public services in addition to the income component. The values of each variable were sorted in ascending order, giving the lowest value to the lower status and increasing with better conditions, except for the variable “overcrowding”.
The treatment of schooling was special, because in order to control for the agents’ age, we propose to consider the ratio of the exponential of years of schooling,
Where the years of schooling were calculated with the variables: approved level, degree approved and, in the case of Technical Career level, the school history variable. We only consider the population of 6 years and older. The lowest school year (1) corresponds to the first passing grade of primary, while the last school year (25) corresponds to the sixth year of doctoral studies. The value of 0 is assigned to those who have no years of study.
The reason to express the schooling in this way is the weight that is given to the years of schooling to adults, this means that it is normal for children and youth to be studying, so there are not a situation of inequality, while adults will be punished or rewarded for years of study according to their age.
Source: Data from household's members interviewed in the ENIGH 2014.
Variable
Mean
Standard
Minimum
Maximum
Total Current Income Per Capita
3,459.2
8,346.6
0.0
1,858,807.0
Institution that Provides Health Service
2.4
1.1
1.0
7.0
Years of Study
8.1
4.6
0.0
25.0
Drain Destination
4.5
1.0
1.0
5.0
Water Origin
6.4
1.5
1.0
7.0
Walls Material
7.8
0.8
1.0
8.0
Roofs Material
8.4
2.8
1.0
10.0
Floors Material
2.4
0.6
1.0
3.0
Cooking Fuel
3.5
1.2
1.0
5.0
Crowding
1.4
1.0
0.1
21.0
Observations
190,296
190,296
190,296
190,296
Regarding the rest of the variables that make up the MII, we note that individuals of an average household of Mexico have a monthly income of $3,459.23 pesos with a standard deviation of $8,346.59. In the case of health, on average, the population is served by the Seguro Popular or by the Mexican Institute of Social Security (IMSS).
On the other hand, regarding the characteristics of dwelling, on average, households are connected to a septic tank or public network, while the material of construction of the houses may be of brick, block, stone, quarry, cement or concrete walls and; cement or some other type of coating as wood or tile for floors. The fuel used for cooking in homes is mostly electricity and gas tank. The overcrowding indicator, on average, is 1.40 people. Finally, on average, members of Mexican families have 8.1 years of schooling equivalent to the second year of secondary school.
Before applying the PCA we conducted a correlation analysis in order to assess the relevance of including the variables to be analyzed;
Note: The correlation shown corresponds to the Spearman Correlation Coefficient.
Variable
Income per Capita
Health Service
Drain Destination
Water Origin
Walls Material
Roofs Material
Floors Material
Cooking Fuel
Crowding
Years of Study
Income Per Capita
1.00
Health Service
0.16
1.00
Drain Destination
0.09
0.15
1.00
Water Origin
0.09
0.15
0.39
1.00
Walls Material
0.05
0.09
0.25
0.20
1.00
Roofs Material
0.10
0.17
0.34
0.32
0.38
1.00
Floors Material
0.16
0.23
0.31
0.30
0.25
0.41
1.00
Cooking Fuel
0.13
0.21
0.47
0.38
0.25
0.40
0.39
1.00
Crowding
-0.14
-0.17
-0.25
-0.23
-0.24
-0.28
-0.33
-0.30
1.00
Years of Study
0.04
0.03
0.01
0.01
0.01
0.01
0.02
0.01
-0.02
1.00
The 10 selected variables,
Where
Note:
0.15
0.66
-0.25
0.39
0.22
0.49
-0.24
0.53
0.38
-0.19
0.06
0.52
0.35
-0.15
0.03
0.60
0.30
-0.25
0.14
0.64
0.40
-0.16
0.07
0.49
0.39
0.10
-0.04
0.53
0.41
-0.07
0.00
0.48
-0.32
-0.11
0.04
0.67
0.02
0.39
0.92
0.00
According to the scores on the variables we can see that the
To ensure the selection of the eigenvectors we plot the eigenvalues with a confidence interval of 99%.
We also apply the analysis of Kaiser-Meyer-Olkin (KMO) to compare the correlations and partial correlations between the 10 variables. If the partial correlations are relatively higher compared to the total ones, the extent of KMO is small implying that it is not possible to generate a low-dimensional representation with data. On average, the variables have a rating of 0.84 (
0.817
0.860
0.830
0.870
0.835
0.837
0.859
0.838
0.884
0.730
Because the interpretation of loadings makes more sense without rotating the eigenvectors than rotating them, we proceed to use it for the construction of the MII. Note that for some agents, the eigenvectors have corresponding negative elements, so we transform them in order to have positive numbers using its minimum values,
Using equation (1), we have that the MII for Mexico is obtained with the following formula:
We use this formula to calculate the MII nationally, by state, by areas: rural and urban, and by gender. In addition, to contrast the MII obtained, we calculate other indices of inequality: Coefficient of Variation, Gini index, Theil index, Atkinson index, among others.
The application of the methodology that we propose suggests that Mexico has a MII equal to 0.85, i.e., it is a country with lots of multidimensional inequality. But how serious is this? Suppose there is a country with exactly the same characteristics as Mexico, but with a more equal income distribution. To do this we generated an income variable with a normal distribution, taking the average income of Mexico ($3,459.23 pesos) and assuming a lower standard deviation ($800). Given these assumptions, we get a MII of 0.50. This means that even with an improvement in income distribution in Mexico, our index suggests that there is a gap in living conditions and opportunities for Mexicans, however, the MII improves substantially. In contrast, if we taste the Gini index for this imaginary country, we get a Gini of 0.13 which suggest an egalitarian population. This clearly omits the inequality among the many human development indicators.
Furthermore, let consider the income distribution in Iceland, which implies approximately a Coefficient of Variation (CV) of 0.40 (the CV in Mexico was 2.40 in 2014). We use the Iceland data since it is the country with less inequality worldwide, according to the European Commission (2016), with a Gini index of 0.227. Again we take the other household characteristics equal to the Mexican ones. If Mexico could redistribute the wealth, as well as Iceland does, but preserving the same access to opportunities as in the present, we get a MII of 0.48. This means that our index still suggest the existence of inequality although the income is redistributed. That is why we propose that an income redistribution must be accompanied by an improvement in services that the government offers (both in quality and access), mainly the education system because of the loadings received in the PCA from this particular component.
Note: The corresponding measures in column 2014* coincide to the ones with the income corrected by multidimensionality using the elements of the MII by agent.
Inequality Measures
2014
2008
Difference
Change
2014*
Difference
Change
Multidimensional Inequality Index
0.853
0.701
0.152
21.69
Relative Mean Deviation
0.367
0.361
0.005
1.48
0.396
0.029
7.97
Coefficient of Variation
2.403
1.624
0.779
47.99
6.853
4.450
185.22
Gini Coefficient
0.505
0.499
0.006
1.28
0.543
0.038
7.51
Theil Entropy Measure
0.551
0.501
0.050
10.02
0.755
0.204
36.97
Atkinson Inequality Measure
0.368
0.365
0.004
0.98
0.418
0.050
13.59
On the other hand, as a reference point, we compere the MII in 2014 with the one in 2008, both for the Mexican data. The financial crisis of 2008 in the United States of America provoked a slowdown in Mexican economic activity since the four quarter of 2008. For this year the MII was 0.70, and six years later was 0.85. Hence, the multidimensional inequality in Mexico raised for this period, meaning worse conditions for the distribution of wealth. In contrast, the GDP increased by a mean rate of 2.1 percent (see
Note: We omitted the population from zero to five years old given that their no-schooling affected the index, despite the number of household members is considered in the variable "crowding". ***p<0.01, **p<0.05, *p<0.1
Variable
MII
Difference
Urban
Rura
0.8491
0.8656
-0.0165***
Population
69,400,000
20,900,000
Male
Female
Gender
0.8530
0.8527
0.0004***
Population
43,700,000
46,600,000
The conventional inequality measures also predict an increase of inequality between 2008 and 2014 but in different magnitudes. If we consider the percentage change in each index we could assess how intense was this increase. The results in
Ranking
State
Gini
State
MII
1
Puebla
0.578
Chiapas
0.867
2
Chiapas
0.513
Guerrero
0.867
3
Coahuila
0.510
Oaxaca
0.866
4
Oaxaca
0.508
Veracruz
0.860
5
Distrito Federal
0.507
Tabasco
0.860
6
Yucatán
0.507
Campeche
0.859
7
Hidalgo
0.506
Puebla
0.856
8
Zacatecas
0.506
Michoacán
0.856
9
Quintana Roo
0.500
San Luis Potosí
0.855
10
Campeche
0.498
Yucatán
0.855
11
Aguascalientes
0.490
Hidalgo
0.855
12
Veracruz
0.490
Morelos
0.855
13
Guerrero
0.489
Durango
0.853
14
Querétaro
0.488
Zacatecas
0.852
15
Sinaloa
0.487
Tlaxcala
0.852
16
San Luis Potosí
0.478
Nayarit
0.852
17
Tamaulipas
0.478
Baja California
0.851
18
Sonora
0.475
Quintana Roo
0.851
19
Jalisco
0.471
México
0.851
20
Morelos
0.468
Guanajuato
0.851
21
Nayarit
0.467
Sinaloa
0.851
22
México
0.464
Baja California
0.851
23
Baja California Sur
0.455
Tamaulipas
0.851
24
Chihuahua
0.455
Chihuahua
0.850
25
Nuevo León
0.455
Querétaro
0.850
26
Tabasco
0.454
Colima
0.850
27
Guanajuato
0.453
Sonora
0.849
28
Colima
0.452
Coahuila
0.847
29
Michoacán
0.450
Jalisco
0.847
30
Durango
0.448
Aguascalientes
0.846
31
Baja California
0.434
Nuevo León
0.845
32
Tlaxcala
0.414
Distrito Federal
0.845
We also tasted the MII to analyze the inequality between urban areas and rural ones, as well as by gender. Note that in all cases the MII is calculated considering the total population, thus, the differences corresponds to the inherent characteristics of the both aspects considered by our index. For example, we find that the rural areas in Mexico are more unequal than urban ones, maybe this is because in the former there are gaps in services and poor housing conditions, meanwhile in the later most of the populations have access to public services. We also find that among the males there are more inequality than among the females; given that the household characteristics does not change between male and female we conclude that the differences must be due to the income, health and education. In the area as well as in the gender analysis we obtained statistically significant differences (see
Finally, we compare the MII with the Gini index to assess the changes in the rankings when the inequality is measured among the states of the Mexican Republic (this kind of analysis could be done when comparing the rankings among countries). Although the Gini index considers that the most unequal state of Mexico is Puebla, the MII situate it in the seventh place.
The most radical change is in the Distrito Federal (DF) which passes from the fifth place to the last one, i.e., the MII consider that this state is the most egalitarian. This result must be related to the fact that most of people have access to basic services at the DF. That is why we conclude that our index identifies greater inequality in those states where there are gaps in services and poor housing conditions. Analogously the Gini index classifies the Tlaxcala state as the less unequal whilst the MII classifies it in the middle of the distribution (see
Given that most of the variables included in the MII are related to those used for CONEVAL in the Multidimensional Poverty Index (MPI) calculus, we conducted an analysis of both indices (see
After consider the ordering of Mexico states for both indices we conclude that, although there are similarities in the extremes of the distribution, the MII offers a more flexible view of services access. The reason is because the MPI classifies people according to binaries criterions, which yields a discrete distribution; by the other hand, the PCA deals the agents characteristics in the linear combination of the multidimensional variables, hence, the resulting MII has a quasi-continuous distribution. This characteristic provides a much rich information about the differences among agents.
In this research paper we propose a new methodology to analyze the inequality using a multidimensional perspective. This new methodology is based on a PCA to use the eigenvectors as the elements of a Multidimensional Inequality Index (MII). The eigenvectors are normalized and are included in the index as an arithmetic mean. The interpretation of the MII is analogous to the one of the Gini index.
In a simulation conducted we used an income distribution similar to the less unequal country in the world (Iceland), but unchanging the other characteristics for Mexico. We find that, if Mexico could redistribute the wealth as well as Iceland does, but preserving the same access to opportunities as in the present, the MII still suggest the existence of inequality. That is why we propose that an income redistribution must be accompanied by an improvement in services that the government offers, with an especial attention to education.
We also find that our results are consistent with those of conventional inequality measures when we analyze it between two periods of time; specifically we use the beginning of the financial crisis (2008) and the most recent period available (2014). Likewise, we propose to use the MII’s elements to correct the conventional measures to incorporate a multidimensional approach. In all cases the corrected inequality measures predicts a harsher inequality.
Finally, we compared the inequality among the states of Mexico using the Gini index and the MII, the later identifies greater inequality in those states where there are gaps in services and poor housing conditions. Furthermore, the inequality in Mexico is analyzed considering the gender and the region of the agents studied. In the first case, we find that among the females there are less inequality than among males, with statistically significant differences, while in the second case, we find that rural areas are more unequal than urban ones.
In advance, we suggests that the normalization property cannot be satisfied given that our index do not only consider income. The symmetry property and the principle of Dalton are indeed fulfilled. The principle of Pigou-Dalton and the property of independence of scale are achieved and both are proven in the section 4. Finally, the continuity and the differentiability properties are satisfied.
The main criterion is the variance of the eigenvector in itself, however this will be discussed later.
In the case of the institution that provides health service we consider the absence of health service with a zero value and the private service with a value of seven. All the other kind of health services provided are in between.
In 2009 CONEVAL unveiled a new methodology for measuring multidimensional poverty in Mexico. The multidimensional measurement of poverty takes into account two dimensions: economic welfare and social rights. The dimension of economic well-being is only an indicator of current income, while social rights dimension has six indicators: educational gap, access to health care, social security, space and housing materials, and access to services and to food.
Ranking
State
MPI
State
MII
1
Chiapas
0.752
Chiapas
0.867
2
Oaxaca
0.659
Guerrero
0.867
3
Guerrero
0.642
Oaxaca
0.866
4
Puebla
0.633
Veracruz
0.860
5
Michoacán
0.583
Tabasco
0.860
6
Tlaxcala
0.582
Campeche
0.859
7
Veracruz
0.572
Puebla
0.856
8
Hidalgo
0.535
Michoacán
0.856
9
Zacatecas
0.515
San Luis Potosí
0.855
10
Morelos
0.510
Yucatán
0.855
11
San Luis Potosí
0.485
Hidalgo
0.855
12
Tabasco
0.483
Morelos
0.855
13
México
0.479
Durango
0.853
14
Guanajuato
0.460
Zacatecas
0.852
15
Yucatán
0.450
Tlaxcala
0.852
16
Durango
0.429
Nayarit
0.852
17
Campeche
0.423
Baja California
0.851
18
Nayarit
0.393
Quintana Roo
0.851
19
Sinaloa
0.382
México
0.851
20
Tamaulipas
0.368
Guanajuato
0.851
21
Quintana Roo
0.352
Sinaloa
0.851
22
Jalisco
0.347
Baja California
0.851
23
Aguascalientes
0.337
Tamaulipas
0.851
24
Colima
0.333
Chihuahua
0.850
25
Chihuahua
0.333
Querétaro
0.850
26
Querétaro
0.331
Colima
0.850
27
Coahuila
0.293
Sonora
0.849
28
Baja California Sur
0.293
Coahuila
0.847
29
Sonora
0.286
Jalisco
0.847
30
Baja California
0.276
Aguascalientes
0.846
31
Distrito Federal
0.274
Nuevo León
0.845
32
Nuevo León
0.197
Distrito Federal
0.845
A person is considered multidimensional poor when is deprived in both dimensions (economic welfare and social rights). If a person shows deprivation in any of the six indicators are considered vulnerable by social deprivation. Likewise, showing income deprivation (income vulnerable) and at least any three of the social rights, then is considered to be in extreme multidimensional poverty.
That said, we call Multidimensional Poverty Index (MPI) to the proportion of people in situation of multidimensional poverty with respect to the total population; this apply to nations, regions, states, etc. Since the MPI and the MII use similar variables, it is natural to think that both could yield similar results.
For 2014, the state with more people classified as multidimensional poor (as a proportion of total population) in Mexico was Chiapas, while the state with less multidimensional poor people was Nuevo León (see
If we compare the MPI with the MII among states according to their ranking, it is clear that both indices are similar but just in the extremes of the classification, i.e., the most unequal states agree to be the multidimensional poorest, however, in the middle of the distribution there are significant differences, for example, states like Campeche, Yucatán, Tabasco, and San Luis Potosí are classified by the MPI in the second third while the MII classifies them in the first one (i.e., from a middle classification to an extreme one). The other way is also true for states like Sinaloa and Tamaulipas, i.e., the MPI classifies them in a middle category while the MII does it in a less extreme.
Finally, it is important to say that whether both indices may be similar, the MII offers a more flexible view of services access. The reason is because the MPI classifies people according to binaries criterions, which yields a discrete distribution; on the other hand, the PCA deals with the agents characteristics in the linear combination of the multidimensional variables, hence, the resulting MII has a quasi-continuous distribution. This characteristic provides a much rich information about the differences among agents, and this is one important reason to changes in the states ranking, which gives an idea of the MII precision.


