ESTIMATING INCOME STATISTICS FROM GROUPED DATA: MEAN-CONSTRAINED INTEGRATION OVER BRACKETS
Researchers studying income inequality, economic segregation, and other subjects must often rely on grouped data—that is, data in which thousands or millions of observations have been reduced to counts of units by specified income bracks. The distribution of households within the brackets is unknown...
Saved in:
| Published in | Sociological methodology Vol. 48; no. 1; pp. 337 - 374 |
|---|---|
| Main Authors | , |
| Format | Journal Article |
| Language | English |
| Published |
Los Angeles, CA
SAGE Publishing
01.08.2018
SAGE Publications American Sociological Association |
| Subjects | |
| Online Access | Get full text |
| ISSN | 0081-1750 1467-9531 |
| DOI | 10.1177/0081175018782579 |
Cover
| Summary: | Researchers studying income inequality, economic segregation, and other subjects must often rely on grouped data—that is, data in which thousands or millions of observations have been reduced to counts of units by specified income bracks. The distribution of households within the brackets is unknown, and highest incomes are often included in an open-ended top bracket, such as "$200,000 and above." Common approaches to this estimation problem include calculating midpoint estimators with an assumed Pareto distribution in the top bracket and fitting a flexible multipleparameter distribution to the data. The authors describe a new method, mean-constrained integration over bracfats (MCIB), that is far more accurate than those methods using only the bracket counts and the overall mean of the data. On the basis of an analysis of 297 metropolitan areas, MCIB produces estimates of the standard deviation, Gini coefficient, and Theil index that are correlated at 0.997, 0.998, and 0.991, respectively, with the parameters calculated from the underlying individual record data. Similar levels of accuracy are obtained for percentiles of the distribution and the shares of income by quintiles of the distribution. The technique can easily be extended to other distributional parameters and inequality statistics. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 0081-1750 1467-9531 |
| DOI: | 10.1177/0081175018782579 |