with the results of the initial match. The dissatisfaction was primarily with estimates of numbers of recipients and aggregate amounts for several income types in the AF. The presence of several income types was given a larger role in the rematch in order to improve these estimates.
Finally, Radner (1983:137) notes: “Because the EM sample…contains few high-income records…it was decided to add more AF records at $30,000 AGI (adjusted gross income) and above.” For the EM records, this match replaced the previous match.
This statistical match was followed by several instances of controlling to various totals from a variety of sources using a variety of techniques, including the addition of recipients for certain income types, such as transfer income. Other changes were the additional inflating of amounts and audit corrections. The usual CPS weights were not used; instead, the weights were adjusted for the census undercount and for consistency with administrative control totals.
The model that used the statistical match for input was not strictly a microsimulation model, since no program changes were simulated with this data set, just changes to income estimates for various subgroups. However, the adjustments used are typical of those used on microsimulation data sets and so are relevant to consider.
The federal individual income tax form lacks certain types of information, including sources of income and types of expenditures not subject to tax under current law, links between taxpayers and families, and information on nonfilers. These limitations motivated the creation of the merge file in the Office of Tax Analysis of the Internal Revenue Service (IRS) (see Bristol [1988] and Cilke, Nelson, and Wyscarver [1988]; creation of the most recent merge file is described in detail in Chapter 8 of Volume I). The merge file represents a statistical match of about 60,000 sample households from the March CPS with about 90,000 tax returns obtained from the Statistics of Income (SOI) sample of IRS records. Prior to the match, several variables are imputed to the SOI records, such as itemized deductions for nonitemizers and the share of earnings attributable to husbands and wives. The CPS data are also modified in several ways, including the correction of certain types of income for nonreporting and underreporting.
Before the statistical match is performed, the two files are “aligned” so that they represent the same universe. This process, a complicated form of reweighting, is not very successful because the CPS file has many low-income units and the SOI file is relatively rich in high-income units. For example (Bristol, 1988:118):
In the 1981 merge, 300 “returns” in the highest income class of the CPS
Sign in to access your saved publications, downloads, and email preferences.
Former MyNAP users: You'll need to reset your password on your first login to MyAcademies. Click "Forgot password" below to receive a reset link via email. Having trouble? Visit our FAQ page to contact support.
Members of the National Academy of Sciences, National Academy of Engineering, or National Academy of Medicine should log in through their respective Academy portals.
Thank you for creating a MyAcademies account!
Enjoy free access to thousands of National Academies' publications, a 10% discount off every purchase, and build your personal library.
Enter the email address for your MyAcademies (formerly MyNAP) account to receive password reset instructions.
We sent password reset instructions to your email . Follow the link in that email to create a new password. Didn't receive it? Check your spam folder or contact us for assistance.
Your password has been reset.
Verify Your Email Address
We sent a verification link to your email. Please check your inbox (and spam folder) and follow the link to verify your email address. If you did not receive the email, you can request a new verification link below