record with the larger weight and it was included as another potential record for matching. (For the precise algorithm used, see Springs and Beebout [1976].)
The greatest advantage of statistical matching in comparison with other techniques (mentioned below) is probably the great flexibility it provides to data users. As imputation provides data users with a rectangular data file that can be input directly into most statistical software packages, statistical matching creates a file on which a variety of analyses, often unanticipated, can be performed. Thus, if one would use iterative proportional fitting for some purposes, covariance matrices for another, etc., it does seem easier to simply create a statistically matched file, especially in those cases for which the analysis cannot be anticipated. If the conditional independence assumption is warranted, or is roughly valid, the creation of a statistically matched file is very convenient for most data users and one that should provide reasonable results. Statistical matching also allows considerable reduction in respondent burden and reduces the opportunity for data disclosure.
As pointed out by Sims (1972), statistical matching assumes that Y and Z, given X, are independent. Records from the two files are matched or not matched on the basis of the values of X(A) and X(B). Therefore, there is no additional information in the matched file about the relationship between Y and Z that is not explained by the relationships between X and Y and between X and Z. That is, the approach assumes that if one were to regress a Yi on X(A) and Z, and then regress Yi on X(A), the multiple correlations in the two regressions would be identical.
Technically speaking, the procedure assumes that Y conditioned on X and Z conditioned on X are independent, or that the partial correlation between a Yi given X(A) and a Zj given X(B) is equal to 0 (which are equivalent notions if one assumes multivariate normality). It is important at this point to consider the mathematical definition of conditional independence. The partial correlation between Yi and Zj conditioned on X is equal to
Sign in to access your saved publications, downloads, and email preferences.
Former MyNAP users: You'll need to reset your password on your first login to MyAcademies. Click "Forgot password" below to receive a reset link via email. Having trouble? Visit our FAQ page to contact support.
Members of the National Academy of Sciences, National Academy of Engineering, or National Academy of Medicine should log in through their respective Academy portals.
Thank you for creating a MyAcademies account!
Enjoy free access to thousands of National Academies' publications, a 10% discount off every purchase, and build your personal library.
Enter the email address for your MyAcademies (formerly MyNAP) account to receive password reset instructions.
We sent password reset instructions to your email . Follow the link in that email to create a new password. Didn't receive it? Check your spam folder or contact us for assistance.
Your password has been reset.
Verify Your Email Address
We sent a verification link to your email. Please check your inbox (and spam folder) and follow the link to verify your email address. If you did not receive the email, you can request a new verification link below