The analysis of record-linked data using multiple imputation
Goldstein, Harvey and Harron, Katie and Wade, Angie (2012) The analysis of record-linked data using multiple imputation. Statistics in Medicine, n/a (n/a). n/a. ISSN 0277-6715 (In Press)
Abstract
Probabilistic record linkage techniques assign match weights to one or more potential matches for those individual records that cannot be assigned ‘unequivocal matches’ across data files. Existing methods select the single record having the maximum weight provided this weight is higher than an assigned threshold. We argue that this procedure, which ignores all information from matches with lower weights, and for some individuals assigns no match, is inefficient and may also lead to biases in subsequent analysis of the linked data. It is proposed that a multiple imputation framework is utilised for data that belong to records that cannot be matched unequivocally. In this way the information from all potential matches is transferred through to the analysis stage. This procedure allows for the propagation of matching uncertainty through a full modelling process that preserves the data structure. For purposes of statistical modelling, results from a simulation example suggest that a full probabilistic record linkage is unnecessary and that standard multiple imputation will provide unbiased and efficient parameter estimates.
Item Type: | Article |
---|---|
Subjects: | 5. Quantitative Data Handling and Data Analysis > 5.4 Microdata Methods > 5.4.1 Data linkage |
Depositing User: | LEMMA user |
Date Deposited: | 14 May 2012 17:33 |
Last Modified: | 14 Jul 2021 13:55 |
URI: | https://eprints.ncrm.ac.uk/id/eprint/2300 |
Available Versions of this Item
-
The analysis of record-linked data using multiple imputation. (deposited 18 Mar 2012 13:11)
- The analysis of record-linked data using multiple imputation. (deposited 14 May 2012 17:33) [Currently Displayed]