Institutional Affiliation: Duke University
|A Framework for Sharing Confidential Research Data, Applied to Investigating Differential Pay by Race in the U. S. Government|
with , , , , , , , : w23534
Data stewards seeking to provide access to large-scale social science data face a difficult challenge. They have to share data in ways that protect privacy and confidentiality, are informative for many analyses and purposes, and are relatively straightforward to use by data analysts. We present a framework for addressing this challenge. The framework uses an integrated system that includes fully synthetic data intended for wide access, coupled with means for approved users to access the confidential data via secure remote access solutions, glued together by verification servers that allow users to assess the quality of their analyses with the synthetic data. We apply this framework to data on the careers of employees of the U. S. federal government, studying differentials in pay by race. ...
Published: Barrientos, Andres F., Alexander Bolton, Tom Balmat, Jerome P. Reiter, John M. de Figueiredo, Ashwin Machanavajjhala, Yan Chen, Charley Kneifel, and Mark DeLong (2018). “A Framework for Sharing Confidential Research Data, Applied to Investigating Differential Pay by Race in the U.S. Government,” Annals of Applied Statistics 12(2): 1124-1156.