Highlights
- •Government administrative data is valuable for social sciences research and policy insights.
- •Administrative data is sensitive and must be anonymized to maintain privacy and confidentiality.
- •SIRAD is a tool for anonymizing administrative data in a consistent way for research and insights.
- •To date, SIRAD has anonymized data for nine research studies and 75 policy memos.
Abstract
Keywords
Current code version | 0.3.2 |
Permanent link to code/repository used for this code version | https://github.com/SoftwareImpacts/SIMPAC-2022-4 |
Permanent link to reproducible capsule | https://codeocean.com/capsule/0283272/tree/v1 |
Legal code license | Modified 3-clause BSD |
Code versioning system used | Git |
Software code languages, tools and services used | Python, YAML |
Compilation requirements, operating environments and dependencies | Python |
If available, link to developer documentation/manual | https://github.com/ripl-org/sirad/wiki |
Support email for questions | [email protected] |
Current software version | 0.3.2 |
Permanent link to executables of this version | https://github.com/ripl-org/sirad |
Software License | Modified 3-clause BSD |
Computing platform/Operating System | Linux, Windows, OS X |
Installation requirements & dependencies | Python |
Link to user manual | https://github.com/ripl-org/sirad/wiki |
Support email for questions | [email protected] |
1. Background
J.S. Hastings, Fact-based policy: How do state and local governments accomplish it?, The Hamilton Project (Brookings) Policy Proposal 2019-01 (2019). https://www.brookings.edu/wp-content/uploads/2019/01/Hastings_PP_web_20190128.pdf. (Accessed 28 December 2021).
- Kholod I.I.
- Efimova M.S.
- Kulikov S.Y.
- Biplob M.B.
- Sheraji G.A.
- Khan S.I.
- Biswas N.
- Sarkar A.
- Mondal K.C.
- Patel M.
- Patel D.B.
2. Functionality
- Dayal U.
- Castellanos M.
- Simitsis A.
- Wilkinson K.
3. Example usage
4. Impact
J.S. Hastings, Fact-based policy: How do state and local governments accomplish it?, The Hamilton Project (Brookings) Policy Proposal 2019-01 (2019). https://www.brookings.edu/wp-content/uploads/2019/01/Hastings_PP_web_20190128.pdf. (Accessed 28 December 2021).
- •Labor training, wage roll, and unemployment insurance (UI) records, to estimate the value-added to wages following enrollment in workforce training programs [[11]].
- •Supplemental Nutrition Assistance Program (SNAP) records, UI records, and transaction records from a large US grocery retailer, to demonstrate how a mental accounting model can explain SNAP benefit spending [[12]] and to identify the effects of SNAP participation on the nutritional content of purchased food [[13]].
- •Electricity billing data, to understand the importance of bill timing for low-income and aged households who rely on SNAP and Social Security benefits [[14]].
- Barrage L.
- Chin I.
- Chyn E.
- Hastings J.S.
The impact of bill receipt timing among low-income and aged households: New evidence from administrative electricity bill data.NBER Bull. Retire. Disabil. 2020; (https://www.nber.org/brd/how-bill-timing-affects-low-income-and-aged-households. (Accessed 5 January 2022)) - •Medicaid claims, social benefit program, wage roll, and incarceration records, to predict high-cost use of emergency departments that could be diverted to more appropriate care [[15]].
- •Birth, Medicaid claims, social benefit program, and education records, to understand the impact of early-life interventions for very low birth weight children on later-life health and educational outcomes and social program expenditures [[16]].
- Chyn E.
- Gold S.
- Hasting J.S.
The returns to early-life interventions for very low birth weight children.J. Health Econ. 2021; 75102400https://doi.org/10.1016/j.jhealeco.2020.102400 - •Medicaid claims, social benefit program, wage roll, incarceration, and criminal history records, to predict adverse outcomes that could result from opioid therapy before the initial opioid prescription is written [[17]].
- •Voter registration, voting history, social benefit program, wage roll, and driver’s license records, to examine the impact of a state photo ID law on voter turnout and registration [[18]].
F.M. Esposito, D. Focanti, J.S. Hastings, Effects of Photo ID Laws on Registration and Turnout: Evidence from Rhode Island, NBER Working Paper No. 25503, 2019, https://www.nber.org/papers/w25503 (Accessed 5 January 2022).
- •Child protective services and education records, to measure the impact of removing children from abusive and neglectful homes on educational outcomes [[19]].
A. Bald, E. Chyn, J.S. Hasting, M. Machelett, The Causal Impact of Removing Children from Abusive and Neglectful Homes, NBER Working Paper No. 25419, 2019, https://www.nber.org/papers/w25419 (Accessed 5 January 2022).
5. Discussion
Declaration of Competing Interest
Acknowledgments
Funding
Appendix A. Supplementary tables
- MMC S1
References
- The role of administrative data in the big data revolution in social science research.Soc. Sci. Res. 2016; 59: 1-12https://doi.org/10.1016/j.ssresearch.2016.04.015
J.S. Hastings, Fact-based policy: How do state and local governments accomplish it?, The Hamilton Project (Brookings) Policy Proposal 2019-01 (2019). https://www.brookings.edu/wp-content/uploads/2019/01/Hastings_PP_web_20190128.pdf. (Accessed 28 December 2021).
- Unlocking data to improve public policy.Commun. ACM. 2019; 62: 48-53https://doi.org/10.1145/3335150
- Data Hashing Application, ADRF User Guide.2021 (https://web.archive.org/web/20210414102424/https://coleridgeinitiative.org/adrf/documentation/adrf-overview/data-hashing-application/. (Accessed 14 April 2021))
- Using ETL tools for developing a virtual data warehouse.in: 2016 XIX IEEE International Conference on Soft Computing and Measurements (SCM). IEEE, St. Petersburg, Russia2016: 351-354https://doi.org/10.1109/SCM.2016.7519778
- Comparison of different extraction transformation and loading tools for data warehousing.in: Proceedings of the 2018 International Conference on Innovations in Science Engineering and Technology. IEEE, Chittagong, Bangladesh2018: 262-267https://doi.org/10.1109/ICISET.2018.8745574
- Empirical analysis of programmable ETL tools.in: Computational Intelligence, Communications, and Business Analytics, CICBA 2018Communications in Computer and Information Science. Vol. 1031. Springer, Singapore2019: 267-277https://doi.org/10.1007/978-981-13-8581-0_22
- Progressive growth of ETL tools: A literature review of past to equip future.in: Rising Threats in Expert Applications and Solutions. Advances in Intelligent Systems and Computing. Vol. 1187. Springer, Singapore2021: 389-398https://doi.org/10.1007/978-981-15-6014-9_45
C.R. Robert, The Soundex coding system. US Patent No. US1261167.
- Data integration flows for business intelligence.in: Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology. ACM, New York, NY, USA2009https://doi.org/10.1145/1516360.1516362
- Estimating value-added returns to labor training programs with causal machine learning, OSF Preprints.2021https://doi.org/10.31219/osf.io/thg23
- How are SNAP benefits spent? Evidence from a retail panel.Amer. Econ. Rev. 2018; 108: 3493-3540https://doi.org/10.1257/aer.20170866
- The effect of SNAP on the composition of purchased foods: Evidence and implications.Am. Econ. J. Econ. Policy. 2019; 13: 277-315https://doi.org/10.1257/pol.20190350
- The impact of bill receipt timing among low-income and aged households: New evidence from administrative electricity bill data.NBER Bull. Retire. Disabil. 2020; (https://www.nber.org/brd/how-bill-timing-affects-low-income-and-aged-households. (Accessed 5 January 2022))
- Predicting Divertible Medicaid Emergency Department Costs, OSF Preprints.2021https://doi.org/10.31219/osf.io/q36es
- The returns to early-life interventions for very low birth weight children.J. Health Econ. 2021; 75102400https://doi.org/10.1016/j.jhealeco.2020.102400
- Predicting high-risk opioid prescriptions before they are given.Proc. Natl. Acad. Sci. 2020; 117: 1917-1923https://doi.org/10.1073/pnas.1905355117
F.M. Esposito, D. Focanti, J.S. Hastings, Effects of Photo ID Laws on Registration and Turnout: Evidence from Rhode Island, NBER Working Paper No. 25503, 2019, https://www.nber.org/papers/w25503 (Accessed 5 January 2022).
A. Bald, E. Chyn, J.S. Hasting, M. Machelett, The Causal Impact of Removing Children from Abusive and Neglectful Homes, NBER Working Paper No. 25419, 2019, https://www.nber.org/papers/w25419 (Accessed 5 January 2022).
Article info
Publication history
Footnotes
The code (and data) in this article has been certified as Reproducible by Code Ocean: (https://codeocean.com/). More information on the Reproducibility Badge Initiative is available at https://www.elsevier.com/physical-sciences-and-engineering/computer-science/journals.
Identification
Copyright
User license
Creative Commons Attribution (CC BY 4.0) |
Permitted
- Read, print & download
- Redistribute or republish the final article
- Text & data mine
- Translate the article
- Reuse portions or extracts from the article in other works
- Sell or re-use for commercial purposes
Elsevier's open access license policy