To apply for the any of these restricted-use data files:
- New users will need to apply for restricted-use data. Please download and complete the restricted-use data contract using the CPC Data Portal.
- Current Add Health users can log in to their CPC Data Portal application to request additional data.
For more information, please visit the Data Portal’s Frequently Asked Questions page.
Data Released (September 26, 2024)
Contextual Heterosexism Database, Phase 1
Contextual Heterosexism Database-Phase 1 (CHD1) further expands the collection of contextual data available to users of The National Longitudinal Study of Adolescent to Adult Health (Add Health) through the provision of state, county, and tract level measures from the Decennial Census of Population and Housing, American Community Survey (ACS), the Movement Advancement Project (MAP), Lax and Phillips (2009), Public Religion Research Institute (PRRI), Cooperative Election Study (CES), U.S. Religion Census, and Massachusetts Institute of Technology (MIT) Election Lab. These data include indicators of social policies, social climate, and confounding factors related to the study/measurement of structural heterosexism that correspond to Waves 3, 4, and 5. Some of these indicators are new to the Add Health contextual database and others were previously not available at all three of these waves. N=18,352
Mortality Outcomes Surveillance Part I Ascertaining_Decedents_User_Guide_(2022_Update)
Individual Vital Status and Underlying Cause of Death File, 2022
This file contains one record for each of the 20,745 Add Health sample members from Wave I. It provides the vital status of each sample member through 2022 as well as the National Death Index-provided underlying cause of death code in ICD-10 format for each decedent. The month and year of the most recent Add Health interview are provided for living sample members, while the month and year of death are provided for decedents. N=20,745
Ordered Cause of Death File, 2022
This file contains entity- and record-axis codes reported by the National Death Index (NDI) for each decedent in the Add Health sample through 2022. The file is arranged hierarchically, by axis code; therefore, each decedent may have multiple records depending on the maximum number of entity- and record-axis codes recorded by NDI. The sequence of the decedent’s records reflects the order in which the entity- and record-axis codes were reported in the NDI record. N=2,377
Ordered CaAll Coded Causes of Death File, Including Entity-Axis Codes, 2022use of Death File, 2022
This file contains all underlying cause of death and entity-axis codes appearing in the National Death Index (NDI) source file through 2022. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one. N=706
All Coded Causes of Death File, Including Record-Axis Codes, 2022
This file contains all underlying cause of death and record-axis codes appearing in the National Death Index (NDI) source file through 2022. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one. N=706
FAA Noise Data
Day-Night Level (DNL) Noise Exposures from 90 Major Airports
Contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=83,357
Equivalent Sound Level for a 15-Hour Day (LAEQD) Noise Exposures from 90 Major Airports
Data contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=73,174
Equivalent Sound Level for a 9-Hour Night (LAEQN) Noise Exposures from 90 Major Airports
Data contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=57,886
Proxies for Aircraft Noise from Other Airports: Airport Counts
Contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=166,195
Proxies for Aircraft Noise from Other Airports: Mean Distances data
Contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=256,318
Proxies for Aircraft Noise from Other Airports: Mean Total Enplanements
Data contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=250,740
Rural-Urban Commuting Area (RUCA) Codes
Rural-urban commuting area (RUCA) codes classify U.S. census tracts using measures of population density, urbanization, and daily commuting. The data file including them is based on RUCA codes for census years 1990, 2000, and 2010. The rationale for and utility of acquiring RUCA codes, assigning them to census geographies in which Add Health respondents have resided over three decades. N=97,700
Add Health Sample Member Birth Records Database
Birth record data was collected from participating states for AHSM birth years, 1974-83. When these states provided birth data for all recorded births occurring during that time interval, an AHSM-specific subset was created using Link Plus, a statistical linkage software developed by the U.S. Centers for Disease Control and Prevention (CDC), Cancer Division. One participating state performed its own AHSM linkages and provided Add Health with the linked subset of births. Add Health then performed transformations on all of the original data from the participating states to create the categorical variables present in this release. N=2,750
Data Released (June 12, 2024)
Wave V Contextual Despair
This contextual data set focuses on the social, political, and resource environment of Add Health respondents at the tract, county, and state level that are relevant to the prevailing causes of death in midlife – namely alcohol-related diseases, drug overdoses and accidental poisonings, and suicide and self-inflicted harm. Most measures are specific to Wave V residential location, though several measures span multiple waves. Measures include the sociodemographic and segregation context, proximity to firearms distributors and alcohol outlets, opioid dispensing, and policies related to alcohol, drugs, and firearms. N=20,745
Data Released (May 7, 2024)
Baroreflex Sensitivity and Hemodynamic Recovery
This file contains constructed measures for baroreflex sensitivity, heart rate recovery, and systolic blood pressure recovery for the Wave V respondents. N=5,381
Measures of Inflammation and Immune Function
This file contains additional measures of inflammation and immune function based on venous blood collected via phlebotomy at the Wave V home exam and then assayed for several cytokines (IL-1β; IL-6; IL-8; IL-10; TNF-α) and anti-cytomegalovirus (CMV) IgG. N=5,381
Neurodegeneration
This file contains two measures of neurodegeneration based on venous blood collected via phlebotomy at the Wave V home exam and then assayed for neurofilament light (NfL) and tau. N=5,381
Data Released (August 11, 2023)
Individual Vital Status and Underlying Cause of Death File, 2021
This file contains one record for each of the 20,745 Add Health sample members from Wave I. It provides the vital status of each sample member through 2021 as well as the National Death Index-provided underlying cause of death code in ICD-10 format for each decedent. The month and year of the most recent Add Health interview are provided for living sample members, while the month and year of death are provided for decedents. N=20,745
Ordered Cause of Death File, 2021
This file contains entity- and record-axis codes reported by the National Death Index (NDI) for each decedent in the Add Health sample through 2021. The file is arranged hierarchically, by axis code; therefore, each decedent may have multiple records depending on the maximum number of entity- and record-axis codes recorded by NDI. The sequence of the decedent’s records reflects the order in which the entity- and record-axis codes were reported in the NDI record. N=2,123
All Coded Causes of Death File, Including Entity-Axis Codes, 2021
This file contains all underlying cause of death and entity-axis codes appearing in the National Death Index (NDI) source file through 2021. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one. N=647
All Coded Causes of Death File, Including Record-Axis Codes, 2021
This file contains all underlying cause of death and record-axis codes appearing in the National Death Index (NDI) source file through 2021. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one. N=647
Data Released (May 31, 2023)
Polygenic Index Inventories – Release 2
This data file is a 2022 update of the polygenic scores computed by the SSGAC consortium for anthropometric traits, cognition/education, fertility/sexual development, health/health behaviors, and personality/well being. N=5,689
Data Released (May 12, 2023)
Sexual Orientation/Gender Identity, Socioeconomic Status, and Health across the Life Course (SOGI-SES)
This file contains new survey data to support exploration of the relationships among sexual orientation, gender identity, gender expression, romantic and sexual behaviors, socioeconomic status, and health. It contains social, demographic, behavioral, and health data collected in 2020-2021 on a sample of Add Health Wave V participants. N=2,614
Sexual Orientation/Gender Identity, Socioeconomic Status, and Health across the Life Course (SOGI-SES) – Sensitive
This file contains the SOGI-SES study’s sensitive data variables related to gender identity, in vitro fertilization, and HIV status. N=2,614
Data Released (March 20, 2023)
Wave I & II School District Grouping Data
To facilitate clustering by school district, the school district identifiers comprising this file are based on the Local Education Agency identification numbers (LEAID) of the school districts in which the Wave I school, Wave I residence, and Wave II residence were situated. The first two characters of this LEAID represent state and reflect state codes assigned by Add Health in other disseminated data similarly intended for clustering at different geographic areas. N=84,166
Data Released (February 17, 2023)
Historical Neighborhood Redlining
This contextual database allows researchers to identify potential long-term consequences of redlining for contemporary inequities in neighborhood environments, and individual health and socioeconomic attainment over the life course. N=20,706
Waves III-V Multi-year Air Pollution Exposure Estimates
The air pollution data described here provide longer-term estimates of air pollution exposure that can be used to address a broad range of research questions related to how air pollution exposure over time may relate to a variety of health outcomes. N=20,745
Wave I & II School Desegregation Disparities
This file contains data on the levels of school racial segregation experienced by Add Health respondents during their school-age years, related school district characteristics, and measures of tract-level residential segregation present in adulthood (Waves III-V). N=84,166
Data Released (November 3, 2022)
Wave V Hepatic Injury
This file contains constructed measures designed to facilitate analysis and interpretation of hepatic injury based on venous blood collected via phlebotomy at the Wave V home exam. Assay results for aspartate aminotransferase (AST) and alanine aminotransferase (ALT) are available, as well as three semi-quantitative serum index assays – lipemia, hemolysis and icterus – to evaluate the possibility of interference with the AST or ALT assays. Moreover, two constructed measures are available – the AST/ALT ratio and its classification. N=5,381
Wave V Renal Function Addendum
This file contains constructed measures designed to facilitate analysis and interpretation of renal function based on venous blood collected via phlebotomy at the Wave V home exam. Updated estimations of the glomerular filtration rate (GFR) based on 2021 guidelines are available using either the creatinine concentration or using both the creatinine and cystatin C concentrations. The estimations of GFR have been calculated according to the 2021 NIDDK CKD-EPI guidelines. Classifications of the new estimations according to both clinical and KDIGO guidelines are available as well. N=5,381
Data Released (September 9, 2022)
Wave IV dbGaP GWAS Sample Weight
A weight component for the dbGaP GWAS Sample. N=9,975
Data Released (June 21, 2022)
Individual Vital Status and Underlying Cause of Death File, 2019
This file contains one record for each of the 20,745 Add Health sample members from Wave I. It provides the vital status of each sample member as well as the National Death Index-provided underlying cause of death code in ICD-10 format for each decedent. The month and year of the most recent Add Health interview are provided for living sample members, while the month and year of death are provided for decedents. N=20,745
Ordered Cause of Death File, 2019
This file contains entity- and record-axis codes reported by the National Death Index (NDI) for each decedent in the Add Health sample. The file is arranged hierarchically, by axis code; therefore, each decedent may have multiple records depending on the maximum number of entity- and record-axis codes recorded by NDI. The sequence of the decedent’s records reflects the order in which the entity- and record-axis codes were reported in the NDI record. N=1,745
All Coded Causes of Death File, Including Entity-Axis Codes, 2019
This file contains all underlying cause of death and entity-axis codes appearing in the National Death Index (NDI) source file. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one. N=540
All Coded Causes of Death File, Including Record-Axis Codes, 2019
This file contains all underlying cause of death and record-axis codes appearing in the National Death Index (NDI) source file. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one. N=540