Skip to main content

METHODS article

Front. Public Health, 05 March 2021
Sec. Digital Public Health
This article is part of the Research Topic Responsible Big Data Solutions for Public Health View all 5 articles

The National Children's Study Archive Model: A 3-Tier Framework for Dissemination of Data and Specimens for General Use and Secondary Analysis

\nPeter K. GilbertsonPeter K. Gilbertson1Susan ForresterSusan Forrester1Linda AndrewsLinda Andrews1Kathleen McCannKathleen McCann1Lydia RogersLydia Rogers1Christina ParkChristina Park2Jack Moye
Jack Moye3*
  • 1Social & Scientific Systems Inc., Silver Spring, MD, United States
  • 2National Institutes of Health, Office of the Director, Bethesda, MD, United States
  • 3Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD, United States

The National Children's Study (NCS) Archive was created as a repository of samples, data, and information from the NCS Vanguard Study—a longitudinal pregnancy and birth cohort evaluating approaches to study influence of environmental exposures on child health and development—to provide qualified researchers with access to NCS materials for use in secondary research. The National Children's Study Archive (NCSA) model is a 3-tiered access model designed to make the wealth of information and materials gathered during the NCS Vanguard Study available at a user appropriate level. The NCSA model was developed as a 3-tier framework, for users of varying access levels, providing intuitive data exploration and visualization tools, an end-to-end data and sample request management system, and a restricted portal for participant-level data access with a team of experts available to assist users. This platform provides a model to accelerate transformation of information and materials from existing studies into new scientific discoveries.

Trial Registration: ClinicalTrials.gov Identifier: NCT00852904 (first posted February 27, 2009).

Introduction and Background

The National Children's Study (NCS) Vanguard Study was a pilot for a nationally representative cohort of 100,000 U.S. children to be followed beginning before birth to adulthood (along with their parents) in a national longitudinal study of environmental (including physical, chemical, biological, and psychosocial) influences on child health and development, to be conducted by the Eunice Kennedy Shriver National Institute of Child Health Human Development (NICHD) in collaboration with the U.S. Centers for Disease Control and Prevention and the U.S. Environmental Protection Agency (1).

The Vanguard Study tested different recruitment strategies between 2009 and 2014. The Initial Vanguard Study (IVS) opened to accrual in 2009 in seven locations across the U.S. The IVS focused on recruitment through household enumeration, and used comprehensive data and sample collection procedures that were designed for the Main NCS (2). In November 2010, the IVS was succeeded by the Alternate Recruitment Substudy (ARS). The ARS was designed to evaluate three recruitment strategies: Direct Outreach, Enhanced Household Enumeration, and Provider-Based Recruitment (PBR). Each strategy was conducted in 10 study locations (35). Recruitment under the three ARS strategies ended in February 2012. The ARS was followed in June 2012 by another substudy, the Provider-Based Sampling (PBS) Substudy, which was conducted in three study locations (6). Overall, the PBS strategy proved most effective and cost efficient (7). In December of 2014, the Vanguard Study was discontinued by the National Institutes of Health (NIH) Director following the recommendations of a National Children's Study Working Group (8, 9).

To recruit Vanguard Study participants, over 74,000 women were screened to determine eligibility (Figure 1). At the close of the NCS Vanguard Study, more than 5,400 birth families had been enrolled and followed (Figure 1) in 43 counties within 31 states across the country (Figure 2). Over the course of the study, 5,608 children and their primary caregivers participated in at least one protocol-specified study visit, from pre-conception through pregnancy and birth up to 42 months of age (Figure 1).

FIGURE 1
www.frontiersin.org

Figure 1. NCS recruitment, enrollment, and study status flow by participant and activity.

FIGURE 2
www.frontiersin.org

Figure 2. NCS participating counties by recruitment strategy across the United States.

Data on a wide variety of health-related topics was collected by interview, self-administered questionnaires, and data collector generated case report forms. In total, 161 data collection forms, excluding form versioning (i.e., v1, v2, etc.), were utilized by the study for the IVS and ARS phases. These forms and the data they collected included pre-, peri-, and post-partum interviews, biological sample collection, environmental sample collection, physical examination, and neuropsychosocial and cognitive assessments. During active data collection, study centers submitted up to 700 datasets biweekly, totaling 4.1 terabytes of unique data by study end.

Nearly 19,000 unique biological and 4,000 unique environmental samples were collected from participants (women, fathers, and children) and their households across all study visit stages. From these a sample repository of more than 250,000 items was derived. Biological material collected includes blood, urine, saliva, breast milk, mucosal swabs, meconium, hair, nails, and placental tissue. Various air, dust, and water collections comprise the environmental samples. Serial sets of biospecimens were collected from women over several timepoints in pregnancy. Various combined sets of biological samples from mother/child dyads and some mother/child/father triads are represented.

The Vanguard Study, in addition to collecting data and samples, produced thousands of documents to support and facilitate study implementation. To effectively coordinate 40 locations across the country, each site worked from centrally created and maintained procedures and guidance documents. These documents were a combined effort of different teams of subject matter experts (SMEs) and defined the functions and implementation of the NCS. Each study task, such as data collection, had corresponding standard operating procedures (SOP), in-home study-visit and site workflows, training documentation, equipment maintenance guides, and adverse outcome documentation requirements. This documentation supporting the implementation of the NCS captured the many goals and challenges of executing any large, multicenter study.

With the Vanguard study's closure in 2014, the wealth of potential future research use represented by the existing data, samples, documentation, and knowledge of the NCS was recognized. The potential for data sharing was specifically noted in a 12/11/2014 NIH Director's statement indicating that “Data from the Vanguard Study should be archived and available upon request by investigators for secondary analyses” (10). Studies have shown that collaboration among researchers and the sharing of data not only promotes more robust research but also leads to faster translation of findings into clinical practice (11). The NIH therefore deemed that the NCS should continue as a scientific resource and sought to archive and make available the data and samples as quickly and completely as possible in accordance with the February 26, 2003 Final NIH Statement on Sharing Research Data (12).

The National Children's Study Archive (NCSA) core mission was to curate and maintain the information, data, and sample materials generated by the NCS Vanguard Study, and to make those products available for continued de novo analysis (9). The NCSA supported this mission from November 2015 to April 2020, after which the NCS data, following removal of HIPAA identifiers (13), is made available through the NICHD Data and Specimen Hub (DASH) (13) for continued research use beginning in early or mid-2020.

To support the NCSA mission, the NCS team developed the NCSA model to share the wide array of study protocols, procedures, instruments, and other products from the NCS, and to make available for scientific research the data and samples collected from women, fathers, and children for the NCS Vanguard Study. The objective of this report is to:

1. Describe the 3-tier NCSA model the NCS Archive developed to promote user access at a user appropriate level.

2. Describe the user-centric tools and resources developed for the NCSA model to facilitate the understanding of complex longitudinal study design and the resulting products.

3. Describe the evaluation framework the NCS archive used to support secondary analysis within the NCSA model.

Materials and Methods

NCS leadership consulted experts within and outside the agency on various aspects of data archive design. The team reviewed existing data repository and data resource archives, including the Data and Specimen Hub (DASH, NIH/NICHD) (14), the Biospecimen Repository Access and Data Sharing program (BRAD, NIH/NICHD), the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial (PLCO, NIH/NCI) and the Biologic Specimen and Data Repository Information Coordinating Center (BioLINCC, NIH/NHLBI) (15) to determine best archive and repository solutions for dissemination of NCS Vanguard Study data. The NCS reviewed data sharing practices and procedures of the National Health and Nutrition Examination Survey (NHANES; CDC/NCHS), National Database for Autism Research (NDAR; NIH/NIMH), and the NICHD Population Dynamics Branch. Some of the criteria evaluated included data specifications, data documentation and discovery tools, remote analysis functionality, and privacy protection measures.

Because of the complex and unique nature of the NCS Vanguard Study's design and conduct, and to meet the need for further data curation, the NCS Archive was created as a study-specific resource rather than a component of an existing data repository. As a separate resource, the NCS Archive could support carry-over research from the NCS Vanguard Study, release study data and information on a rolling basis, provide immediate support for secondary research projects, and share experience upon which other national cohort studies like the Environmental influences on Child Health Outcomes (ECHO) (16) and All of Us studies (17) could build.

While NCSA planning, development, and implementation predated establishment and publication of the FAIR (Findable, Accessible, Interoperable, Reuseable) Guiding Principles for scientific data management and stewardship (18), these principles were anticipated in the functional requirements that were established for the system.

Data security and privacy requirements were developed to ensure establishment of adequate controls to protect the privacy of the participants and ensure the security of the data as required by the Department of Health and Human Services (http://www.hhs.gov/ocio/policy/index.html#Security) and by the Federal Information Security Modernization Act of 2014 and Federal Information Security Management Act of 2002 (FISMA) (19).

System design requirements were established to achieve specific desirable characteristics, including:

• Flexible—quickly and efficiently accommodate evolving requirements

• Comprehensive—support an extensive range of data analysis functions

• Integrated—provide interoperability across data sources

• Reliable—assure function under all conditions

• Secure—employ stringent security controls to prevent inappropriate information disclosure and possible data loss, while ensuring that the right information is provided to the right people

• Private—ensure data privacy and integrity despite technological evolution

• Maximum access to data by researchers while maintaining FISMA standards

• User-focused and User-friendly—tailored to a variety of data users

• Reduction of long-term risks and lifecycle costs

Definition of functional requirements incorporated input, review, and comment from data users as part of the process. Key functional requirements of the NCSA are summarized below for various elements of the Archive.

Data curation, preparation, and documentation for easy reuse:

• Create analytic data files and all documentation including user guides, data dictionary, and code books, to be made available for the general research community and in accordance with Section 508 of the Rehabilitation Act of 1973 (20).

• Harmonize all analytic data files across multiple protocol versions to the extent possible in terms of file structures, variable naming, labeling, coding, and formatting.

• Produce final data files that meet Data Documentation Initiative (DDI) standards.

• Produce a comprehensive list of all biological and environmental samples in inventory with documentation of informed consent available for each NCS universal participant identifier.

• Readily update the inventory based on NCS sample repository use reports.

• The sample inventory and result data should be linkable to other analytic data at the participant level.

Restricted Use Data (RUD) file construction:

• Remove sensitive variables to comply with the NIH data privacy protection requirements

• Perform other data manipulation techniques to reduce disclosure risks

• Develop and implement specifications to minimize participant disclosure risk

• Produce data documentation needed by data users, including analytic limitations imposed by removal of personal identifiers.

Support tools development:

• Create new data discovery and cohort discovery tools based on existing applications such as the NCS Workbench and NCS Knowledgebase

• Provide links to questionnaires and data collection instruments and materials

• Develop and construct data item level crosswalks to facilitate data use

• Incorporate search capabilities into these data user tools

• Provide web-based analysis tools for end data users

• Produce data tables and charts that summarize participant characteristics or key findings from analytic files

Archive user support services:

• Support requests for data access (microdata, specimens)

• Produce other custom data files

• Development and maintenance of common linking identifiers to integrate NCS data with external sources while maintaining participant confidentiality

• Support linkages to laboratory data

• Disclosure review of researcher produced analysis outputs and tables

• Support for publication submission

Requirements for multiple levels of data access:

• Provide a limited set of carefully de-identified data files containing key variables which can be downloaded by users to be analyzed on their own equipment, as an inexpensive method to access key variables from the NCS.

• Provide a secure enclave wherein users can access more sensitive, partially de-identified data and conduct analyses within the enclave, with no ability to download any microdata (individual-level data).

Results

The NCSA model shares information using a 3-tier approach built on the “principle of least privilege” and augmented with user driven content tools (Figure 3). Each tier is intended to allow users to identify the Study information and the available data at their level of need. Need-based access has three goals—first, to streamline the process of data absorption; second, to allow the user to drive meaningful discovery; and third, to allow targeted use of limited analytic and support resources. Key to the NCSA model is the leveraging of user self-service opportunities. Through either documentation or system tools information, information access is democratized. Users drive access and understanding of a study on their own, thereby reducing the resource requirements of Archive support staff and SMEs.

FIGURE 3
www.frontiersin.org

Figure 3. NCSA Model Access Tiers.

Tier 1: Public Access

The Public Access tier is intended for the general population to allow persons interested in the research to learn about the content without unnecessary technical detail. For the NCS, public access meant access to the public portion of the NCS Archive website and a NICHD hosted web page. Users had access to a study description (21), a history of the study, descriptions of the data and samples available, and information about the ongoing archive activities. Users were provided a mechanism to gain registered access and a mechanism to contact the NCS Archive with questions or inquiries.

Public access for the NCS additionally meant highlighting a selection of scientific works for both informational and aspirational purposes. Publications based on NCS data presented in journals and at conferences were made available together with a series of data briefs, which are concise summaries of selected data from the NCS Archive, with comparisons to other widely recognized sources of similar data (Table 1).

TABLE 1
www.frontiersin.org

Table 1. Published NCS data briefs.

This allowed sharing of previous work completed using NCS data and highlighted opportunities for new or continued research. As a federally funded program, it was important for NCS to make tangible results available to the general population. As a value-added component, the NCS Archive created and freely provided teaching databases comprised of child-level and health topic-related modules (22). The teaching databases were derived from real data collected during the NCS Vanguard Study. They were created for academic use only and not intended to be suitable for use for publication. The teaching databases offer a ready-made database system with examples for various statistical models, including repeated measures analyses, to supplement classroom instruction and for use in student projects.

Tier 2: Registered Access

The registered access tier of the NCSA model provides the next level of data access privilege to users interested in a study. Registered access makes available resources to promote and cultivate new ideas for further exploration of existing data or samples, as well as providing detailed documentation of the study and the study's experiences. In exchange for the access to new information, a user must register and attest to responsible stewardship of the information provided.

For the NCS Archive, restricted access to the site required completion of a demographic registration form (Table 2), consent to adhere to the NCS Vanguard data user agreement (Table 3), and registering a login service account. Registered users gained access to all publicly available Study information and data files, the site's protocol and data self-service tools, and the proposal management system for requests to access restricted study data and other stored materials in Tier 3.

TABLE 2
www.frontiersin.org

Table 2. Registration collected demographic elements and requirements.

TABLE 3
www.frontiersin.org

Table 3. NCS data use agreement terms.

The NCS Archive developed several self-service tools to explore the study operational information and sample size data for participants, biospecimen, and environmental samples (Table 4), to facilitate meaningful evaluation and consumption of the large amount of available NCS data. These tools transfer the ability to identify and confirm potential research opportunities from the Archive support staff to the interested user, preemptively answering potential inquiries.

TABLE 4
www.frontiersin.org

Table 4. NCS archive self-service tools and domains of coverage.

One challenge to understanding the NCS Vanguard is a study design that involves multiple sequential study protocols—the 2009–2010 IVS Protocol and the 2011–2014 ARS Protocol. Each protocol included participant visits across multiple life stages (pre-conception, prenatal, perinatal, postnatal). The Protocol Browser tool was developed to allow users easily to assess the content and timing of study evaluations undertaken by NCS participants in either protocol. The tool is a visual representation of study events experienced by participants within each protocol, displayed by life stage, study visit, and data collection instruments administered (Figure 4). The tool can display one or both protocols simultaneously to allow comparisons.

FIGURE 4
www.frontiersin.org

Figure 4. NCS Protocol Browser.

With numerous Study visits for each protocol across multiple life stages, IVS with 14 visits and ARS with 16, the NCS ended up fielding 161 different instruments (IVS 97, ARS 64). To manage the dissemination of these instruments, the Instrument and Dataset Inventories tool (Figure 5) was developed to display a list of available instruments by protocol and subject domain. Instruments are made available in PDF format, unless copy restrictions apply, and interested users can view the in-line variables notation. Additionally, the number of variables and records in each corresponding dataset is identified.

FIGURE 5
www.frontiersin.org

Figure 5. NCS Archive Instrument Inventory.

From these instrument datasets and other constructed analytic files, the NCS Archive generated nearly 14,000 variables. To quickly evaluate what variables are available with how many data points or responses for a given area, the Variable Locator tool (Figure 6) was created. This tool provides for free text variable search of all data in the NCS Archive. A user who might enter the term “sleep” into the Variable Locator would be provided a list of all sleep-related variables and questions, together with frequency counts of respondents, valid responses, legitimate skip responses, and records in the dataset. The Variable Locator allows users to filter results further by using Boolean search terms (“AND,” “NOT,” and “OR”). Researchers can use the Variable Locator tool to identify readily what potential the NCS Archive data may hold for their intended research.

FIGURE 6
www.frontiersin.org

Figure 6. NCS Archive Variable Locator.

Once researchers have identified operational components of interest—datasets, instruments, or variables—knowing the potential sample size is crucial to evaluate the feasibility of their planned research. The Participant Explorer tool allows users to interrogate metadata-level NCS participant and study participation information. Users of the Participant Explorer tool can determine participant counts by various categories separately or in combination, such as type of participant (woman, child, father), data collection timepoint (prenatal, perinatal, postnatal), demographic variables (race, ethnicity, education level, marital status), etc. Data for fathers, for example, can be categorized by age, education, race/ethnicity, other demographics, and study activity (Figure 7). Each variable in the Participant Explorer allows a researcher to narrow the target population to reflect a researcher's specific needs and interest.

FIGURE 7
www.frontiersin.org

Figure 7. Participant Explorer for father participants in the IVS phase.

In similar fashion, the Sample Explorer allows users to develop sample size estimates based on the inventory of biological and environmental samples collected from women, children, and fathers who participated in the NCS (Table 5). The Archive initially held over 250,000 individual biospecimens and 4,600 individual environmental samples. This extensive sample collection made the Archive a significant resource of materials for laboratory research, while the Sample Explorer provided a method to explore those samples in context and detail. Like the Participant Explorer, the Sample Explorer allows researchers to determine the number and type of primary or derivative biological and environmental sample material available by various demographic categories, type of participant, study visit, etc. Researchers can use the Sample Explorer to determine counts of samples available from the specific type of participant (e.g., infants, pregnant women, etc.) of most interest to them. Once the available sample types, sample demographics, and sample size information are known, researchers can develop specific proposals to request those data and samples for their research.

TABLE 5
www.frontiersin.org

Table 5. NCS Biospecimen and Environmental Samples available by Collection Period.

Within the NCSA model, the user driven content framework and tools enable researchers to target specific populations, topic areas, and types of biological or environmental samples to determine quickly the feasibility of a research proposal. The research tools empower researchers to explore study information and to develop and refine their research ideas independent of SMEs. The tools help to decrease the knowledge gap between Archive staff and general users and allow Archive staff to provide expert knowledge and experience on narrowly targeted questions about potential research.

Once researchers determined the feasibility of using NCS resources, the Archive leveraged a user-driven electronic proposal submission system to submit, process, and review researcher requests for data alone or for combined data and biological and/or environmental samples to pursue their scientific objectives. Electronic forms collect information such as title, requesting investigator, institution, other users for whom approved access is requested, and a detailed plan of research (rationale, main hypothesis, and proposed research aims) (23). Additionally, the sample request form collects data about shipping accounts, laboratory contact information, specific sample information, and testing parameters such as target analytes and assay platforms.

Along with the completed request form, potential researchers can submit documentation of institutional review board (IRB) determination or review and approval, documentation of funding availability, and letters of institutional support. Other documents such as curriculum vitae (CV) or biosketch are encouraged but not required. When a research request is submitted, the researcher receives notification via email of receipt and processing. The NCSA model framework thus allows user-driven discovery, ideation, and data request submission.

Submitted requests next pass through a stringent multi-step review. Requests are first screened by Archive staff for completeness and clarity. Analysts then assess the practical feasibility of request fulfillment. Preliminary data frequencies are run and availability of suitable samples in sufficient quantities and amounts is reviewed, according to information submitted in the request form. In parallel, a review is conducted to assure compliance with regulatory and ethical requirements, evaluate appropriateness for the intended purpose of the specific data and any samples requested, and determine whether the research plan is consistent with the NCS informed consent. Discussions between the researcher, Archive staff, and NICHD staff may be needed to determine how best to fulfill a request. The potential impact on the repository inventory also is taken into consideration when samples potentially suitable for a request are identified, and an assessment is made whether the selected samples (including quantity, volume, concentration, etc.) and proposed assay methods are suitable for the planned research. If the requested data or samples are unavailable, or research plans are deemed unacceptable, immediate notification of investigators is provided, with further processing of the request suspended.

After the initial review, the request and the preliminary feasibility analysis enter a queue for formal review by an NCS Data Access Committee (DAC) composed of NICHD scientists and ad-hoc subject matter experts. Requests can be reviewed by individual DAC members immediately and are not dependent on a formal meeting. Members are given an initial 10-day review window and have access to review all submitted request materials within the electronic request system. Members use standard evaluation criteria for review (Table 6) and can share thoughts amongst themselves in a restricted discussion board.

TABLE 6
www.frontiersin.org

Table 6. NCS archive request review criteria.

If the DAC has questions, needs clarification, or needs additional information, the Archive staff work with the researcher to provide the requested information. Once the review is completed, the requester is notified of the review outcome by Archive staff. Accepted requests continue along the data access process, while unapproved research requests are returned to the requester with explanation.

The next step in the data access process is completion of a Research Materials Distribution Agreement (RMDA). The RMDA is a contract between the Principal Investigator of the proposed research, their institution, and the NICHD. It ensures that all parties are signatories and aware of the agreed terms for accessing participant-level NCS data, and it confirms their commitment to the policies and procedures put in place to protect the rights of NCS participants and institutions. Once the RMDA is executed, researchers move to the third tier of access.

Tier 3: Restricted Access

The restricted access tier of the NCSA model is the most highly restricted level. It provides access and interaction with participant-level data collected during the NCS Vanguard Study and is the tier at which Archive staff interact with identifiable data. For the NCSA, investigators enter the restricted access tier through a secure virtual environment, known as the Researcher Portal.

For the Researcher Portal, Archive staff create a personalized virtual machine with statistical software that the researcher accesses through security authentication requirements (two-factor token, username, and password). Staff remain in communication with the researcher to ensure successful access to the Portal. The Portal allows for a spoke-hub distribution model whereby archive staff can centrally coordinate the flow of information between geographically disparate researchers.

Archive staff populate the researcher's folder with the requested data and coordinate all aspects of sample distribution, including sample selection based on proposal-defined characteristics. As an incentive to return laboratory test result data to the Archive and thereby expand available Archive data resources, participant-sample linkages are withheld until results are returned. With the NCSA model the limited technical support on tier 1 allows staff to provide concierge support on tier 3 for researchers. Staff work closely with the researcher to ensure that the Archive can provide for successful completion of a project. This includes assisting with any questions about how the data were processed, providing additional data analysis, transferring output, programs, and summary files in and out of the Portal, linking supplemental files such as geographic characteristics or other data to NCS data, and assisting with writing and reviewing articles for publication, including adherence to NCSA data publication guidelines (Table 7). Prior to release of information once the research project is complete, Archive staff performs a disclosure review of any information to be transferred out of the Researcher Portal to the researcher, to assure that the information meets disclosure standards.

TABLE 7
www.frontiersin.org

Table 7. NCS archive data publication guidelines.

From its opening in November of 2015 through December 2019, there were over 20,000 visits to the NCS Archive (Table 8) by users from more than 50 different countries. Those visitors spent on average 6 min and 27 s on the site browsing the available resources. This highlights the strength of the NCSA model, where the resources for user self-service are both available and utilized. Of those visits, 476 moved from Tier 1 access to Tier 2 by registering with the site. Using the exploratory tools and accessing study documents, those users submitted 59 research requests to gain access to NCS data and specimens, 44 of which started or completed research within the NCSA Tier 3 framework. These researchers represent 43 academic, non-profit, commercial, and government organizations/institutions. The researchers, 60 percent of whom have more than 10 years of scientific experience, range from undergraduate and graduate students to research scientists, professors and medical doctors. Over fifty peer reviewed publications have resulted from use of NCS Vanguard data and specimens (17, 2469) as of January 2021.

TABLE 8
www.frontiersin.org

Table 8. NCS archive site visit metrics.

Discussion

Data sharing is becoming an imperative for U.S. scientific research funding agencies. Recent calls for mandated data sharing (70) were followed by the October 29, 2020 announcement of the Final NIH Policy for Data Management and Sharing (71). The new policy establishes the requirements of submission of Data Management and Sharing Plans, applies to research funded or conducted by NIH that results in the generation of scientific data, and links compliance to funding actions. Related notices provide considerations for selecting a data repository, desirable characteristics for all data repositories, and additional considerations for repositories storing human data (72). NIH promotes the use of established data repositories because deposit in a quality data repository generally improves the FAIRness (Findable, Accessible, Interoperable, and Re-usable) of the data. The National Library of Medicine provides a useful listing of NIH Data Repositories at https://www.nlm.nih.gov/NIHbmic/nih_data_sharing_repositories.html.

The widely acknowledged benefits of data sharing must be balanced against the paramount need to protect privacy and maintain confidentiality of research participants (73). The human scientific research enterprise relies absolutely upon trust between research participants and researchers. Public trust in research is built over decades but can be destroyed in seconds and take lifetimes to repair, as exemplified by episodes such as the U.S. Public Health Service Syphilis Study at Tuskegee (74). Specific challenges for information security and privacy protection are presented by datasets like those of the NCS that incorporate chronological or geographic information that can be linked to other datasets with potential to reidentify individual research participants. Creative solutions to these challenges are illustrated by the secure data enclave models of the NCHS Research Data Centers (75), of which the Tier 3 restricted access level of the NCSA model represents a virtual equivalent.

The three-tier NCSA framework developed by the NCS Archive provides a model that allows researchers to evaluate information and data at a level appropriate for them, protects the rights of research subjects, and provides interactive support to researchers for specific research goals and outcomes.

The NCSA example represents a new model for adding value to large cohort studies after they close or even during their conduct and can serve as a model for effective and interactive data platform development, with tools and processes that empower users, protect research participants, and reduce the distance between data and knowledge.

Peer Review

External peer review of the National Children's Study was carried out by the National Academy of Sciences in 20081, in 20132, and in 20143.

Data Availability Statement

The datasets and other materials supporting the conclusions of this article are available in the National Children's Study Archive (https://ncsarchive.s-3.net) through the spring of 2020. In 2020, the NCS Archive data files will be available in the NICHD Data and Specimen Hub (DASH) (https://dash.nichd.nih.gov/study/228954) after removal of HIPAA identifiers.

Ethics Statement

Written informed consent was obtained from all participants and the study protocol was approved by the Institutional Review Boards (IRBs) of NICHD and each Vanguard Study institution. All procedures involving human subjects were approved by the Institutional Review Board of the Eunice Kennedy Shriver National Institute of Child Health and Human Development (Office for Human Research Protections Database IRB00000008 - National Insts Hlth - NICHD - IRB #8, FWA00005897).

Author Contributions

PG and JM prepared the final manuscript. All authors participated in the design and conduct of the project, analysis and interpretation of the data, manuscript preparation and review, and read and approved the final manuscript.

Funding

The National Children's Study was created by Congressional mandate through the Children's Health Act of 2000 (Public Law 106-310 Sec. 1004). It was funded as a direct line item in the Congressional appropriation for the National Institutes of Health in FY2010, FY2011, FY2012, FY2013, FY2014, FY2015, FY2016, and FY2017. Funding for development and operation of the NCS Data and Sample Archive and Access System was provided through NICHD Contract HHSN275201500050U.

Disclaimer

The findings and conclusions in this report are those of the authors and do not necessarily represent the views of the National Institutes of Health or U.S. Department of Health and Human Services.

Conflict of Interest

PG, SF, LA, KM, and LR were employed by the company Social and Scientific Systems Inc.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We gratefully acknowledge first and foremost the contribution of the families who participated in the National Children's Study and we honor the memory of the late Dr. Ruth Brenner for her tireless, wide ranging, and longstanding contributions to the National Children's Study.

We thank the following current and former team members at Social & Scientific Systems, Inc. for their contributions to the NCS Archive: Cora McPherson for project direction; Anita Johnson, Mat Weiss, and Vitya Blay for technical direction; Layla Sian, Thomas Otterson, Jeff Schinckle, and Carol Brimah for technical contributions; Natalie Madero, Renuka Thyagaraja, Zarinah Dyson, and Robert Bauserman for content development and testing.

We acknowledge the team members at IMS Inc., John Cleveland, Sean Brennan, Jacob Tomlinson, and Joseph Wrobleski, for their technical assistance. We also acknowledge the National Heart, Lung, and Blood Institute's BioLINCC Program on which framework the NCS Archive was based, and the generous support of the NIH Environmental influences on Child Health Outcomes (ECHO) Program.

We thank Dr. Jennifer Kwan for distinguished and continuing support for the NCS Archive, Dr. David Murray for overseeing development of the NCS Archive within the NIH, and Dr. John Gohagan and former NCHS Director Dr. Edward Sondik for review and assistance with the initial launch of the NCS Archive.

Abbreviations

ARS, Alternate Recruitment Substudy; BioLINCC, Biologic Specimen and Data Repository Information Coordinating Center; BRAD, Biospecimen Repository Access and Data Sharing program; CDC, Centers for Disease Control and Prevention; CV, Curriculum Vitae; DAC, Data Access Committee; DASH, Data and Specimen Hub; ECHO, Environmental influences on Child Health Outcomes; FISMA, Federal Information Security Management Act; HIPAA, Health Insurance Portability and Accountability Act; IRB, Institutional Review Board; IVS, Initial Vanguard Study; NCI, National Cancer Institute; NCHS, National Center for Health Statistics; NCS, National Children's Study; NCSA, National Children's Study Archive; NDAR, National Database for Autism Research; NHANES, National Health and Nutrition Examination Survey; NHLBI, National Heart, Lung, and Blood Institute; NICHD, National Institute of Child Health and Human Development; NIH, National Institutes of Health; NIMH, National Institute of Mental Health; PBR, Provider-Based Recruitment; PBS, Provider-Based Sampling; PDF, Portable Document Format; PLCO, Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial; RMDA, Research Materials Distribution Agreement; RUD, Restricted Use Data; SME, Subject Matter Expert; SOP, Standard Operating Procedures.

Footnotes

1. ^National Research Council and Institute of Medicine. (2008). The National Children's Study Research Plan: A Review. Panel to Review the National Children's Study Research Plan. Committee on National Statistics, Division of Behavioral and Social Sciences and Education. Board on Children, Youth, and Families and Board on Population Health and Public Health Practice, Institute of Medicine. Washington, DC: The National Academies Press.

2. ^National Research Council and Institute of Medicine. (2013). Design of the National Children's Study: A Workshop Summary. Nancy Kirkendall, Rapporteur. Committee on National Statistics, Division of Behavioral and Social Sciences and Education. Board on Children, Youth, and Families, Institute of Medicine. Washington, DC: The National Academies Press.

3. ^National Research Council and Institute of Medicine. (2014). The National Children's Study 2014: An Assessment. Panel on the Design of the National Children's Study and Implications for the Generalizability of Results, G.J. Duncan, N.J. Kirkendall, and C.F. Citro (Eds). Committee on National Statistics, Division of Behavioral and Social Sciences and Education, and Board on Children, Youth, and Families, Institute of Medicine. Washington, DC: The National Academies Press.

References

1. Hudak ML, Park CH, Annett RD, Hale DE, McGovern PM, McLaughlin TJ, et al. The national children's study: an introduction and historical overview. Pediatrics. (2016) 137(Suppl. 4):S213–8. doi: 10.1542/peds.2015-4410B

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Baker D, Park C, Sweeney C, McCormack L, Durkin M, Brenner R, et al. Recruitment of women in the national children's study initial vanguard study. Am J Epidemiol. (2014) 179:1366–74. doi: 10.1093/aje/kwu062

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Blaisdell LL, Zellner JA, King AA, Faustman E, Wilhelm M, Hudak ML, et al. The national children's study: recruitment outcomes using an enhanced household-based approach. Pediatrics. (2016) 137(Suppl. 4):S219–30. doi: 10.1542/peds.2015-4410C

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Hale DE, Wyatt SB, Buka S, Cherry D, Cislo KK, Dudley DJ, et al. The national children's study: recruitment outcomes using the provider-based recruitment approach. Pediatrics. (2016) 137(Suppl. 4):S239–47. doi: 10.1542/peds.2015-4410E

PubMed Abstract | CrossRef Full Text | Google Scholar

5. McGovern PM, Nachreiner NM, Holl JL, Halfon N, Dabelea D, Caulfield L, et al. The national children's study: early recruitment outcomes using the direct outreach approach. Pediatrics. (2016) 137(Suppl. 4):S231–8. doi: 10.1542/peds.2015-4410D

PubMed Abstract | CrossRef Full Text | Google Scholar

6. McLaughlin TJ, Aupont O, Kozinetz CA, Hubble D, Moore-Simas TA, Davis D, et al. Multilevel provider-based sampling for recruitment of pregnant women and mother-newborn dyads. Pediatrics. (2016) 137(Suppl. 4):S248–57. doi: 10.1542/peds.2015-4410F

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Park CH, Winglee M, Kwan J, Andrews L, Hudak ML. Comparison of recruitment strategy outcomes in the national children's study. Pediatrics. (2017) 140:e20162822. doi: 10.1542/peds.2016-2822

PubMed Abstract | CrossRef Full Text | Google Scholar

8. National Institutes of Health, Advisory Committee to the Director. National Children's Study (NCS) Working Group. Retrieved from: https://acd.od.nih.gov/documents/reports/NCS_WG_FINAL_REPORT.pdf (accessed January 12, 2020).

9. National Research Council Institute of Medicine. The national children's study 2014: an assessment, panel on the design of the national children's study implications for the generalizability of results. In: Duncan GJ, Kirkendall NJ, Citro CF, editors. Committee on National Statistics, Division of Behavioral and Social Sciences and Education, and Board on Children, Youth and Families, Institute of Medicine. Washington, DC: The National Academies Press (2014). p. 1–183.

Google Scholar

10. NIH Director. Statement on the National Children's Study. Retrieved from: https://www.nih.gov/about-nih/who-we-are/nih-director/statements/statement-national-childrens-study (accessed January 12, 2020).

11. Federer LM, Lu Y-L, Joubert DJ, Welsh J, Brandys B. Biomedical data sharing and reuse: attitudes and practices of clinical and scientific research staff. PLoS ONE. (2015) 10:e0129506. doi: 10.1371/journal.pone.0129506

PubMed Abstract | CrossRef Full Text | Google Scholar

12. US Department of Health Human Services NIH. Final NIH Statement on Sharing Research Data. (2003). Retrieved from: https://grants.nih.gov/grants/guide/notice-files/NOT-OD-03-032.html (accessed January 12, 2020).

13. U.S. Department of Health and Human Services. Guidance Regarding Methods for De-identification of Protected Health Information in Accordance with the Health Insurance Portability and Accountability Act (HIPAA) Privacy Rule. Retrieved from: https://www.hhs.gov/hipaa/for-professionals/privacy/special-topics/de-identification/index.html (accessed January 12, 2020).

Google Scholar

14. Hazra R, Tenney S, Shlionskaya A, Samavedam R, Baxter K, Ilekis J, et al. DASH, the data and specimen hub of the national institute of child health and human development. Scientific Data. (2018) 5:180046. doi: 10.1038/sdata.2018.46

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Coady SA, Mensah GA, Wagner EL, Goldfarb ME, Hitchcock DM, Giffen CA. Use of the national heart, lung, and blood institute data repository. N Engl J Med. (2017) 376:1849–58. doi: 10.1056/NEJMsa1603542

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Gillman MW, Blaisdell CJ. Environmental influences on child health outcomes, a research program of the national institutes of health. Curr Opin Pediatrics. (2018) 30:260–2. doi: 10.1097/MOP.0000000000000600

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Denny JC, Rutter JL, Goldstein DB, Philippakis A, Smoller JW, Jenkins G, et al. The “All of us” research program. N Engl J Med. (2019) 381:668–76. doi: 10.1056/NEJMsr1809937

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Wilkinson MD, Dumontier M, Aalbersberg IJ, Appleton G, Axton M, Baak A, et al. The FAIR guiding principles for scientific data management and stewardship. Sci Data. (2016) 3:160018. doi: 10.1038/sdata.2016.18

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Federal Information Security Modernization Act of 2014. Public Law 113-283; December 18, 2014) and Federal Information Security Management Act of. (2002). (Public Law 107-347 (Title III); December 17, 2002).

20. Section 508 of the Rehabilitation Act of 1973 as amended 29 U.S.C. §794(d). Retrieved from: https://www.govinfo.gov/content/pkg/USCODE-2011-title29/html/USCODE-2011-title29-chap16-subchapV-sec794d.htm (accessed December 8, 2020).

21. National Children's Study Archive Study Description and Guide February. (2019). Retrieved from: https://dash.nichd.nih.gov/study/228954, Descriptive Documents, Project Summaries, NCS_Archive_Study_Description_February2019.pdf (accessed December 8, 2020). (requires registration).

22. National Children's Study Teaching Datasets. Retrieved from: https://dash.nichd.nih.gov/study/228954, Descriptive Documents, Teaching Datasets, Teaching Datasets.zip (accessed December 8, 2020). (requires registration).

23. National Children's Study Archive Data Request Form. National Children's Study Archive Study Description and Guide February. (2019). p. 51. Retrieved from: https://dash.nichd.nih.gov/study/228954, Descriptive Documents, Project Summaries, NCS_Archive_Study_Description_February2019.pdf (accessed December 8, 2020). (requires registration).

24. Shih YH, Chen HY, Christensen K, Handler A, Turyk ME, Argos M. Prenatal exposure to multiple metals and birth outcomes: an observational study within the National Children's Study cohort. Environ Int. (2021) 147:106373. doi: 10.1016/j.envint.2020.106373

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Francis EC, Charron E, Li M, Chen L, Mayo R, Butler LS, et al. Third trimester maternal vitamin D and early childhood socioemotional development. Paediatr Perinat Epidemiol. (2020) doi: 10.1111/ppe.12736. [Epub ahead of print].

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Caldwell J, Matson A, Mosha M, Hagadorn JI, Moore J, Brownell E. Maternal H-antigen secretor status is an early biomarker for potential preterm delivery. J Perinatol. (2020) doi: 10.1038/s41372-020-00870-1. [Epub ahead of print].

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Gardener H, Sun Q, Grandjean P. PFAS concentration during pregnancy in relation to cardiometabolic health and birth outcomes. Environ Res. (2020) 192:110287. doi: 10.1016/j.envres.2020.110287

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Nandadasa S, Szafron JM, Pathak V, Murtada SI, Kraft CM, O'Donnell A, et al. Vascular dimorphism ensured by regulated proteoglycan dynamics favors rapid umbilical artery closure at birth. Elife. (2020) 9:e60683. doi: 10.7554/eLife.60683

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Richards M, Weigel M, Li M, Rosenberg M, Ludema C. Food insecurity, gestational weight gain and gestational diabetes in the national children's study, 2009-2014. J Public Health. (2020) 3:fdaa093. doi: 10.1093/pubmed/fdaa093

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Heffler KF, Sienko DM, Subedi K, McCann KA, Bennett DS. Association of early-life social and digital media experiences with development of autism spectrum disorder-like symptoms. JAMA Pediatr. (2020) 174:690–6. doi: 10.1001/jamapediatrics.2020.0230

PubMed Abstract | CrossRef Full Text | Google Scholar

31. deChristopher LR, Tucker KL. Excess free fructose, apple juice, high fructose corn syrup and childhood asthma risk - the national children's study. Nutr J. (2020) 19:60. doi: 10.1186/s12937-020-00578-0

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Richards M, Weigel M, Li M, Rosenberg M, Ludema C. Household food insecurity and antepartum depression in the national children's study. Ann Epidemiol. (2020) 44:38–44.e1. doi: 10.1016/j.annepidem.2020.01.010

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Shih YH, Scannell Bryan M, Argos M. Association between prenatal arsenic exposure, birth outcomes, and pregnancy complications: an observational study within the national children's study cohort. Environ Res. (2020) 183:109182. doi: 10.1016/j.envres.2020.109182

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Zhu H, Kannan K. Occurrence of melamine and its derivatives in breast milk from the united states and its implications for exposure in infants. Environ Sci Technol. (2019) 53:7859–65. doi: 10.1021/acs.est.9b02040

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Ma J, Zhu H, Kannan K. Organophosphorus flame retardants and plasticizers in breast milk from the united states. Environ Sci Technol Lett. (2019) 6:525–31. doi: 10.1021/acs.estlett.9b00394

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Hughes SC, Hogue CJ, Clark MA, Graber JE, Eaker ED, Herring AH. Screening for pregnancy status in a population-based sample: characteristics associated with item nonresponse. Matern Child Health J. (2019) 23:316–24. doi: 10.1007/s10995-018-2665-0

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Zhu Y, Hernandez LM, Dong Y, Himes JH, Caulfield LE, Kerver JM, et al. Weight estimation among multi-racial/ethnic infants and children aged 0-5·9 years in the USA: simple tools for a critical measure. Public Health Nutr. (2018) 22:147–56. doi: 10.1017/S1368980018002549

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Park BY, Misra DP, Moye J, Miller RK, Croen L, Fallin MD, et al. Placental gross shape differences in a high autism risk cohort and the general population. PLoS ONE. (2018) 13:e0191276. doi: 10.1371/journal.pone.0191276

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Miller ES, Grobman WA, Culhane J, Adam E, Buss C, Entringer S, et al. Antenatal depression, psychotropic medication use, and inflammation among pregnant women. Arch Womens Ment Health. (2018) 21:785–90. doi: 10.1007/s00737-018-0855-9

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Hirschfeld S. Introduction and goals for the national children's study. Front Pediatr. (2018). 5:240. doi: 10.3389/fped.2017.00240

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Chang J-M, Zeng H, Han R, Chang Y-M, Shah R, Salafia CM, et al. Autism risk classification using placental chorionic surface vascular network features. BMC Med Inform Decision Making. (2017). 17:162. doi: 10.1186/s12911-017-0564-8

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Tigges B, Kaar JL, Erbstein N, Silberman P, Winseck K, Lopez-Class M, et al. Effectiveness of community outreach and engagement in recruitment success for a prebirth cohort. J Clin Transl Sci. (2017) 1:184–91. doi: 10.1017/cts.2017.7

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Burke-Garcia A, Winseck K, Jouvenal LC, Hubble D, Kulbicki KM. A review of social media methods and lessons learned from the national children's study. J Clin Transl Sci. (2017) 1:260–4. doi: 10.1017/cts.2017.19

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Burke-Garcia A, Mathew S. Leveraging social and digital media for participant recruitment: a review of methods from the Bayley short form formative study. J Clin Transl Sci. (2017) 1:205–7. doi: 10.1017/cts.2017.9

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Luderer U, Li T, Fine JP, Hamman RF, Stanford JB, Baker D. Transitions in pregnancy planning in women recruited for a large prospective cohort study. Hum Reprod. (2017) 32:1325–33. doi: 10.1093/humrep/dex065

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Zimowski MF, Moye J, Heim Viox M, Cohen H, Dugoni B, Winfrey K. Use of replicates to improve the quality of physical measurements: lessons learned from a psychometric evaluation of an adaptive protocol in the national children's study. J Surv Stat Methodol. (2016) 4:52–4. doi: 10.1093/jssam/smw029

CrossRef Full Text | Google Scholar

47. Zimowski M, Moye J, Dugoni B, Heim Viox M, Cohen H, Winfrey K. Home-based anthropometric, blood pressure and pulse measurements in young children by trained data collectors in the National Children's Study. Public Health Nutr. (2016) 15:1–10. doi: 10.1017/S1368980016002378

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Price SM, Park CH, Brenner RA, Lo A, Adams S, Baetz RA, et al. Participant retention in a longitudinal study: do motivations and experiences matter? Survey Pract. (2016) 9:1–10. doi: 10.29115/SP-2016-0022

CrossRef Full Text | Google Scholar

49. Cubbins LA, Sepavich D, Killpack S, Hill CV. Assessing determinants of father's involvement with his partner's pregnancy and his child's well-being: the father resources survey instrument. Men Masculinities. (2018) 21:3–34. doi: 10.1177/1097184X16652653

CrossRef Full Text | Google Scholar

50. Mortensen ME, Birch R, Wong L-Y, Valentin-Blasini L, Boyle EB, Caldwell KL, et al. Thyroid antagonists thyroid indicators in US pregnant women in the vanguard study of the national children's study. Environ Res. (2016) 149:179–88. doi: 10.1016/j.envres.2016.05.017

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Dietzen DJ, Bennett MJ, Lo SF, Grey VL, Jones PM. Dried blood spot reference intervals for steroids and amino acids in a neonatal cohort of the national children's study. Clin Chem. (2016) 62:1658–67. doi: 10.1373/clinchem.2016.263434

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Heikkinen MS, Khalaf A, Beard B, Viet SM, Dellarco M. Evaluation of environmental sample analysis methods and results reporting in the NCS vanguard study. Anal Chem. (2016) 88:4692–700. doi: 10.1021/acs.analchem.5b04599

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Kappil MA, Li Q, Li A, Dassanayake PS, Xia Y, Nanes JA, et al. In utero exposures to environmental organic pollutants disrupt epigenetic marks linked to fetoplacental development. Environ Epigenet. (2016) 2:dvv013. doi: 10.1093/eep/dvv013

PubMed Abstract | CrossRef Full Text | Google Scholar

54. Boyle E, Viet S, Wright D, Merril Ll, Udeni Alwis K, Blount B, et al. Assessment of exposure to VOCs among pregnant women in the national children's study. Int J Environ Res Public Health. (2016) 13:376. doi: 10.3390/ijerph13040376

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Kozinetz CA, Royse K, Graham SC, Yu X, Moye J, Selwyn BJ, et al. Consenting postpartum women for use of routinely collected biospecimens and/or future biospecimen collection. J Community Genet. (2016) 72:153–8. doi: 10.1007/s12687-016-0261-9

CrossRef Full Text | Google Scholar

56. Abdul-Rahman OA, Rodriguez B, Wadlinger SR, Slutsman J, Boyle EB, Merrill LS, et al. Success rates for consent and collection of prenatal biological specimens in an epidemiologic survey of child health. Birth Defects Res A Clin Mol Teratol. (2016) 106:47–5. doi: 10.1002/bdra.23455

PubMed Abstract | CrossRef Full Text | Google Scholar

57. Robbins JM, Bridges MD, Childers EM, Harris RM, McElfish PA. Lessons from prenatal care provider-based recruitment into the national children's study. Pediatr Rep. (2015) 7:6056. doi: 10.4081/pr.2015.6056

PubMed Abstract | CrossRef Full Text | Google Scholar

58. Stanford JB, Brenner R, Fetterer D, Palmer L, Schoendorf KC. Impact of preconception enrollment on birth enrollment timing of exposure assessment in the initial vanguard cohort of the US national children's study. BMC Med Res Methodol. (2015) 15:75. doi: 10.1186/s12874-015-0067-1

PubMed Abstract | CrossRef Full Text

59. Bakulski KM, Lee H, Feinberg JI, Wells EM, Brown S, Herbstman JB, et al. Prenatal mercury concentration is associated with changes in DNA methylation at TCEANC2 in newborns. Int J Epidemiol. (2015) 44:1249–6. doi: 10.1093/ije/dyv032

PubMed Abstract | CrossRef Full Text | Google Scholar

60. Li Q, Kappil MA, Li A, Dassanayake PS, Darrah TH, Friedman AE, et al. Exploring the associations between microRNA expression profiles and environmental pollutants in human placenta from the national children's study (NCS). Epigenetics. (2015) 10:793–802. doi: 10.1080/15592294.2015.1066960

PubMed Abstract | CrossRef Full Text | Google Scholar

61. Barksdale Boyle E, Deziel NC, Specker BL, Collingwood S, Weisel CP, Wright DJ, et al. Feasibility and informative value of environmental sample collection in the national children's vanguard study. Environ Res. (2015) 140:345–53. doi: 10.1016/j.envres.2015.04.006

PubMed Abstract | CrossRef Full Text | Google Scholar

62. Viet SM, Dellarco M, Dearborn DG, Neitzel R. Assessment of noise exposure to children: considerations for the national children's study. J Pregnancy Child Health. (2014) 1:105. doi: 10.4172/2376-127X.1000105

PubMed Abstract | CrossRef Full Text | Google Scholar

63. Nanes JA, Xia Y, Dassanayake RM, Jones RM, Li A, Stodgell CJ, et al. Selected persistent organic pollutants in human placental tissue from the United States. Chemosphere. (2014) 106:20–7. doi: 10.1016/j.chemosphere.2013.12.080

PubMed Abstract | CrossRef Full Text | Google Scholar

64. Ghantous A, Saffery R, Cros MP, Ponsonby AL, Hirschfeld S, Kasten C, et al. Optimized DNA extraction from neonatal dried blood spots: application in methylome profiling. BMC Biotechnol. (2014) 14:60. doi: 10.1186/1472-6750-14-60

PubMed Abstract | CrossRef Full Text | Google Scholar

65. Mortensen ME, Calafat AM, Ye X, Wong LY, Wright DJ, Pirkle JL, et al. Urinary concentrations of environmental phenols in pregnant women in a pilot study of the national children's study. Environ Res. (2014) 129:32–8. doi: 10.1016/j.envres.2013.12.004

PubMed Abstract | CrossRef Full Text | Google Scholar

66. Medina J, Morrell R, Pickett D, Lumpkin J, McCain T, Drankus Pekelnicky D, et al. Considerations for ensuring security of research data in a federally regulated environment. ISACA J. (2013) 3:26–33. Available online at: https://www.isaca.org/-/media/files/isacadp/project/isaca/articles/journal/archives/journal-volume-3-2014.pdf

67. Mechanic L, Mendez A, Merrill L, Rogers J, Layton M, Todd D, et al. Planned variation in preanalytical conditions to evaluate biospecimen stability in the National Children's Study (NCS). Clin Chem Lab Med. (2013) 51:2287–94. doi: 10.1515/cclm-2013-0454

PubMed Abstract | CrossRef Full Text | Google Scholar

68. Caldwell KL, Pan Y, Mortensen ME, Makhmudov A, Merrill L, Moye J. Iodine status in pregnant women in the national children's study in US women (15-44 years), national health and nutrition examination survey 2005-2010. Thyroid. (2013) 23:927–37. doi: 10.1089/thy.2013.0012

PubMed Abstract | CrossRef Full Text | Google Scholar

69. Kasten CH. The national children's study. abstracts of the national children's study research day 2011. Am J Med Genet A. (2012) 158A:2385–90. doi: 10.1002/ajmg.a.35432

CrossRef Full Text | Google Scholar

70. Sim I, Stebbins M, Bierer BE, Butte AJ, Drazen J, Dzau V, et al. Time for NIH to lead on data sharing. Science. (2020) 367:1308–9. doi: 10.1126/science.aba4456

PubMed Abstract | CrossRef Full Text | Google Scholar

71. NOT-OD-21-013. Final NIH Policy for Data Management and Sharing. NIH Guide to Grants and Contracts, release date October 29, 2020. Retrieved from: https://grants.nih.gov/grants/guide/notice-files/NOT-OD-21-013.html (accessed December 8, 2020).

72. NOT-OD-21-016. Supplemental Information to the NIH Policy for Data Management and Sharing: Selecting a Repository for Data Resulting from NIH-Supported Research. NIH Guide to Grants and Contracts, release date October 29, 2020. Retrieved from: https://grants.nih.gov/grants/guide/notice-files/NOT-OD-21-016.html (accessed December 8, 2020).

73. National Academies of Sciences, Engineering, and Medicine. Principles and Obstacles for Sharing Data from Environmental Health Research: Workshop Summary. Washington, DC: The National Academies Press (2016).

Google Scholar

74. Mays VM. The legacy of the US public health services study of untreated syphilis in african american men at tuskegee on the affordable care act and health care reform fifteen years after president clinton's apology. Ethics Behav. (2012) 22:411–8. doi: 10.1080/10508422.2012.730808

PubMed Abstract | CrossRef Full Text | Google Scholar

75. US Centers for Disease Control and Prevention, National Center for Health Statistics. Research Data Center (RDC). Retrieved from: https://www.cdc.gov/rdc/index.htm (accessed December 8, 2020).

Keywords: biological specimen bank, child health, data archive, data exploration, data sharing, environmental sample bank, information dissemination, longitudinal studies

Citation: Gilbertson PK, Forrester S, Andrews L, McCann K, Rogers L, Park C and Moye J (2021) The National Children's Study Archive Model: A 3-Tier Framework for Dissemination of Data and Specimens for General Use and Secondary Analysis. Front. Public Health 9:526286. doi: 10.3389/fpubh.2021.526286

Received: 24 January 2020; Accepted: 26 January 2021;
Published: 05 March 2021.

Edited by:

Yelena Mejova, Institute for Scientific Interchange, Italy

Reviewed by:

Elizabeth Barksdale Boyle, National Academies of Sciences, Engineering, and Medicine, United States
Mike Conway, The University of Utah, United States

Copyright © 2021 Gilbertson, Forrester, Andrews, McCann, Rogers, Park and Moye. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jack Moye, bW95ZWomI3gwMDA0MDtleGNoYW5nZS5uaWguZ292

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.