NIH Data Sharing Repositories
This table lists NIH-supported data repositories that make data accessible for reuse. Most accept submissions of appropriate data from NIH-funded investigators (and others), but some restrict data submission to only those researchers involved in a specific research network. Also included are resources that serve as a portal for information about biomedical data and information sharing systems. The table can be sorted by repository name and by NIH Institute or Center and may be searched using keywords so that you can find repositories more relevant to your data. Links are provided to information about submitting data to and accessing data from the listed repositories. Also, we have asked all repositories to indicate whether or not they have— current NIH funding support; open data submission; open data access; open time frame for data submission; and sustained support. Their Yes or No responses are included in the five columns to the right of the table. The information in the BMIC list is also available in a downloadable Excel version.
In our query to the repositories, we acknowledged that some repositories may have restrictions or limitations on data submission and requirements for data access. Additional information about the repositories and points-of-contact for further information or inquiries can be found on the websites or links shown for each. Are we missing a data sharing repository that should be included in this list? Do you have comments or feedback on this list or the website? Contact us.
|ICO||Repository Name||Repository Description||Data Submission Policy||Access to Data||Current NIH funding support||Open data submission||Open data access||Open time frame for data deposit||Sustained support|
|Common Fund||Epigenomics||Epigenomic, 6 histone modification marks, DNAse I, DNA methylation, transcriptome for wide variety of cell types and tissues.||Not applicable||How to access Epigenomics data||No||No||Yes||No||Yes|
|Common Fund||exRNA Atlas||Includes exRNA profiles derived from various biofluids and conditions and currently stores data profiled from small RNA sequencing assays.||Not applicable||How to access exRNA Atlas data||Yes||No||Yes||No||Yes|
|Common Fund||GTEx||The Genotype-Tissue Expression (GTEx) project aims to provide to the scientific community a resource with which to study human gene expression and regulation and its relationship to genetic variation. This project will collect and analyze multiple human tissues from donors who are also densely genotyped, to assess genetic variation within their genomes. By analyzing global RNA expression within individual tissues and treating the expression levels of genes as quantitative traits, variations in gene expression that are highly correlated with genetic variation can be identified as expression quantitative trait loci, or eQTLs.||Project data only||How to access GTEx data||Yes||No||Yes||No||Yes|
|Common Fund||HMP DACC||The HMP DACC is a common repository for diverse human microbiome datsets and minimum reporting standards for the Common Fund Human Microbiome Project (HMP).||Project data only||How to access HMP DACC data||No||No||Yes||No||Yes|
|Common Fund||Illuminating Druggable Genome||The goal of the Illuminating the Druggable Genome (IDG) program is to improve our understanding of the properties and functions of proteins that are currently unannotated within the three most commonly drug-targeted protein families: G-protein coupled receptors, ion channels, and protein kinases.||Project data only||How to access IDG data||Yes||No||Yes||Yes||Yes|
|Common Fund||International Mouse Phenotyping Consortium (IMPC)||Phenotype data on knockout mouse lines.||How to submit data to IMPC||How to access IMPC data||Yes||No||Yes||Yes||Yes|
|Common Fund||Kids First||The Common Fund’s Gabriella Miller Kids First Pediatric Research Program (Kids First) is developing a large-scale data resource to help researchers uncover new insights into the biology of childhood cancer and structural birth defects, including the discovery of shared genetic pathways between these disorders. Childhood cancers and structural birth defects have profound, lifelong effects on patients and their families. The Kids First program will stimulate research in these areas and work towards providing a better understanding of how genetics plays a role in these conditions that will lead to early detection, therapeutic interventions, and toward more effective prevention.||Project data only||How to access Kids First data||Yes||No||Yes||No||Yes|
|Common Fund||LINCS||The Library of Integrated Network-Based Cellular Signatures (LINCS) Program aims to create a network-based understanding of biology by cataloging changes in gene expression and other cellular processes that occur when cells are exposed to a variety of perturbing agents.||Project data only||How to access LINCS data||Yes||No||Yes||Yes||Yes|
|Common Fund||Metabolomics Workbench (MetWB)||The Metabolomics Program's Data Repository and Coordinating Center (DRCC), housed at the San Diego Supercomputer Center (SDSC), University of California, San Diego, has developed the Metabolomics Workbench. MetWB will serve as a national and international repository for metabolomics data and metadata and will provide analysis tools and access to metabolite standards, protocols, tutorials, training, and more.||How to submit data to MetWB||How to access MetWB data||Yes||Yes||Yes||Yes||Yes|
|Common Fund||SOBC Measures Repository||The SOBC measures repository is the source for behavioral science measures that have been validated (or are in the process of being validated) in accordance with the SOBC Experimental Medicine Approach.||How to submit data to SOBC||How to access SOBC data||Yes||No||Yes||No||No|
|Common Fund||Stimulating Peripheral Activity to Relieve Conditions Portal (SPARC)||The SPARC Portal provides interactive access to a growing collection of data, maps, and computational studies that focus on the role of the autonomic nervous system in controlling organ function. These resources are made available to the public with the intent of advancing bioelectronic medicine towards more precise treatment of diseases and conditions.||Project data only||How to access SPARC data||Yes||No||Yes||Yes||Yes|
|Common Fund (NCCIH, NCI, NHLBI, NIA, NIAMS, NIDA, NIDCD, NIDDK, NIMH, NINDS, NINR, & OD)||PROMIS||The Patient-Reported Outcomes Measurment Informmation System (PROMIS) is a set of person-centered measures that evaluates and monitors physical, mental, and social health in adults and children.||Project data only||How to access PROMIS data||Yes||No||Yes||No||No|
|NCI||Cancer Nanotechnology Laboratory (caNanoLab)||caNanoLab is a data sharing portal designed to facilitate information sharing in the biomedical nanotechnology research community to expedite and validate the use of nanotechnology in biomedicine. caNanoLab provides support for the annotation of nanomaterials with characterizations resulting from physico-chemical, in vitro, and in vivo assays and the sharing of these characterizations and associated nanotechnology protocols in a secure fashion.||How to submit data to caNanoLab||How to access caNanoLab data||Yes||Yes||Yes||Yes||Yes|
|NCI||Genomic Data Commons (GDC)||The mission of the GDC is to provide the cancer research community with a unified data repository that enables data sharing across cancer genomic studies in support of precision medicine. The GDC contains clinical, biospecimen, and molecular data from several cancer research programs.||How to submit data to GDC||How to access GDC data||Yes||Yes||Yes||Yes||Yes|
|NCI||The Cancer Imaging Archive (TCIA)||TCIA is a service which de-identifies and hosts a large archive of medical images of cancer accessible for public download. The data are organized as “Collections”, typically patients related by a common disease (e.g. lung cancer), image modality (MRI, CT, etc) or research focus. DICOM is the primary file format used by TCIA for image storage. Supporting data related to the images such as patient outcomes, treatment details, genomics, pathology, and expert analyses are also provided when available.||How to submit data to TCIA||How to access TCIA data||Yes||Yes||Yes||Yes||Yes|
|NCI||The Network Data Exchange (NDEx)||NDEx is an online commons where scientists can upload, share, and publicly distribute biological networks and pathway models. The NDEx Project maintains a web-accessible public server, a documentation website, provides seamless connectivity to Cytoscape as well as programmatic access using a variety of languages including Python and Java. NDEx users can easily create accounts or sign in using their Google credentials thanks to the supported open authentication (OAUTH2) method and mint DOIs for their networks to use in publications or include in other resources for long term access.||How to submit data to NDEx||How to access NDEx data||Yes||Yes||Yes||Yes||Yes|
|NCI||NCI CSSI Data Coordinating Center (CSSI DCC)||The CSSI DCC stores and manages access to diverse NCI-supported cancer reserarch data, such as advanced imaging, pre-analytical variable studies, and cell characterizations. This data is in the standard Investigation-Study-Assay tab-delimited format (ISA-TAB) format, which describes a scientific investigation, its study or studies, and each study's assay(s).||How to submit data to CSSI DCC||How to access CSSI DCC data||Yes||No||Yes||Yes||Yes|
|NCI (NHGRI, NIGMS)||PeptideAtlas||PeptideAtlas is a multi-organism, publicly accessible compendium of peptides identified in a large set of tandem mass spectrometry proteomics experiments. Mass spectrometer output files are collected for human, mouse, yeast, and several other organisms, and searched using the latest search engines and protein sequences.||How to submit data to PeptideAtlas||How to access Peptide Atlas data||No||yes||Yes||Yes||No|
|NEI||EyeGENE®||The eyeGENE® Biorepository and corresponding Database contain family history and clinical eye exam data from subjects enrolled in eyeGENE® Program coupled to clinical grade DNA samples. This data and samples are submitted by collaborators throughout the US and Canada and the data is available on a controlled access basis to researchers world-wide.||How to submit data to EyeGENE®||How to access EyeGENE® data||Yes||Yes||Yes||Yes||Yes|
|NEI||NEI Data Commons||The NEI Commons is a virtual infrastructure to enable sharing. The Commons portal provides a platform for querying and accessing vision research data and tools for data processing and analysis. It is the central location for NEI generated clinical and basic research data available to the public.||Dependent on sub-project||How to access NEI Commons data||Yes||Yes||Yes||Yes||Yes|
|NHGRI||FlyBase: A Drosophila Genomic and Genetic Database||Drosophila Genomic and Genetic database that includes proteomics data, microarrays and Tiling BAC's.||How to submit data to Flybase||How to access Flybase data||Yes||Yes||Yes||Yes||Yes|
|NHGRI||The Zebrafish Model Organism Database (ZFIN)||ZFIN serves as the zebrafish model organism database. It aims to: a) be the community database resource for the laboratory use of zebrafish, b) develop and support integrated zebrafish genetic, genomic and developmental information, c) maintain the definitive reference data sets of zebrafish research information, d) to link this information extensively to corresponding data in other model organism and human databases, e) facilitate the use of zebrafish as a model for human biology, and f) serve the needs of the research community.||How to submit data to ZFIN||How to access ZFIN data||Yes||Yes||Yes||Yes||Yes|
|NHGRI||WormBase||WormBase is an international consortium of biologists and computer scientists dedicated to providing the research community with accurate, current, accessible information concerning the genetics, genomics and biology of C. elegans and related nematodes.||How to submit data to WormBase||How to access WormBase data||Yes||Yes||Yes||Yes||Yes|
|NHGRI (NICHD, NCI)||Mouse Genome Informatics (MGI)||MGI is the international database resource for the laboratory mouse, providing integrated genetic, genomic, and biological data to facilitate the study of human health and disease.||How to submit data to MGI||How to access MGI data||Yes||Yes||Yes||Yes||Yes|
|NHGRI/NIGMS||The Universal Protein Resource (UniProt)||The Universal Protein Resource (UniProt) is a comprehensive resource for protein sequence and annotation data. The UniProt databases are the UniProt Knowledgebase (UniProtKB), the UniProt Reference Clusters (UniRef), and the UniProt Archive (UniParc).||How to submit data to UniProt||How to access UniProt data||Yes||Yes||Yes||Yes||Yes|
|NHLBI||Biologic Specimen and Data Repository Information Coordinating Center (BioLINCC)||The goal of BioLINCC is to facilitate and coordinate the existing activities of the NHLBI Biorepository and the Data Repository and to expand their scope and usability to the scientific community through a single web-based user interface.||How to submit data to BioLINCC||How to access BioLINCC data||Yes||Yes||Yes||Yes||Yes|
|NHLBI||National Sleep Research Resource||The NSRR web platform enables sharing of physiological signals and clinical data elements from well-characterized, de-identified, research cohorts and clinical trials. Data queries across studies (data pooling) are possible for demographics and data variables with shared common data elements.||How to submit data to NSRR||How to access NSRR data||Yes||Yes||Yes||Yes||Yes|
|NHLBI||Rat Genome Database (RGD)||The Rat Genome Database (RGD) was established in 1999 and is the premier site for genetic, genomic, phenotype, and disease data generated from rat research. In addition, it provides easy access to corresponding data for human and mouse, as well as multiple other models such as chinchilla and 13-lined ground squirrel, facilitating cross-species comparisons. RGD’s comprehensive data and innovative software tools make it a valuable resource for researchers worldwide.||How to submit data to RGD||How to access RGD data||Yes||Yes||Yes||Yes||Yes|
|NHLBI||The Cardiovascular Research Grid (CVRG)||The CardioVascular Research Grid (CVRG) project is creating an infrastructure for secure seamless access to study data and analysis tools.
CVRG tools are developed using the Software as a Service model, allowing users to access tools through their browser, thus eliminating the need to install and maintain complex software. The CVRG project is supported by the National Heart Lung & Blood Institute. The project is based at the Institute for Computational Medicine at the Johns Hopkins University, Department of Biomedical Informatics at Vanderbilt University Medical Center, The College of Computing and Informatics at UNC Charlotte, The Center for Comprehensive Informatics at Emory University, The College of Engineering and Applied Sciences at Stony Brook University, and the Computation Institute at The University
|No new data submission, but one could submit new tools.||How to access CVRG data||No||No||Yes||No||No|
|NIA||AMP-AD Knowledge Portal||The AMP-AD Knowledge Portal is an NIH-designated repository and the distribution site for multi-omic data from human samples, cell-based and animal models, analysis results, analytical methodology and research tools generated by multiple National Institute of Aging supported Alezheimer's disease research programs and consortia . Data is available to qualified investigators as open or controlled access depending on the data type and data source.||How to submit data to AMP-AD Knowledge Portal||How to access AMP-AD Knowledge Portal data||Yes||Yes||Yes||Yes||Yes|
|NIA||National Archive of Computerized Data on Aging (NACDA)||NACDA acquires and preserves data relevant to gerontological research, processing as needed to promote effective research use, disseminates them to researchers, and facilitates their use. By preserving and making available the largest library of electronic data on aging in the United States, NACDA offers opportunities for secondary analysis on major issues of scientific and policy relevance.||How to submit data to NACDA||How to access NACDA data||Yes||Yes||Yes||Yes||Yes|
|NIA||The National Institute on Aging Genetics of Alzheimer’s Disease Data Storage Site (NIAGADS)||The National Institute on Aging Genetics of Alzheimer's Disease Data Storage Site (NIAGADS) is a national genetics data repository facilitating access to genotypic and phenotypic data for Alzheimer's disease (AD). Data include GWAS, whole genome (WGS) and whole exome (WES), expression, RNA Seq, and CHIP Seq analyses. Data for the Alzheimer’s Disease Sequencing Project (ADSP) are available through a partnership with dbGaP (ADSP at dbGaP). Results are integrated and annotated in the searchable genomics database that also provides access to a variety of software packages, analytic pipelines, online resources, and web-based tools to facilitate analysis and interpretation of large-scale genomic data. Data are available as defined by the NIA Genomics of Alzheimer’s Disease Sharing Policy and the NIH Genomics Data Sharing Policy. Investigators return secondary analysis data to the database in keeping with the NIAGADS Data Distribution Agreement.||How to submit data to NIAGADS||How to access NIAGADS data||Yes||Yes||Yes||Yes||Yes|
|NIAID||Eukaryotic Pathogen Database Resources (EuPathDB)||EuPathDB Bioinformatics Resource Center for Biodefense and Emerging/Re-emerging Infectious Diseases is a portal for accessing genomic-scale datasets associated with the eukaryotic pathogens.||How to submit data to EuPathDB||How to access EuPathDB data||Yes||Yes||Yes||Yes||Yes|
|NIAID||Immune Epitope Database and Analysis Resource (IEDB)||This repository contains antibody/B cell and T cell epitope information and epitope prediction and analysis tools for use by the research community worldwide. Immune epitopes are defined as molecular structures recognized by specific antigen receptors of the immune system, namely antibodies, B cell receptors, and T cell receptors. Immune epitopes from infectious diseases, excluding HIV, and immune-mediated diseases and the accompanying biological information are included.||How to submit data to IEDB||How to access IEDB data||Yes||Yes||Yes||Yes||Yes|
|NIAID||Influenza Research Database (IRD)||The Influenza Research Database (IRD) serves as a public repository and analysis platform for flu sequence, experiment, surveillance and related data.||How to submit data to IRD||How to access IRD data||Yes||Yes||Yes||Yes||Yes|
|NIAID||ITN TrialShare||TrialShare is a transformative approach to data-sharing that enables clinical trial transparency. TrialShare provides clinical trial investigators the unprecedented ability to access de-identified study data, review published analysis methods, and perform real-time, interactive graphical analyses in collaboration with other researchers. Immune Tolerance Network (ITN) TrialShare shares information about ITN's clinical studies and specimen bio-repository. Data and analysis code underlying ITN-published manuscripts are publicly available with the goal of promoting transparency, reproducibility, and scientific collaboration.||How to submit data to TrialShare||How to access TrialShare data||Yes||Yes||Yes||Yes||Yes|
|NIAID||Pathosystems Resource Integration Center (PATRIC)||PATRIC, the Bacterial Bioinformatics Resource Center, supports research on bacterial infectious diseases by serving as a repository of genomic and other data with associated metadata for over 100,000 bacterial genomes. PATRIC provides an integrated suite of computational services and visualizations for users to analyze and compare their own data in a private workspace with the public data in PATRIC.||How to submit data to PATRIC||How to access PATRIC data||Yes||Yes||Yes||Yes||Yes|
|NIAID||The Immunology Database and Analysis Portal (ImmPort)||ImmPort is a public data sharing repository funded by DAIT. The data sources are primarily DAIT-funded clinical trials, associated mechanistic studies, and other basic and applied immunology research programs.||How to submit data to ImmPort||How to access ImmPort data||Yes||Yes||Yes||Yes||Yes|
|NIAID||VectorBase||VectorBase is a Bioinformatics Resource Center for invertebrate vectors. It is one of four Bioinformatics Resource Centers funded by NIAID to provide web-based resources to scientific community conducting basic and applied research on organisms considered potential agents of biowarfare or bioterrorism or causing emerging or re-emerging diseases.||How to submit data to VectorBase||How to access VectorBase data||Yes||Yes||Yes||Yes||Yes|
|NIAID||Virus Pathogen Research (ViPR)||The Virus Pathogen Resource (ViPR) provides a searchable public repository of genomic, proteomic and other important research data for more than 500,000 strains of pathogenic viruses along with a suite of tools for analyzing these data. Data can be shared, aggregated, analyzed using ViPR tools, and downloaded for local analysis.||How to submit data to ViPR||How to access ViPR data||Yes||Yes||Yes||Yes||Yes|
|NIBIB||LONI Database||The LONI Image Data Archive (IDA) is a user-friendly environment for archiving, searching, sharing, tracking and disseminating neuroimaging and related clinical data. The IDA is utilized for dozens of neuroimaging research projects across North America and Europe and accommodates MRI, PET, MRA, DTI and other imaging modalities.||How to submit data to LONI [PDF]||How to access LONI data [PDF]||Yes||Yes||Yes||Yes||Yes|
|NIBIB||Medical Information Mart for Intensive Care-III (MIMIC-III)||MIMIC-III is a large, freely-available database comprising deidentified health-related data associated with over forty thousand patients who stayed in critical care units of the Beth Israel Deaconess Medical Center between 2001 and 2012. The database includes information such as demographics, vital sign measurements made at the bedside (~1 data point per hour), laboratory test results, procedures, medications, caregiver notes, imaging reports, and mortality (both in and out of hospital). MIMIC-III supports a diverse range of analytic studies spanning epidemiology, clinical decision-rule improvement, and electronic tool development.||How to submit data of MIMIC-III||How to access MIMIC-III data||Yes||Yes||Yes||Yes||Yes|
|NIBIB (NIMH, NIDA)||NeuroImaging Tools and Resources Collaboratory (NITRC)||The NeuroImaging Tools and Resources Collaboratory (NITRC) provides free access to data and enables pay-per-use cloud-based access to unlimited computing power, enabling worldwide scientific collaboration with minimal startup and cost. With NITRC and its components—the Resources Registry (NITRC-R), Image Repository (NITRC-IR), and Computational Environment (NITRC-CE)—a researcher can obtain pilot or proof-of-concept data to validate a hypothesis for a few dollars.||How to submit data to NITRC||How to access NITRC data||Yes||Yes||Yes||Yes||Yes|
|NICHD||Biospecimen Repository Access and Data Sharing (BRADS)||The Division of Intramural Population Health Research (DIPHR) of the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) maintains an extensive repository of datasets from completed studies, including biospecimens and ancillary data.||Not applicable||How to access BRADS data||Yes||No||No||No||Yes|
|NICHD||Child Language Data Exchange System (CHILDES)||CHILDES is a system for sharing and analyzing conversational interactions.||How to submit data to CHILDES||How to access CHILDES data||Yes||Yes||Yes||Yes||Yes|
|NICHD||Data and Specimen Hub (DASH)||NICHD DASH is a centralized resource for researchers to store and access de-identified data from NICHD funded research studies for the purposes of secondary research use. It serves as a mechanism for NICHD-funded extramural and intramural investigators to share research data from studies in accordance with the NIH Data Sharing Policy and the NIH Genomic Data Sharing Policy.||How to submit data to DASH||How to access DASH data||Yes||Yes||Yes||Yes||Yes|
|NICHD||Data Sharing for Demographic Research (DSDR)||DSDR is a data sharing project providing curation and archiving services for the demographic and population sciences community. It focuses on data collected through funding from the NICHD Population Dynamics Branch (PDB), but also provides these services for other data sets that fall within the scientific mission of PDB.||How to submit data to DSDR||How to access DSDR data||Yes||Yes||Yes||Yes||Yes|
|NICHD||National Children’s Study (NCS) Archive||The NCS Archive, a data and sample repository for the National Children’s Study, provides access to data and samples collected from over 5,600 U.S. birth families to study environmental influences on child health and development. Data and biological and environmental samples are freely available, with an approved request, for scientific research.||How to submit data to NCS||How to access NCS data||Yes||Yes||Yes||Yes||Yes|
|NICHD||PhonBank||PhonBank is an open database for the study of early phonological development using the Phon program.||How to submit data to PhonBank||How to access PhonBank data||Yes||Yes||Yes||Yes||Yes|
|NICHD||Xenbase||Xenbase is a Xenopus laevis and Xenopus tropicalis biology and genomics resource.||How to submit data to Xenbase||How to access Xenbase data||Yes||Yes||Yes||Yes||Yes|
|NICHD (NINDS, & NIBIB)||Archive of Data on Disability to Enable Policy (ADDEP)||ADDEP provides access to data including a wide range of topics related to disability. ADDEP data can be used to better understand and inform the implementation of the Americans with Disabilities Act and other disability policies.||How to submit data to ADDEP||How to access ADDEP data||Yes||Yes||Yes||Yes||No|
|NIDA||Mouse Phenome Database (MPD)||The Mouse Phenome Database (MPD) enables the integration of genomic and phenomic data by providing access to primary experimental data, well-documented data collection protocols and analysis tools. Data are contributed by investigators from around the world and represent a broad scope of behavioral, morphological and physiological disease-related characteristics in naive mice and those exposed to drugs, environmental agents or other treatments.||How to submit data to MPD||How to access MPD data||Yes||Yes||Yes||Yes||Yes|
|NIDA||National Addiction & HIV Data Archive Program (NAHDAP)||NAHDAP acquires, preserves and disseminates data relevant to drug addiction and HIV research. By preserving and making available an easily accessible library of electronic data on drug addiction and HIV infection in the United States, NAHDAP offers scholars the opportunity to conduct secondary analysis on major issues of social and behavioral sciences and public policy.||How to submit data to NAHDAP||How to access NAHDAP data||Yes||Yes||Yes||Yes||Yes|
|NIDA||Neuroscience Information Framework (NIF)||Discovery Portal: An innovative semantically-enhanced search engine of the largest collection of neuroscience information that allows users to customize and refine their searches via NIF's terminologies.
The NIF Registry: a core resource of NIF, is a catalog of electronic resources that have been selected by NIF curators, or contributed by the community, as valuable tools for researchers and students in the field of neuroscience.
Date Sharing service: NIF maintains the largest searchable collection of neuroscience data, the largest catalog of biomedical resources, and the largest ontology for neuroscience on the web. NIF's already established and extensive infrastructure makes sharing your data easy.
LinkOut Broker: NIF provides links between PubMed articles and your data when you register to the Data Federation through the LinkOut Broker. To enable this feature, your data must include PubMed IDs.
Ontology Engineering: Expertise and experience in building and enhancing the main terminologies and ontologies.
|How to submit data to NIF||How to access NIF data||Yes||Yes||Yes||Yes||Yes|
|NIDA||NIDA Center for Genetics Studies||Blood and other biospecimens along with phenotypic data collected, archived and stored.||How to submit data to the NIDA Center for Genetics Studies||How to access NIDA Center for Genetics Studies data||Yes||Yes||No||Yes||Yes|
|NIDA||NIDA Data Share||The NIDA Data Share web site is an electronic environment that allows data from completed clinical trials to be distributed to investigators and the public in order to promote new research, encourage further analyses, and disseminate information to the community.||Data download only. Data is submitted through NIDA POC||How to access NIDA Data Share data||Yes||No||Yes||No||Yes|
|NIDCD||AphasiaBank||AphasiaBank is a shared database of multimedia interactions for the study of communication in aphasia. Access to the data in AphasiaBank is password protected and restricted to members of the AphasiaBank consortium group.||How to submit data to AphasiaBank||How to access AphasiaBank data||Yes||Yes||Yes||Yes||Yes|
|NIDCD||FluencyBank||FluencyBank is a shared database for the study of the development of fluency in both normal and disordered populations. Participants include normally-developing monolingual and bilingual children, children with disfluencies (CWD), adults with disfluencies (AWSD), and second language learners.||How to submit data to FluencyBank||How to access FluencyBank data||Yes||Yes||Yes||Yes||Yes|
|NIDCR||FaceBase||FaceBase is a NIDCR-funded data hub that hosts variety of data generated through dental, oral,and craniofacial research using model organisms and humans. The data offer spotlights high-throughput genetic, molecular, biological, imaging and computational techniques, as well as the database of 3D Facial Norms, developmental atlases and the Ontology of Craniofacial Development and Malformation (OCDM), Human Genome Analysis Interface (HGAI) and other resources.||How to submit data to FaceBase||How to access FaceBase data||Yes||Yes||Yes||Yes||Yes|
|NIDDK||NIDDK Central Repository||The NIDDK Central Repository stores biosamples, genetic and other data collected in designated NIDDK-funded clinical studies. The purpose of the NIDDK Central Repository is to expand the usefulness of these studies by allowing a wider research community to access data and materials beyond the end of the study.||How to submit data to NIDDK Central Repository||How to access NIDDK Central Repository data||Yes||Yes||Yes||Yes||Yes|
|NIDDK||NIDDK Information Network (DKnet)||The NIDDK Information Network serves the needs of basic and clinical investigators by providing seamless access to large pools of data relevant to the mission of NIDDK. The goal of DKnet is to develop a community-based network for integration across disciplines to include the larger DK universe of diseases, investigators, and potential users.||How to submit data to DKnet||How to access DKnet data||Yes||Yes||Yes||Yes||Yes|
|NIDDK||Nuclear Receptor Signaling Atlas (NURSA)||The Nuclear Receptor Signaling Atlas (NURSA) is designed to foster the development of a comprehensive understanding of the structure, function, and role in disease of nuclear receptors (NRs) and coregulators. NURSA seeks to elucidate the roles played by NRs and coregulators in metabolism and the development of metabolic disorders (including type 2 diabetes, obesity, osteoporosis, and lipid dysregulation), as well as in cardiovascular disease, oncology, regenerative medicine and the effects of environmental agents on their actions.||How to submit data to NURSA||How to access NURSA data||Yes||Yes||Yes||Yes||Yes|
|NIDDK||The AMP-T2D Knowledge Portal||The AMP-T2D Knowledge Portal is the NIDDK-designated repository being developed as part of the Accelerating Medicines Partnership (AMP), a public-private partnership between the National Institutes of Health (NIH), the U.S. Food and Drug Administration (FDA), 10 biopharmaceutical companies, and multiple non-profit organizations that is managed through the Foundation for the NIH (FNIH) for collection of various types of molecular data such as DNA sequences, functional and epigenomic information, and clinical data from studies on type 2 diabetes and its macro- and microvascular complications from human, cell-based and animal model biosamples, analytical results and research tools towards improving current efforts to develop new therapies for complex, heterogeneous diseases, including type 2 diabetes.||How to submit data to the AMP-T2D Knowledge Portal||How to access AMP-T2D Knowledge Portal data||Yes||No||Yes||Yes||Yes|
|NIEHS||Chemical Effects in Biological Systems (CEBS)||The CEBS database houses data of interest to environmental health scientists. CEBS is a public resource, and has received depositions of data from academic, industrial, and governmental laboratories. CEBS is designed to display data in the context of biology and study design, and permit data integration across studies for novel meta analysis.||How to submit data to CEBS||How to access CEBS data||Yes||Yes||Yes||Yes||Yes|
|NIGMS||Cell Image Library||The Cell Image Library, accepts image data sets that are too large for publishers to store, and provides access to the biomedical community. There are 10,000 datasets in 20TB of uploaded data as of mid-2018. The library inherits data from the Cell Centered Database at UCSD. Since its launch in 2010, the site has had 721,00 visitors and was cited by 175 research publications.||How to submit data to Cell Image Library||How to access Cell Image Library data||Yes||Yes||Yes||Yes||Yes|
|NIGMS||Database of Interacting Proteins (DIP)||The DIP database, a founding member of the International Molecular Exchange Consortium (IMEx: https://www.imexconsortium.org), catalogs experimentally determined interactions between proteins. It combines information from a variety of sources to create a single, consistent set of protein-protein interactions. The data stored within the DIP database are manually curated and include, when available, detailed information on the interacting regions, mutations and post-translational modifications that affect each interaction.||How to submit data to DIP||How to access DIP data||Yes||Yes||Yes||Yes||Yes|
|NIGMS||PhysioNet||The PhysioNet Resource is intended to stimulate current research and new investigations in the study of complex biomedical and physiologic signals. It offers free web access to large collections of recorded physiologic signals (PhysioBank) and related open-source software (PhysioToolkit).||How to submit data to PhysioNet||How to access PhysioNet data||Yes||Yes||Yes||Yes||Yes|
|NIGMS||Transporter Classification Database (TCDB)||The Transporter Classification Database details a comprehensive classification system for membrane transport proteins known as the Transporter Classification (TC) system. The TC system is analogous to the Enzyme Commission (EC) system for classification of enzymes, except that it incorporates both functional and phylogenetic information. Descriptions, TC numbers, and examples of over 600 families of transport proteins are provided.||How to submit data to TCDB||How to access TCDB data||Yes||No||Yes||No||Yes|
|NIGMS (NCI, NSF, DOE-BER)||Protein Data Bank (PDB)||The mission of the RCSB Protein Data Bank (PDB) is to Sustain a unique data resource of three-dimensional biomolecular structure
|How to submit data to PDB||How to access PDB data||Yes||Yes||Yes||Yes||No|
|NIH||Biological General Repository for Interaction Datasets (BioGRID)||BioGRID is a curated biological database of protein-protein interactions, genetic interactions, chemical interactions, and post-translational modifications from major model organism species. All interaction data are freely provided through our search index and available via download in a wide variety of standardized formats.||How to submit data to BioGRID||How to access BioGRID data||Yes||Yes||Yes||Yes||Yes|
|NIH (NIA, NICHD, NIDA)||Inter-university Consortium for Political and Social Research (ICPSR)||ICPSR provides leadership and training in data access, curation, and methods of analysis for the social science research community.
ICPSR maintains a data archive of more than 250,000 files of research in the social and behavioral sciences.
ICPSR collaborates with a number of funders, including U.S. statistical agencies and foundations, to create thematic data collections and data stewardship and research projects. NICHD (DSDR), NIA (NACDA), and NIDA (NAHDAP) currently support specialty archives that use the ICPSR infrastructure.
|How to submit data to ICPSR||How to access ICPSR data||Yes||Yes||Yes||Yes||Yes|
|NIMH||NIMH Data Archive||The National Institute of Mental Health Data Archive (NDA) makes available human subjects data collected from hundreds of research projects across many scientific domains. The NDA provides infrastructure for sharing research data, tools, methods, and analyses enabling collaborative science and discovery. De-identified human subjects data, harmonized to a common standard, are available to qualified researchers. Summary data is available to all.||How to submit data to the NIMH Data Archive||How to access NIMH Data Archive data||Yes||No||Yes||Yes||Yes|
|NIMH||NIMH Repository and Genomics Resources (NRGR)||The NIMH Repository and Genomics Resource (NRGR) is the largest biorepository supporting genomics in psychiatry, providing access to biomaterials (DNA, plasma, RNA, lymphoblastoid cell lines, induced pluripotent stem cells, etc.) and phenotypic data collected from nearly 200,000 well-characterized, high-quality control and patient samples from a wide-range of mental illnesses across ancestrally diverse populations. The NRGR also provides access to some genomic data sets, including some which do not have biosamples available in the repository, such as data sets generated by the Psychiatric Genomics Consortium, the CommonMind Consortium, and the PsychENCODE consoritum.||How to submit data to NRGR||How to access NRGR data||Yes||Yes||Yes||Yes||Yes|
|NIMH||OpenfMRI||The OpenfMRI database is a curated public repository of human and non-human brain imaging data collected using MRI technique (potentially with additional PET, EEG and MEG data). No registration or license agreement is required to obtain the data, which is distributed, by default, using a Public Domain license. This is possible because data are anonymized before distribution to protect the confidentiality of participants.||How to submit data to OpenfMRI||How to access OpenfMRI data||Yes||Yes||Yes||Yes||Yes|
|NINDS||Archived Clinical Research Datasets||The data from NINDS-supported clinical trials are an important scientific resource, made available to the wider scientific community, while ensuring that the confidentiality and privacy of study participants are protected. NINDS requires all investigators seeking access to data from archived NINDS-supported trials to agree to certain terms and conditions.||How to submit data to Archived Clinical Research Data||How to access Archived Clinical Research Datasets data||Yes||Yes||Yes||Yes||Yes|
|NINDS||Federal Interagency Traumatic Brain Injury Research (FITBIR) Informatics System||The Federal Interagency Traumatic Brain Injury Research (FITBIR) informatics system was developed to share data across the entire TBI research field and to facilitate collaboration between laboratories, as well as interconnectivity with other informatics platforms. Sharing data, methodologies, and associated tools, rather than summaries or interpretations of this information, can accelerate research progress by allowing re-analysis of data, as well as re-aggregation, integration, and rigorous comparison with other data, tools, and methods. This community-wide sharing requires common data definitions and standards, as well as comprehensive and coherent informatics approaches.||How to submit data to FITBIR||How to access FITBIR data||Yes||No||Yes||Yes||Yes|
|NINDS||NeuroMorpho.org||NeuroMorpho.Org is a centrally curated inventory of digitally reconstructed neurons associated with peer-reviewed publications. It contains contributions from over 400 laboratories worldwide and is continuously updated as new morphological reconstructions are collected, published, and shared. To date, NeuroMorpho.Org is the largest collection of publicly accessible 3D neuronal reconstructions and associated metadata.
The goal of NeuroMorpho.Org is to provide dense coverage of available reconstruction data for the neuroscience community. Data sharing through NeuroMorpho.Org enables the full and continuing research potential of existing digital reconstruction data.
|How to submit data to NeuroMorpho||How to access NeuroMorpho data||Yes||Yes||Yes||Yes||Yes|
|NINDS||Parkinson’s Disease Biomarkers Program Data Management Resource (PDBP DMR)||The NINDS Parkinson’s Disease (PD) Biomarkers Program Data Management Resource enables web-based data entry for clinical studies supporting PD biomarker development, as well as broad data sharing (imaging, clinical, genetic, and biospecimen analysis) across the entire PD research community. The PDBP DMR coordinates information and access to PD biospecimens distributed through the NINDS Human Genetics, DNA, iPSC , Cell Line and Biospecimen Repository and the Harvard Neurodiscovery Initiative.||How to submit data to PDBP DMR||How to access PDBP DMR data|
|NINDS||The NINDS Human Cell and Data Repository (NHCDR)||Cell sources currently include fibroblasts and/or induced pluripotent stem cells for Alzheimer’s Disease, Amyotrophic Lateral Sclerosis (ALS), Ataxia-telangiectasia, Frontotemporal Lobar Degeneration (FTD), Huntington’s Disease, Parkinson’s Disease, and healthy controls. Cell sources, including isogenic cell lines for current and new diseases covered by the NINDS will be added over the next several years.||How to submit data to NHCDR||How to access NHCDR data|
|NLM||ClinicalTrials.gov||ClinicalTrials.gov is a registry and results database of publicly and privately supported clinical studies of human participants conducted around the world.||How to submit data to ClinicalTrials.gov||How to access ClinicalTrials.gov data||Yes||Yes||Yes||Yes||Yes|
|NLM||database of Genotypes and Phenotypes (dbGaP)||The database of Genotypes and Phenotypes (dbGaP) was developed to archive and distribute the data and results from studies that have investigated the interaction of genotype and phenotype in Humans.||How to submit data to dbGaP||How to access dbGaP data||No||Yes||Yes||Yes||Yes|
|NLM||dbSNP||dbSNP is a general public archive of all short sequence variation, not just single nucleotide substitutions that occur frequently enough in a population to be termed polymorphic, but also rare variants, including those with clinical assertions in ClinVar. dbSNP includes a broad collection of simple genetic variations such as single-base nucleotide substitutions, small-scale multi-base deletions or insertions, and microsatellite repeats.||How to submit data to dbSNP||How to access dbSNP data||Yes||Yes||Yes||Yes||Yes|
|NLM||dbVar||dbVar is a NCBI database of genomic structural variation.||How to submit data to dbVar||How to access dbVar data||Yes||Yes||Yes||Yes||Yes|
|NLM||GenBank||GenBank is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences. GenBank is designed to provide and encourage access within the scientific community to the most up to date and comprehensive DNA sequence information.||How to submit data to GenBank||How to access GenBank data||Yes||Yes||Yes||Yes||Yes|
|NLM||Gene Expression Omnibus (GEO)||Gene Expression Omnibus is a public functional genomics data repository supporting MIAME-compliant submissions of array- and sequence-based data. Tools are provided to help users query and download experiments and curated gene expression profiles.||How to submit data to GEO||How to access GEO data||Yes||Yes||Yes||Yes||Yes|
|NLM||PubChem||PubChem provides information on the biological activities of small molecules. PubChem is organized as three linked databases (bioassay, compound, and substance data) within the NCBI's Entrez information retrieval system.||How to submit data to PubChem||How to access PubChem data||Yes||Yes||Yes||Yes||Yes|
|NLM||Sequence Read Archive (SRA)||The SRA is NIH's primary archive of high-throughput sequencing data and is part of the International Nucleotide Sequence Database Collaboration (INSDC) that includes at the NCBI Sequence Read Archive (SRA), the European Bioinformatics Institute (EBI), and the DNA Database of Japan (DDBJ). Data submitted to any of the three organizations are shared among them.||How to submit data to SRA||How to access SRA data||Yes||Yes||Yes||Yes||Yes|
Last Reviewed: October 4, 2019