Skip Navigation Bar
BMIC logo

Trans-NIH BioMedical Informatics Coordinating Committee (BMIC)

NIH Data Sharing Repositories

This table lists NIH-supported data repositories that make data accessible for reuse. Most accept submissions of appropriate data from NIH-funded investigators (and others), but some restrict data submission to only those researchers involved in a specific research network. Also included are resources that serve as a portal for information about biomedical data and information sharing systems. The table can be sorted by repository name and by NIH Institute or Center and may be searched using keywords so that you can find repositories more relevant to your data. Links are provided to information about submitting data to and accessing data from the listed repositories. Also, we have asked all repositories to indicate whether or not they have— current NIH funding support; open data submission; open data access; open time frame for data submission; and sustained support. Their Yes or No responses are included in the five columns to the right of the table. The information in the BMIC list is also available in a downloadable Excel version.

In our query to the repositories, we acknowledged that some repositories may have restrictions or limitations on data submission and requirements for data access. Additional information about the repositories and points-of-contact for further information or inquiries can be found on the websites or links shown for each. Are we missing a data sharing repository that should be included in this list? Do you have comments or feedback on this list or the website? Contact us.

 

ICO Repository Name Repository Description Data Submission Policy Access to Data Current NIH funding support Open data submission Open data access Open time frame for data deposit Sustained support
Common Fund Epigenomics Epigenomic, 6 histone modification marks, DNAse I, DNA methylation, transcriptome for wide variety of cell types and tissues. Not applicable How to access Epigenomics data No No Yes No Yes
Common Fund exRNA Atlas Includes exRNA profiles derived from various biofluids and conditions and currently stores data profiled from small RNA sequencing assays. Not applicable How to access exRNA Atlas data Yes No Yes No Yes
Common Fund GTEx The Genotype-Tissue Expression (GTEx) project aims to provide to the scientific community a resource with which to study human gene expression and regulation and its relationship to genetic variation. This project will collect and analyze multiple human tissues from donors who are also densely genotyped, to assess genetic variation within their genomes. By analyzing global RNA expression within individual tissues and treating the expression levels of genes as quantitative traits, variations in gene expression that are highly correlated with genetic variation can be identified as expression quantitative trait loci, or eQTLs. Project data only How to access GTEx data Yes No Yes No Yes
Common Fund HMP DACC The HMP DACC is a common repository for diverse human microbiome datsets and minimum reporting standards for the Common Fund Human Microbiome Project (HMP). Project data only How to access HMP DACC data No No Yes No Yes
Common Fund Illuminating Druggable Genome The goal of the Illuminating the Druggable Genome (IDG) program is to improve our understanding of the properties and functions of proteins that are currently unannotated within the three most commonly drug-targeted protein families: G-protein coupled receptors, ion channels, and protein kinases. Project data only How to access IDG data Yes No Yes Yes Yes
Common Fund International Mouse Phenotyping Consortium (IMPC) Phenotype data on knockout mouse lines. How to submit data to IMPC How to access IMPC data Yes No Yes Yes Yes
Common Fund Kids First The Common Fund’s Gabriella Miller Kids First Pediatric Research Program (Kids First) is developing a large-scale data resource to help researchers uncover new insights into the biology of childhood cancer and structural birth defects, including the discovery of shared genetic pathways between these disorders. Childhood cancers and structural birth defects have profound, lifelong effects on patients and their families. The Kids First program will stimulate research in these areas and work towards providing a better understanding of how genetics plays a role in these conditions that will lead to early detection, therapeutic interventions, and toward more effective prevention. Project data only How to access Kids First data Yes No Yes No Yes
Common Fund LINCS The Library of Integrated Network-Based Cellular Signatures (LINCS) Program aims to create a network-based understanding of biology by cataloging changes in gene expression and other cellular processes that occur when cells are exposed to a variety of perturbing agents. Project data only How to access LINCS data Yes No Yes Yes Yes
Common Fund Metabolomics Workbench (MetWB) The Metabolomics Program's Data Repository and Coordinating Center (DRCC), housed at the San Diego Supercomputer Center (SDSC), University of California, San Diego, has developed the Metabolomics Workbench. MetWB will serve as a national and international repository for metabolomics data and metadata and will provide analysis tools and access to metabolite standards, protocols, tutorials, training, and more. How to submit data to MetWB How to access MetWB data Yes Yes Yes Yes Yes
Common Fund SOBC Measures Repository The SOBC measures repository is the source for behavioral science measures that have been validated (or are in the process of being validated) in accordance with the SOBC Experimental Medicine Approach. How to submit data to SOBC How to access SOBC data Yes No Yes No No
Common Fund (NCCIH, NCI, NHLBI, NIA, NIAMS, NIDA, NIDCD, NIDDK, NIMH, NINDS, NINR, & OD) PROMIS The Patient-Reported Outcomes Measurment Informmation System (PROMIS) is a set of person-centered measures that evaluates and monitors physical, mental, and social health in adults and children.  Project data only How to access PROMIS data Yes No Yes No No
NCI Cancer Nanotechnology Laboratory (caNanoLab) caNanoLab is a data sharing portal designed to facilitate information sharing in the biomedical nanotechnology research community to expedite and validate the use of nanotechnology in biomedicine. caNanoLab provides support for the annotation of nanomaterials with characterizations resulting from physico-chemical, in vitro, and in vivo assays and the sharing of these characterizations and associated nanotechnology protocols in a secure fashion. How to submit data to caNanoLab How to access caNanoLab data Yes Yes Yes Yes Yes
NCI The Cancer Imaging Archive (TCIA) TCIA is a service which de-identifies and hosts a large archive of medical images of cancer accessible for public download. The data are organized as “Collections”, typically patients related by a common disease (e.g. lung cancer), image modality (MRI, CT, etc) or research focus. DICOM is the primary file format used by TCIA for image storage. Supporting data related to the images such as patient outcomes, treatment details, genomics, pathology, and expert analyses are also provided when available. How to submit data to TCIA How to access TCIA data Yes Yes Yes Yes Yes
NCI The Network Data Exchange (NDEx) NDEx is an online commons where scientists can upload, share, and publicly distribute biological networks and pathway models. The NDEx Project maintains a web-accessible public server, a documentation website, provides seamless connectivity to Cytoscape as well as programmatic access using a variety of languages including Python and Java. NDEx  users can easily create accounts or sign in using their Google credentials thanks to  the supported open authentication (OAUTH2) method  and mint DOIs for their networks to use in publications or include in other resources for long term access.  How to submit data to NDEx How to access NDEx data Yes Yes Yes Yes Yes
NCI (NHGRI, NIGMS) PeptideAtlas PeptideAtlas is a multi-organism, publicly accessible compendium of peptides identified in a large set of tandem mass spectrometry proteomics experiments. Mass spectrometer output files are collected for human, mouse, yeast, and several other organisms, and searched using the latest search engines and protein sequences. How to submit data to PeptideAtlas How to access Peptide Atlas data No yes Yes Yes No
NEI EyeGENE® The eyeGENE® Biorepository and corresponding Database contain family history and clinical eye exam data from subjects enrolled in eyeGENE® Program coupled to clinical grade DNA samples. This data and samples are submitted by collaborators throughout the US and Canada and the data is available on a controlled access basis to researchers world-wide. How to submit data to EyeGENE® How to access EyeGENE® data Yes Yes Yes Yes Yes
NEI NEI Data Commons The NEI Commons is a virtual infrastructure to enable sharing. The Commons portal provides a platform for querying and accessing vision research data and tools for data processing and analysis. It is the central location for NEI generated clinical and basic research data available to the public. Dependent on sub-project How to access NEI Commons data Yes Yes Yes Yes Yes
NHGRI FlyBase: A Drosophila Genomic and Genetic Database Drosophila Genomic and Genetic database that includes proteomics data, microarrays and Tiling BAC's. How to submit data to Flybase How to access Flybase data Yes Yes Yes Yes Yes
NHGRI The Zebrafish Model Organism Database (ZFIN) ZFIN serves as the zebrafish model organism database. It aims to: a) be the community database resource for the laboratory use of zebrafish, b) develop and support integrated zebrafish genetic, genomic and developmental information, c) maintain the definitive reference data sets of zebrafish research information, d) to link this information extensively to corresponding data in other model organism and human databases, e) facilitate the use of zebrafish as a model for human biology, and f) serve the needs of the research community. How to submit data to ZFIN How to access ZFIN data Yes Yes Yes Yes Yes
NHGRI WormBase WormBase is an international consortium of biologists and computer scientists dedicated to providing the research community with accurate, current, accessible information concerning the genetics, genomics and biology of C. elegans and related nematodes. How to submit data to WormBase How to access WormBase data Yes Yes Yes Yes Yes
NHGRI (NICHD, NCI) Mouse Genome Informatics (MGI) MGI is the international database resource for the laboratory mouse, providing integrated genetic, genomic, and biological data to facilitate the study of human health and disease. How to submit data to MGI How to access MGI data Yes Yes Yes Yes Yes
NHGRI/NIGMS The Universal Protein Resource (UniProt) The Universal Protein Resource (UniProt) is a comprehensive resource for protein sequence and annotation data. The UniProt databases are the UniProt Knowledgebase (UniProtKB), the UniProt Reference Clusters (UniRef), and the UniProt Archive (UniParc). How to submit data to UniProt How to access UniProt data Yes Yes Yes Yes Yes
NHLBI Biologic Specimen and Data Repository Information Coordinating Center (BioLINCC) The goal of BioLINCC is to facilitate and coordinate the existing activities of the NHLBI Biorepository and the Data Repository and to expand their scope and usability to the scientific community through a single web-based user interface. How to submit data to BioLINCC How to access BioLINCC data Yes Yes Yes Yes Yes
NHLBI National Sleep Research Resource The NSRR web platform enables sharing of physiological signals and clinical data elements from well-characterized, de-identified,  research cohorts and clinical trials.  Data queries across studies (data pooling) are possible for demographics and data variables with shared common data elements.   How to submit data to NSRR How to access NSRR data Yes Yes Yes Yes Yes
NHLBI Rat Genome Database (RGD) The Rat Genome Database (RGD) was established in 1999 and is the premier site for genetic, genomic, phenotype, and disease data generated from rat research. In addition, it provides easy access to corresponding data for human and mouse, as well as multiple other models such as chinchilla and 13-lined ground squirrel, facilitating cross-species comparisons. RGD’s comprehensive data and innovative software tools make it a valuable resource for researchers worldwide. How to submit data to RGD How to access RGD data Yes Yes Yes Yes Yes
NHLBI The Cardiovascular Research Grid (CVRG) The CardioVascular Research Grid (CVRG) project is creating an infrastructure for secure seamless access to study data and analysis tools.

CVRG tools are developed using the Software as a Service model, allowing users to access tools through their browser, thus eliminating the need to install and maintain complex software. The CVRG project is supported by the National Heart Lung & Blood Institute. The project is based at the Institute for Computational Medicine at the Johns Hopkins University, Department of Biomedical Informatics at Vanderbilt University Medical Center, The College of Computing and Informatics at UNC Charlotte, The Center for Comprehensive Informatics at Emory University, The College of Engineering and Applied Sciences at Stony Brook University, and the Computation Institute at The University
of Chicago.
No new data submission, but one could submit new tools.   How to access CVRG data No No Yes No No
NIA AMP-AD Knowledge Portal The AMP-AD Knowledge Portal is an NIH-designated repository and the distribution site for multi-omic data from human samples, cell-based and animal models, analysis results, analytical methodology and research tools generated by  multiple National Institute of Aging supported Alezheimer's disease research programs and consortia .  Data is available to qualified investigators as open or controlled access depending on the data type and data source.  How to submit data to AMP-AD Knowledge Portal How to access AMP-AD Knowledge Portal data Yes Yes Yes Yes Yes
NIA National Archive of Computerized Data on Aging (NACDA) NACDA acquires and preserves data relevant to gerontological research, processing as needed to promote effective research use, disseminates them to researchers, and facilitates their use. By preserving and making available the largest library of electronic data on aging in the United States, NACDA offers opportunities for secondary analysis on major issues of scientific and policy relevance. How to submit data to NACDA How to access NACDA data Yes Yes Yes Yes Yes
NIA The National Institute on Aging Genetics of Alzheimer’s Disease Data Storage Site (NIAGADS)  The National Institute on Aging Genetics of Alzheimer's Disease Data Storage Site (NIAGADS) is a national genetics data repository facilitating access to genotypic and phenotypic data for Alzheimer's disease (AD). Data include GWAS, whole genome (WGS) and whole exome (WES), expression, RNA Seq, and CHIP Seq analyses. Data for the Alzheimer’s Disease Sequencing Project (ADSP) are available through a partnership with dbGaP (ADSP at dbGaP). Results are integrated and annotated in the searchable genomics database that also provides access to a variety of software packages, analytic pipelines, online resources, and web-based tools to facilitate analysis and interpretation of large-scale genomic data. Data are available as defined by the NIA Genomics of Alzheimer’s Disease Sharing Policy and the NIH Genomics Data Sharing Policy. Investigators return secondary analysis data to the database in keeping with the NIAGADS Data Distribution Agreement. How to submit data to NIAGADS How to access NIAGADS data Yes Yes Yes Yes Yes
NIAID Eukaryotic Pathogen Database Resources (EuPathDB) EuPathDB Bioinformatics Resource Center for Biodefense and Emerging/Re-emerging Infectious Diseases is a portal for accessing genomic-scale datasets associated with the eukaryotic pathogens. How to submit data to EuPathDB How to access EuPathDB data Yes Yes Yes Yes Yes
NIAID Immune Epitope Database and Analysis Resource (IEDB) This repository contains antibody/B cell and T cell epitope information and epitope prediction and analysis tools for use by the research community worldwide. Immune epitopes are defined as molecular structures recognized by specific antigen receptors of the immune system, namely antibodies, B cell receptors, and T cell receptors. Immune epitopes from infectious diseases, excluding HIV, and immune-mediated diseases and the accompanying biological information are included. How to submit data to IEDB How to access IEDB data Yes Yes Yes Yes Yes
NIAID Influenza Research Database (IRD) The Influenza Research Database (IRD) serves as a public repository and analysis platform for flu sequence, experiment, surveillance and related data. How to submit data to IRD How to access IRD data Yes Yes Yes Yes Yes
NIAID ITN TrialShare TrialShare is a transformative approach to data-sharing that enables clinical trial transparency. TrialShare provides clinical trial investigators the unprecedented ability to access de-identified study data, review published analysis methods, and perform real-time, interactive graphical analyses in collaboration with other researchers. Immune Tolerance Network (ITN) TrialShare shares information about ITN's clinical studies and specimen bio-repository. Data and analysis code underlying ITN-published manuscripts are publicly available with the goal of promoting transparency, reproducibility, and scientific collaboration. How to submit data to TrialShare How to access TrialShare data Yes Yes Yes Yes Yes
NIAID Pathosystems Resource Integration Center (PATRIC) PATRIC, the Bacterial Bioinformatics Resource Center, supports research on bacterial infectious diseases by serving as a repository of genomic and other data with associated metadata for over 100,000 bacterial genomes. PATRIC provides an integrated suite of computational services and visualizations for users to analyze and compare their own data in a private workspace with the public data in PATRIC. How to submit data to PATRIC How to access PATRIC data Yes Yes Yes Yes Yes
NIAID The Immunology Database and Analysis Portal (ImmPort) ImmPort is a public data sharing repository funded by DAIT. The data sources are primarily DAIT-funded clinical trials, associated mechanistic studies, and other basic and applied immunology research programs. How to submit data to ImmPort How to access ImmPort data Yes Yes Yes Yes Yes
NIAID VectorBase VectorBase is a Bioinformatics Resource Center for invertebrate vectors. It is one of four Bioinformatics Resource Centers funded by NIAID to provide web-based resources to scientific community conducting basic and applied research on organisms considered potential agents of biowarfare or bioterrorism or causing emerging or re-emerging diseases. How to submit data to VectorBase How to access VectorBase data Yes Yes Yes Yes Yes
NIAID Virus Pathogen Research (ViPR) The Virus Pathogen Resource (ViPR) provides a searchable public repository of genomic, proteomic and other important research data for more than 500,000 strains of pathogenic viruses along with a suite of tools for analyzing these data.  Data can be shared, aggregated, analyzed using ViPR tools, and downloaded for local analysis. How to submit data to ViPR How to access ViPR data Yes Yes Yes Yes Yes
NIBIB LONI Database The LONI Image Data Archive (IDA) is a user-friendly environment for archiving, searching, sharing, tracking and disseminating neuroimaging and related clinical data. The IDA is utilized for dozens of neuroimaging research projects across North America and Europe and accommodates MRI, PET, MRA, DTI and other imaging modalities. How to submit data to LONI [PDF] How to access LONI data [PDF] Yes Yes Yes Yes Yes
NIBIB Medical Information Mart for Intensive Care-III (MIMIC-III) MIMIC-III is a large, freely-available database comprising deidentified health-related data associated with over forty thousand patients who stayed in critical care units of the Beth Israel Deaconess Medical Center between 2001 and 2012. The database includes information such as demographics, vital sign measurements made at the bedside (~1 data point per hour), laboratory test results, procedures, medications, caregiver notes, imaging reports, and mortality (both in and out of hospital). MIMIC-III supports a diverse range of analytic studies spanning epidemiology, clinical decision-rule improvement, and electronic tool development. How to submit data of MIMIC-III How to access MIMIC-III data Yes Yes Yes Yes  Yes
NIBIB (NIMH, NIDA) NeuroImaging Tools and Resources Collaboratory (NITRC) The NeuroImaging Tools and Resources Collaboratory (NITRC) provides free access to data and enables pay-per-use cloud-based access to unlimited computing power, enabling worldwide scientific collaboration with minimal startup and cost. With NITRC and its components—the Resources Registry (NITRC-R), Image Repository (NITRC-IR), and Computational Environment (NITRC-CE)—a researcher can obtain pilot or proof-of-concept data to validate a hypothesis for a few dollars. How to submit data to NITRC How to access NITRC data Yes Yes Yes Yes Yes
NICHD Biospecimen Repository Access and Data Sharing (BRADS) The Division of Intramural Population Health Research (DIPHR) of the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) maintains an extensive repository of datasets from completed studies, including biospecimens and ancillary data. Not applicable How to access BRADS data Yes No No No Yes
NICHD Child Language Data Exchange System (CHILDES) CHILDES is a system for sharing and analyzing conversational interactions. How to submit data to CHILDES How to access CHILDES data Yes Yes Yes Yes Yes
NICHD Data and Specimen Hub (DASH) NICHD DASH is a centralized resource for researchers to store and access de-identified data from NICHD funded research studies for the purposes of secondary research use. It serves as a mechanism for NICHD-funded extramural and intramural investigators to share research data from studies in accordance with the NIH Data Sharing Policy and the NIH Genomic Data Sharing Policy. How to submit data to DASH How to access DASH data Yes Yes Yes Yes Yes
NICHD Data Sharing for Demographic Research (DSDR) DSDR is a data sharing project providing curation and archiving services for the demographic and population sciences community.  It focuses on data collected through funding from the NICHD Population Dynamics Branch (PDB), but also provides these services for other data sets that fall within the scientific mission of PDB. How to submit data to DSDR How to access DSDR data Yes Yes Yes Yes Yes
NICHD National Children’s Study (NCS) Archive The NCS Archive, a data and sample repository for the National Children’s Study, provides access to data and samples collected from over 5,600 U.S. birth families to study environmental influences on child health and development. Data and biological and environmental samples are freely available, with an approved request, for scientific research. How to submit data to NCS How to access NCS data Yes Yes Yes Yes Yes
NICHD PhonBank PhonBank is an open database for the study of early phonological development using the Phon program. How to submit data to PhonBank How to access PhonBank data Yes Yes Yes Yes Yes
NICHD Xenbase Xenbase is a Xenopus laevis and Xenopus tropicalis biology and genomics resource. How to submit data to Xenbase How to access Xenbase data Yes Yes Yes Yes Yes
NICHD (NINDS, & NIBI) Archive of Data on Disability to Enable Policy (ADDEP) ADDEP provides access to data including a wide range of topics related to disability. ADDEP data can be used to better understand and inform the implementation of the Americans with Disabilities Act and other disability policies. How to submit data to ADDEP How to access ADDEP data Yes Yes Yes Yes No
NIDA Mouse Phenome Database (MPD) The Mouse Phenome Database (MPD) enables the integration of genomic and phenomic data by providing access to primary experimental data, well-documented data collection protocols and analysis tools. Data are contributed by investigators from around the world and represent a broad scope of behavioral, morphological and physiological disease-related characteristics in naive mice and those exposed to drugs, environmental agents or other treatments. How to submit data to MPD How to access MPD data Yes Yes Yes Yes Yes
NIDA National Addiction & HIV Data Archive Program (NAHDAP) NAHDAP acquires, preserves and disseminates data relevant to drug addiction and HIV research. By preserving and making available an easily accessible library of electronic data on drug addiction and HIV infection in the United States, NAHDAP offers scholars the opportunity to conduct secondary analysis on major issues of social and behavioral sciences and public policy. How to submit data to NAHDAP How to access NAHDAP data Yes Yes Yes Yes Yes
NIDA Neuroscience Information Framework (NIF) Discovery Portal: An innovative semantically-enhanced search engine of the largest collection of neuroscience information that allows users to customize and refine their searches via NIF's terminologies.
The NIF Registry: a core resource of NIF, is a catalog of electronic resources that have been selected by NIF curators, or contributed by the community, as valuable tools for researchers and students in the field of neuroscience.
Date Sharing service: NIF maintains the largest searchable collection of neuroscience data, the largest catalog of biomedical resources, and the largest ontology for neuroscience on the web. NIF's already established and extensive infrastructure makes sharing your data easy.
LinkOut Broker: NIF provides links between PubMed articles and your data when you register to the Data Federation through the LinkOut Broker. To enable this feature, your data must include PubMed IDs.
Ontology Engineering: Expertise and experience in building and enhancing the main terminologies and ontologies.
How to submit data to NIF How to access NIF data Yes Yes Yes Yes Yes
NIDA NIDA Center for Genetics Studies Blood and other biospecimens along with phenotypic data collected, archived and stored. How to submit data to the NIDA Center for Genetics Studies How to access NIDA Center for Genetics Studies data Yes Yes No Yes Yes
NIDA NIDA Data Share The NIDA Data Share web site is an electronic environment that allows data from completed clinical trials to be distributed to investigators and the public in order to promote new research, encourage further analyses, and disseminate information to the community.  Data download only.  Data is submitted through NIDA POC How to access NIDA Data Share data Yes No Yes No Yes
NIDCD AphasiaBank AphasiaBank is a shared database of multimedia interactions for the study of communication in aphasia. Access to the data in AphasiaBank is password protected and restricted to members of the AphasiaBank consortium group. How to submit data to AphasiaBank How to access AphasiaBank data Yes Yes Yes Yes Yes
NIDCD FluencyBank FluencyBank is a shared database for the study of the development of fluency in both normal and disordered populations. Participants include normally-developing monolingual and bilingual children, children with disfluencies (CWD), adults with disfluencies (AWSD), and second language learners. How to submit data to FluencyBank How to access FluencyBank data Yes Yes Yes Yes Yes
NIDDK NIDDK Central Repository The NIDDK Central Repository stores biosamples, genetic and other data collected in designated NIDDK-funded clinical studies. The purpose of the NIDDK Central Repository is to expand the usefulness of these studies by allowing a wider research community to access data and materials beyond the end of the study. How to submit data to NIDDK Central Repository How to access NIDDK Central Repository data Yes Yes Yes Yes Yes
NIDDK NIDDK Information Network (DKnet) The NIDDK Information Network serves the needs of basic and clinical investigators by providing seamless access to large pools of data relevant to the mission of NIDDK. The goal of DKnet is to develop a community-based network for integration across disciplines to include the larger DK universe of diseases, investigators, and potential users. How to submit data to DKnet How to access DKnet data Yes Yes Yes Yes Yes
NIDDK Nuclear Receptor Signaling Atlas (NURSA) The Nuclear Receptor Signaling Atlas (NURSA) is designed to foster the development of a comprehensive understanding of the structure, function, and role in disease of nuclear receptors (NRs) and coregulators. NURSA seeks to elucidate the roles played by NRs and coregulators in metabolism and the development of metabolic disorders (including type 2 diabetes, obesity, osteoporosis, and lipid dysregulation), as well as in cardiovascular disease, oncology, regenerative medicine and the effects of environmental agents on their actions. How to submit data to NURSA How to access NURSA data Yes Yes Yes Yes Yes
NIDDK The AMP-T2D Knowledge Portal The AMP-T2D Knowledge Portal is the NIDDK-designated repository being developed as part of the Accelerating Medicines Partnership (AMP), a public-private partnership between the National Institutes of Health (NIH), the U.S. Food and Drug Administration (FDA), 10 biopharmaceutical companies, and multiple non-profit organizations that is managed through the Foundation for the NIH (FNIH) for collection of various types of molecular data such as DNA sequences, functional and epigenomic information, and clinical data from studies on type 2 diabetes and its macro- and microvascular complications from human, cell-based and animal model biosamples, analytical results and research tools towards improving current efforts to develop new therapies for complex, heterogeneous diseases, including type 2 diabetes. How to submit data to the AMP-T2D Knowledge Portal How to access AMP-T2D Knowledge Portal data Yes No Yes Yes Yes
NIEHS Chemical Effects in Biological Systems (CEBS) The CEBS database houses data of interest to environmental health scientists. CEBS is a public resource, and has received depositions of data from academic, industrial, and governmental laboratories. CEBS is designed to display data in the context of biology and study design, and permit data integration across studies for novel meta analysis. How to submit data to CEBS How to access CEBS data Yes Yes Yes Yes Yes
NIGMS Cell Image Library The Cell Image Library, accepts image data sets that are too large for publishers to store, and provides access to the biomedical community. There are 10,000 datasets in 20TB of uploaded data as of mid-2018.  The  library inherits data from the Cell Centered Database at UCSD.  Since its launch in 2010, the site has had 721,00 visitors and  was cited by 175 research publications. How to submit data to Cell Image Library How to access Cell Image Library data Yes Yes Yes Yes Yes
NIGMS Database of Interacting Proteins (DIP) The DIP database, a founding member of the International Molecular Exchange Consortium (IMEx: https://www.imexconsortium.org),  catalogs experimentally determined interactions between proteins. It combines information from a variety of sources to create a single, consistent set of protein-protein interactions. The data stored within the DIP database are manually curated and include, when available, detailed information on the interacting regions, mutations and post-translational modifications that affect each interaction. How to submit data to DIP How to access DIP data Yes Yes Yes Yes Yes
NIGMS PhysioNet The PhysioNet Resource is intended to stimulate current research and new investigations in the study of complex biomedical and physiologic signals. It offers free web access to large collections of recorded physiologic signals (PhysioBank) and related open-source software (PhysioToolkit). How to submit data to PhysioNet How to access PhysioNet data Yes Yes Yes Yes Yes
NIGMS Transporter Classification Database (TCDB) The Transporter Classification Database details a comprehensive classification system for membrane transport proteins known as the Transporter Classification (TC) system. The TC system is analogous to the Enzyme Commission (EC) system for classification of enzymes, except that it incorporates both functional and phylogenetic information. Descriptions, TC numbers, and examples of over 600 families of transport proteins are provided. How to submit data to TCDB How to access TCDB data Yes No Yes No Yes
NIGMS (NCI, NSF, DOE-BER) Protein Data Bank (PDB) The mission of the RCSB Protein Data Bank (PDB) is to Sustain a unique data resource of three-dimensional  biomolecular structure
information.
How to submit data to PDB How to access PDB data Yes Yes Yes Yes No
NIH Biological General Repository for Interaction Datasets (BioGRID) BioGRID is a curated biological database of protein-protein interactions, genetic interactions, chemical interactions, and post-translational modifications from major model organism species. All interaction data are freely provided through our search index and available via download in a wide variety of standardized formats. How to submit data to BioGRID How to access BioGRID data Yes Yes Yes Yes Yes
NIH (NIA, NICHD, NIDA) Inter-university Consortium for Political and Social Research (ICPSR) ICPSR provides leadership and training in data access, curation, and methods of analysis for the social science research community.
ICPSR maintains a data archive of more than 250,000 files of research in the social and behavioral sciences.
ICPSR collaborates with a number of funders, including U.S. statistical agencies and foundations, to create thematic data collections and data stewardship and research projects. NICHD (DSDR), NIA (NACDA), and NIDA (NAHDAP) currently support specialty archives that use the ICPSR infrastructure. 
How to submit data to ICPSR How to access ICPSR data Yes Yes Yes Yes Yes
NIMH NIMH Data Archive The National Institute of Mental Health Data Archive (NDA) makes available human subjects data collected from hundreds of research projects across many scientific domains. The NDA provides infrastructure for sharing research data, tools, methods, and analyses enabling collaborative science and discovery. De-identified human subjects data, harmonized to a common standard, are available to qualified researchers.  Summary data is available to all. How to submit data to the NIMH Data Archive How to access NIMH Data Archive data Yes No Yes Yes Yes
NIMH NIMH Repository and Genomics Resources (NRGR) The NIMH Repository and Genomics Resource (NRGR) is the largest biorepository supporting genomics in psychiatry, providing access to biomaterials (DNA, plasma, RNA, lymphoblastoid cell lines, induced pluripotent stem cells, etc.) and phenotypic data collected from nearly 200,000 well-characterized, high-quality control and patient samples from a wide-range of mental illnesses across ancestrally diverse populations. The NRGR also provides access to some genomic data sets, including some which do not have biosamples available in the repository, such as data sets generated by the Psychiatric Genomics Consortium, the CommonMind Consortium, and the PsychENCODE consoritum.  How to submit data to NRGR How to access NRGR data Yes Yes Yes Yes Yes
NIMH OpenfMRI The OpenfMRI database is a curated public repository of human and non-human brain imaging data collected using MRI technique (potentially with additional PET, EEG and MEG data). No registration or license agreement is required to obtain the data, which is distributed, by default, using a Public Domain license. This is possible because data are anonymized before distribution to protect the confidentiality of participants. How to submit data to OpenfMRI How to access OpenfMRI data Yes Yes Yes Yes Yes
NINDS Federal Interagency Traumatic Brain Injury Research (FITBIR) Informatics System The Federal Interagency Traumatic Brain Injury Research (FITBIR) informatics system was developed to share data across the entire TBI research field and to facilitate collaboration between laboratories, as well as interconnectivity with other informatics platforms. Sharing data, methodologies, and associated tools, rather than summaries or interpretations of this information, can accelerate research progress by allowing re-analysis of data, as well as re-aggregation, integration, and rigorous comparison with other data, tools, and methods. This community-wide sharing requires common data definitions and standards, as well as comprehensive and coherent informatics approaches. How to submit data to FITBIR How to access FITBIR data Yes No Yes Yes Yes
NINDS NeuroMorpho.org NeuroMorpho.Org is a centrally curated inventory of digitally reconstructed neurons associated with peer-reviewed publications. It contains contributions from over 400 laboratories worldwide and is continuously updated as new morphological reconstructions are collected, published, and shared. To date, NeuroMorpho.Org is the largest collection of publicly accessible 3D neuronal reconstructions and associated metadata.

The goal of NeuroMorpho.Org is to provide dense coverage of available reconstruction data for the neuroscience community. Data sharing through NeuroMorpho.Org enables the full and continuing research potential of existing digital reconstruction data.
How to submit data to NeuroMorpho How to access NeuroMorpho data  Yes Yes Yes Yes Yes
NINDS Parkinson’s Disease Biomarkers Program Data Management Resource (PDBP DMR) The NINDS Parkinson’s Disease (PD) Biomarkers Program Data Management Resource enables web-based data entry for clinical studies supporting PD biomarker development, as well as broad data sharing (imaging, clinical, genetic, and biospecimen analysis) across the entire PD research community. The PDBP DMR coordinates information and access to PD biospecimens distributed through the NINDS Human Genetics, DNA, iPSC , Cell Line and Biospecimen Repository and the Harvard Neurodiscovery Initiative. How to submit data to PDBP DMR How to access PDBP DMR data          
NINDS The NINDS Human Cell and Data Repository (NHCDR) Cell sources currently include fibroblasts and/or induced pluripotent stem cells for Alzheimer’s Disease, Amyotrophic Lateral Sclerosis (ALS), Ataxia-telangiectasia, Frontotemporal Lobar Degeneration (FTD), Huntington’s Disease, Parkinson’s Disease, and healthy controls. Cell sources, including isogenic cell lines for current and new diseases covered by the NINDS will be added over the next several years. How to submit data to NHCDR How to access NHCDR data          
NLM ClinicalTrials.gov ClinicalTrials.gov is a registry and results database of publicly and privately supported clinical studies of human participants conducted around the world. How to submit data to ClinicalTrials.gov How to access ClinicalTrials.gov data Yes Yes Yes Yes Yes
NLM database of Genotypes and Phenotypes (dbGaP) The database of Genotypes and Phenotypes (dbGaP) was developed to archive and distribute the data and results from studies that have investigated the interaction of genotype and phenotype in Humans. How to submit data to dbGaP How to access dbGaP data No Yes Yes Yes Yes
NLM dbSNP dbSNP is a general public archive of all short sequence variation, not just single nucleotide substitutions that occur frequently enough in a population to be termed polymorphic, but also rare variants, including those with clinical assertions in ClinVar. dbSNP includes a broad collection of simple genetic variations such as single-base nucleotide substitutions, small-scale multi-base deletions or insertions, and microsatellite repeats. How to submit data to dbSNP How to access dbSNP data Yes Yes Yes Yes Yes
NLM dbVar dbVar is a NCBI database of genomic structural variation. How to submit data to dbVar How to access dbVar data Yes Yes Yes Yes Yes
NLM GenBank GenBank is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences. GenBank is designed to provide and encourage access within the scientific community to the most up to date and comprehensive DNA sequence information. How to submit data to GenBank  How to access GenBank data Yes Yes Yes Yes Yes
NLM Gene Expression Omnibus (GEO) Gene Expression Omnibus is a public functional genomics data repository supporting MIAME-compliant submissions of array- and sequence-based data. Tools are provided to help users query and download experiments and curated gene expression profiles. How to submit data to GEO How to access GEO data Yes Yes Yes Yes Yes
NLM PubChem PubChem provides information on the biological activities of small molecules. PubChem is organized as three linked databases (bioassay, compound, and substance data) within the NCBI's Entrez information retrieval system. How to submit data to PubChem How to access PubChem data Yes Yes Yes Yes Yes
NLM Sequence Read Archive (SRA) The SRA is NIH's primary archive of high-throughput sequencing data and is part of the International Nucleotide Sequence Database Collaboration (INSDC) that includes at the NCBI Sequence Read Archive (SRA), the European Bioinformatics Institute (EBI), and the DNA Database of Japan (DDBJ). Data submitted to any of the three organizations are shared among them. How to submit data to SRA How to access SRA data Yes Yes Yes Yes Yes