

Library Services


Data repositories

A selection of online data repositories, grouped by subject areas.

Biochemistry of the Human Body repositories

  • . A repository for electronic microscopy density maps of macromolecular complexes and subcellular structures.
  • . Contains detailed information about small molecule metabolites found in the human body.
  • . A database for metabolomics experiments and derived information.

Biomedicine and Health repositories

  • . A database, compiled by the U.S. Food and Drug Administration (FDA), containing information on adverse event and medication error reports submitted to FDA.
  • . Cancer program datasets from the Broad Institute, MIT.
  • . Weekly U.S. Influenza Surveillance Report
  • . This page compiles key sources of research data and related information from the NCHS website.    
  • This site provides access to data from listed clinical trials. Researchers can use it to request access to anonymized patient level data and/or supporting documents to conduct further research.    
  • . An online resource that enables researchers to view and appraise data from eight leading UK longitudinal studies.
  • ). Allows researchers to upload and work on data relevant to biomarkers of drug toxicity, neurodegenerative diseases, and patient-reported outcomes.    
  • . A website set up to all the uploading and sharing of injury and emergency research data.   
  • . A collection of longitudinal hospital care data from the Healthcare Cost and Utilization Project (HCUP), USA.    
  • . A site that provides access to health data from the U.S. government.    
  • . Contains avian and non-human mammalian influenza surveillance data, human clinical data associated with virus extracts, phenotypic characteristics of viruses isolated from extracts, and all genomic and proteomic data available in public repositories for influenza viruses.
  • . A US repository of data relevant to drug addiction and HIV research.    
  • . The Program provides surveillance and research data, statistical reports, and analytical tools on cancer.    
  • . An NIH-funded research data repository that aims to accelerate progress in autism spectrum disorders (ASD)
  • . A platform for the sharing of human subjects data from all clinical trials funded by the National Institute of Mental Health (NIMH), USA.
  • . A list of NIH-supported data repositories that make data accessible for reuse.   
  • . Based at Stanford University, this repository allows for deposit and sharing of complete raw fMRI datasets.    
  • . Free access to collections of recorded physiologic signals and related open-source software.


  • . Stores data from high-throughput functional genomics experiments.    
  • (Catalogue of Somatic Mutations in Cancer). Stores and displays somatic mutation information and related details and contains information relating to human cancers.    
  • . A catalogue of structural variation (SV) found in the genomes of control individuals from worldwide populations.
  • . An archive for genetic variation within and across species.    
  • . NCBI's database of genomic structural variation.
  • . Developed to archive and distribute the results of studies that have investigated the interaction of genotype and phenotype.    
  • . Enables analysis and archiving of metagenomic data.
  • . A repository for all types of sequence and genotype experiments.
  • . A database of all types of genetic variation data from all species.    
  • . A genomics data repository supporting MIAME-compliant data submissions.
  • . A database containing phenotypes from RNA interference (RNAi) screens in Drosophila and Homo sapiens.
  • - Global Proteome Machine Database. A database of proteomics experimental information and data.    
  • . A searchable database of published miRNA sequences and annotation.
  • . A database which stores sequence data obtained from next-generation sequence technology.
  • . A repository of mass spectrometry-derived proteomics data.

Global Health repositories

  • . Contains standards-based quantitative information on government respect for 15 internationally recognized human rights for 202 countries, annually from 1981-2011.
  • . A collection of data on population, health, HIV and nutrition through more than 300 surveys in over 90 countries.
  • . Standardised data sets compiled by the Centre for Research on the Epidmiology of Disasters - CRED.
  • . This registry provides links to all raw data published using the IATI xml standard.    
  • . INDEPTH is a global network of research centres that conduct longitudinal health and demographic evaluation of populations in low- and middle-income countries (LMICs). The repository aims to make data from these evaluations available to data users.
  • (Organization for Economic Co-operation and Development) Data. Data relating to a variety of subject areas, including health.
  • . The United Nations Statistical Division (UNSD) data service brings UN statistical databases within easy reach of users through a single entry point.
  • : Monitoring the Situation of Children and Women. UNICEF maintains several databases for tracking the situation of children and women globally. The databases include only statistically sound, nationally representative data from household surveys and other sources. They are updated annually.
  • (Multiple Indicator Cluster Surveys). Datasets from these household surveys can be accessed.
  • . Statistics on development in countries around the globe, collated by the World Bank. Consists of a number of datasets including development indicators, debt statistics and trade logistics statistics.
  • . This site includes access to the Global Health Observatory (GHO); Global Health Estimates (GHE) and the WHO Mortality Database.

Medical Image repositories

  • . Contains medical images of cancer available for public download.
  • - a Virtual Skeleton Database. A collection of medical images.
  • . A collection of images with themes ranging from medical and social history to contemporary healthcare and biomedical science.

Neurosciences repositories

  • . A platform for large-scale, automated synthesis of functional magnetic resonance imaging (fMRI) data.
  • . A database of published functional and structural neuroimaging experiments with coordinate-based results (x,y,z) in Talairach or MNI space.
  • (Pooled Resource Open-Access ALS Clinical Trials Database). A large ALS (amyotrophic lateral sclerosis) clinical trials dataset.

    Proteins, nucleoproteins, neucleotides, nucleic acids and peptides repositories

    • . An archive of genetic and protein interaction data from model organisms and humans.
    • . A repository for data from NMR spectroscopy on proteins, peptides, nucleic acids and other biomolecules.
    • . Provides nucleotide data and a supercomputer system to support researchers in life science.
    • . This database archives and evaluates experimentally determined interactions between proteins.
    • . A comprehensive record of the world's nucleotide sequencing information, including raw sequencing data.
    • . The NIH genetic sequence database.
    • . A database system and analysis tool for molecular interaction data.
    • . Provides access to information about 3D nucleic acid structures and their complexes.
    • . A compendium of peptides identified in a large set of tandem mass spectrometry experiments.
    • . Stores archive information about the 3D shapes of proteins, nucleic acids, and complex assemblies.
    • . Contains functional information on proteins.