Essential information > Understanding UK Biobank > Retrospective timeline of the data currently available

Timeline of additional data made available to researchers


The UK Biobank resource was launched with the data collected at baseline made available in March 2012. The following timeline indicates when additional data was first made available for researchers to download.


DateAdditional data made available
September 2012
  • Four online 24-hour recall diet questionnaires. The data was collected over the period February 2011 - June 2012

March 2013
  • Death and cancer registry data for participants in England and Wales available

December 2013
  • Death and cancer registry data for participants in Scotland

  • Repeat assessment data for 20,000 participants

  • Inpatient hospital data for participants in England

June 2014
  • Nutrient data from the 24-hour diet recall

  • Inpatient hospital data for participants in Scotland

July 2014
  • Physical activity monitor data for 30,000 participants

  • Objective measures of built environment for participants resident in Wales and Greater London Area

July 2015
  • Online occupational health questionnaire

October 2015
  • Imaging data for 5,000 participants

  • Online questionnaire data on cognitive function

March 2016
  • Updated physical activity (accelerometer) data-fields

  • Updated imaging data

  • Updated death and cancer register data

May 2016
  • Updated death and cancer data.

October 2016
  • Updated death, cancer and hospital inpatient data

  • Updated imaging data

February 2017
  • Algorithmically-defined cases of myocardial infarction and stroke

  • Address history data

  • Derived data on body composition from abdominal MRI for some participants

  • Updated imaging data

July 2017
  • Genotyping data for the full cohort

  • Mental health questionnaire data

  • Additional home location coordinates

January 2018
  • Greenspace and coastal proximity data

  • Updated imaging data

  • Updated body composition data from the MRI (for some participants)

March 2018
  • Version 3 of the imputed genetics data

October 2018
  • Online digestive health questionnaire data

  • Indices of Multiple deprivation (IMD) scores

  • 12-lead ECG metrics (imaging assessment)

  • Liver phenotypes from MRI scan (for some participants)

  • Updated imaging data

  • MET score data

  • Returned datasets

March 2019
  • Exome sequencing data for 50,000 participants.

  • Biochemistry assay data - serum and red blood cells assay data for all participants. See here for the list of biomarkers that have been measured.

  • Updated hospital inpatient and death and cancer registry data.

  • Additional summary data fields for hospital inpatient data, including corresponding first recorded diagnosis/procedure date.

  • Algorithmically-derived health outcomes for: Asthma, COPD, Dementia, End stage renal disease, Motor neurone disease, Parkinson's disease.

  • Infectious disease pilot study (10,000 participants).

  • Retinal OCT image slices in PNG format (fields 21017 & 21018 in category 100016).

September 2019
  • Primary care (GP) data for around 45% of the cohort, containing coded clinical data and prescriptions (category 3000).

  • "First occurrence" fields (category 1712) showing the first occurrence of any code mapped to 3-character ICD-10. The data-fields have been generated by mapping:

    • Read code information in the Primary Care data (Category 3000),
    • ICD-9 and ICD-10 codes in the Hospital inpatient data (Category 2000),
    • ICD-10 codes in Death Register records (Field 40001, Field 40002), and
    • Self-reported medical condition codes (Field 20002) reported at the baseline or subsequent UK Biobank assessment centre visit

    to 3-character ICD-10 codes.
  • Freesurfer segmentation data (categories 190-197).

  • The hospital inpatient data has been restructured, and a very small amount of additional data has been added. See Update of HES data - September 2019 for more details.

February 2020
  • Replacement exome sequencing data from the SPB pipeline (~50,000 participants). These will replace Data-fields 23171 - 23174.

  • Imaging data:

    • Additional NIFTI brain images.
    • Additional derived brain imaging data, including data for 10 new Data-fields (Fields 25921 - 25930).
    • Raw carotid ultrasound data (Field 20241).
  • Derived imaging variables:

  • Other data collected at the assessment centre visits:

    • Impedance data from the imaging and repeat imaging visits (Category 100009).
    • Paired associate learning cognitive function test data (Category 506).
  • Data from the online questionnaire on food preferences (Category 1039).

  • Various Returned Results via the Returns Catalogue, including returns with derived individual-level data related to: cardiac measures, eye measures, actigraphy, spirometry, and weather. See here for more details of the returned datasets that have recently been made available.

  • Additional Data-fields for the online cognitive function questionnaire (Category 116) relating to device (Field 23077 and Field 23078) and mood (Category 155).


June 2020
  • Death register data:

    • Additional death register records via Field 40023 and the Data Portal.
    • Updated death data for Field 40007 and Field 40010. Please note these were not made accessible via the Data Portal.

  • UK Biobank started making death records available on the Data Portal to allow researchers to access new data more easily. Researchers who were already approved to access any Data-field in Category 100093 were given access to the death data without needing to submit a new basket containing Field 40023. Please see here for details.

    The death tables on the Data Portal will be updated as new data becomes available, so going forward no basket refreshes will be required to access the most up-to-date death data.

    The algorithmically defined outcome fields in Category 42 and First Occurrences fields (Category 1712) were not updated to take account of the new death data at this Showcase update.


Click here for future timelines.

Improving the health of future generations