uk_biobank:downloading_the_data
This is an old revision of the document!
These procedures were all derived from the documentation at the UK Biobank.
Phenotypic data
- The phenotype file was downloaded from UK Biobank by the project PI as instructed in the data accessibility email.
- All of the utilities from the UK Biobank download page were retrieved.
- The key, k1234.key was saved from the PI's email.
- This command was run to decrypt the downloaded phenotype file
$ ./ukb_unpack ukb1234.enc k1234.key
which produced the file ukb1234.enc_ukb
- Once decrypted, the following commands were run to extract the data into useful formats
$ ./ukb_conv ukb1234.enc_ukb bulk -eencoding.ukb $ ./ukb_conv ukb1234.enc_ukb docs -eencoding.ukb $ ./ukb_conv ukb1234.enc_ukb r -eencoding.ukb
- bulk is a list of IDs for use with the ukbfetch utility
- docs produces an html file containing documentation of the variables in this dataset
- r produces a tab deliminated file and an R script for labeling and putting levels on the variables.
Genotypic data
uk_biobank/downloading_the_data.1455922169.txt.gz · Last modified: by lessem