User Tools

Site Tools


uk_biobank:downloading_the_data

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Last revision Both sides next revision
uk_biobank:downloading_the_data [2016/02/29 11:22]
scott /* Phenotypic data */
uk_biobank:downloading_the_data [2016/03/15 12:09]
luke /* Quality Control */
Line 35: Line 35:
   -  Genetic data is downloaded following the instructions at [[http://biobank.ctsu.ox.ac.uk/showcase/exinfo.cgi?src=AccessingGeneticData|the UK Biobank site]].   -  Genetic data is downloaded following the instructions at [[http://biobank.ctsu.ox.ac.uk/showcase/exinfo.cgi?src=AccessingGeneticData|the UK Biobank site]].
   -  Scripted downloads of all chromosomes were done using a command such as   -  Scripted downloads of all chromosomes were done using a command such as
-<code>+
 $ seq 1 26 | parallel -j1 ./gfetch cal {} $ seq 1 26 | parallel -j1 ./gfetch cal {}
 $ seq 1 26 | parallel -j1 ./gfetch imp {} $ seq 1 26 | parallel -j1 ./gfetch imp {}
-</code>+
   -  A single sample map (impv1.sample) for the imputed data also was downloaded   -  A single sample map (impv1.sample) for the imputed data also was downloaded
-<code>+
 $ ./gfetch imp 1 -m $ ./gfetch imp 1 -m
-</code> 
  
 +
 +
 +
 +====== Quality Control ======
 +
 +
 +
 +We identified lists of individuals and positions to exclude from information in the UKB data and in the Axiom Array unimputed genotypes.
 +  -  All files can be found on RC at: /work/KellerLab/UKBiobank/genetics/raw/Quality_Control
 +  -  UKB and Affymetrix performed a number of QC analyses to exclude questionable positions and identify individual samples. Additional pdfs from the UKBiobank are found within /work/KellerLab/UKBiobank/genetics/raw/Quality_Control/UK_Biobank_Axiom_Array
 +  -  Additional Affymetrix and UKB information can be found on their websites:
 +        [[http://www.ukbiobank.ac.uk/scientists-3/uk-biobank-axiom-array/|UK Biobank Axiom Array]], [[https://www.jiscmail.ac.uk/cgi-bin/webadmin?A0=UKB-GENETICS|UKB-Genetics Archive]]
 +  -  A list of 1068 individuals to exclude is in Exclude_individuals.poorQC.UKB_Affy_sex.id on RC.
 +  -  A list of 8010 positions to exclude is in duplicate.positions.excludesnps.txt on RC.
 +  -  A README.txt file located on RC contains the steps used and additional information.
uk_biobank/downloading_the_data.txt ยท Last modified: 2020/04/22 11:35 by luev6784