This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Last revision Both sides next revision | ||
homework_6_ancestry [2015/11/15 16:13] scott /* Code for Homework */ |
homework_6_ancestry [2015/11/15 18:17] scott /* Code for Homework */ |
||
---|---|---|---|
Line 29: | Line 29: | ||
### ONLY EXPLAIN COMMANDS WHERE I SPECIFICALLY REQUEST IT! YOU DO NOT | ### ONLY EXPLAIN COMMANDS WHERE I SPECIFICALLY REQUEST IT! YOU DO NOT | ||
### HAVE TO EXPLAIN EVERY COMMAND! But please run all commands to produce | ### HAVE TO EXPLAIN EVERY COMMAND! But please run all commands to produce | ||
- | ### a PCA plot containing yourself compared to all 1000 Genomes samples. | + | ### in the end a PCA plot containing yourself compared to all 1000 Genomes |
+ | ### samples. | ||
# For many questions you'll want to run analyses by chromosome. To do | # For many questions you'll want to run analyses by chromosome. To do | ||
Line 108: | Line 109: | ||
### Retain in the 1000 Genomes VCF only your SNPs that are also fairly common | ### Retain in the 1000 Genomes VCF only your SNPs that are also fairly common | ||
- | ### | + | ### because we're going to conduct PCA on these SNPs and only want common ones. |
- | ###------ QUESTION 3: WHY WOULD WE REMOVE | + | ### |
+ | ###------ QUESTION 3: WHY WOULD WE RETAIN ONLY COMMON SNPS, OTHER THAN IT | ||
### | ### | ||
for i in {1..22}; do | for i in {1..22}; do | ||
Line 269: | Line 271: | ||
### ' | ### ' | ||
- | kg_sf <- read.table('/ | + | kg_sf <- read.table('/ |
sample_ids <- unique(data.frame(IID=kg_sf$SAMPLE_NAME, | sample_ids <- unique(data.frame(IID=kg_sf$SAMPLE_NAME, |