November 2018
Intermediate to advanced
360 pages
9h 36m
English
Let's take a look at the following steps:
from collections import defaultdictimport gzipimport scipyimport scipy.stats as statsgermline_file = 'good.match.gz'sample_file = 'integrated_call_samples.20101123.ped'inds = set()ind_pop = {}selected_inds = {}pop_inds = defaultdict(list)with gzip.open(germline_file, 'rt', encoding='utf-8') as f: for l in f: toks = l.rstrip().split() inds.add(toks[1]) inds.add(toks[3])with open(sample_file, 'rt', encoding='utf-8') as f: f.readline() # header for l in f: toks ...