Skip to Content
Bioinformatics with Python Cookbook - Second Edition
book

Bioinformatics with Python Cookbook - Second Edition

by Tiago Antao
November 2018
Intermediate to advanced
360 pages
9h 36m
English
Packt Publishing
Content preview from Bioinformatics with Python Cookbook - Second Edition

Getting ready

You will need PLINK installed. Remember that we are not using a conda environment, so you have to make sure it is available for Airflow. We will define the following tasks:

  1. Downloading data
  2. Uncompressing it
  3. Sub-sampling at 10%
  4. Sub-sampling at 1%
  5. Computing PCA on the 1% sub-sample
  6. Charting the PCA

Our pipeline recipe will have two parts: the actual coding of the pipeline and making the pipeline actually execute.

The code for this can be found on Chapter08/pipelines/airflow/create_tasks.py.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Bioinformatics with Python Cookbook

Bioinformatics with Python Cookbook

Tiago Antao

Publisher Resources

ISBN: 9781789344691Supplemental Content