November 2018
Intermediate to advanced
360 pages
9h 36m
English
Pipelines are fundamental in any data science environment. Data processing is never a single task. Many pipelines are implemented via ad hoc scripts. This can be done in a useful way, but in many cases, they fail many fundamental viewpoints: reproducibility, maintainability, and extensibility.
In bioinformatics, you can find three main types of pipeline systems: