Skip to Content
云端基因组学
book

云端基因组学

by Geraldine A. Van der Auwera, Brian D. O’Connor
April 2022
Beginner to intermediate
486 pages
10h 22m
Chinese
China Electric Power Press Ltd.
Content preview from 云端基因组学
GATK
最佳实践发现种系短变异
167
-I sample_markdups.bam \
--bqsr-recal-file recal_data.table \
-O sample_markdups_recal.bam
该命令生成最终输出结果:一个
BAM
文件(
sample_markdups_recal.bam
),存放
已校正碱基质量值的读段数据,可用于后续分析。
每个样本都要单独执行该处理步骤,但算法本身在不同读段组有差别,因为它跟踪
的很多偏差因读段组而异。初始统计数据的采集可以在不同基因组坐标并行开展,
通常按染色体或染色体批次采集。但如有必要,该工作可进一步拆分以提高通量。
按区域采集的统计数据,无法以并行方式汇总到单个基因组宽度的协变量模型,但
是由于计算量较小不会变成瓶颈。重校正规则的最终应用,最好也像初始统计数据
采集那样,在基因组区域并行处理,再紧跟一步最终的文件合并操作,为每个样本
生成一个适用于后续分析的文件。
6.2
联合发现分析
终于到了真正有趣的部分!我们将识别人群变异。但深入具体做法之前,我们还是
用几分钟讨论为什么这么做。
若要理解种系变异,一个人的基因组的用途很有限。该领域大多数研究问题都将受
益于对多人数据的研究,不管是只有几个人(如调查父母遗传疾病给孩子的情况),
还是更多人(如群体遗传学)。因此从科学立场出发,我们通常想一起分析多个样本。
此外,聚集来自多个样本的数据有其技术优势,主要体现在提高统计能力和降低技
术噪音的影响等方面。
6.2.1
联合变异识别工作流概览 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

What Successful Project Managers Do

What Successful Project Managers Do

W. Scott Cameron, Jeffrey S. Russell, Edward J. Hoffman, Alexander Laufer
How to Overcome a Power Deficit

How to Overcome a Power Deficit

Cyril Bouquet, Jean-Louis Barsoux
The Human Factor in AI-Based Decision-Making

The Human Factor in AI-Based Decision-Making

Philip Meissner, Christoph Keding

Publisher Resources

ISBN: 9787519864422