Skip to Content
云端基因组学
book

云端基因组学

by Geraldine A. Van der Auwera, Brian D. O’Connor
April 2022
Beginner to intermediate
486 pages
10h 22m
Chinese
China Electric Power Press Ltd.
Content preview from 云端基因组学
撰写可完全复现的论文
441
3.
分享:我们计划在一个
Terra
工作区内完成所有工作,以便我们将其分享给社区,
使研究者可以访问数据、
WDL
工作流和
Jupyter
笔记本。
Terra
平台易分享,有
助于我们践行
FAIR
框架的要求,使我们的案例研究符合
FAIR
框架可查找和可
访问两方面要求。
最后一点,我们为完成该案例研究在
Terra
平台重做原工作还有额外收获:完成该
工作,我们知道自己可以通过轻松克隆开发工作区来创建一个公开工作区,该工作
区将包含所有工作流、笔记本和对数据的引用。因此我们只需最少量工作,就能将
我们的私有开发环境转变为一个完全可复用的环境,供他人使用。
接下来两节,我们将解释如何复现该研究成果。在每个阶段,我们将指明我们在工
作区开发的代码和使用的数据,以便你理解我们正在讨论的内容。本章我们不再让
你运行任何内容,但你可以自行克隆工作区,并尝试运行各种不同工作流和笔记本。
现在,你准备好深入学习细节了吗?系好安全带并发动车子,因为我们将开始讲最
具挑战的部分:创建合成数据集。
14.2
生成合成数据集,替代私有数据
论文所讲原分析涵盖的
829
case
样本和
1252
control
样本,均用外显子组测序
技术生成,可是这部分数据限制访问,我们轻易无法取得授权,故而无法访问该数据。
实际上,即使我们能够访问这部分数据,我们当然也不能将其跟我们计划开发的教
学资源一起发布。拥有可用于测试的数据当然很有用,但无论如何我们需要依靠合
成数据,才能完全实现我们的目标。
但请稍等……你可能要问那具体是什么意思 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

What Successful Project Managers Do

What Successful Project Managers Do

W. Scott Cameron, Jeffrey S. Russell, Edward J. Hoffman, Alexander Laufer
How to Overcome a Power Deficit

How to Overcome a Power Deficit

Cyril Bouquet, Jean-Louis Barsoux
The Human Factor in AI-Based Decision-Making

The Human Factor in AI-Based Decision-Making

Philip Meissner, Christoph Keding

Publisher Resources

ISBN: 9787519864422