Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program

Author:

Taliun Daniel, ,Harris Daniel N.,Kessler Michael D.,Carlson Jedidiah,Szpiech Zachary A.,Torres Raul,Taliun Sarah A. GaglianoORCID,Corvelo AndréORCID,Gogarten Stephanie M.ORCID,Kang Hyun Min,Pitsillides Achilleas N.,LeFaive JonathonORCID,Lee Seung-beenORCID,Tian Xiaowen,Browning Brian L.,Das SayantanORCID,Emde Anne-Katrin,Clarke Wayne E.,Loesch Douglas P.,Shetty Amol C.ORCID,Blackwell Thomas W.,Smith Albert V.ORCID,Wong Quenna,Liu Xiaoming,Conomos Matthew P.ORCID,Bobo Dean M.,Aguet François,Albert Christine,Alonso Alvaro,Ardlie Kristin G.,Arking Dan E.,Aslibekyan Stella,Auer Paul L.,Barnard John,Barr R. Graham,Barwick Lucas,Becker Lewis C.,Beer Rebecca L.,Benjamin Emelia J.,Bielak Lawrence F.,Blangero John,Boehnke Michael,Bowden Donald W.,Brody Jennifer A.,Burchard Esteban G.,Cade Brian E.,Casella James F.,Chalazan Brandon,Chasman Daniel I.,Chen Yii-Der Ida,Cho Michael H.,Choi Seung Hoan,Chung Mina K.,Clish Clary B.,Correa Adolfo,Curran Joanne E.,Custer Brian,Darbar Dawood,Daya Michelle,de Andrade Mariza,DeMeo Dawn L.,Dutcher Susan K.,Ellinor Patrick T.,Emery Leslie S.,Eng Celeste,Fatkin Diane,Fingerlin Tasha,Forer Lukas,Fornage Myriam,Franceschini Nora,Fuchsberger Christian,Fullerton Stephanie M.,Germer Soren,Gladwin Mark T.,Gottlieb Daniel J.,Guo Xiuqing,Hall Michael E.,He Jiang,Heard-Costa Nancy L.,Heckbert Susan R.,Irvin Marguerite R.,Johnsen Jill M.,Johnson Andrew D.,Kaplan Robert,Kardia Sharon L. R.,Kelly Tanika,Kelly Shannon,Kenny Eimear E.,Kiel Douglas P.,Klemmer Robert,Konkle Barbara A.,Kooperberg Charles,Köttgen Anna,Lange Leslie A.,Lasky-Su Jessica,Levy Daniel,Lin Xihong,Lin Keng-Han,Liu Chunyu,Loos Ruth J. F.,Garman Lori,Gerszten Robert,Lubitz Steven A.,Lunetta Kathryn L.,Mak Angel C. Y.,Manichaikul Ani,Manning Alisa K.,Mathias Rasika A.,McManus David D.,McGarvey Stephen T.,Meigs James B.,Meyers Deborah A.,Mikulla Julie L.,Minear Mollie A.,Mitchell Braxton D.,Mohanty Sanghamitra,Montasser May E.,Montgomery Courtney,Morrison Alanna C.,Murabito Joanne M.,Natale Andrea,Natarajan Pradeep,Nelson Sarah C.,North Kari E.,O’Connell Jeffrey R.,Palmer Nicholette D.,Pankratz Nathan,Peloso Gina M.,Peyser Patricia A.,Pleiness Jacob,Post Wendy S.,Psaty Bruce M.,Rao D. C.,Redline Susan,Reiner Alexander P.,Roden Dan,Rotter Jerome I.,Ruczinski Ingo,Sarnowski Chloé,Schoenherr Sebastian,Schwartz David A.,Seo Jeong-Sun,Seshadri Sudha,Sheehan Vivien A.,Sheu Wayne H.,Shoemaker M. Benjamin,Smith Nicholas L.,Smith Jennifer A.,Sotoodehnia Nona,Stilp Adrienne M.,Tang Weihong,Taylor Kent D.,Telen Marilyn,Thornton Timothy A.,Tracy Russell P.,Van Den Berg David J.,Vasan Ramachandran S.,Viaud-Martinez Karine A.,Vrieze Scott,Weeks Daniel E.,Weir Bruce S.,Weiss Scott T.,Weng Lu-Chen,Willer Cristen J.,Zhang Yingze,Zhao Xutong,Arnett Donna K.,Ashley-Koch Allison E.,Barnes Kathleen C.,Boerwinkle Eric,Gabriel Stacey,Gibbs Richard,Rice Kenneth M.,Rich Stephen S.,Silverman Edwin K.,Qasba Pankaj,Gan Weiniu,Papanicolaou George J.,Nickerson Deborah A.,Browning Sharon R.ORCID,Zody Michael C.,Zöllner Sebastian,Wilson James G.,Cupples L. AdrienneORCID,Laurie Cathy C.ORCID,Jaquish Cashell E.ORCID,Hernandez Ryan D.ORCID,O’Connor Timothy D.ORCID,Abecasis Gonçalo R.ORCID

Abstract

AbstractThe Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes)1. In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3