GENCODE 2021

Author:

Frankish Adam1ORCID,Diekhans Mark2ORCID,Jungreis Irwin34ORCID,Lagarde Julien5,Loveland Jane E1ORCID,Mudge Jonathan M1,Sisu Cristina67,Wright James C8,Armstrong Joel2,Barnes If1,Berry Andrew1,Bignell Alexandra1,Boix Carles349,Carbonell Sala Silvia5,Cunningham Fiona1ORCID,Di Domenico Tomás10,Donaldson Sarah1,Fiddes Ian T2,García Girón Carlos1ORCID,Gonzalez Jose Manuel1,Grego Tiago1,Hardy Matthew1,Hourlier Thibaut1ORCID,Howe Kevin L1ORCID,Hunt Toby1,Izuogu Osagie G1,Johnson Rory1112ORCID,Martin Fergal J1ORCID,Martínez Laura10,Mohanan Shamika1,Muir Paul1314,Navarro Fabio C P6,Parker Anne1,Pei Baikang6,Pozo Fernando10,Riera Ferriol Calvet1,Ruffier Magali1ORCID,Schmitt Bianca M1,Stapleton Eloise1,Suner Marie-Marthe1ORCID,Sycheva Irina1,Uszczynska-Ratajczak Barbara15,Wolf Maxim Y16,Xu Jinuri6,Yang Yucheng T617,Yates Andrew1ORCID,Zerbino Daniel1ORCID,Zhang Yan618ORCID,Choudhary Jyoti S8,Gerstein Mark61719,Guigó Roderic520,Hubbard Tim J P21,Kellis Manolis34,Paten Benedict2,Tress Michael L10ORCID,Flicek Paul1ORCID

Affiliation:

1. European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK

2. UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA 95064, USA

3. MIT Computer Science and Artificial Intelligence Laboratory, 32 Vassar St, Cambridge, MA 02139, USA

4. Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA 02142, USA

5. Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Dr. Aiguader 88, Barcelona, E-08003 Catalonia, Spain

6. Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA

7. Department of Bioscience, Brunel University London, Uxbridge UB8 3PH, UK

8. Functional Proteomics, Division of Cancer Biology, Institute of Cancer Research, 237 Fulham Road, London SW3 6JB, UK

9. Computational and Systems Biology Program, Massachusetts Institute of Technology, Cambridge, MA, USA

10. Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain

11. Department of Medical Oncology, Inselspital, University Hospital, University of Bern, Bern, Switzerland

12. Department of Biomedical Research (DBMR), University of Bern, Bern, Switzerland

13. Department of Molecular, Cellular & Developmental Biology, Yale University, New Haven, CT 06520, USA

14. Systems Biology Institute, Yale University, West Haven, CT 06516, USA

15. Centre of New Technologies, University of Warsaw, Warsaw, Poland

16. Department of Biomedical Informatics at Harvard Medical School, 10 Shattuck Street, Suite 514, Boston, MA 02115, USA

17. Program in Computational Biology & Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, CT 06520, USA

18. Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, USA

19. Department of Computer Science, Yale University, Bass 432, 266 Whitney Avenue, New Haven, CT 06520, USA

20. Universitat Pompeu Fabra (UPF), Barcelona, E-08003 Catalonia, Spain

21. Department of Medical and Molecular Genetics, King's College London, Guys Hospital, Great Maze Pond, London SE1 9RT, UK

Abstract

Abstract The GENCODE project annotates human and mouse genes and transcripts supported by experimental data with high accuracy, providing a foundational resource that supports genome biology and clinical genomics. GENCODE annotation processes make use of primary data and bioinformatic tools and analysis generated both within the consortium and externally to support the creation of transcript structures and the determination of their function. Here, we present improvements to our annotation infrastructure, bioinformatics tools, and analysis, and the advances they support in the annotation of the human and mouse genomes including: the completion of first pass manual annotation for the mouse reference genome; targeted improvements to the annotation of genes associated with SARS-CoV-2 infection; collaborative projects to achieve convergence across reference annotation databases for the annotation of human and mouse protein-coding genes; and the first GENCODE manually supervised automated annotation of lncRNAs. Our annotation is accessible via Ensembl, the UCSC Genome Browser and https://www.gencodegenes.org.

Funder

National Institutes of Health

Wellcome Trust

European Molecular Biology Laboratory

Swiss National Science Foundation

University of Bern

Publisher

Oxford University Press (OUP)

Subject

Genetics

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3