Functional Annotation of the Arabidopsis Genome Using Controlled Vocabularies

Author:

Berardini Tanya Z.1,Mundodi Suparna1,Reiser Leonore1,Huala Eva1,Garcia-Hernandez Margarita1,Zhang Peifen1,Mueller Lukas A.1,Yoon Jungwoon1,Doyle Aisling1,Lander Gabriel1,Moseyko Nick1,Yoo Danny1,Xu Iris1,Zoeckler Brandon1,Montoya Mary1,Miller Neil1,Weems Dan1,Rhee Seung Y.1

Affiliation:

1. Carnegie Institution, Department of Plant Biology, Stanford, California 94305 (T.Z.B., S.M., L.R., E.H., M.G.-H., P.Z., L.A.M., J.Y., A.D., G.L., N.M., D.Y., I.X., B.Z., S.Y.R.); and National Center for Genome Resources, Santa Fe, New Mexico 87505 (M.M., N.M., D.W.)

Abstract

Abstract Controlled vocabularies are increasingly used by databases to describe genes and gene products because they facilitate identification of similar genes within an organism or among different organisms. One of The Arabidopsis Information Resource's goals is to associate all Arabidopsis genes with terms developed by the Gene Ontology Consortium that describe the molecular function, biological process, and subcellular location of a gene product. We have also developed terms describing Arabidopsis anatomy and developmental stages and use these to annotate published gene expression data. As of March 2004, we used computational and manual annotation methods to make 85,666 annotations representing 26,624 unique loci. We focus on associating genes to controlled vocabulary terms based on experimental data from the literature and use The Arabidopsis Information Resource-developed PubSearch software to facilitate this process. Each annotation is tagged with a combination of evidence codes, evidence descriptions, and references that provide a robust means to assess data quality. Annotation of all Arabidopsis genes will allow quantitative comparisons between sets of genes derived from sources such as microarray experiments. The Arabidopsis annotation data will also facilitate annotation of newly sequenced plant genomes by using sequence similarity to transfer annotations to homologous genes. In addition, complete and up-to-date annotations will make unknown genes easy to identify and target for experimentation. Here, we describe the process of Arabidopsis functional annotation using a variety of data sources and illustrate several ways in which this information can be accessed and used to infer knowledge about Arabidopsis and other plant species.

Publisher

Oxford University Press (OUP)

Subject

Plant Science,Genetics,Physiology

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3