Reclassification of ASFV into 7 Biotypes Using Unsupervised Machine Learning

Author:

Dinhobl Mark123,Spinard Edward123ORCID,Tesler Nicolas14ORCID,Birtley Hillary14,Signore Anthony35ORCID,Ambagala Aruna35ORCID,Masembe Charles36ORCID,Borca Manuel V.123,Gladue Douglas P.123ORCID

Affiliation:

1. United States Department of Agriculture, Agricultural Research Service, Foreign Animal Disease Research Unit, Plum Island Animal Disease Center, Orient, NY 11957, USA

2. United States Department of Agriculture, Agricultural Research Service, Foreign Animal Disease Research Unit, National Bio and Agro-Defense Facility, Manhattan, KS 66502, USA

3. Center of Excellence for African Swine Fever Genomics, Guilford, CT 06437, USA

4. Oak Ridge Institute for Science and Education (ORISE), Oak Ridge, TN 37830, USA

5. National Centre for Foreign Animal Disease, Canadian Food Inspection Agency, Winnipeg, MB R3E 3M4, Canada

6. Department of Zoology, Entomology and Fisheries Sciences, School of Biosciences, College of Natural Sciences, Makerere University, Kampala P.O. Box 7062, Uganda

Abstract

In 2007, an outbreak of African swine fever (ASF), a deadly disease of domestic swine and wild boar caused by the African swine fever virus (ASFV), occurred in Georgia and has since spread globally. Historically, ASFV was classified into 25 different genotypes. However, a newly proposed system recategorized all ASFV isolates into 6 genotypes exclusively using the predicted protein sequences of p72. However, ASFV has a large genome that encodes between 150–200 genes, and classifications using a single gene are insufficient and misleading, as strains encoding an identical p72 often have significant mutations in other areas of the genome. We present here a new classification of ASFV based on comparisons performed considering the entire encoded proteome. A curated database consisting of the protein sequences predicted to be encoded by 220 reannotated ASFV genomes was analyzed for similarity between homologous protein sequences. Weights were applied to the protein identity matrices and averaged to generate a genome-genome identity matrix that was then analyzed by an unsupervised machine learning algorithm, DBSCAN, to separate the genomes into distinct clusters. We conclude that all available ASFV genomes can be classified into 7 distinct biotypes.

Funder

USDA

NBAF

Publisher

MDPI AG

Subject

Virology,Infectious Diseases

Reference42 articles.

1. One hundred years of African swine fever: A tribute to R. Eustace Montgomery;Penrith;Transbound. Emerg. Dis.,2021

2. Genomic analysis of highly virulent Georgia 2007/1 isolate of African swine fever virus;Chapman;Emerg. Infect. Dis.,2011

3. African swine fever in the Dominican Republic;Gonzales;Transbound. Emerg. Dis.,2021

4. Flach, B. (2023, December 01). United States Department of Agriculture (2023). First Case of African Swine Fever Found in Wild Boars in Sweden. Sweden, Available online: https://apps.fas.usda.gov/newgainapi/api/Report/DownloadReportByFileName?fileName=First%20Case%20of%20African%20Swine%20Fever%20Found%20in%20Wild%20Boars%20in%20Sweden%20_The%20Hague_Sweden_SW2023-0002.pdf.

5. Highly lethal genotype I and II recombinant African swine fever viruses detected in pigs;Zhao;Nat. Commun.,2023

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3