High-throughput nanopore sequencing of Treponema pallidum tandem repeat genes arp and tp0470 reveals clade-specific patterns and recapitulates global whole genome phylogeny

Author:

Lieberman Nicole A. P.,Armstrong Thaddeus D.,Chung Benjamin,Pfalmer Daniel,Hennelly Christopher M.,Haynes Austin,Romeis Emily,Wang Qian-Qiu,Zhang Rui-Li,Kou Cai-Xia,Ciccarese Giulia,Conte Ivano Dal,Cusini Marco,Drago Francesco,Nakayama Shu-ichi,Lee Kenichi,Ohnishi Makoto,Konda Kelika A.,Vargas Silver K.,Eguiluz Maria,Caceres Carlos F.,Klausner Jeffrey D.,Mitja Oriol,Rompalo Anne,Mulcahy Fiona,Hook Edward W.,Hoffman Irving F.,Matoga Mitch M.,Zheng Heping,Yang Bin,Lopez-Medina Eduardo,Ramirez Lady G.,Radolf Justin D.,Hawley Kelly L.,Salazar Juan C.,Lukehart Sheila A.,Seña Arlene C.,Parr Jonathan B.,Giacani Lorenzo,Greninger Alexander L.

Abstract

Sequencing of most Treponema pallidum genomes excludes repeat regions in tp0470 and the tp0433 gene, encoding the acidic repeat protein (arp). As a first step to understanding the evolution and function of these genes and the proteins they encode, we developed a protocol to nanopore sequence tp0470 and arp genes from 212 clinical samples collected from ten countries on six continents. Both tp0470 and arp repeat structures recapitulate the whole genome phylogeny, with subclade-specific patterns emerging. The number of tp0470 repeats is on average appears to be higher in Nichols-like clade strains than in SS14-like clade strains. Consistent with previous studies, we found that 14-repeat arp sequences predominate across both major clades, but the combination and order of repeat type varies among subclades, with many arp sequence variants limited to a single subclade. Although strains that were closely related by whole genome sequencing frequently had the same arp repeat length, this was not always the case. Structural modeling of TP0470 suggested that the eight residue repeats form an extended α-helix, predicted to be periplasmic. Modeling of the ARP revealed a C-terminal sporulation-related repeat (SPOR) domain, predicted to bind denuded peptidoglycan, with repeat regions possibly incorporated into a highly charged β-sheet. Outside of the repeats, all TP0470 and ARP amino acid sequences were identical. Together, our data, along with functional considerations, suggests that both TP0470 and ARP proteins may be involved in T. pallidum cell envelope remodeling and homeostasis, with their highly plastic repeat regions playing as-yet-undetermined roles.

Funder

Bill and Melinda Gates Foundation

National Institute of Allergy and Infectious Diseases

Japan Agency for Medical Research and Development

Ministry of Education, Culture, Sports, Science and Technology

Publisher

Frontiers Media SA

Subject

Microbiology (medical),Microbiology

全球学者库

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"全球学者库"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前全球学者库共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2023 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3