Improving bacterial metagenomic research through long read sequencing

Author:

Greenman Noah,Hassouneh Sayf Al-Deen,Johnston Catherine,Azarian TajORCID

Abstract

AbstractMetagenomic sequencing analysis is central to investigating microbial communities in clinical and environmental studies. Short read sequencing remains the primary data type for metagenomic research. However, long read sequencing promises the advantages of improved metagenomic assembly and resolved taxonomic identification. To assess the comparative performance of short and long read sequencing data for metagenomic analysis, we simulated short and long read datasets using increasingly complex metagenomes of 10, 20, and 50 microbial taxa. In addition, an empirical dataset of paired short and long read data from mouse fecal pellets was generated to assess feasibility. We compared metagenomic assembly quality, taxonomic classification capabilities, and metagenome-assembled genome recovery rates for both simulated and real metagenomic sequence data. We show that long read sequencing data significantly outperforms short read sequencing. For simulated long read datasets, metagenomic assemblies were completer and more contiguous with higher rates of metagenome-assembled genome recovery. This resulted in more precise taxonomic classifications and increased accuracy of relative abundance estimates. Analysis of empirical data demonstrated that sequencing technology directly affects compositional results. Overall, we highlight strengths of long read sequencing for metagenomic research of microbial communities over traditional short read approaches.

Publisher

Cold Spring Harbor Laboratory

Reference56 articles.

1. A. Murat Eren . 2016. “Assessing Completion and Contamination of Metagenome-Assembled Genomes.” https://Merenlab.Org/2016/06/09/Assessing-Completion-and-Contamination-of-MAGs/. June 9, 2016.

2. Aitchison, J . (1986). The statistical analysis of Compositional Data. Chapman and Hall.

3. Opportunities and challenges in long-read sequencing data analysis

4. long-read-tools.org: an interactive catalogue of analysis methods for long-read sequencing data

5. Amarasinghe, S. L. , Ritchie, M. E. , & Gouil, Q. (2023). Long-read-tools. Long Read Tools. https://long-read-tools.org/analysis.html

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3