iFLAS: positive‐unlabeled learning facilitates full‐length transcriptome‐based identification and functional exploration of alternatively spliced isoforms in maize

Author:

Xu Feng1,Liu Songyu1,Zhao Anwen1,Shang Meiqi1,Wang Qian1ORCID,Jiang Shuqin1,Cheng Qian1,Chen Xingming2,Zhai Xiaoguang2,Zhang Jianan2,Wang Xiangfeng1ORCID,Yan Jun1ORCID

Affiliation:

1. State Key Laboratory of Maize Bio‐Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology China Agricultural University Beijing 100094 China

2. Molbreeding Biotechnology Co., Ltd Shijiazhuang Hebei Province 051430 China

Abstract

Summary The advent of full‐length transcriptome sequencing technologies has accelerated the discovery of novel splicing isoforms. However, existing alternative splicing (AS) tools are either tailored for short‐read RNA‐Seq data or designed for human and animal studies. The disparities in AS patterns between plants and animals still pose a challenge to the reliable identification and functional exploration of novel isoforms in plants. Here, we developed integrated full‐length alternative splicing analysis (iFLAS), a plant‐optimized AS toolkit that introduced a semi‐supervised machine learning method known as positive‐unlabeled (PU) learning to accurately identify novel isoforms. iFLAS also enables the investigation of AS functions from various perspectives, such as differential AS, poly(A) tail length, and allele‐specific AS (ASAS) analyses. By applying iFLAS to three full‐length transcriptome sequencing datasets, we systematically identified and functionally characterized maize (Zea mays) AS patterns. We found intron retention not only introduces premature termination codons, resulting in lower expression levels of isoforms, but may also regulate the length of 3′UTR and poly(A) tail, thereby affecting the functional differentiation of isoforms. Moreover, we observed distinct ASAS patterns in two genes within heterosis offspring, highlighting their potential value in breeding. These results underscore the broad applicability of iFLAS in plant full‐length transcriptome‐based AS research.

Funder

Natural Science Foundation of Beijing Municipality

National Key Research and Development Program of China

Chinese Universities Scientific Fund

Publisher

Wiley

Subject

Plant Science,Physiology

Reference65 articles.

1. A survey of the sorghum transcriptome using single‐molecule long reads;Abdel‐Ghany SE;Nature Communications,2016

2. Opportunities and challenges in long‐read sequencing data analysis;Amarasinghe SL;Genome Biology,2020

3. Genome architecture and tetrasomic inheritance of autotetraploid potato;Bao Z;Molecular Plant,2022

4. Learning from positive and unlabeled data: a survey;Bekker J;Machine Learning,2020

5. The three as: alternative splicing, alternative polyadenylation and their impact on apoptosis in immune function;Blake D;Immunological Reviews,2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3