Spaced Out Data No More: Genomic Harmonization Meets Machine Learning in Murine Livers

Author:

Ilangovan Hari1,Kothiyal Prachi2,Hoadley Katherine3ORCID,Elgart Shona4ORCID,Eley Greg2,Eslami Parastou5

Affiliation:

1. Science Application International Corporation (SAIC)

2. Scimentis LLC

3. University of North Carolina at Chapel Hill

4. University of Houston

5. Universal Artificial Intelligence Inc.

Abstract

Abstract NASA has employed high-throughput molecular assays to identify sub-cellular changes impacting human physiology during spaceflight. Machine learning (ML) methods hold the promise to improve our ability to identify important signals within highly dimensional molecular data. However, the inherent limitation of study subject numbers within a spaceflight mission minimizes the utility of ML approaches. To overcome the sample power limitations, data from multiple spaceflight missions must be aggregated while appropriately addressing intra- and inter-study variabilities. Here we describe an approach to log transform, scale and normalize data from six heterogeneous, mouse liver derived transcriptomics datasets (ntotal=137) which enabled ML-methods to classify spaceflown vs. ground control animals (AUC ≥ 0.87) while mitigating the variability from mission-of-origin. Concordance was found between liver-specific biological processes identified from harmonized ML-based analysis and study-by-study classical omics analysis. This work demonstrates the feasibility of applying ML methods on integrated, heterogeneous datasets of small sample size.

Publisher

Research Square Platform LLC

Reference53 articles.

1. Breaking the limit: Biological countermeasures for space radiation exposure to enable long-duration spaceflight;Dynan WS;Life Sci. Space Res.,2022

2. Choi, S. Y., Beegle, J. E., Wigley, C. L., Pletcher, D. & Globus, R. K. NASA’s Rodent Research Project: Validation of Flight Hardware, Operations and Science Capabilities for Conducting Long Duration Experiments in Space. in (2015).

3. Evaluation of rodent spaceflight in the NASA animal enclosure module for an extended operational period (up to 35 days);Moyer EL;Npj Microgravity,2016

4. NASA GeneLab RNA-seq consensus pipeline: Standardized processing of short-read RNA-seq data;Overbey EG;iScience,2021

5. Cause of death and neoplasia in mice continuously exposed to very low dose rates of gamma rays;Tanaka IB;Radiat. Res.,2007

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3