Genetic algorithm‐based semisupervised convolutional neural network for real‐time monitoring of Escherichia coli fermentation of recombinant protein production using a Raman sensor

Author:

Liu Yuan1,Zhou Xiaotian1,Wang Teng123,Luo An13,Jia Zhaojun13,Pan Xingquan13,Cai Weiqi13,Sun Mengge13,Wang Xuezhong123,Wen Zhenguo123ORCID,Zhou Guangzheng23

Affiliation:

1. Department of Pharmaceutical Engineering Beijing Institute of Petrochemical Technology Beijing China

2. Beijing Key Laboratory of Enze Biomass and Fine Chemicals Beijing Institute of Petrochemical Technology Beijing China

3. Beijing Institute of Petrochemical Technology College of New Materials and Chemical Engineering Beijing China

Abstract

AbstractAs a non‐destructive sensing technique, Raman spectroscopy is often combined with regression models for real‐time detection of key components in microbial cultivation processes. However, achieving accurate model predictions often requires a large amount of offline measurement data for training, which is both time‐consuming and labor‐intensive. In order to overcome the limitations of traditional models that rely on large datasets and complex spectral preprocessing, in addition to the difficulty of training models with limited samples, we have explored a genetic algorithm‐based semi‐supervised convolutional neural network (GA‐SCNN). GA‐SCNN integrates unsupervised process spectral labeling, feature extraction, regression prediction, and transfer learning. Using only an extremely small number of offline samples of the target protein, this framework can accurately predict protein concentration, which represents a significant challenge for other models. The effectiveness of the framework has been validated in a system of Escherichia coli expressing recombinant ProA5M protein. By utilizing the labeling technique of this framework, the available dataset for glucose, lactate, ammonium ions, and optical density at 600 nm (OD600) has been expanded from 52 samples to 1302 samples. Furthermore, by introducing a small component of offline detection data for recombinant proteins into the OD600 model through transfer learning, a model for target protein detection has been retrained, providing a new direction for the development of associated models. Comparative analysis with traditional algorithms demonstrates that the GA‐SCNN framework exhibits good adaptability when there is no complex spectral preprocessing. Cross‐validation results confirm the robustness and high accuracy of the framework, with the predicted values of the model highly consistent with the offline measurement results.

Publisher

Wiley

Subject

Applied Microbiology and Biotechnology,Bioengineering,Biotechnology

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3