1
|
Keenum I, Jackson SA, Eloe-Fadrosh E, Schriml LM. A standards perspective on genomic data reusability and reproducibility. FRONTIERS IN BIOINFORMATICS 2025; 5:1572937. [PMID: 40130011 PMCID: PMC11931119 DOI: 10.3389/fbinf.2025.1572937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2025] [Accepted: 02/21/2025] [Indexed: 03/26/2025] Open
Abstract
Genomic and metagenomic sequence data provides an unprecedented ability to re-examine findings, offering a transformative potential for advancing research, developing computational tools, enhancing clinical applications, and fostering scientific collaboration. However, effective and ethical reuse of genomics data is hampered by numerous technical and social challenges. The International Microbiome and Multi'Omics Standards Alliance (IMMSA, https://www.microbialstandards.org/) and the Genomic Standards Consortium (GSC, https://gensc.org) hosted a 5-part seminar series "A Year of Data Reuse" in 2024 to explore challenges and opportunities of data reuse and reproducibility across disparate domains of the genomic sciences. Addressing these challenges will require a multifaceted approach, including common metadata reporting, clear communication, standardized protocols, improved data management infrastructure, ethical guidelines, and collaborative policies that prioritize transparency and accessibility. We offer strategies to enable responsible and technically feasible data reuse, recognition of data reproducibility challenges, and emphasizing the importance of cross-disciplinary efforts in the pursuit of open science and data-driven innovation.
Collapse
Affiliation(s)
- Ishi Keenum
- Department of Civil, Environmental, and Geospatial Engineering, Michigan Technological University, Houghton, MI, United States
| | - Scott A. Jackson
- Complex Microbial Systems Group, National Institute of Standards and Technology, Gaithersburg, MD, United States
| | - Emiley Eloe-Fadrosh
- Lawrence Berkeley National Laboratory, Environmental Genomics and Systems Biology Division, Berkeley, CA, United States
| | - Lynn M. Schriml
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, United States
| |
Collapse
|
2
|
Wang M, Chen L, Zhang Z, Wang Q. Recent advances in genome mining and synthetic biology for discovery and biosynthesis of natural products. Crit Rev Biotechnol 2025; 45:236-256. [PMID: 39134459 DOI: 10.1080/07388551.2024.2383754] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 12/28/2023] [Accepted: 07/13/2024] [Indexed: 12/17/2024]
Abstract
Natural products have long served as critical raw materials in chemical and pharmaceutical manufacturing, primarily which can provide superior scaffolds or intermediates for drug discovery and development. Over the last century, natural products have contributed to more than a third of therapeutic drug production. However, traditional methods of producing drugs from natural products have become less efficient and more expensive over the past few decades. The combined utilization of genome mining and synthetic biology based on genome sequencing, bioinformatics tools, big data analytics, genetic engineering, metabolic engineering, and systems biology promises to counter this trend. Here, we reviewed recent (2020-2023) examples of genome mining and synthetic biology used to resolve challenges in the production of natural products, such as less variety, poor efficiency, and low yield. Additionally, the emerging efficient tools, design principles, and building strategies of synthetic biology and its application prospects in NPs synthesis have also been discussed.
Collapse
Affiliation(s)
- Mingpeng Wang
- School of Life Sciences, Qufu Normal University, Qufu, China
- Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, China
| | - Lei Chen
- School of Life Sciences, Qufu Normal University, Qufu, China
- Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, China
| | - Zhaojie Zhang
- Department of Zoology and Physiology, University of WY, Laramie, Laramie, WY, USA
| | - Qinhong Wang
- Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, China
| |
Collapse
|
3
|
Chibwe K, Mantilla-Calderon D, Ling F. Evaluating GPT Models for Automated Literature Screening in Wastewater-Based Epidemiology. ACS ENVIRONMENTAL AU 2025; 5:61-68. [PMID: 39830716 PMCID: PMC11741058 DOI: 10.1021/acsenvironau.4c00042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/25/2024] [Revised: 11/22/2024] [Accepted: 11/25/2024] [Indexed: 01/22/2025]
Abstract
Methods to quantitatively synthesize findings across multiple studies is an emerging need in wastewater-based epidemiology (WBE), where disease tracking through wastewater analysis is performed at broad geographical locations using various techniques to facilitate public health responses. Meta-analysis provides a rigorous statistical procedure for research synthesis, yet the manual process of screening large volumes of literature remains a hurdle for its application in timely evidence-based public health responses. Here, we evaluated the performance of GPT-3, GPT-3.5, and GPT-4 models in automated screening of publications for meta-analysis in the WBE literature. We show that the chat completion model in GPT-4 accurately differentiates papers that contain original data from those that did not with texts of the Abstract as the input at a Precision of 0.96 and Recall of 1.00, exceeding current quality standards for manual screening (Recall = 0.95) while costing less than $0.01 per paper. GPT models performed less accurately in detecting studies reporting relevant sampling location, highlighting the value of maintaining human intervention in AI-assisted literature screening. Importantly, we show that certain formulation and model choices generated nonsensical answers to the screening tasks, while others did not, urging the attention to robustness when employing AI-assisted literature screening. This study provided novel performance evaluation data on GPT models for document screening as a step in meta-analysis, suggesting AI-assisted literature screening a useful complementary technique to speed up research synthesis in WBE.
Collapse
Affiliation(s)
- Kaseba Chibwe
- Department of Energy, Environmental
and Chemical Engineering, Washington University
in St. Louis, St. Louis, Missouri 63130, United States
| | - David Mantilla-Calderon
- Department of Energy, Environmental
and Chemical Engineering, Washington University
in St. Louis, St. Louis, Missouri 63130, United States
| | - Fangqiong Ling
- Department of Energy, Environmental
and Chemical Engineering, Washington University
in St. Louis, St. Louis, Missouri 63130, United States
| |
Collapse
|
4
|
Wang Y, Wang Y, Cui J, Wu C, Yu B, Wang L. Non-conventional yeasts: promising cell factories for organic acid bioproduction. Trends Biotechnol 2025:S0167-7799(24)00364-0. [PMID: 39799011 DOI: 10.1016/j.tibtech.2024.12.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2024] [Revised: 11/29/2024] [Accepted: 12/13/2024] [Indexed: 01/15/2025]
Abstract
Microbial production of organic acids has been hindered by the poor acid tolerance of microorganisms and the high costs of waste salt reprocessing. The robustness of non-conventional microorganisms in an acidic environment makes it possible to produce organic acids at low pH and greatly simplifies downstream processing. In this review we discuss the environmental adaptability features of non-conventional yeasts, as well as the latest developments in genomic engineering strategies that have facilitated metabolic engineering of these strains. We also use selected examples of three-carbon (C3), C4, and C6 organic acids to illustrate the ongoing efforts and challenges of using non-conventional yeasts for organic acid production. This review provides theoretical guidance for the construction of highly robust organic acid producers.
Collapse
Affiliation(s)
- Yupeng Wang
- Department of Microbial Physiological and Metabolic Engineering, State Key Laboratory of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yi Wang
- Department of Microbial Physiological and Metabolic Engineering, State Key Laboratory of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jiakai Cui
- Department of Microbial Physiological and Metabolic Engineering, State Key Laboratory of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China; School of Life Sciences, Yunnan University, Kunming 650500, China
| | - Chenchen Wu
- Department of Microbial Physiological and Metabolic Engineering, State Key Laboratory of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Bo Yu
- Department of Microbial Physiological and Metabolic Engineering, State Key Laboratory of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China.
| | - Limin Wang
- Department of Microbial Physiological and Metabolic Engineering, State Key Laboratory of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China.
| |
Collapse
|
5
|
Sun B, Pashkova L, Pieters P, Harke A, Mohite O, Santos A, Zielinski D, Palsson B, Phaneuf P. PanKB: An interactive microbial pangenome knowledgebase for research, biotechnological innovation, and knowledge mining. Nucleic Acids Res 2025; 53:D806-D818. [PMID: 39574409 PMCID: PMC11701538 DOI: 10.1093/nar/gkae1042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2024] [Revised: 10/11/2024] [Accepted: 10/19/2024] [Indexed: 01/18/2025] Open
Abstract
The exponential growth of microbial genome data presents unprecedented opportunities for unlocking the potential of microorganisms. The burgeoning field of pangenomics offers a framework for extracting insights from this big biological data. Recent advances in microbial pangenomic research have generated substantial data and literature, yielding valuable knowledge across diverse microbial species. PanKB (pankb.org), a knowledgebase designed for microbial pangenomics research and biotechnological applications, was built to capitalize on this wealth of information. PanKB currently includes 51 pangenomes from 8 industrially relevant microbial families, comprising 8402 genomes, over 500 000 genes and over 7M mutations. To describe this data, PanKB implements four main components: (1) Interactive pangenomic analytics to facilitate exploration, intuition, and potential discoveries; (2) Alleleomic analytics, a pangenomic-scale analysis of variants, providing insights into intra-species sequence variation and potential mutations for applications; (3) A global search function enabling broad and deep investigations across pangenomes to power research and bioengineering workflows; (4) A bibliome of 833 open-access pangenomic papers and an interface with an LLM that can answer in-depth questions using its knowledge. PanKB empowers researchers and bioengineers to harness the potential of microbial pangenomics and serves as a valuable resource bridging the gap between pangenomic data and practical applications.
Collapse
Affiliation(s)
- Binhuan Sun
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Building 220 Søltofts Plads, 2800 Kongens, Lyngby, Denmark
| | - Liubov Pashkova
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Building 220 Søltofts Plads, 2800 Kongens, Lyngby, Denmark
| | - Pascal Aldo Pieters
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Building 220 Søltofts Plads, 2800 Kongens, Lyngby, Denmark
| | - Archana Sanjay Harke
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Building 220 Søltofts Plads, 2800 Kongens, Lyngby, Denmark
| | - Omkar Satyavan Mohite
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Building 220 Søltofts Plads, 2800 Kongens, Lyngby, Denmark
| | - Alberto Santos
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Building 220 Søltofts Plads, 2800 Kongens, Lyngby, Denmark
| | - Daniel C Zielinski
- Department of Bioengineering, University of California, San Diego, La Jolla, California 92093, United States
| | - Bernhard O Palsson
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Building 220 Søltofts Plads, 2800 Kongens, Lyngby, Denmark
- Department of Bioengineering, University of California, San Diego, La Jolla, California 92093, United States
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, California 92093, United States
- Department of Pediatrics, University of California, San Diego, La Jolla, California 92093, United States
| | - Patrick Victor Phaneuf
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Building 220 Søltofts Plads, 2800 Kongens, Lyngby, Denmark
| |
Collapse
|
6
|
Xiao Z, Pakrasi HB, Chen Y, Tang YJ. Network for knowledge Organization (NEKO): An AI knowledge mining workflow for synthetic biology research. Metab Eng 2025; 87:60-67. [PMID: 39580108 DOI: 10.1016/j.ymben.2024.11.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2024] [Revised: 09/30/2024] [Accepted: 11/17/2024] [Indexed: 11/25/2024]
Abstract
Large language models (LLMs) can complete general scientific question-and-answer, yet they are constrained by their pretraining cut-off dates and lack the ability to provide specific, cited scientific knowledge. Here, we introduce Network for Knowledge Organization (NEKO), a workflow that uses LLM Qwen to extract knowledge through scientific literature text mining. When user inputs a keyword of interest, NEKO can generate knowledge graphs to link bioinformation entities and produce comprehensive summaries from PubMed search. NEKO significantly enhance LLM ability and has immediate applications in daily academic tasks such as education of young scientists, literature review, paper writing, experiment planning/troubleshooting, and new ideas/hypothesis generation. We exemplified this workflow's applicability through several case studies on yeast fermentation and cyanobacterial biorefinery. NEKO's output is more informative, specific, and actionable than GPT-4's zero-shot Q&A. NEKO offers flexible, lightweight local deployment options. NEKO democratizes artificial intelligence (AI) tools, making scientific foundation model more accessible to researchers without excessive computational power.
Collapse
Affiliation(s)
- Zhengyang Xiao
- Department of Energy, Environment, and Chemical Engineering, Washington University in St. Louis, St. Louis, MO, 63130, United States
| | - Himadri B Pakrasi
- Department of Biology, Washington University in St. Louis, St. Louis, MO, 63130, United States
| | - Yixin Chen
- Department of Computer Science, Washington University in St. Louis, St. Louis, MO, 63130, United States.
| | - Yinjie J Tang
- Department of Energy, Environment, and Chemical Engineering, Washington University in St. Louis, St. Louis, MO, 63130, United States.
| |
Collapse
|
7
|
Jin L, Zhou Y, Zhang S, Chen SJ. mRNA vaccine sequence and structure design and optimization: Advances and challenges. J Biol Chem 2025; 301:108015. [PMID: 39608721 PMCID: PMC11728972 DOI: 10.1016/j.jbc.2024.108015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2024] [Revised: 11/13/2024] [Accepted: 11/16/2024] [Indexed: 11/30/2024] Open
Abstract
Messenger RNA (mRNA) vaccines have emerged as a powerful tool against communicable diseases and cancers, as demonstrated by their huge success during the coronavirus disease 2019 (COVID-19) pandemic. Despite the outstanding achievements, mRNA vaccines still face challenges such as stringent storage requirements, insufficient antigen expression, and unexpected immune responses. Since the intrinsic properties of mRNA molecules significantly impact vaccine performance, optimizing mRNA design is crucial in preclinical development. In this review, we outline four key principles for optimal mRNA sequence design: enhancing ribosome loading and translation efficiency through untranslated region (UTR) optimization, improving translation efficiency via codon optimization, increasing structural stability by refining global RNA sequence and extending in-cell lifetime and expression fidelity by adjusting local RNA structures. We also explore recent advancements in computational models for designing and optimizing mRNA vaccine sequences following these principles. By integrating current mRNA knowledge, addressing challenges, and examining advanced computational methods, this review aims to promote the application of computational approaches in mRNA vaccine development and inspire novel solutions to existing obstacles.
Collapse
Affiliation(s)
- Lei Jin
- Department of Physics and Astronomy, University of Missouri, Columbia, Missouri, USA
| | - Yuanzhe Zhou
- Department of Physics and Astronomy, University of Missouri, Columbia, Missouri, USA
| | - Sicheng Zhang
- Department of Physics and Astronomy, University of Missouri, Columbia, Missouri, USA
| | - Shi-Jie Chen
- Department of Physics and Astronomy, University of Missouri, Columbia, Missouri, USA; Department of Biochemistry, MU Institute for Data Science and Informatics, University of Missouri, Columbia, Missouri, USA.
| |
Collapse
|
8
|
Stohr AM, Ma D, Chen W, Blenner M. Engineering conditional protein-protein interactions for dynamic cellular control. Biotechnol Adv 2024; 77:108457. [PMID: 39343083 DOI: 10.1016/j.biotechadv.2024.108457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2024] [Revised: 08/28/2024] [Accepted: 09/26/2024] [Indexed: 10/01/2024]
Abstract
Conditional protein-protein interactions enable dynamic regulation of cellular activity and are an attractive approach to probe native protein interactions, improve metabolic engineering of microbial factories, and develop smart therapeutics. Conditional protein-protein interactions have been engineered to respond to various chemical, light, and nucleic acid-based stimuli. These interactions have been applied to assemble protein fragments, build protein scaffolds, and spatially organize proteins in many microbial and higher-order hosts. To foster the development of novel conditional protein-protein interactions that respond to new inputs or can be utilized in alternative settings, we provide an overview of the process of designing new engineered protein interactions while showcasing many recently developed computational tools that may accelerate protein engineering in this space.
Collapse
Affiliation(s)
- Anthony M Stohr
- Department of Chemical and Biomolecular Engineering, University of Delaware, Newark, DE 19716, USA
| | - Derron Ma
- Department of Chemical and Biomolecular Engineering, University of Delaware, Newark, DE 19716, USA
| | - Wilfred Chen
- Department of Chemical and Biomolecular Engineering, University of Delaware, Newark, DE 19716, USA.
| | - Mark Blenner
- Department of Chemical and Biomolecular Engineering, University of Delaware, Newark, DE 19716, USA.
| |
Collapse
|
9
|
Ding Z, Wei R, Xia J, Mu Y, Wang J, Lin Y. Exploring the potential of large language model-based chatbots in challenges of ribosome profiling data analysis: a review. Brief Bioinform 2024; 26:bbae641. [PMID: 39668339 PMCID: PMC11638007 DOI: 10.1093/bib/bbae641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2024] [Revised: 11/02/2024] [Accepted: 11/27/2024] [Indexed: 12/14/2024] Open
Abstract
Ribosome profiling (Ribo-seq) provides transcriptome-wide insights into protein synthesis dynamics, yet its analysis poses challenges, particularly for nonbioinformatics researchers. Large language model-based chatbots offer promising solutions by leveraging natural language processing. This review explores their convergence, highlighting opportunities for synergy. We discuss challenges in Ribo-seq analysis and how chatbots mitigate them, facilitating scientific discovery. Through case studies, we illustrate chatbots' potential contributions, including data analysis and result interpretation. Despite the absence of applied examples, existing software underscores the value of chatbots and the large language model. We anticipate their pivotal role in future Ribo-seq analysis, overcoming limitations. Challenges such as model bias and data privacy require attention, but emerging trends offer promise. The integration of large language models and Ribo-seq analysis holds immense potential for advancing translational regulation and gene expression understanding.
Collapse
Affiliation(s)
- Zheyu Ding
- School of Pharmacy, Hangzhou Normal University, Hangzhou, Zhejiang 311121, China
- Key Laboratory of Elemene Class Anti-Cancer Chinese Medicines, Engineering Laboratory of Development and Application of Traditional Chinese Medicines, Collaborative Innovation Center of Traditional Chinese Medicines of Zhejiang Province, Hangzhou Normal University, Hangzhou, Zhejiang 311121, China
| | - Rong Wei
- School of Pharmacy, Hangzhou Normal University, Hangzhou, Zhejiang 311121, China
- Key Laboratory of Elemene Class Anti-Cancer Chinese Medicines, Engineering Laboratory of Development and Application of Traditional Chinese Medicines, Collaborative Innovation Center of Traditional Chinese Medicines of Zhejiang Province, Hangzhou Normal University, Hangzhou, Zhejiang 311121, China
| | - Jianing Xia
- School of Pharmacy, Hangzhou Normal University, Hangzhou, Zhejiang 311121, China
- Key Laboratory of Elemene Class Anti-Cancer Chinese Medicines, Engineering Laboratory of Development and Application of Traditional Chinese Medicines, Collaborative Innovation Center of Traditional Chinese Medicines of Zhejiang Province, Hangzhou Normal University, Hangzhou, Zhejiang 311121, China
| | - Yonghao Mu
- School of Pharmacy, Hangzhou Normal University, Hangzhou, Zhejiang 311121, China
- Key Laboratory of Elemene Class Anti-Cancer Chinese Medicines, Engineering Laboratory of Development and Application of Traditional Chinese Medicines, Collaborative Innovation Center of Traditional Chinese Medicines of Zhejiang Province, Hangzhou Normal University, Hangzhou, Zhejiang 311121, China
| | - Jiahuan Wang
- School of Pharmacy, Hangzhou Normal University, Hangzhou, Zhejiang 311121, China
- Key Laboratory of Elemene Class Anti-Cancer Chinese Medicines, Engineering Laboratory of Development and Application of Traditional Chinese Medicines, Collaborative Innovation Center of Traditional Chinese Medicines of Zhejiang Province, Hangzhou Normal University, Hangzhou, Zhejiang 311121, China
| | - Yingying Lin
- School of Pharmacy, Hangzhou Normal University, Hangzhou, Zhejiang 311121, China
- Key Laboratory of Elemene Class Anti-Cancer Chinese Medicines, Engineering Laboratory of Development and Application of Traditional Chinese Medicines, Collaborative Innovation Center of Traditional Chinese Medicines of Zhejiang Province, Hangzhou Normal University, Hangzhou, Zhejiang 311121, China
| |
Collapse
|
10
|
Busta L, Hall D, Johnson B, Schaut M, Hanson CM, Gupta A, Gundrum M, Wang Y, A Maeda H. Mapping of specialized metabolite terms onto a plant phylogeny using text mining and large language models. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 120:406-419. [PMID: 38976238 DOI: 10.1111/tpj.16906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Revised: 05/21/2024] [Accepted: 06/19/2024] [Indexed: 07/09/2024]
Abstract
Plants produce a staggering array of chemicals that are the basis for organismal function and important human nutrients and medicines. However, it is poorly defined how these compounds evolved and are distributed across the plant kingdom, hindering a systematic view and understanding of plant chemical diversity. Recent advances in plant genome/transcriptome sequencing have provided a well-defined molecular phylogeny of plants, on which the presence of diverse natural products can be mapped to systematically determine their phylogenetic distribution. Here, we built a proof-of-concept workflow where previously reported diverse tyrosine-derived plant natural products were mapped onto the plant tree of life. Plant chemical-species associations were mined from literature, filtered, evaluated through manual inspection of over 2500 scientific articles, and mapped onto the plant phylogeny. The resulting "phylochemical" map confirmed several highly lineage-specific compound class distributions, such as betalain pigments and Amaryllidaceae alkaloids. The map also highlighted several lineages enriched in dopamine-derived compounds, including the orders Caryophyllales, Liliales, and Fabales. Additionally, the application of large language models, using our manually curated data as a ground truth set, showed that post-mining processing can largely be automated with a low false-positive rate, critical for generating a reliable phylochemical map. Although a high false-negative rate remains a challenge, our study demonstrates that combining text mining with language model-based processing can generate broader phylochemical maps, which will serve as a valuable community resource to uncover key evolutionary events that underlie plant chemical diversity and enable system-level views of nature's millions of years of chemical experimentation.
Collapse
Affiliation(s)
- Lucas Busta
- Department of Chemistry and Biochemistry, University of Minnesota Duluth, Duluth, Minnesota, USA
| | - Drew Hall
- Department of Botany, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Braidon Johnson
- Department of Chemistry and Biochemistry, University of Minnesota Duluth, Duluth, Minnesota, USA
| | - Madelyn Schaut
- Department of Botany, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Caroline M Hanson
- Department of Botany, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Anika Gupta
- Department of Botany, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Megan Gundrum
- Department of Botany, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Yuer Wang
- Department of Botany, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Hiroshi A Maeda
- Department of Botany, University of Wisconsin-Madison, Madison, Wisconsin, USA
| |
Collapse
|
11
|
Zhou H, Wang HL, Duan YY, Yan ZN, Luo R, Lv XX, Xie Y, Zhang JY, Yang JM, Xue MD, Fang Y, Lu L, Liu PR, Ye ZW. Enhancing Orthopedic Knowledge Assessments: The Performance of Specialized Generative Language Model Optimization. Curr Med Sci 2024; 44:1001-1005. [PMID: 39368054 DOI: 10.1007/s11596-024-2929-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2024] [Accepted: 08/18/2024] [Indexed: 10/07/2024]
Abstract
OBJECTIVE This study aimed to evaluate and compare the effectiveness of knowledge base-optimized and unoptimized large language models (LLMs) in the field of orthopedics to explore optimization strategies for the application of LLMs in specific fields. METHODS This research constructed a specialized knowledge base using clinical guidelines from the American Academy of Orthopaedic Surgeons (AAOS) and authoritative orthopedic publications. A total of 30 orthopedic-related questions covering aspects such as anatomical knowledge, disease diagnosis, fracture classification, treatment options, and surgical techniques were input into both the knowledge base-optimized and unoptimized versions of the GPT-4, ChatGLM, and Spark LLM, with their generated responses recorded. The overall quality, accuracy, and comprehensiveness of these responses were evaluated by 3 experienced orthopedic surgeons. RESULTS Compared with their unoptimized LLMs, the optimized version of GPT-4 showed improvements of 15.3% in overall quality, 12.5% in accuracy, and 12.8% in comprehensiveness; ChatGLM showed improvements of 24.8%, 16.1%, and 19.6%, respectively; and Spark LLM showed improvements of 6.5%, 14.5%, and 24.7%, respectively. CONCLUSION The optimization of knowledge bases significantly enhances the quality, accuracy, and comprehensiveness of the responses provided by the 3 models in the orthopedic field. Therefore, knowledge base optimization is an effective method for improving the performance of LLMs in specific fields.
Collapse
Affiliation(s)
- Hong Zhou
- Department of Orthopedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
- Laboratory of Intelligent Medicine, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
| | - Hong-Lin Wang
- Department of Orthopedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
- Laboratory of Intelligent Medicine, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
| | - Yu-Yu Duan
- Laboratory of Intelligent Medicine, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
- College of Chinese Medicine, Hubei University of Chinese Medicine, Wuhan, 433065, China
| | - Zi-Neng Yan
- Department of Orthopedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
- Laboratory of Intelligent Medicine, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
| | - Rui Luo
- Department of Orthopedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
- Laboratory of Intelligent Medicine, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
| | - Xiang-Xin Lv
- Department of Orthopedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
- Laboratory of Intelligent Medicine, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
| | - Yi Xie
- Department of Orthopedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
- Laboratory of Intelligent Medicine, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
| | - Jia-Yao Zhang
- Department of Orthopedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
- Laboratory of Intelligent Medicine, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
| | - Jia-Ming Yang
- Department of Orthopedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
- Laboratory of Intelligent Medicine, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
| | - Ming-di Xue
- Department of Orthopedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
- Laboratory of Intelligent Medicine, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
| | - Ying Fang
- Department of Orthopedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
- Laboratory of Intelligent Medicine, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China
| | - Lin Lu
- Laboratory of Intelligent Medicine, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.
- Department of Orthopedics, Renmin Hospital of Wuhan University, Wuhan, 433060, China.
| | - Peng-Ran Liu
- Department of Orthopedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.
- Laboratory of Intelligent Medicine, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.
| | - Zhe-Wei Ye
- Department of Orthopedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.
- Laboratory of Intelligent Medicine, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.
| |
Collapse
|
12
|
Ille AM, Markosian C, Burley SK, Mathews MB, Pasqualini R, Arap W. Generative artificial intelligence performs rudimentary structural biology modeling. Sci Rep 2024; 14:19372. [PMID: 39169047 PMCID: PMC11339285 DOI: 10.1038/s41598-024-69021-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2024] [Accepted: 07/30/2024] [Indexed: 08/23/2024] Open
Abstract
Natural language-based generative artificial intelligence (AI) has become increasingly prevalent in scientific research. Intriguingly, capabilities of generative pre-trained transformer (GPT) language models beyond the scope of natural language tasks have recently been identified. Here we explored how GPT-4 might be able to perform rudimentary structural biology modeling. We prompted GPT-4 to model 3D structures for the 20 standard amino acids and an α-helical polypeptide chain, with the latter incorporating Wolfram mathematical computation. We also used GPT-4 to perform structural interaction analysis between the anti-viral nirmatrelvir and its target, the SARS-CoV-2 main protease. Geometric parameters of the generated structures typically approximated close to experimental references. However, modeling was sporadically error-prone and molecular complexity was not well tolerated. Interaction analysis further revealed the ability of GPT-4 to identify specific amino acid residues involved in ligand binding along with corresponding bond distances. Despite current limitations, we show the current capacity of natural language generative AI to perform basic structural biology modeling and interaction analysis with atomic-scale accuracy.
Collapse
Affiliation(s)
- Alexander M Ille
- School of Graduate Studies, Rutgers, The State University of New Jersey, Newark, NJ, USA
- Rutgers Cancer Institute, Newark, NJ, USA
- Division of Cancer Biology, Department of Radiation Oncology, Rutgers New Jersey Medical School, Newark, NJ, USA
| | - Christopher Markosian
- School of Graduate Studies, Rutgers, The State University of New Jersey, Newark, NJ, USA
- Rutgers Cancer Institute, Newark, NJ, USA
- Division of Cancer Biology, Department of Radiation Oncology, Rutgers New Jersey Medical School, Newark, NJ, USA
| | - Stephen K Burley
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Rutgers Cancer Institute, New Brunswick, NJ, USA
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, San Diego Supercomputer Center, University of California-San Diego, La Jolla, San Diego, CA, USA
| | - Michael B Mathews
- School of Graduate Studies, Rutgers, The State University of New Jersey, Newark, NJ, USA
- Division of Infectious Disease, Department of Medicine, Rutgers New Jersey Medical School, Newark, NJ, USA
| | - Renata Pasqualini
- Rutgers Cancer Institute, Newark, NJ, USA.
- Division of Cancer Biology, Department of Radiation Oncology, Rutgers New Jersey Medical School, Newark, NJ, USA.
| | - Wadih Arap
- Rutgers Cancer Institute, Newark, NJ, USA.
- Division of Hematology/Oncology, Department of Medicine, Rutgers New Jersey Medical School, Newark, NJ, USA.
| |
Collapse
|
13
|
Saha R, Chauhan A, Rastogi Verma S. Machine learning: an advancement in biochemical engineering. Biotechnol Lett 2024; 46:497-519. [PMID: 38902585 DOI: 10.1007/s10529-024-03499-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 02/24/2024] [Accepted: 05/18/2024] [Indexed: 06/22/2024]
Abstract
One of the most remarkable techniques recently introduced into the field of bioprocess engineering is machine learning. Bioprocess engineering has drawn much attention due to its vast application in different domains like biopharmaceuticals, fossil fuel alternatives, environmental remediation, and food and beverage industry, etc. However, due to their unpredictable mechanisms, they are very often challenging to optimize. Furthermore, biological systems are extremely complicated; hence, machine learning algorithms could potentially be utilized to improve and build new biotechnological processes. Gaining insight into the fundamental mathematical understanding of commonly used machine learning algorithms, including Support Vector Machine, Principal Component Analysis, Partial Least Squares and Reinforcement Learning, the present study aims to discuss various case studies related to the application of machine learning in bioprocess engineering. Recent advancements as well as challenges posed in this area along with their potential solutions are also presented.
Collapse
Affiliation(s)
- Ritika Saha
- Department of Biotechnology, Delhi Technological University, New Delhi, 110042, India
| | - Ashutosh Chauhan
- Department of Biotechnology, Delhi Technological University, New Delhi, 110042, India
| | - Smita Rastogi Verma
- Department of Biotechnology, Delhi Technological University, New Delhi, 110042, India.
| |
Collapse
|
14
|
Zhang Q, Hu Y, Yan J, Zhang H, Xie X, Zhu J, Li H, Niu X, Li L, Sun Y, Hu W. Large-Language-Model-Based AI Agent for Organic Semiconductor Device Research. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2405163. [PMID: 38816034 DOI: 10.1002/adma.202405163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Revised: 05/23/2024] [Indexed: 06/01/2024]
Abstract
Large language models (LLMs) have attracted widespread attention recently, however, their application in specialized scientific fields still requires deep adaptation. Here, an artificial intelligence (AI) agent for organic field-effect transistors (OFETs) is designed by integrating the generative pre-trained transformer 4 (GPT-4) model with well-trained machine learning (ML) algorithms. It can efficiently extract the experimental parameters of OFETs from scientific literature and reshape them into a structured database, achieving precision and recall rates both exceeding 92%. Combined with well-trained ML models, this AI agent can further provide targeted guidance and suggestions for device design. With prompt engineering and human-in-loop strategies, the agent extracts sufficient information of 709 OFETs from 277 research articles across different publishers and gathers them into a standardized database containing more than 10 000 device parameters. Using this database, a ML model based on Extreme Gradient Boosting is trained for device performance judgment. Combined with the interpretation of the high-precision model, the agent has provided a feasible optimization scheme that has tripled the charge transport properties of 2,6-diphenyldithieno[3,2-b:2',3'-d]thiophene OFETs. This work is an effective practice of LLMs in the field of organic optoelectronic devices and expands the research paradigm of organic optoelectronic materials and devices.
Collapse
Affiliation(s)
- Qian Zhang
- Key Laboratory of Organic Integrated Circuits, Ministry of Education and Tianjin Key Laboratory of Molecular Optoelectronic Sciences, Department of Chemistry, School of Science, Tianjin University, Tianjin, 300072, China
- Haihe Lab of ITAI, Tianjin, 300051, China
| | - Yongxu Hu
- Key Laboratory of Organic Integrated Circuits, Ministry of Education and Tianjin Key Laboratory of Molecular Optoelectronic Sciences, Department of Chemistry, School of Science, Tianjin University, Tianjin, 300072, China
| | - Jiaxin Yan
- Key Laboratory of Organic Integrated Circuits, Ministry of Education and Tianjin Key Laboratory of Molecular Optoelectronic Sciences, Department of Chemistry, School of Science, Tianjin University, Tianjin, 300072, China
- Haihe Lab of ITAI, Tianjin, 300051, China
| | - Hengyue Zhang
- Key Laboratory of Organic Integrated Circuits, Ministry of Education and Tianjin Key Laboratory of Molecular Optoelectronic Sciences, Department of Chemistry, School of Science, Tianjin University, Tianjin, 300072, China
- Haihe Lab of ITAI, Tianjin, 300051, China
| | - Xinyi Xie
- Key Laboratory of Organic Integrated Circuits, Ministry of Education and Tianjin Key Laboratory of Molecular Optoelectronic Sciences, Department of Chemistry, School of Science, Tianjin University, Tianjin, 300072, China
| | - Jie Zhu
- Key Laboratory of Organic Integrated Circuits, Ministry of Education and Tianjin Key Laboratory of Molecular Optoelectronic Sciences, Department of Chemistry, School of Science, Tianjin University, Tianjin, 300072, China
- Haihe Lab of ITAI, Tianjin, 300051, China
| | - Huchao Li
- Key Laboratory of Organic Integrated Circuits, Ministry of Education and Tianjin Key Laboratory of Molecular Optoelectronic Sciences, Department of Chemistry, School of Science, Tianjin University, Tianjin, 300072, China
| | - Xinxin Niu
- Key Laboratory of Organic Integrated Circuits, Ministry of Education and Tianjin Key Laboratory of Molecular Optoelectronic Sciences, Department of Chemistry, School of Science, Tianjin University, Tianjin, 300072, China
- Haihe Lab of ITAI, Tianjin, 300051, China
| | - Liqiang Li
- Key Laboratory of Organic Integrated Circuits, Ministry of Education and Tianjin Key Laboratory of Molecular Optoelectronic Sciences, Department of Chemistry, School of Science, Tianjin University, Tianjin, 300072, China
| | - Yajing Sun
- Key Laboratory of Organic Integrated Circuits, Ministry of Education and Tianjin Key Laboratory of Molecular Optoelectronic Sciences, Department of Chemistry, School of Science, Tianjin University, Tianjin, 300072, China
- Haihe Lab of ITAI, Tianjin, 300051, China
| | - Wenping Hu
- Key Laboratory of Organic Integrated Circuits, Ministry of Education and Tianjin Key Laboratory of Molecular Optoelectronic Sciences, Department of Chemistry, School of Science, Tianjin University, Tianjin, 300072, China
- Joint School of National University of Singapore and Tianjin University, Fuzhou, Fujian, 350207, China
| |
Collapse
|
15
|
Zhang ZX, Xu YS, Li ZJ, Xu LW, Ma W, Li YF, Guo DS, Sun XM, Huang H. Turning waste into treasure: A new direction for low-cost production of lipid chemicals from Thraustochytrids. Biotechnol Adv 2024; 73:108354. [PMID: 38588906 DOI: 10.1016/j.biotechadv.2024.108354] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Revised: 03/29/2024] [Accepted: 04/03/2024] [Indexed: 04/10/2024]
Abstract
Thraustochytrids are marine microorganisms known for their fast growth and ability to store lipids, making them useful for producing polyunsaturated fatty acids (PUFAs), biodiesel, squalene, and carotenoids. However, the high cost of production, mainly due to expensive fermentation components, limits their wider use. A significant challenge in this context is the need to balance production costs with the value of the end products. This review focuses on integrating the efficient utilization of waste with Thraustochytrids fermentation, including the economic substitution of carbon sources, nitrogen sources, and fermentation water. This approach aligns with the 3Rs principles (reduction, recycling, and reuse). Furthermore, it emphasizes the role of Thraustochytrids in converting waste into lipid chemicals and promoting sustainable circular production models. The aim of this review is to emphasize the value of Thraustochytrids in converting waste into treasure, providing precise cost reduction strategies for future commercial production.
Collapse
Affiliation(s)
- Zi-Xu Zhang
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, 2 Xuelin Road, Qixia District, Nanjing, People's Republic of China
| | - Ying-Shuang Xu
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, 2 Xuelin Road, Qixia District, Nanjing, People's Republic of China
| | - Zi-Jia Li
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, 2 Xuelin Road, Qixia District, Nanjing, People's Republic of China
| | - Lu-Wei Xu
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, 2 Xuelin Road, Qixia District, Nanjing, People's Republic of China
| | - Wang Ma
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, 2 Xuelin Road, Qixia District, Nanjing, People's Republic of China
| | - Ying-Feng Li
- Zhihe Biotechnology (Changzhou) Co. Ltd, 1 Hanshan Road, Xuejia Town, Xinbei District, Changzhou, People's Republic of China
| | - Dong-Sheng Guo
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, 2 Xuelin Road, Qixia District, Nanjing, People's Republic of China; Zhihe Biotechnology (Changzhou) Co. Ltd, 1 Hanshan Road, Xuejia Town, Xinbei District, Changzhou, People's Republic of China
| | - Xiao-Man Sun
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, 2 Xuelin Road, Qixia District, Nanjing, People's Republic of China.
| | - He Huang
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, 2 Xuelin Road, Qixia District, Nanjing, People's Republic of China
| |
Collapse
|
16
|
Batalis S, Schuerger C, Gronvall GK, Walsh ME. Safeguarding Mail-Order DNA Synthesis in the Age of Artificial Intelligence. APPLIED BIOSAFETY 2024; 29:79-84. [PMID: 39131179 PMCID: PMC11313546 DOI: 10.1089/apb.2023.0020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/13/2024]
Abstract
Introduction Artificial intelligence (AI) tools continue to be developed and used within the life sciences. The impact of these tools on the biosecurity landscape surrounding mail-order DNA synthesis and how to address the impacts have not been critically examined in the literature. Methods The impacts of AI-driven chatbots and biological design tools on the biosecurity landscape surrounding mail-order DNA synthesis were analyzed and described. The findings are informed by the authors' experience in the field. Results Generally, chatbots lower barriers to access of information that could be misused while biological design tools may provide new abilities to users with the intent of misuse. Six recommendations to the United States Government that attempt to maximize the benefits of these new technologies while mitigating risks are provided. Conclusion Mandating mail-order DNA synthesis providers to screen DNA synthesis orders is a critical safeguarding step that should be taken as soon as possible. Over time, biological design tools will reduce the effectiveness of such a regulation and actions should be taken now to limit the negative impacts in the future.
Collapse
Affiliation(s)
- Stephanie Batalis
- Center for Security and Emerging Technology, Georgetown University, Washington, DC, USA
| | - Caroline Schuerger
- Center for Security and Emerging Technology, Georgetown University, Washington, DC, USA
| | - Gigi Kwik Gronvall
- Johns Hopkins Department of Environmental Health and Engineering, Johns Hopkins Center for Health Security, Baltimore, Maryland, USA
| | - Matthew E. Walsh
- Johns Hopkins Department of Environmental Health and Engineering, Johns Hopkins Center for Health Security, Baltimore, Maryland, USA
| |
Collapse
|
17
|
Undheim TA. The whack-a-mole governance challenge for AI-enabled synthetic biology: literature review and emerging frameworks. Front Bioeng Biotechnol 2024; 12:1359768. [PMID: 38481570 PMCID: PMC10933118 DOI: 10.3389/fbioe.2024.1359768] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Accepted: 02/05/2024] [Indexed: 02/08/2025] Open
Abstract
AI-enabled synthetic biology has tremendous potential but also significantly increases biorisks and brings about a new set of dual use concerns. The picture is complicated given the vast innovations envisioned to emerge by combining emerging technologies, as AI-enabled synthetic biology potentially scales up bioengineering into industrial biomanufacturing. However, the literature review indicates that goals such as maintaining a reasonable scope for innovation, or more ambitiously to foster a huge bioeconomy do not necessarily contrast with biosafety, but need to go hand in hand. This paper presents a literature review of the issues and describes emerging frameworks for policy and practice that transverse the options of command-and-control, stewardship, bottom-up, and laissez-faire governance. How to achieve early warning systems that enable prevention and mitigation of future AI-enabled biohazards from the lab, from deliberate misuse, or from the public realm, will constantly need to evolve, and adaptive, interactive approaches should emerge. Although biorisk is subject to an established governance regime, and scientists generally adhere to biosafety protocols, even experimental, but legitimate use by scientists could lead to unexpected developments. Recent advances in chatbots enabled by generative AI have revived fears that advanced biological insight can more easily get into the hands of malignant individuals or organizations. Given these sets of issues, society needs to rethink how AI-enabled synthetic biology should be governed. The suggested way to visualize the challenge at hand is whack-a-mole governance, although the emerging solutions are perhaps not so different either.
Collapse
Affiliation(s)
- Trond Arne Undheim
- Stanford University, Stanford, CA, United States
- Center for International Security and Cooperation, Freeman Spogli Institute for International Studies, Stanford University, Stanford, CA, United States
| |
Collapse
|
18
|
Moon TS. EBRC: Enhancing bioeconomy through research and communication. N Biotechnol 2023; 78:150-152. [PMID: 37918664 DOI: 10.1016/j.nbt.2023.10.012] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 10/29/2023] [Accepted: 10/30/2023] [Indexed: 11/04/2023]
Abstract
On September 12, 2022, President Biden issued Executive Order 14081 to enable the progress of biomanufacturing and biotechnology. This timely initiative will help overcome many challenging issues, and its potential impacts will be huge. This article discusses eight recommendations to make this US national initiative successful, encourage other nations to consider similar initiatives, and create a better world for the next generations.
Collapse
Affiliation(s)
- Tae Seok Moon
- Moonshot Bio, Inc., 73 Turnpike Street, North Andover, MA 01845, USA.
| |
Collapse
|