1
|
Sun H, Tian H, Hu Y, Cui Y, Chen X, Xu M, Wang X, Zhou T. Bio-Plausible Multimodal Learning with Emerging Neuromorphic Devices. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2406242. [PMID: 39258724 PMCID: PMC11615814 DOI: 10.1002/advs.202406242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Revised: 08/02/2024] [Indexed: 09/12/2024]
Abstract
Multimodal machine learning, as a prospective advancement in artificial intelligence, endeavors to emulate the brain's multimodal learning abilities with the objective to enhance interactions with humans. However, this approach requires simultaneous processing of diverse types of data, leading to increased model complexity, longer training times, and higher energy consumption. Multimodal neuromorphic devices have the capability to preprocess spatio-temporal information from various physical signals into unified electrical signals with high information density, thereby enabling more biologically plausible multimodal learning with low complexity and high energy-efficiency. Here, this work conducts a comparison between the expression of multimodal machine learning and multimodal neuromorphic computing, followed by an overview of the key characteristics associated with multimodal neuromorphic devices. The bio-plausible operational principles and the multimodal learning abilities of emerging devices are examined, which are classified into heterogeneous and homogeneous multimodal neuromorphic devices. Subsequently, this work provides a detailed description of the multimodal learning capabilities demonstrated by neuromorphic circuits and their respective applications. Finally, this work highlights the limitations and challenges of multimodal neuromorphic computing in order to hopefully provide insight into potential future research directions.
Collapse
Affiliation(s)
- Haonan Sun
- School of Automation EngineeringUniversity of Electronic Science and Technology of ChinaChengdu611731China
- State Key Laboratory of Electronic Thin Film and Integrated DevicesUniversity of Electronic Science and Technology of ChinaChengdu611731China
| | - Haoxiang Tian
- State Key Laboratory of Electronic Thin Film and Integrated DevicesUniversity of Electronic Science and Technology of ChinaChengdu611731China
| | - Yihao Hu
- School of Automation EngineeringUniversity of Electronic Science and Technology of ChinaChengdu611731China
- State Key Laboratory of Electronic Thin Film and Integrated DevicesUniversity of Electronic Science and Technology of ChinaChengdu611731China
| | - Yi Cui
- State Key Laboratory of Electronic Thin Film and Integrated DevicesUniversity of Electronic Science and Technology of ChinaChengdu611731China
| | - Xinrui Chen
- State Key Laboratory of Electronic Thin Film and Integrated DevicesUniversity of Electronic Science and Technology of ChinaChengdu611731China
| | - Minyi Xu
- State Key Laboratory of Electronic Thin Film and Integrated DevicesUniversity of Electronic Science and Technology of ChinaChengdu611731China
| | - Xianfu Wang
- State Key Laboratory of Electronic Thin Film and Integrated DevicesUniversity of Electronic Science and Technology of ChinaChengdu611731China
| | - Tao Zhou
- School of Automation EngineeringUniversity of Electronic Science and Technology of ChinaChengdu611731China
- State Key Laboratory of Electronic Thin Film and Integrated DevicesUniversity of Electronic Science and Technology of ChinaChengdu611731China
| |
Collapse
|
2
|
Gao J, Wang D. Quantifying the use and potential benefits of artificial intelligence in scientific research. Nat Hum Behav 2024; 8:2281-2292. [PMID: 39394445 DOI: 10.1038/s41562-024-02020-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Accepted: 09/12/2024] [Indexed: 10/13/2024]
Abstract
The rapid advancement of artificial intelligence (AI) is poised to reshape almost every line of work. Despite enormous efforts devoted to understanding AI's economic impacts, we lack a systematic understanding of the benefits to scientific research associated with the use of AI. Here we develop a measurement framework to estimate the direct use of AI and associated benefits in science. We find that the use and benefits of AI appear widespread throughout the sciences, growing especially rapidly since 2015. However, there is a substantial gap between AI education and its application in research, highlighting a misalignment between AI expertise supply and demand. Our analysis also reveals demographic disparities, with disciplines with higher proportions of women or Black scientists reaping fewer benefits from AI, potentially exacerbating existing inequalities in science. These findings have implications for the equity and sustainability of the research enterprise, especially as the integration of AI with science continues to deepen.
Collapse
Affiliation(s)
- Jian Gao
- Center for Science of Science and Innovation, Northwestern University, Evanston, IL, USA
- Kellogg School of Management, Northwestern University, Evanston, IL, USA
- Ryan Institute on Complexity, Northwestern University, Evanston, IL, USA
- Faculty of Social Sciences, The University of Hong Kong, Hong Kong SAR, China
| | - Dashun Wang
- Center for Science of Science and Innovation, Northwestern University, Evanston, IL, USA.
- Kellogg School of Management, Northwestern University, Evanston, IL, USA.
- Ryan Institute on Complexity, Northwestern University, Evanston, IL, USA.
- McCormick School of Engineering, Northwestern University, Evanston, IL, USA.
| |
Collapse
|
3
|
Krauss A. Science of science: A multidisciplinary field studying science. Heliyon 2024; 10:e36066. [PMID: 39296115 PMCID: PMC11408022 DOI: 10.1016/j.heliyon.2024.e36066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Revised: 07/24/2024] [Accepted: 08/08/2024] [Indexed: 09/21/2024] Open
Abstract
Science and knowledge are studied by researchers across many disciplines, examining how they are developed, what their current boundaries are and how we can advance them. By integrating evidence across disparate disciplines, the holistic field of science of science can address these foundational questions. This field illustrates how science is shaped by many interconnected factors: the cognitive processes of scientists, the historical evolution of science, economic incentives, institutional influences, computational approaches, statistical, mathematical and instrumental foundations of scientific inference, scientometric measures, philosophical and ethical dimensions of scientific concepts, among other influences. Achieving a comprehensive overview of a multifaceted field like the science of science requires pulling together evidence from the many sub-fields studying science across the natural and social sciences and humanities. This enables developing an interdisciplinary perspective of scientific practice, a more holistic understanding of scientific processes and outcomes, and more nuanced perspectives to how scientific research is conducted, influenced and evolves. It enables leveraging the strengths of various disciplines to create a holistic view of the foundations of science. Different researchers study science from their own disciplinary perspective and use their own methods, and there is a large divide between quantitative and qualitative researchers as they commonly do not read or cite research using other methodological approaches. A broader, synthesizing paper employing a qualitative approach can however help provide a bridge between disciplines by pulling together aspects of science (economic, scientometric, psychological, philosophical etc.). Such an approach enables identifying, across the range of fields, the powerful role of our scientific methods and instruments in shaping most aspects of our knowledge and science, whereas economic, social and historical influences help shape what knowledge we pursue. A unifying theory is then outlined for science of science - the new-methods-drive-science theory.
Collapse
Affiliation(s)
- Alexander Krauss
- London School of Economics, London, UK
- Institute for Economic Analysis, Spanish National Research Council, Barcelona, Spain
| |
Collapse
|
4
|
Zhu K, Shen Z, Wang M, Jiang L, Zhang Y, Yang T, Zhang H, Zhang M. Visual Knowledge Domain of Artificial Intelligence in Computed Tomography: A Review Based on Bibliometric Analysis. J Comput Assist Tomogr 2024; 48:652-662. [PMID: 38271538 DOI: 10.1097/rct.0000000000001585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2024]
Abstract
ABSTRACT Artificial intelligence (AI)-assisted medical imaging technology is a new research area of great interest that has developed rapidly over the last decade. However, there has been no bibliometric analysis of published studies in this field. The present review focuses on AI-related studies on computed tomography imaging in the Web of Science database and uses CiteSpace and VOSviewer to generate a knowledge map and conduct the basic information analysis, co-word analysis, and co-citation analysis. A total of 7265 documents were included and the number of documents published had an overall upward trend. Scholars from the United States and China have made outstanding achievements, and there is a general lack of extensive cooperation in this field. In recent years, the research areas of great interest and difficulty have been the optimization and upgrading of algorithms, and the application of theoretical models to practical clinical applications. This review will help researchers understand the developments, research areas of great interest, and research frontiers in this field and provide reference and guidance for future studies.
Collapse
|
5
|
Stahl RG. Will Artificial Intelligence be Useful (or Misused) in Environmental Toxicology and Chemistry? ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY 2023; 42:745-746. [PMID: 36718790 DOI: 10.1002/etc.5575] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Accepted: 01/29/2023] [Indexed: 06/18/2023]
|
6
|
Porciello J, Lipper L, Ivanina M. Using machine learning to evaluate 1.2 million studies on small-scale farming and post-production food systems in low- and middle-income countries. FRONTIERS IN SUSTAINABLE FOOD SYSTEMS 2023. [DOI: 10.3389/fsufs.2022.1013701] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/11/2023] Open
Abstract
Recent developments have emphasized the need for agrifood systems to move beyond a production-oriented approach to recognize agriculture as part of a broader agrifood system that prioritizes livelihoods, social equity, diets, and climate and environmental outcomes. At the same time, the knowledge base for agriculture is growing exponentially. Using artificial intelligence and machine learning approaches, we reviewed more than 1.2 million publications from the past 20 years to assess the current landscape of agricultural research taking place in low- and middle-income countries. The result is a clearer picture of what research has been conducted on small-scale farming and post-production systems from 2000 to the present, and where persistent evidence gaps exist. We found that the greatest focus of the literature is on economic outcomes, such as productivity, yield, and incomes. There is also some emphasis on identifying and measuring environmental outcomes. However, noticeable data gaps exist for agricultural research focused on nutrition and diet, and gender and inclusivity.
Collapse
|
7
|
Xing Z, Jiang Y, Zogona D, Wu T, Xu X. Fully nondestructive analysis of capsaicinoids electrochemistry data with deep neural network enables portable system. Food Chem 2023; 417:135882. [PMID: 36934708 DOI: 10.1016/j.foodchem.2023.135882] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Revised: 03/01/2023] [Accepted: 03/04/2023] [Indexed: 03/09/2023]
Abstract
Electrochemical methods have been extensively applied for the detection of chemical information from food or other analytes. However, existing electrochemical methods are limited to focusing solely on the absorption peaks and disregard much of the hidden chemical fingerprint information. Consequently, electrochemical sensors are constrained by their ability to detect samples containing multiple source-material mixtures with overlapping constituents. We hypothesized that the target substances can be effectively identified and detected using differential sensor data combined with artificial intelligence (AI). In this study, we developed a novel signal array composed of five metal electrodes and used a convolutional neural network (CNN) model for feature extraction to detect capsaicinoids in stews. Our results indicate that the proposed method achieved satisfactory predictions with a root mean square error (RMSE) of 5.407 in independent brine samples. This provides a promising strategy and practical approach for the nondestructive analysis of multidimensional electrochemical data of mixed analytes.
Collapse
Affiliation(s)
- Zheng Xing
- Key Laboratory of Environment Correlative Dietology (Ministry of Education), Huazhong Agricultural University, Wuhan, Hubei 430072, China; Hubei Key Laboratory of Fruit & Vegetable Processing & Quality Control, Huazhong Agricultural University, Wuhan, Hubei 430072, China
| | - Ying Jiang
- Key Laboratory of Environment Correlative Dietology (Ministry of Education), Huazhong Agricultural University, Wuhan, Hubei 430072, China; Hubei Key Laboratory of Fruit & Vegetable Processing & Quality Control, Huazhong Agricultural University, Wuhan, Hubei 430072, China
| | - Daniel Zogona
- Key Laboratory of Environment Correlative Dietology (Ministry of Education), Huazhong Agricultural University, Wuhan, Hubei 430072, China; Hubei Key Laboratory of Fruit & Vegetable Processing & Quality Control, Huazhong Agricultural University, Wuhan, Hubei 430072, China
| | - Ting Wu
- Key Laboratory of Environment Correlative Dietology (Ministry of Education), Huazhong Agricultural University, Wuhan, Hubei 430072, China; Hubei Key Laboratory of Fruit & Vegetable Processing & Quality Control, Huazhong Agricultural University, Wuhan, Hubei 430072, China
| | - Xiaoyun Xu
- Key Laboratory of Environment Correlative Dietology (Ministry of Education), Huazhong Agricultural University, Wuhan, Hubei 430072, China; Hubei Key Laboratory of Fruit & Vegetable Processing & Quality Control, Huazhong Agricultural University, Wuhan, Hubei 430072, China.
| |
Collapse
|
8
|
Matassi G, Martinez P. The brain-computer analogy—“A special issue”. Front Ecol Evol 2023. [DOI: 10.3389/fevo.2022.1099253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
In this review essay, we give a detailed synopsis of the twelve contributions which are collected in a Special Issue in Frontiers Ecology and Evolution, based on the research topic “Current Thoughts on the Brain-Computer Analogy—All Metaphors Are Wrong, But Some Are Useful.” The synopsis is complemented by a graphical summary, a matrix which links articles to selected concepts. As first identified by Turing, all authors in this Special Issue recognize semantics as a crucial concern in the brain-computer analogy debate, and consequently address a number of such issues. What is missing, we believe, is the distinction between metaphor and analogy, which we reevaluate, describe in some detail, and offer a definition for the latter. To enrich the debate, we also deem necessary to develop on the evolutionary theories of the brain, of which we provide an overview. This article closes with thoughts on creativity in Science, for we concur with the stance that metaphors and analogies, and their esthetic impact, are essential to the creative process, be it in Sciences as well as in Arts.
Collapse
|
9
|
Hagos DH, Rawat DB. Recent Advances in Artificial Intelligence and Tactical Autonomy: Current Status, Challenges, and Perspectives. SENSORS (BASEL, SWITZERLAND) 2022; 22:9916. [PMID: 36560285 PMCID: PMC9782095 DOI: 10.3390/s22249916] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Revised: 12/13/2022] [Accepted: 12/14/2022] [Indexed: 06/17/2023]
Abstract
This paper presents the findings of detailed and comprehensive technical literature aimed at identifying the current and future research challenges of tactical autonomy. It discusses in great detail the current state-of-the-art powerful artificial intelligence (AI), machine learning (ML), and robot technologies, and their potential for developing safe and robust autonomous systems in the context of future military and defense applications. Additionally, we discuss some of the technical and operational critical challenges that arise when attempting to practically build fully autonomous systems for advanced military and defense applications. Our paper provides the state-of-the-art advanced AI methods available for tactical autonomy. To the best of our knowledge, this is the first work that addresses the important current trends, strategies, critical challenges, tactical complexities, and future research directions of tactical autonomy. We believe this work will greatly interest researchers and scientists from academia and the industry working in the field of robotics and the autonomous systems community. We hope this work encourages researchers across multiple disciplines of AI to explore the broader tactical autonomy domain. We also hope that our work serves as an essential step toward designing advanced AI and ML models with practical implications for real-world military and defense settings.
Collapse
|
10
|
Xu Y, Zhang X, Li H, Zheng H, Zhang J, Olsen MS, Varshney RK, Prasanna BM, Qian Q. Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction. MOLECULAR PLANT 2022; 15:1664-1695. [PMID: 36081348 DOI: 10.1016/j.molp.2022.09.001] [Citation(s) in RCA: 72] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 08/20/2022] [Accepted: 09/02/2022] [Indexed: 05/12/2023]
Abstract
The first paradigm of plant breeding involves direct selection-based phenotypic observation, followed by predictive breeding using statistical models for quantitative traits constructed based on genetic experimental design and, more recently, by incorporation of molecular marker genotypes. However, plant performance or phenotype (P) is determined by the combined effects of genotype (G), envirotype (E), and genotype by environment interaction (GEI). Phenotypes can be predicted more precisely by training a model using data collected from multiple sources, including spatiotemporal omics (genomics, phenomics, and enviromics across time and space). Integration of 3D information profiles (G-P-E), each with multidimensionality, provides predictive breeding with both tremendous opportunities and great challenges. Here, we first review innovative technologies for predictive breeding. We then evaluate multidimensional information profiles that can be integrated with a predictive breeding strategy, particularly envirotypic data, which have largely been neglected in data collection and are nearly untouched in model construction. We propose a smart breeding scheme, integrated genomic-enviromic prediction (iGEP), as an extension of genomic prediction, using integrated multiomics information, big data technology, and artificial intelligence (mainly focused on machine and deep learning). We discuss how to implement iGEP, including spatiotemporal models, environmental indices, factorial and spatiotemporal structure of plant breeding data, and cross-species prediction. A strategy is then proposed for prediction-based crop redesign at both the macro (individual, population, and species) and micro (gene, metabolism, and network) scales. Finally, we provide perspectives on translating smart breeding into genetic gain through integrative breeding platforms and open-source breeding initiatives. We call for coordinated efforts in smart breeding through iGEP, institutional partnerships, and innovative technological support.
Collapse
Affiliation(s)
- Yunbi Xu
- Institute of Crop Sciences, CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China; CIMMYT-China Tropical Maize Research Center, School of Food Science and Engineering, Foshan University, Foshan, Guangdong 528231, China; Peking University Institute of Advanced Agricultural Sciences, Weifang, Shandong 261325, China.
| | - Xingping Zhang
- Peking University Institute of Advanced Agricultural Sciences, Weifang, Shandong 261325, China
| | - Huihui Li
- Institute of Crop Sciences, CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China; National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan 572024, China
| | - Hongjian Zheng
- CIMMYT-China Specialty Maize Research Center, Shanghai Academy of Agricultural Sciences, Shanghai 201400, China
| | - Jianan Zhang
- MolBreeding Biotechnology Co., Ltd., Shijiazhuang, Hebei 050035, China
| | - Michael S Olsen
- CIMMYT (International Maize and Wheat Improvement Center), ICRAF Campus, United Nations Avenue, Nairobi, Kenya
| | - Rajeev K Varshney
- State Agricultural Biotechnology Centre, Centre for Crop and Food Innovation, Food Futures Institute, Murdoch University, Murdoch, Australia
| | - Boddupalli M Prasanna
- CIMMYT (International Maize and Wheat Improvement Center), ICRAF Campus, United Nations Avenue, Nairobi, Kenya
| | - Qian Qian
- Institute of Crop Sciences, CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| |
Collapse
|
11
|
Sheetal A, Chaudhury SH, Savani K. A deep learning model identifies emphasis on hard work as an important predictor of income inequality. Sci Rep 2022; 12:9845. [PMID: 35701456 PMCID: PMC9194778 DOI: 10.1038/s41598-022-13902-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Accepted: 05/30/2022] [Indexed: 11/29/2022] Open
Abstract
High levels of income inequality can persist in society only if people accept the inequality as justified. To identify psychological predictors of people's tendency to justify inequality, we retrained a pre-existing deep learning model to predict the extent to which World Values Survey respondents believed that income inequality is necessary. A feature importance analysis revealed multiple items associated with the importance of hard work as top predictors. As an emphasis on hard work is a key component of the Protestant Work Ethic, we formulated the hypothesis that the PWE increases acceptance of inequality. A correlational study found that the more people endorsed PWE, the less disturbed they were about factual statistics about wealth equality in the US. Two experiments found that exposing people to PWE items decreased their disturbance with income inequality. The findings indicate that machine learning models can be reused to generate viable hypotheses.
Collapse
Affiliation(s)
- Abhishek Sheetal
- School of Business and Law, Central Queensland University, Rockhampton, Australia
- Faculty of Business, The Hong Kong Polytechnic University, Kowloon, Hong Kong
| | | | - Krishna Savani
- Faculty of Business, The Hong Kong Polytechnic University, Kowloon, Hong Kong.
- Nanyang Business School, Nanyang Technological University, Singapore, Singapore.
| |
Collapse
|
12
|
ML4STEM Professional Development Program: Enriching K-12 STEM Teaching with Machine Learning. INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION 2022. [DOI: 10.1007/s40593-022-00292-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
13
|
Ament S, Amsler M, Sutherland DR, Chang MC, Guevarra D, Connolly AB, Gregoire JM, Thompson MO, Gomes CP, van Dover RB. Autonomous materials synthesis via hierarchical active learning of nonequilibrium phase diagrams. SCIENCE ADVANCES 2021; 7:eabg4930. [PMID: 34919429 PMCID: PMC8682983 DOI: 10.1126/sciadv.abg4930] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Autonomous experimentation enabled by artificial intelligence offers a new paradigm for accelerating scientific discovery. Nonequilibrium materials synthesis is emblematic of complex, resource-intensive experimentation whose acceleration would be a watershed for materials discovery. We demonstrate accelerated exploration of metastable materials through hierarchical autonomous experimentation governed by the Scientific Autonomous Reasoning Agent (SARA). SARA integrates robotic materials synthesis using lateral gradient laser spike annealing and optical characterization along with a hierarchy of AI methods to map out processing phase diagrams. Efficient exploration of the multidimensional parameter space is achieved with nested active learning cycles built upon advanced machine learning models that incorporate the underlying physics of the experiments and end-to-end uncertainty quantification. We demonstrate SARA’s performance by autonomously mapping synthesis phase boundaries for the Bi2O3 system, leading to orders-of-magnitude acceleration in the establishment of a synthesis phase diagram that includes conditions for stabilizing δ-Bi2O3 at room temperature, a critical development for electrochemical technologies.
Collapse
Affiliation(s)
- Sebastian Ament
- Department of Computer Science, Cornell University, Ithaca, NY 14853, USA
| | - Maximilian Amsler
- Department of Materials Science and Engineering, Cornell University, Ithaca, NY 14853, USA
- Department of Chemistry and Biochemistry, University of Bern, Freiestrasse 3, CH-3012 Bern, Switzerland
- Corresponding author. (M.A.); (C.P.G.)
| | - Duncan R. Sutherland
- Department of Materials Science and Engineering, Cornell University, Ithaca, NY 14853, USA
| | - Ming-Chiang Chang
- Department of Materials Science and Engineering, Cornell University, Ithaca, NY 14853, USA
| | - Dan Guevarra
- Division of Engineering and Applied Science, California Institute of Technology, Pasadena, CA 91125, USA
| | - Aine B. Connolly
- Department of Materials Science and Engineering, Cornell University, Ithaca, NY 14853, USA
| | - John M. Gregoire
- Division of Engineering and Applied Science, California Institute of Technology, Pasadena, CA 91125, USA
| | - Michael O. Thompson
- Department of Materials Science and Engineering, Cornell University, Ithaca, NY 14853, USA
| | - Carla P. Gomes
- Department of Computer Science, Cornell University, Ithaca, NY 14853, USA
- Corresponding author. (M.A.); (C.P.G.)
| | - R. Bruce van Dover
- Department of Materials Science and Engineering, Cornell University, Ithaca, NY 14853, USA
| |
Collapse
|
14
|
Suyetin M. The application of machine learning for predicting the methane uptake and working capacity of MOFs. Faraday Discuss 2021; 231:224-234. [PMID: 34195741 DOI: 10.1039/d1fd00011j] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Multiple linear regression analysis, as a part of machine learning, is employed to develop equations for the quick and accurate prediction of the methane uptake and working capacity of metal-organic frameworks (MOFs). Only three crystal characteristics of MOFs (geometric descriptors) are employed for developing the equations: surface area, pore volume and density of the crystal structure. The values of the geometric descriptors can be obtained much more cheaply in terms of time and other resources compared to running calculations of gas sorption or performing experimental work. Within this work sets of equations are provided for the different cases studied: a series of MOFs with NbO topology, a set of benchmark MOFs with outstanding methane storage and working capacities, and the whole CoRE MOF database (11 000 structures).
Collapse
Affiliation(s)
- Mikhail Suyetin
- Institute of Nanotechnology, Karlsruhe Institute of Technology, P. O. Box 3640, 76021 Karlsruhe, Germany.
| |
Collapse
|
15
|
Automating crystal-structure phase mapping by combining deep learning with constraint reasoning. NAT MACH INTELL 2021. [DOI: 10.1038/s42256-021-00384-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
16
|
An Evaluation of the Yangtze River Economic Belt Manufacturing Industry Level of Intelligentization and Influencing Factors: Evidence from China. SUSTAINABILITY 2021. [DOI: 10.3390/su13168913] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Over recent decades, the application of artificial intelligence methods in manufacturing has led to new spheres of research such as the Internet of Things, Cyber–Physical Systems, and Cloud Computing and Big Data, leading to the so-called Industry 4.0. However, to date, little research has been geared towards assessing the factors that influence intelligent manufacturing on a regional scale. Addressing this problem, this paper constructs an evaluation index system for the Yangtze River Economic Belt (YREB) intelligent manufacturing sector using eleven years (2008–2018) of provincial panel data. The entropy method is applied to three evaluation criteria, namely intelligent innovation, equipment, and profit, to construct an evaluation index system. An analysis of the results revealed that the level intelligentization of the manufacturing industry of the YREB increases yearly, and that intelligent innovations are notably occurring at a faster rate than profits. Disproportional enterprise returns on investment have occurred, which decreases enterprise motivation to be innovative in the first place. Additionally, it was also observed that FDI, financial development, government intervention, and the level of opening-up were the primary factors modulating regional intelligent manufacturing levels.
Collapse
|
17
|
Kitano H. Nobel Turing Challenge: creating the engine for scientific discovery. NPJ Syst Biol Appl 2021; 7:29. [PMID: 34145287 PMCID: PMC8213706 DOI: 10.1038/s41540-021-00189-3] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Accepted: 06/03/2021] [Indexed: 12/15/2022] Open
Abstract
Scientific discovery has long been one of the central driving forces in our civilization. It uncovered the principles of the world we live in, and enabled us to invent new technologies reshaping our society, cure diseases, explore unknown new frontiers, and hopefully lead us to build a sustainable society. Accelerating the speed of scientific discovery is therefore one of the most important endeavors. This requires an in-depth understanding of not only the subject areas but also the nature of scientific discoveries themselves. In other words, the "science of science" needs to be established, and has to be implemented using artificial intelligence (AI) systems to be practically executable. At the same time, what may be implemented by "AI Scientists" may not resemble the scientific process conducted by human scientist. It may be an alternative form of science that will break the limitation of current scientific practice largely hampered by human cognitive limitation and sociological constraints. It could give rise to a human-AI hybrid form of science that shall bring systems biology and other sciences into the next stage. The Nobel Turing Challenge aims to develop a highly autonomous AI system that can perform top-level science, indistinguishable from the quality of that performed by the best human scientists, where some of the discoveries may be worthy of Nobel Prize level recognition and beyond.
Collapse
Affiliation(s)
- Hiroaki Kitano
- The Systems Biology Institute, Tokyo, Japan; Okinawa Institute of Science and Technology Graduate School, Okinawa, Japan; Sony Computer Science Laboratories, Inc., Tokyo, Japan; Sony AI, Inc., Tokyo, Japan; and The Alan Turing Institute, London, UK.
| |
Collapse
|
18
|
Debowski N, Tavanapour N, Bittner EAC. Einsatz eines virtuellen Kollaborators in analogen & digitalen Workshops im organisationalen Kontext. INFORMATIK SPEKTRUM 2021. [PMCID: PMC8101334 DOI: 10.1007/s00287-021-01361-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
ZusammenfassungIn dieser Studie stellen wir Unterschiede und Gemeinsamkeiten in der analogen sowie digitalen Zusammenarbeit hinsichtlich eines virtuellen Kollaborators (VK) dar. Konkret beobachten wir eine Kreativeinheit in einem Industrieunternehmen sowohl in der analogen als auch in der digitalen Durchführung kollaborativer Workshops. Aus den daraus resultierenden Herausforderungen und Anforderungen, die wir anhand von Interviews erheben, leiten wir Designprinzipien an einen VK ab und ziehen einen Vergleich. Gemeinsamkeiten bestehen darin, den Teilnehmenden zusätzliche Informationen und kreativen Input aus internetbasierten Quellen zu liefern. Unterschiede bestehen in der administrativen Vor- und Nachbereitung der jeweiligen Workshops sowie in der Art der Beeinflussung kollaborativer Arbeit. Während bei der digitalen Durchführung eher die Perspektiverweiterung im Vordergrund steht, ist es bei der analogen Durchführung die Ausbalancierung der Redebeiträge. Spezifika stellen sich darüber hinaus für die digitale Durchführung bei der Vernetzung der Teilnehmenden sowie beim Umgang mit digitalen Werkzeugen.
Collapse
Affiliation(s)
- Nicole Debowski
- Fakultät für Mathematik, Informatik und Naturwissenschaften Informatik, Universität Hamburg, Vogt-Kölln-Straße 30, 22527 Hamburg, Deutschland
| | - Navid Tavanapour
- Fakultät für Mathematik, Informatik und Naturwissenschaften Informatik, Universität Hamburg, Vogt-Kölln-Straße 30, 22527 Hamburg, Deutschland
| | - Eva A. C. Bittner
- Fakultät für Mathematik, Informatik und Naturwissenschaften Informatik, Universität Hamburg, Vogt-Kölln-Straße 30, 22527 Hamburg, Deutschland
| |
Collapse
|
19
|
Nazarova AL, Yang L, Liu K, Mishra A, Kalia RK, Nomura KI, Nakano A, Vashishta P, Rajak P. Dielectric Polymer Property Prediction Using Recurrent Neural Networks with Optimizations. J Chem Inf Model 2021; 61:2175-2186. [PMID: 33871989 DOI: 10.1021/acs.jcim.0c01366] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Despite the growing success of machine learning for predicting structure-property relationships in molecules and materials, such as predicting the dielectric properties of polymers, it is still in its infancy. We report on the effectiveness of solving structure-property relationships for a computer-generated database of dielectric polymers using recurrent neural network (RNN) models. The implementation of a series of optimization strategies was crucial to achieving high learning speeds and sufficient accuracy: (1) binary and nonbinary representations of SMILES (Simplified Molecular Input Line System) fingerprints and (2) backpropagation with affine transformation of the input sequence (ATransformedBP) and resilient backpropagation with initial weight update parameter optimizations (iRPROP- optimized). For the investigated database of polymers, the binary SMILES representation was found to be superior to the decimal representation with respect to the training and prediction performance. All developed and optimized Elman-type RNN algorithms outperformed nonoptimized RNN models in the efficient prediction of nonlinear structure-activity relationships. The average relative standard deviation (RSD) remained well below 5%, and the maximum RSD did not exceed 30%. Moreover, we provide a C++ codebase as a testbed for a new generation of open programming languages that target increasingly diverse computer architectures.
Collapse
Affiliation(s)
- Antonina L Nazarova
- Department of Chemistry, Loker Hydrocarbon Research Institute, and USC Bridge Institue, University of Southern California, Los Angeles, California 90089, United States
| | - Liqiu Yang
- Collaboratory of Advanced Computing and Simulations, Department of Computer Science, Department of Physics & Astronomy, Department of Chemical Engineering & Materials Science, and Department of Biological Sciences, University of Southern California, Los Angeles, California 90089, United States
| | - Kuang Liu
- Collaboratory of Advanced Computing and Simulations, Department of Computer Science, Department of Physics & Astronomy, Department of Chemical Engineering & Materials Science, and Department of Biological Sciences, University of Southern California, Los Angeles, California 90089, United States
| | - Ankit Mishra
- Collaboratory of Advanced Computing and Simulations, Department of Computer Science, Department of Physics & Astronomy, Department of Chemical Engineering & Materials Science, and Department of Biological Sciences, University of Southern California, Los Angeles, California 90089, United States
| | - Rajiv K Kalia
- Collaboratory of Advanced Computing and Simulations, Department of Computer Science, Department of Physics & Astronomy, Department of Chemical Engineering & Materials Science, and Department of Biological Sciences, University of Southern California, Los Angeles, California 90089, United States
| | - Ken-Ichi Nomura
- Collaboratory of Advanced Computing and Simulations, Department of Computer Science, Department of Physics & Astronomy, Department of Chemical Engineering & Materials Science, and Department of Biological Sciences, University of Southern California, Los Angeles, California 90089, United States
| | - Aiichiro Nakano
- Collaboratory of Advanced Computing and Simulations, Department of Computer Science, Department of Physics & Astronomy, Department of Chemical Engineering & Materials Science, and Department of Biological Sciences, University of Southern California, Los Angeles, California 90089, United States
| | - Priya Vashishta
- Collaboratory of Advanced Computing and Simulations, Department of Computer Science, Department of Physics & Astronomy, Department of Chemical Engineering & Materials Science, and Department of Biological Sciences, University of Southern California, Los Angeles, California 90089, United States
| | - Pankaj Rajak
- Argonne Leadership Computing Facility, Argonne National Laboratory, Lemont, Illinois 60439, United States
| |
Collapse
|
20
|
Cao L, Russo D, Lapkin AA. Automated robotic platforms in design and development of formulations. AIChE J 2021. [DOI: 10.1002/aic.17248] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Liwei Cao
- Department of Chemical Engineering and Biotechnology University of Cambridge Cambridge UK
- Cambridge Centre for Advanced Research and Education in Singapore, CARES Ltd. Singapore
| | - Danilo Russo
- Department of Chemical Engineering and Biotechnology University of Cambridge Cambridge UK
| | - Alexei A. Lapkin
- Department of Chemical Engineering and Biotechnology University of Cambridge Cambridge UK
- Cambridge Centre for Advanced Research and Education in Singapore, CARES Ltd. Singapore
| |
Collapse
|
21
|
Kanza S, Bird CL, Niranjan M, McNeill W, Frey JG. The AI for Scientific Discovery Network . PATTERNS (NEW YORK, N.Y.) 2021; 2:100162. [PMID: 33511363 PMCID: PMC7815949 DOI: 10.1016/j.patter.2020.100162] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The Artificial Intelligence and Augmented Intelligence for Automated Investigation for Scientific Discovery Network+ (AI3SD) was established in response to the UK Engineering and Physical Sciences Research Council (EPSRC) late-2017 call for a Network+ to promote cutting-edge research in artificial intelligence to accelerate groundbreaking scientific discoveries. This article provides the philosophical, scientific, and technical underpinnings of the Network+, the history of the different domains represented in the Network+, and the specific focus of the Network+. The activities, collaborations, and research covered in the first year of the Network+ have highlighted the significant challenges in the chemistry and augmented and artificial intelligence space. These challenges are shaping the future directions of the Network+. The article concludes with a summary of the lessons learned in running this Network+ and introduces our plans for the future in a landscape redrawn by COVID-19, including rebranding into the AI 4 Scientific Discovery Network (www.ai4science.network).
Collapse
Affiliation(s)
- Samantha Kanza
- School of Chemistry, University of Southampton, Southampton SO17 1BJ, UK
| | - Colin Leonard Bird
- School of Chemistry, University of Southampton, Southampton SO17 1BJ, UK
| | - Mahesan Niranjan
- School of Electronics and Computer Science and University of Southampton, Southampton SO17 1BJ, UK
| | - William McNeill
- School of Humanities, University of Southampton, Southampton SO17 1BJ, UK
| | - Jeremy Graham Frey
- School of Chemistry, University of Southampton, Southampton SO17 1BJ, UK
| |
Collapse
|
22
|
Boetto E, Fantini MP, Gangemi A, Golinelli D, Greco M, Nuzzolese AG, Presutti V, Rallo F. Using altmetrics for detecting impactful research in quasi-zero-day time-windows: the case of COVID-19. Scientometrics 2021; 126:1189-1215. [PMID: 33424050 PMCID: PMC7779112 DOI: 10.1007/s11192-020-03809-7] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Accepted: 11/17/2020] [Indexed: 01/17/2023]
Abstract
On December 31st 2019, the World Health Organization China Country Office was informed of cases of pneumonia of unknown etiology detected in Wuhan City. The cause of the syndrome was a new type of coronavirus isolated on January 7th 2020 and named Severe Acute Respiratory Syndrome CoronaVirus 2 (SARS-CoV-2). SARS-CoV-2 is the cause of the coronavirus disease 2019 (COVID-19). Since January 2020 an ever increasing number of scientific works related to the new pathogen have appeared in literature. Identifying relevant research outcomes at very early stages is challenging. In this work we use COVID-19 as a use-case for investigating: (1) which tools and frameworks are mostly used for early scholarly communication; (2) to what extent altmetrics can be used to identify potential impactful research in tight (i.e. quasi-zero-day) time-windows. A literature review with rigorous eligibility criteria is performed for gathering a sample composed of scientific papers about SARS-CoV-2/COVID-19 appeared in literature in the tight time-window ranging from January 15th 2020 to February 24th 2020. This sample is used for building a knowledge graph that represents the knowledge about papers and indicators formally. This knowledge graph feeds a data analysis process which is applied for experimenting with altmetrics as impact indicators. We find moderate correlation among traditional citation count, citations on social media, and mentions on news and blogs. Additionally, correlation coefficients are not inflated by indicators associated with zero values, which are quite common at very early stages after an article has been published. This suggests there is a common intended meaning of the citational acts associated with aforementioned indicators. Then, we define a method, i.e. the Comprehensive Impact Score (CIS), that harmonises different indicators for providing a multi-dimensional impact indicator. CIS shows promising results as a tool for selecting relevant papers even in a tight time-window. Our results foster the development of automated frameworks aimed at helping the scientific community in identifying relevant work even in case of limited literature and observation time.
Collapse
Affiliation(s)
- Erik Boetto
- DIBINEM, University of Bologna, Bologna, Italy
| | | | - Aldo Gangemi
- STLab, ISTC-CNR, Rome, Italy.,FICLIT, University of Bologna, Bologna, Italy
| | | | | | | | | | | |
Collapse
|
23
|
Coley CW, Eyke NS, Jensen KF. Autonome Entdeckung in den chemischen Wissenschaften, Teil II: Ausblick. Angew Chem Int Ed Engl 2020. [DOI: 10.1002/ange.201909989] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- Connor W. Coley
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Natalie S. Eyke
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Klavs F. Jensen
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| |
Collapse
|
24
|
Accelerating evidence-informed decision-making for the Sustainable Development Goals using machine learning. NAT MACH INTELL 2020. [DOI: 10.1038/s42256-020-00235-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
25
|
Alshehri AS, Gani R, You F. Deep learning and knowledge-based methods for computer-aided molecular design—toward a unified approach: State-of-the-art and future directions. Comput Chem Eng 2020. [DOI: 10.1016/j.compchemeng.2020.107005] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
26
|
He G, Dang Y, Zhou L, Dai Y, Que Y, Ji X. Architecture model proposal of innovative intelligent manufacturing in the chemical industry based on multi-scale integration and key technologies. Comput Chem Eng 2020. [DOI: 10.1016/j.compchemeng.2020.106967] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
|
27
|
Collins EM, Raghavachari K. Effective Molecular Descriptors for Chemical Accuracy at DFT Cost: Fragmentation, Error-Cancellation, and Machine Learning. J Chem Theory Comput 2020; 16:4938-4950. [PMID: 32678593 DOI: 10.1021/acs.jctc.0c00236] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Recent advances in theoretical thermochemistry have allowed the study of small organic and bio-organic molecules with high accuracy. However, applications to larger molecules are still impeded by the steep scaling problem of highly accurate quantum mechanical (QM) methods, forcing the use of approximate, more cost-effective methods at a greatly reduced accuracy. One of the most successful strategies to mitigate this error is the use of systematic error-cancellation schemes, in which highly accurate QM calculations can be performed on small portions of the molecule to construct corrections to an approximate method. Herein, we build on ideas from fragmentation and error-cancellation to introduce a new family of molecular descriptors for machine learning modeled after the Connectivity-Based Hierarchy (CBH) of generalized isodesmic reaction schemes. The best performing descriptor ML(CBH-2) is constructed from fragments preserving only the immediate connectivity of all heavy (non-H) atoms of a molecule along with overlapping regions of fragments in accordance with the inclusion-exclusion principle. Our proposed approach offers a simple, chemically intuitive grouping of atoms, tuned with an optimal amount of error-cancellation, and outperforms previous structure-based descriptors using a much smaller input vector length. For a wide variety of density functionals, DFT+ΔML(CBH-2) models, trained on a set of small- to medium-sized organic HCNOSCl-containing molecules, achieved an out-of-sample MAE within 0.5 kcal/mol and 2σ (95%) confidence interval of <1.5 kcal/mol compared to accurate G4 reference values at DFT cost.
Collapse
Affiliation(s)
- Eric M Collins
- Department of Chemistry, Indiana University, Bloomington, Indiana 47405, United States
| | - Krishnan Raghavachari
- Department of Chemistry, Indiana University, Bloomington, Indiana 47405, United States
| |
Collapse
|
28
|
Coley CW, Eyke NS, Jensen KF. Autonomous Discovery in the Chemical Sciences Part II: Outlook. Angew Chem Int Ed Engl 2020; 59:23414-23436. [PMID: 31553509 DOI: 10.1002/anie.201909989] [Citation(s) in RCA: 108] [Impact Index Per Article: 21.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Indexed: 01/19/2023]
Abstract
This two-part Review examines how automation has contributed to different aspects of discovery in the chemical sciences. In this second part, we reflect on a selection of exemplary studies. It is increasingly important to articulate what the role of automation and computation has been in the scientific process and how that has or has not accelerated discovery. One can argue that even the best automated systems have yet to "discover" despite being incredibly useful as laboratory assistants. We must carefully consider how they have been and can be applied to future problems of chemical discovery in order to effectively design and interact with future autonomous platforms. The majority of this Review defines a large set of open research directions, including improving our ability to work with complex data, build empirical models, automate both physical and computational experiments for validation, select experiments, and evaluate whether we are making progress towards the ultimate goal of autonomous discovery. Addressing these practical and methodological challenges will greatly advance the extent to which autonomous systems can make meaningful discoveries.
Collapse
Affiliation(s)
- Connor W Coley
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Natalie S Eyke
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Klavs F Jensen
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| |
Collapse
|
29
|
Coley CW, Eyke NS, Jensen KF. Autonomous Discovery in the Chemical Sciences Part I: Progress. Angew Chem Int Ed Engl 2020; 59:22858-22893. [DOI: 10.1002/anie.201909987] [Citation(s) in RCA: 100] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Indexed: 01/05/2023]
Affiliation(s)
- Connor W. Coley
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Natalie S. Eyke
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Klavs F. Jensen
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| |
Collapse
|
30
|
Coley CW, Eyke NS, Jensen KF. Autonome Entdeckung in den chemischen Wissenschaften, Teil I: Fortschritt. Angew Chem Int Ed Engl 2020. [DOI: 10.1002/ange.201909987] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- Connor W. Coley
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Natalie S. Eyke
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Klavs F. Jensen
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| |
Collapse
|
31
|
Chen G, Shen Z, Iyer A, Ghumman UF, Tang S, Bi J, Chen W, Li Y. Machine-Learning-Assisted De Novo Design of Organic Molecules and Polymers: Opportunities and Challenges. Polymers (Basel) 2020; 12:E163. [PMID: 31936321 PMCID: PMC7023065 DOI: 10.3390/polym12010163] [Citation(s) in RCA: 60] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2019] [Revised: 12/27/2019] [Accepted: 01/02/2020] [Indexed: 12/18/2022] Open
Abstract
Organic molecules and polymers have a broad range of applications in biomedical, chemical, and materials science fields. Traditional design approaches for organic molecules and polymers are mainly experimentally-driven, guided by experience, intuition, and conceptual insights. Though they have been successfully applied to discover many important materials, these methods are facing significant challenges due to the tremendous demand of new materials and vast design space of organic molecules and polymers. Accelerated and inverse materials design is an ideal solution to these challenges. With advancements in high-throughput computation, artificial intelligence (especially machining learning, ML), and the growth of materials databases, ML-assisted materials design is emerging as a promising tool to flourish breakthroughs in many areas of materials science and engineering. To date, using ML-assisted approaches, the quantitative structure property/activity relation for material property prediction can be established more accurately and efficiently. In addition, materials design can be revolutionized and accelerated much faster than ever, through ML-enabled molecular generation and inverse molecular design. In this perspective, we review the recent progresses in ML-guided design of organic molecules and polymers, highlight several successful examples, and examine future opportunities in biomedical, chemical, and materials science fields. We further discuss the relevant challenges to solve in order to fully realize the potential of ML-assisted materials design for organic molecules and polymers. In particular, this study summarizes publicly available materials databases, feature representations for organic molecules, open-source tools for feature generation, methods for molecular generation, and ML models for prediction of material properties, which serve as a tutorial for researchers who have little experience with ML before and want to apply ML for various applications. Last but not least, it draws insights into the current limitations of ML-guided design of organic molecules and polymers. We anticipate that ML-assisted materials design for organic molecules and polymers will be the driving force in the near future, to meet the tremendous demand of new materials with tailored properties in different fields.
Collapse
Affiliation(s)
- Guang Chen
- Department of Mechanical Engineering, University of Connecticut, Storrs, CT 06269, USA; (G.C.); (Z.S.)
| | - Zhiqiang Shen
- Department of Mechanical Engineering, University of Connecticut, Storrs, CT 06269, USA; (G.C.); (Z.S.)
| | - Akshay Iyer
- Department of Mechanical Engineering, Northwestern University, Evanston, IL 60208, USA; (A.I.); (U.F.G.)
| | - Umar Farooq Ghumman
- Department of Mechanical Engineering, Northwestern University, Evanston, IL 60208, USA; (A.I.); (U.F.G.)
| | - Shan Tang
- State Key Laboratory of Structural Analysis for Industrial Equipment, Department of Engineering Mechanics, and International Research Center for Computational Mechanics, Dalian University of Technology, Dalian 116023, China;
| | - Jinbo Bi
- Department of Computer Science and Engineering, University of Connecticut, Storrs, CT 06269, USA;
| | - Wei Chen
- Department of Mechanical Engineering, Northwestern University, Evanston, IL 60208, USA; (A.I.); (U.F.G.)
| | - Ying Li
- Department of Mechanical Engineering, University of Connecticut, Storrs, CT 06269, USA; (G.C.); (Z.S.)
- Polymer Program, Institute of Materials Science, University of Connecticut, Storrs, CT 06269, USA
| |
Collapse
|
32
|
Grizou J, Points LJ, Sharma A, Cronin L. A curious formulation robot enables the discovery of a novel protocell behavior. SCIENCE ADVANCES 2020; 6:eaay4237. [PMID: 32064348 PMCID: PMC6994213 DOI: 10.1126/sciadv.aay4237] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Accepted: 11/20/2019] [Indexed: 05/11/2023]
Abstract
We describe a chemical robotic assistant equipped with a curiosity algorithm (CA) that can efficiently explore the states a complex chemical system can exhibit. The CA-robot is designed to explore formulations in an open-ended way with no explicit optimization target. By applying the CA-robot to the study of self-propelling multicomponent oil-in-water protocell droplets, we are able to observe an order of magnitude more variety in droplet behaviors than possible with a random parameter search and given the same budget. We demonstrate that the CA-robot enabled the observation of a sudden and highly specific response of droplets to slight temperature changes. Six modes of self-propelled droplet motion were identified and classified using a time-temperature phase diagram and probed using a variety of techniques including NMR. This work illustrates how CAs can make better use of a limited experimental budget and significantly increase the rate of unpredictable observations, leading to new discoveries with potential applications in formulation chemistry.
Collapse
|
33
|
Siebert M, Krennrich G, Seibicke M, Siegle AF, Trapp O. Identifying high-performance catalytic conditions for carbon dioxide reduction to dimethoxymethane by multivariate modelling. Chem Sci 2019; 10:10466-10474. [PMID: 32153745 PMCID: PMC7012071 DOI: 10.1039/c9sc04591k] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Accepted: 10/23/2019] [Indexed: 12/20/2022] Open
Abstract
In times of a warming climate due to excessive carbon dioxide production, catalytic conversion of carbon dioxide to formaldehyde is not only a process of great industrial interest, but it could also serve as a means for meeting our climate goals. Currently, formaldehyde is produced in an energetically unfavourable and atom-inefficient process. A much needed solution remains academically challenging. Here we present an algorithmic workflow to improve the ruthenium-catalysed transformation of carbon dioxide to the formaldehyde derivative dimethoxymethane. Catalytic processes are typically optimised by comprehensive screening of catalysts, substrates, reaction parameters and additives to enhance activity and selectivity. The common problem of the multidimensionality of the parameter space, leading to only incremental improvement in laborious physical investigations, was overcome by combining elements from machine learning, optimisation and experimental design, tripling the turnover number of 786 to 2761. The optimised conditions were then used in a new reaction setup tailored to the process parameters leading to a turnover number of 3874, exceeding by far those of known processes.
Collapse
Affiliation(s)
- Max Siebert
- Department Chemie , Ludwig-Maximilians-Universität München , Butenandtstr. 5-13 , 81377 München , Germany .
| | - Gerhard Krennrich
- Department Chemie , Ludwig-Maximilians-Universität München , Butenandtstr. 5-13 , 81377 München , Germany .
| | - Max Seibicke
- Department Chemie , Ludwig-Maximilians-Universität München , Butenandtstr. 5-13 , 81377 München , Germany .
| | - Alexander F Siegle
- Department Chemie , Ludwig-Maximilians-Universität München , Butenandtstr. 5-13 , 81377 München , Germany .
| | - Oliver Trapp
- Department Chemie , Ludwig-Maximilians-Universität München , Butenandtstr. 5-13 , 81377 München , Germany .
| |
Collapse
|
34
|
Comparative Analysis between International Research Hotspots and National-Level Policy Keywords on Artificial Intelligence in China from 2009 to 2018. SUSTAINABILITY 2019. [DOI: 10.3390/su11236574] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
In the last decade, artificial intelligence (AI) has undergone many important developments in China and has risen to the level of national strategy, which is closely related to the areas of research and policy promotion. The interactive relationship between the hotspots of China’s international AI research and its national-level policy keywords is the basis for further clarification and reference in academics and political circles. There has been very little research on the interaction between academic research and policy making. Understanding the relationship between the content of academic research and the content emphasized by actual operational policy will help scholars to better apply research to practice, and help decision-makers to manage effectively. Based on 3577 English publications about AI published by Chinese scholars in 2009–2018, and 262 Chinese national-level policy documents published during this period, this study carried out scientometric analysis and quantitative analysis of policy documents through the knowledge maps of AI international research hotspots in China and the co-occurrence maps of Chinese policy keywords, and conducted a comparative analysis that divided China’s AI development into three stages: the initial exploration stage, the steady rising stage, and the rapid development stage. The studies showed that in the initial exploration stage (2009–2012), research hotspots and policy keywords had a certain alienation relationship; in the steady rising stage (2013–2015), research hotspots focused more on cutting-edge technologies and policy keywords focused more on macro-guidance, and the relationship began to become close; and in the rapid development stage (2016–2018), the research hotspots and policy keywords became closely integrated, and they were mutually infiltrated and complementary, thus realizing organic integration and close connection. Through comparative analysis between international research hotspots and national-level policy keywords on AI in China from 2009 to 2018, the development of AI in China was revealed to some extent, along with the interaction between academics and politics in the past ten years, which is of great significance for the sustainable development and effective governance of China’s artificial intelligence.
Collapse
|
35
|
Harfouche AL, Jacobson DA, Kainer D, Romero JC, Harfouche AH, Scarascia Mugnozza G, Moshelion M, Tuskan GA, Keurentjes JJ, Altman A. Accelerating Climate Resilient Plant Breeding by Applying Next-Generation Artificial Intelligence. Trends Biotechnol 2019; 37:1217-1235. [DOI: 10.1016/j.tibtech.2019.05.007] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2019] [Revised: 05/18/2019] [Accepted: 05/23/2019] [Indexed: 12/20/2022]
|
36
|
Smith JS, Nebgen BT, Zubatyuk R, Lubbers N, Devereux C, Barros K, Tretiak S, Isayev O, Roitberg AE. Approaching coupled cluster accuracy with a general-purpose neural network potential through transfer learning. Nat Commun 2019; 10:2903. [PMID: 31263102 PMCID: PMC6602931 DOI: 10.1038/s41467-019-10827-4] [Citation(s) in RCA: 348] [Impact Index Per Article: 58.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Accepted: 05/29/2019] [Indexed: 01/01/2023] Open
Abstract
Computational modeling of chemical and biological systems at atomic resolution is a crucial tool in the chemist's toolset. The use of computer simulations requires a balance between cost and accuracy: quantum-mechanical methods provide high accuracy but are computationally expensive and scale poorly to large systems, while classical force fields are cheap and scalable, but lack transferability to new systems. Machine learning can be used to achieve the best of both approaches. Here we train a general-purpose neural network potential (ANI-1ccx) that approaches CCSD(T)/CBS accuracy on benchmarks for reaction thermochemistry, isomerization, and drug-like molecular torsions. This is achieved by training a network to DFT data then using transfer learning techniques to retrain on a dataset of gold standard QM calculations (CCSD(T)/CBS) that optimally spans chemical space. The resulting potential is broadly applicable to materials science, biology, and chemistry, and billions of times faster than CCSD(T)/CBS calculations.
Collapse
Affiliation(s)
- Justin S Smith
- Department of Chemistry, University of Florida, Gainesville, FL, 32611, USA
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
| | - Benjamin T Nebgen
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
- Center for Integrated Nanotechnologies, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
| | - Roman Zubatyuk
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
- Department of Chemistry, Physics, and Atmospheric Science, Jackson State University, Jackson, MS, 39217, USA
| | - Nicholas Lubbers
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
| | - Christian Devereux
- Department of Chemistry, University of Florida, Gainesville, FL, 32611, USA
| | - Kipton Barros
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
| | - Sergei Tretiak
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA.
- Center for Integrated Nanotechnologies, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA.
| | - Olexandr Isayev
- UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA.
| | - Adrian E Roitberg
- Department of Chemistry, University of Florida, Gainesville, FL, 32611, USA.
| |
Collapse
|
37
|
Yang JH, Wright SN, Hamblin M, McCloskey D, Alcantar MA, Schrübbers L, Lopatkin AJ, Satish S, Nili A, Palsson BO, Walker GC, Collins JJ. A White-Box Machine Learning Approach for Revealing Antibiotic Mechanisms of Action. Cell 2019; 177:1649-1661.e9. [PMID: 31080069 PMCID: PMC6545570 DOI: 10.1016/j.cell.2019.04.016] [Citation(s) in RCA: 209] [Impact Index Per Article: 34.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2018] [Revised: 03/19/2019] [Accepted: 04/08/2019] [Indexed: 12/13/2022]
Abstract
Current machine learning techniques enable robust association of biological signals with measured phenotypes, but these approaches are incapable of identifying causal relationships. Here, we develop an integrated "white-box" biochemical screening, network modeling, and machine learning approach for revealing causal mechanisms and apply this approach to understanding antibiotic efficacy. We counter-screen diverse metabolites against bactericidal antibiotics in Escherichia coli and simulate their corresponding metabolic states using a genome-scale metabolic network model. Regression of the measured screening data on model simulations reveals that purine biosynthesis participates in antibiotic lethality, which we validate experimentally. We show that antibiotic-induced adenine limitation increases ATP demand, which elevates central carbon metabolism activity and oxygen consumption, enhancing the killing effects of antibiotics. This work demonstrates how prospective network modeling can couple with machine learning to identify complex causal mechanisms underlying drug efficacy.
Collapse
Affiliation(s)
- Jason H Yang
- Institute for Medical Engineering and Science and Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Sarah N Wright
- Institute for Medical Engineering and Science and Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Meagan Hamblin
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Douglas McCloskey
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800 Lyngby, Denmark
| | - Miguel A Alcantar
- Institute for Medical Engineering and Science and Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Lars Schrübbers
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800 Lyngby, Denmark
| | - Allison J Lopatkin
- Institute for Medical Engineering and Science and Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA
| | - Sangeeta Satish
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
| | - Amir Nili
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
| | - Bernhard O Palsson
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800 Lyngby, Denmark; Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Graham C Walker
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - James J Collins
- Institute for Medical Engineering and Science and Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA.
| |
Collapse
|
38
|
|
39
|
Niel O, Bastard P, Boussard C, Hogan J, Kwon T, Deschênes G. Artificial intelligence outperforms experienced nephrologists to assess dry weight in pediatric patients on chronic hemodialysis. Pediatr Nephrol 2018; 33:1799-1803. [PMID: 29987454 DOI: 10.1007/s00467-018-4015-2] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/18/2018] [Revised: 06/24/2018] [Accepted: 06/27/2018] [Indexed: 12/14/2022]
Abstract
BACKGROUND Dry weight is the lowest weight patients on hemodialysis can tolerate; correct dry weight estimation is necessary to minimize morbi-mortality, but is difficult to achieve. Here, we used artificial intelligence to improve the accuracy of dry weight assessment in hemodialysis patients. METHODS/RESULTS We designed a neural network which used bio-impedancemetry, blood volume monitoring, and blood pressure values as inputs; output was artificial intelligence dry weight. Fourteen pediatric patients were switched from nephrologist to artificial intelligence dry weight. Artificial intelligence dry weight was higher (28.6%), lower (50%), or identical to nephrologist dry weight. Mean difference between artificial intelligence and nephrologist dry weights was 0.497 kg (- 1.33 to + 1.29 kg). In patients for whom artificial intelligence dry weight was lower than nephrologist dry weight, systolic blood pressure significantly decreased after dry weight decrease to artificial intelligence dry weight (77th to 60th percentile, p = 0.022); anti-hypertensive treatments were successfully decreased or discontinued in 28.7% of cases. In patients for whom artificial intelligence dry weight was higher than nephrologist dry weight, no hypertension was observed after dry weight increase to artificial intelligence dry weight; when present, symptoms of dry weight underestimation receded. CONCLUSIONS Neural network predictions outperformed those of experienced nephrologists in most cases, proving artificial intelligence is a powerful tool for predicting dry weight in hemodialysis patients.
Collapse
Affiliation(s)
- Olivier Niel
- Pediatric Nephrology Department, Robert Debré Hospital, 48 Boulevard Sérurier, 75019, Paris, France.
| | - Paul Bastard
- Pediatric Nephrology Department, Robert Debré Hospital, 48 Boulevard Sérurier, 75019, Paris, France
| | - Charlotte Boussard
- Pediatric Nephrology Department, Robert Debré Hospital, 48 Boulevard Sérurier, 75019, Paris, France
| | - Julien Hogan
- Pediatric Nephrology Department, Robert Debré Hospital, 48 Boulevard Sérurier, 75019, Paris, France
| | - Thérésa Kwon
- Pediatric Nephrology Department, Robert Debré Hospital, 48 Boulevard Sérurier, 75019, Paris, France
| | - Georges Deschênes
- Pediatric Nephrology Department, Robert Debré Hospital, 48 Boulevard Sérurier, 75019, Paris, France
| |
Collapse
|
40
|
Granda JM, Donina L, Dragone V, Long DL, Cronin L. Controlling an organic synthesis robot with machine learning to search for new reactivity. Nature 2018; 559:377-381. [PMID: 30022133 DOI: 10.1038/s41586-018-0307-8] [Citation(s) in RCA: 328] [Impact Index Per Article: 46.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2017] [Accepted: 05/21/2018] [Indexed: 12/22/2022]
Abstract
The discovery of chemical reactions is an inherently unpredictable and time-consuming process1. An attractive alternative is to predict reactivity, although relevant approaches, such as computer-aided reaction design, are still in their infancy2. Reaction prediction based on high-level quantum chemical methods is complex3, even for simple molecules. Although machine learning is powerful for data analysis4,5, its applications in chemistry are still being developed6. Inspired by strategies based on chemists' intuition7, we propose that a reaction system controlled by a machine learning algorithm may be able to explore the space of chemical reactions quickly, especially if trained by an expert8. Here we present an organic synthesis robot that can perform chemical reactions and analysis faster than they can be performed manually, as well as predict the reactivity of possible reagent combinations after conducting a small number of experiments, thus effectively navigating chemical reaction space. By using machine learning for decision making, enabled by binary encoding of the chemical inputs, the reactions can be assessed in real time using nuclear magnetic resonance and infrared spectroscopy. The machine learning system was able to predict the reactivity of about 1,000 reaction combinations with accuracy greater than 80 per cent after considering the outcomes of slightly over 10 per cent of the dataset. This approach was also used to calculate the reactivity of published datasets. Further, by using real-time data from our robot, these predictions were followed up manually by a chemist, leading to the discovery of four reactions.
Collapse
Affiliation(s)
| | - Liva Donina
- School of Chemistry, University of Glasgow, Glasgow, UK
| | | | - De-Liang Long
- School of Chemistry, University of Glasgow, Glasgow, UK
| | - Leroy Cronin
- School of Chemistry, University of Glasgow, Glasgow, UK.
| |
Collapse
|
41
|
|
42
|
Popova M, Isayev O, Tropsha A. Deep reinforcement learning for de novo drug design. SCIENCE ADVANCES 2018; 4:eaap7885. [PMID: 30050984 PMCID: PMC6059760 DOI: 10.1126/sciadv.aap7885] [Citation(s) in RCA: 569] [Impact Index Per Article: 81.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/26/2017] [Accepted: 06/13/2018] [Indexed: 05/20/2023]
Abstract
We have devised and implemented a novel computational strategy for de novo design of molecules with desired properties termed ReLeaSE (Reinforcement Learning for Structural Evolution). On the basis of deep and reinforcement learning (RL) approaches, ReLeaSE integrates two deep neural networks-generative and predictive-that are trained separately but are used jointly to generate novel targeted chemical libraries. ReLeaSE uses simple representation of molecules by their simplified molecular-input line-entry system (SMILES) strings only. Generative models are trained with a stack-augmented memory network to produce chemically feasible SMILES strings, and predictive models are derived to forecast the desired properties of the de novo-generated compounds. In the first phase of the method, generative and predictive models are trained separately with a supervised learning algorithm. In the second phase, both models are trained jointly with the RL approach to bias the generation of new chemical structures toward those with the desired physical and/or biological properties. In the proof-of-concept study, we have used the ReLeaSE method to design chemical libraries with a bias toward structural complexity or toward compounds with maximal, minimal, or specific range of physical properties, such as melting point or hydrophobicity, or toward compounds with inhibitory activity against Janus protein kinase 2. The approach proposed herein can find a general use for generating targeted chemical libraries of novel compounds optimized for either a single desired property or multiple properties.
Collapse
Affiliation(s)
- Mariya Popova
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA
- Moscow Institute of Physics and Technology, Dolgoprudny, Moscow 141700, Russia
- Skolkovo Institute of Science and Technology, Moscow 143026, Russia
| | - Olexandr Isayev
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA
- Corresponding author. (A.T.); (O.I.)
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA
- Corresponding author. (A.T.); (O.I.)
| |
Collapse
|
43
|
Vasilevich A, de Boer J. Robot-scientists will lead tomorrow's biomaterials discovery. CURRENT OPINION IN BIOMEDICAL ENGINEERING 2018. [DOI: 10.1016/j.cobme.2018.03.005] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
|
44
|
Blockchain with Artificial Intelligence to Efficiently Manage Water Use under Climate Change. ENVIRONMENTS 2018. [DOI: 10.3390/environments5030034] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
45
|
Abstract
Purpose
By establishing a conceptual path through the field of artificial intelligence for objectivistic knowledge artifacts (KAs), the purpose of this paper is to propose an extension to their design principles. The author uses these principles to deploy KAs for knowledge acquired in scientific processes, to determine whether these principles steer the design of KAs that are amenable for both human and computational manipulation.
Design/methodology/approach
Adopting the design principles mentioned above, the author describes the deployment of KAs in collaboration with a group of scientists to represent knowledge gained in scientific processes. The author then analyzes the resulting usage data.
Findings
Usage data reveal that human scientists could enter scientific KAs within the proposed structure. The scientists were able to create associations among them, search and retrieve KAs, and reuse them in drafts of reports to funding agencies. These results were observed when scientists were motivated by imminent incentives.
Research limitations/implications
Previous work has shown that objectivistic KAs are suitable for representing knowledge in computational processes. The data analyzed in this work show that they are suitable for representing knowledge in processes conducted by humans. The need for imminent incentives to motivate humans to contribute KAs suggests a limitation, which may be attributed to the exclusively objectivistic perspective in their design. The author hence discusses the adoption of situativity principles for a more beneficial implementation of KAs.
Originality/value
The suitability for interaction with both human and computational processes makes objectivistic KAs candidates for use as metadata to intersect humans and computers, particularly for scientific processes. The author found no previous work implementing objectivistic KAs for scientific knowledge.
Collapse
|
46
|
Golestan Hashemi FS, Razi Ismail M, Rafii Yusop M, Golestan Hashemi MS, Nadimi Shahraki MH, Rastegari H, Miah G, Aslani F. Intelligent mining of large-scale bio-data: Bioinformatics applications. BIOTECHNOL BIOTEC EQ 2017. [DOI: 10.1080/13102818.2017.1364977] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Affiliation(s)
- Farahnaz Sadat Golestan Hashemi
- Plant Genetics, AgroBioChem Department, Gembloux Agro-Bio Tech, University of Liege, Liege, Belgium
- Laboratory of Food Crops, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, Serdang, Selangor, Malaysia
| | - Mohd Razi Ismail
- Laboratory of Food Crops, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, Serdang, Selangor, Malaysia
- Department of Crop Science, Faculty of Agriculture, Universiti Putra Malaysia, Serdang, Selangor, Malaysia
| | - Mohd Rafii Yusop
- Laboratory of Food Crops, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, Serdang, Selangor, Malaysia
- Department of Crop Science, Faculty of Agriculture, Universiti Putra Malaysia, Serdang, Selangor, Malaysia
| | - Mahboobe Sadat Golestan Hashemi
- Department of Software Engineering, Faculty of Computer Engineering, Najafabad Branch, Islamic Azad University, Isfahan,Iran
- Big Data Research Center, Najafabad Branch, Islamic Azad University, Isfahan, Iran
| | - Mohammad Hossein Nadimi Shahraki
- Department of Software Engineering, Faculty of Computer Engineering, Najafabad Branch, Islamic Azad University, Isfahan,Iran
- Big Data Research Center, Najafabad Branch, Islamic Azad University, Isfahan, Iran
| | - Hamid Rastegari
- Department of Software Engineering, Faculty of Computer Engineering, Najafabad Branch, Islamic Azad University, Isfahan,Iran
| | - Gous Miah
- Laboratory of Food Crops, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, Serdang, Selangor, Malaysia
| | - Farzad Aslani
- Department of Crop Science, Faculty of Agriculture, Universiti Putra Malaysia, Serdang, Selangor, Malaysia
| |
Collapse
|
47
|
Edmunds RC, Su B, Balhoff JP, Eames BF, Dahdul WM, Lapp H, Lundberg JG, Vision TJ, Dunham RA, Mabee PM, Westerfield M. Phenoscape: Identifying Candidate Genes for Evolutionary Phenotypes. Mol Biol Evol 2015; 33:13-24. [PMID: 26500251 PMCID: PMC4693980 DOI: 10.1093/molbev/msv223] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Phenotypes resulting from mutations in genetic model organisms can help reveal candidate genes for evolutionarily important phenotypic changes in related taxa. Although testing candidate gene hypotheses experimentally in nonmodel organisms is typically difficult, ontology-driven information systems can help generate testable hypotheses about developmental processes in experimentally tractable organisms. Here, we tested candidate gene hypotheses suggested by expert use of the Phenoscape Knowledgebase, specifically looking for genes that are candidates responsible for evolutionarily interesting phenotypes in the ostariophysan fishes that bear resemblance to mutant phenotypes in zebrafish. For this, we searched ZFIN for genetic perturbations that result in either loss of basihyal element or loss of scales phenotypes, because these are the ancestral phenotypes observed in catfishes (Siluriformes). We tested the identified candidate genes by examining their endogenous expression patterns in the channel catfish, Ictalurus punctatus. The experimental results were consistent with the hypotheses that these features evolved through disruption in developmental pathways at, or upstream of, brpf1 and eda/edar for the ancestral losses of basihyal element and scales, respectively. These results demonstrate that ontological annotations of the phenotypic effects of genetic alterations in model organisms, when aggregated within a knowledgebase, can be used effectively to generate testable, and useful, hypotheses about evolutionary changes in morphology.
Collapse
Affiliation(s)
| | - Baofeng Su
- School of Fisheries, Aquaculture and Aquatic Sciences, Auburn University
| | | | - B Frank Eames
- Department of Anatomy and Cell Biology, University of Saskatchewan, Saskatoon, SK, Canada
| | - Wasila M Dahdul
- National Evolutionary Synthesis Center, Durham, NC Department of Biology, University of South Dakota
| | - Hilmar Lapp
- National Evolutionary Synthesis Center, Durham, NC
| | - John G Lundberg
- Department of Ichthyology, The Academy of Natural Sciences, Philadelphia, Philadelphia, PA
| | - Todd J Vision
- National Evolutionary Synthesis Center, Durham, NC Department of Biology, University of North Carolina, Chapel Hill
| | - Rex A Dunham
- School of Fisheries, Aquaculture and Aquatic Sciences, Auburn University
| | | | | |
Collapse
|
48
|
Inferring regulatory networks from experimental morphological phenotypes: a computational method reverse-engineers planarian regeneration. PLoS Comput Biol 2015; 11:e1004295. [PMID: 26042810 PMCID: PMC4456145 DOI: 10.1371/journal.pcbi.1004295] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2014] [Accepted: 04/21/2015] [Indexed: 01/18/2023] Open
Abstract
Transformative applications in biomedicine require the discovery of complex regulatory networks that explain the development and regeneration of anatomical structures, and reveal what external signals will trigger desired changes of large-scale pattern. Despite recent advances in bioinformatics, extracting mechanistic pathway models from experimental morphological data is a key open challenge that has resisted automation. The fundamental difficulty of manually predicting emergent behavior of even simple networks has limited the models invented by human scientists to pathway diagrams that show necessary subunit interactions but do not reveal the dynamics that are sufficient for complex, self-regulating pattern to emerge. To finally bridge the gap between high-resolution genetic data and the ability to understand and control patterning, it is critical to develop computational tools to efficiently extract regulatory pathways from the resultant experimental shape phenotypes. For example, planarian regeneration has been studied for over a century, but despite increasing insight into the pathways that control its stem cells, no constructive, mechanistic model has yet been found by human scientists that explains more than one or two key features of its remarkable ability to regenerate its correct anatomical pattern after drastic perturbations. We present a method to infer the molecular products, topology, and spatial and temporal non-linear dynamics of regulatory networks recapitulating in silico the rich dataset of morphological phenotypes resulting from genetic, surgical, and pharmacological experiments. We demonstrated our approach by inferring complete regulatory networks explaining the outcomes of the main functional regeneration experiments in the planarian literature; By analyzing all the datasets together, our system inferred the first systems-biology comprehensive dynamical model explaining patterning in planarian regeneration. This method provides an automated, highly generalizable framework for identifying the underlying control mechanisms responsible for the dynamic regulation of growth and form. Developmental and regenerative biology experiments are producing a huge number of morphological phenotypes from functional perturbation experiments. However, existing pathway models do not generally explain the dynamic regulation of anatomical shape due to the difficulty of inferring and testing non-linear regulatory networks responsible for appropriate form, shape, and pattern. We present a method that automates the discovery and testing of regulatory networks explaining morphological outcomes directly from the resultant phenotypes, producing network models as testable hypotheses explaining regeneration data. Our system integrates a formalization of the published results in planarian regeneration, an in silico simulator in which the patterning properties of regulatory networks can be quantitatively tested in a regeneration assay, and a machine learning module that evolves networks whose behavior in this assay optimally matches the database of planarian results. We applied our method to explain the key experiments in planarian regeneration, and discovered the first comprehensive model of anterior-posterior patterning in planaria under surgical, pharmacological, and genetic manipulations. Beyond the planarian data, our approach is readily generalizable to facilitate the discovery of testable regulatory networks in developmental biology and biomedicine, and represents the first developmental model discovered de novo from morphological outcomes by an automated system.
Collapse
|
49
|
Vehlow C, Kao DP, Bristow MR, Hunter LE, Weiskopf D, Görg C. Visual analysis of biological data-knowledge networks. BMC Bioinformatics 2015; 16:135. [PMID: 25925016 PMCID: PMC4456720 DOI: 10.1186/s12859-015-0550-z] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2014] [Accepted: 03/25/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The interpretation of the results from genome-scale experiments is a challenging and important problem in contemporary biomedical research. Biological networks that integrate experimental results with existing knowledge from biomedical databases and published literature can provide a rich resource and powerful basis for hypothesizing about mechanistic explanations for observed gene-phenotype relationships. However, the size and density of such networks often impede their efficient exploration and understanding. RESULTS We introduce a visual analytics approach that integrates interactive filtering of dense networks based on degree-of-interest functions with attribute-based layouts of the resulting subnetworks. The comparison of multiple subnetworks representing different analysis facets is facilitated through an interactive super-network that integrates brushing-and-linking techniques for highlighting components across networks. An implementation is freely available as a Cytoscape app. CONCLUSIONS We demonstrate the utility of our approach through two case studies using a dataset that combines clinical data with high-throughput data for studying the effect of β-blocker treatment on heart failure patients. Furthermore, we discuss our team-based iterative design and development process as well as the limitations and generalizability of our approach.
Collapse
Affiliation(s)
- Corinna Vehlow
- VISUS, University of Stuttgart, Allmandring 19, Stuttgart, Germany.
| | - David P Kao
- School of Medicine, University of Colorado, E 17th Pl, Aurora, CO, USA.
| | - Michael R Bristow
- School of Medicine, University of Colorado, E 17th Pl, Aurora, CO, USA.
| | - Lawrence E Hunter
- School of Medicine, University of Colorado, E 17th Pl, Aurora, CO, USA.
| | - Daniel Weiskopf
- VISUS, University of Stuttgart, Allmandring 19, Stuttgart, Germany.
| | - Carsten Görg
- School of Medicine, University of Colorado, E 17th Pl, Aurora, CO, USA.
| |
Collapse
|