1
|
Yang Y, Li Y, Tang L, Li J. Single-Molecule Bioelectronic Sensors with AI-Aided Data Analysis: Convergence and Challenges. PRECISION CHEMISTRY 2024; 2:518-538. [PMID: 39483271 PMCID: PMC11523000 DOI: 10.1021/prechem.4c00048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Revised: 08/09/2024] [Accepted: 09/09/2024] [Indexed: 11/03/2024]
Abstract
Single-molecule bioelectronic sensing, a groundbreaking domain in biological research, has revolutionized our understanding of molecules by revealing deep insights into fundamental biological processes. The advent of emergent technologies, such as nanogapped electrodes and nanopores, has greatly enhanced this field, providing exceptional sensitivity, resolution, and integration capabilities. However, challenges persist, such as complex data sets with high noise levels and stochastic molecular dynamics. Artificial intelligence (AI) has stepped in to address these issues with its powerful data processing capabilities. AI algorithms effectively extract meaningful features, detect subtle changes, improve signal-to-noise ratios, and uncover hidden patterns in massive data. This review explores the synergy between AI and single-molecule bioelectronic sensing, focusing on how AI enhances signal processing and data analysis to boost accuracy and reliability. We also discuss current limitations and future directions for integrating AI, highlighting its potential to advance biological research and technological innovation.
Collapse
Affiliation(s)
- Yuxin Yang
- State
Key Laboratory of Extreme Photonics and Instrumentation, College of
Optical Science and Engineering, Zhejiang
University, Hangzhou 310027, China
- Nanhu
Brain-Computer Interface Institute, Hangzhou, Zhejiang 311100, China
| | - Yueqi Li
- Center
for BioAnalytical Chemistry, Hefei National Laboratory of Physical
Science at Microscale, University of Science
and Technology of China, Hefei 230026, China
| | - Longhua Tang
- State
Key Laboratory of Extreme Photonics and Instrumentation, College of
Optical Science and Engineering, Zhejiang
University, Hangzhou 310027, China
- Nanhu
Brain-Computer Interface Institute, Hangzhou, Zhejiang 311100, China
| | - Jinghong Li
- Department
of Chemistry, Center for BioAnalytical Chemistry, Key Laboratory of
Bioorganic Phosphorus Chemistry & Chemical Biology, Tsinghua University, Beijing 100084, China
- Beijing
Institute of Life Science and Technology, Beijing 102206, China
- New
Cornerstone Science Institute, Beijing 102206, China
- Center
for BioAnalytical Chemistry, Hefei National Laboratory of Physical
Science at Microscale, University of Science
and Technology of China, Hefei 230026, China
| |
Collapse
|
2
|
Gorenskaia E, Low PJ. Methods for the analysis, interpretation, and prediction of single-molecule junction conductance behaviour. Chem Sci 2024; 15:9510-9556. [PMID: 38939131 PMCID: PMC11206205 DOI: 10.1039/d4sc00488d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Accepted: 05/06/2024] [Indexed: 06/29/2024] Open
Abstract
This article offers a broad overview of measurement methods in the field of molecular electronics, with a particular focus on the most common single-molecule junction fabrication techniques, the challenges in data analysis and interpretation of single-molecule junction current-distance traces, and a summary of simulations and predictive models aimed at establishing robust structure-property relationships of use in the further development of molecular electronics.
Collapse
Affiliation(s)
- Elena Gorenskaia
- School of Molecular Sciences, University of Western Australia 35 Stirling Highway Crawley Western Australia 6026 Australia
| | - Paul J Low
- School of Molecular Sciences, University of Western Australia 35 Stirling Highway Crawley Western Australia 6026 Australia
| |
Collapse
|
3
|
Huang Y, Darr CM, Gangopadhyay K, Gangopadhyay S, Bok S, Chakraborty S. Applications of machine learning tools for ultra-sensitive detection of lipoarabinomannan with plasmonic grating biosensors in clinical samples of tuberculosis. PLoS One 2022; 17:e0275658. [PMID: 36282804 PMCID: PMC9595565 DOI: 10.1371/journal.pone.0275658] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2021] [Accepted: 09/21/2022] [Indexed: 11/06/2022] Open
Abstract
Background Tuberculosis is one of the top ten causes of death globally and the leading cause of death from a single infectious agent. Eradicating the Tuberculosis epidemic by 2030 is one of the top United Nations Sustainable Development Goals. Early diagnosis is essential to achieving this goal because it improves individual prognosis and reduces transmission rates of asymptomatic infected. We aim to support this goal by developing rapid and sensitive diagnostics using machine learning algorithms to minimize the need for expert intervention. Methods and findings A single molecule fluorescence immunosorbent assay was used to detect Tuberculosis biomarker lipoarabinomannan from a set of twenty clinical patient samples and a control set of spiked human urine. Tuberculosis status was separately confirmed by GeneXpert MTB/RIF and cell culture. Two machine learning algorithms, an automatic and a semiautomatic model, were developed and trained by the calibrated lipoarabinomannan titration assay data and then tested against the ground truth patient data. The semiautomatic model differed from the automatic model by an expert review step in the former, which calibrated the lower threshold to determine single molecules from background noise. The semiautomatic model was found to provide 88.89% clinical sensitivity, while the automatic model resulted in 77.78% clinical sensitivity. Conclusions The semiautomatic model outperformed the automatic model in clinical sensitivity as a result of the expert intervention applied during calibration and both models vastly outperformed manual expert counting in terms of time-to-detection and completion of analysis. Meanwhile, the clinical sensitivity of the automatic model could be improved significantly with a larger training dataset. In short, semiautomatic, and automatic Gaussian Mixture Models have a place in supporting rapid detection of Tuberculosis in resource-limited settings without sacrificing clinical sensitivity.
Collapse
Affiliation(s)
- Yilun Huang
- Department of Statistics, University of Missouri, Columbia, Missouri, United States of America
| | - Charles M. Darr
- Department of Electrical Engineering & Computer Science, Center for Nano/Micro Systems & Nanotechnology, University of Missouri, Columbia, Missouri, United States of America
| | - Keshab Gangopadhyay
- Department of Electrical Engineering & Computer Science, Center for Nano/Micro Systems & Nanotechnology, University of Missouri, Columbia, Missouri, United States of America
| | - Shubhra Gangopadhyay
- Department of Electrical Engineering & Computer Science, Center for Nano/Micro Systems & Nanotechnology, University of Missouri, Columbia, Missouri, United States of America
- * E-mail: (SG); (SB); (SC)
| | - Sangho Bok
- Department of Electrical Engineering & Computer Science, Center for Nano/Micro Systems & Nanotechnology, University of Missouri, Columbia, Missouri, United States of America
- Department of Electrical & Computer Engineering, University of Denver, Denver, Colorado, United States of America
- * E-mail: (SG); (SB); (SC)
| | - Sounak Chakraborty
- Department of Statistics, University of Missouri, Columbia, Missouri, United States of America
- * E-mail: (SG); (SB); (SC)
| |
Collapse
|
4
|
Bro-Jørgensen W, Hamill JM, Bro R, Solomon GC. Trusting our machines: validating machine learning models for single-molecule transport experiments. Chem Soc Rev 2022; 51:6875-6892. [PMID: 35686581 PMCID: PMC9377421 DOI: 10.1039/d1cs00884f] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Indexed: 11/21/2022]
Abstract
In this tutorial review, we will describe crucial aspects related to the application of machine learning to help users avoid the most common pitfalls. The examples we present will be based on data from the field of molecular electronics, specifically single-molecule electron transport experiments, but the concepts and problems we explore will be sufficiently general for application in other fields with similar data. In the first part of the tutorial review, we will introduce the field of single-molecule transport, and provide an overview of the most common machine learning algorithms employed. In the second part of the tutorial review, we will show, through examples grounded in single-molecule transport, that the promises of machine learning can only be fulfilled by careful application. We will end the tutorial review with a discussion of where we, as a field, could go from here.
Collapse
Affiliation(s)
- William Bro-Jørgensen
- Department of Chemistry and Nano-Science Center, University of Copenhagen, Universitetsparken 5, DK-2100, Copenhagen Ø, Denmark.
| | - Joseph M Hamill
- Department of Chemistry and Nano-Science Center, University of Copenhagen, Universitetsparken 5, DK-2100, Copenhagen Ø, Denmark.
| | - Rasmus Bro
- Department of Food Science, University of Copenhagen, Rolighedsvej 26, 1958 Frederiksberg, Denmark.
| | - Gemma C Solomon
- Department of Chemistry and Nano-Science Center, University of Copenhagen, Universitetsparken 5, DK-2100, Copenhagen Ø, Denmark.
| |
Collapse
|
5
|
Lin D, Zhao Z, Pan H, Li S, Wang Y, Wang D, Sanvito S, Hou S. Using Weakly Supervised Deep Learning to Classify and Segment Single-Molecule Break-Junction Conductance Traces. Chemphyschem 2021; 22:2107-2114. [PMID: 34324254 DOI: 10.1002/cphc.202100414] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 07/23/2021] [Indexed: 11/11/2022]
Abstract
In order to design molecular electronic devices with high performance and stability, it is crucial to understand their structure-to-property relationships. Single-molecule break junction measurements yield a large number of conductance-distance traces, which are inherently highly stochastic. Here we propose a weakly supervised deep learning algorithm to classify and segment these conductance traces, a method that is mainly based on transfer learning with the pretrain-finetune technique. By exploiting the powerful feature extraction capabilities of the pretrained VGG-16 network, our convolutional neural network model not only achieves high accuracy in the classification of the conductance traces, but also segments precisely the conductance plateau from an entire trace with very few manually labeled traces. Thus, we can produce a more reliable estimation of the junction conductance and quantify the junction stability. These findings show that our model has achieved a better accuracy-to-manpower efficiency balance, opening up the possibility of using weakly supervised deep learning approaches in the studies of single-molecule junctions.
Collapse
Affiliation(s)
- Dongying Lin
- Department of Electronics, Peking University, Center for Nanoscale Science and Technology, Key Laboratory for the Physics and Chemistry of Nanodevices, Beijing, 100871, China
| | - Zhihao Zhao
- Chinese Academy of Sciences, CAS Key Laboratory of Molecular Nanostructure and Nanotechnology, CAS Research/Education Center for Excellence in Molecular Sciences, Beijing National Laboratory for Molecular Science (BNLMS), Beijing, 100190, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Haoyang Pan
- Department of Electronics, Peking University, Center for Nanoscale Science and Technology, Key Laboratory for the Physics and Chemistry of Nanodevices, Beijing, 100871, China
| | - Shi Li
- Department of Electronics, Peking University, Center for Nanoscale Science and Technology, Key Laboratory for the Physics and Chemistry of Nanodevices, Beijing, 100871, China
| | - Yongfeng Wang
- Department of Electronics, Peking University, Center for Nanoscale Science and Technology, Key Laboratory for the Physics and Chemistry of Nanodevices, Beijing, 100871, China
| | - Dong Wang
- Chinese Academy of Sciences, CAS Key Laboratory of Molecular Nanostructure and Nanotechnology, CAS Research/Education Center for Excellence in Molecular Sciences, Beijing National Laboratory for Molecular Science (BNLMS), Beijing, 100190, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Stefano Sanvito
- School of Physics, Trinity College, AMBER and CRANN Institute, Dublin 2, Ireland
| | - Shimin Hou
- Department of Electronics, Peking University, Center for Nanoscale Science and Technology, Key Laboratory for the Physics and Chemistry of Nanodevices, Beijing, 100871, China
| |
Collapse
|
6
|
Liu B, Murayama S, Komoto Y, Tsutsui M, Taniguchi M. Dissecting Time-Evolved Conductance Behavior of Single Molecule Junctions by Nonparametric Machine Learning. J Phys Chem Lett 2020; 11:6567-6572. [PMID: 32668163 DOI: 10.1021/acs.jpclett.0c01948] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Improved understanding of charge transport in single molecules is essential for utilizing their potential as circuit components at the nanosize limit. However, reliable analyses of varying tunneling current acquired by break junction experiments remain an ongoing challenge to find molecular feature structure-property relationships. In this work, we report on an unsupervised learning approach for investigating molecular signatures in conductance traces. Our hybrid machine learning algorithm compares grids of data in conductance-time domains and judges the similarity without any researcher-crafted parameters to identify fine molecular components that may otherwise be obscured by background fluctuations. We demonstrate its ability for classifying Au-alkanedithiol-Au conductance traces acquired with microfabricated mechanically controllable break junctions. The unbiased procedure was able to not only judge the presence or absence of the carbon chains in the electrode gap but also to identify multiple conductance states of the molecular tunneling junctions with different conformations. This finding may offer a useful tool for studying single-molecule properties using break junction methods.
Collapse
Affiliation(s)
- Bo Liu
- The Institute of Scientific and Industrial Research, Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka 567-0047, Japan
| | - Sanae Murayama
- The Institute of Scientific and Industrial Research, Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka 567-0047, Japan
| | - Yuki Komoto
- The Institute of Scientific and Industrial Research, Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka 567-0047, Japan
| | - Makusu Tsutsui
- The Institute of Scientific and Industrial Research, Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka 567-0047, Japan
| | - Masateru Taniguchi
- The Institute of Scientific and Industrial Research, Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka 567-0047, Japan
| |
Collapse
|
7
|
Albrecht T, Slabaugh G, Alonso E, Al-Arif SMMR. Deep learning for single-molecule science. NANOTECHNOLOGY 2017; 28:423001. [PMID: 28762339 DOI: 10.1088/1361-6528/aa8334] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/14/2023]
Abstract
Exploring and making predictions based on single-molecule data can be challenging, not only due to the sheer size of the datasets, but also because a priori knowledge about the signal characteristics is typically limited and poor signal-to-noise ratio. For example, hypothesis-driven data exploration, informed by an expectation of the signal characteristics, can lead to interpretation bias or loss of information. Equally, even when the different data categories are known, e.g., the four bases in DNA sequencing, it is often difficult to know how to make best use of the available information content. The latest developments in machine learning (ML), so-called deep learning (DL) offer interesting, new avenues to address such challenges. In some applications, such as speech and image recognition, DL has been able to outperform conventional ML strategies and even human performance. However, to date DL has not been applied much in single-molecule science, presumably in part because relatively little is known about the 'internal workings' of such DL tools within single-molecule science as a field. In this Tutorial, we make an attempt to illustrate in a step-by-step guide how one of those, a convolutional neural network (CNN), may be used for base calling in DNA sequencing applications. We compare it with a SVM as a more conventional ML method, and discuss some of the strengths and weaknesses of the approach. In particular, a 'deep' neural network has many features of a 'black box', which has important implications on how we look at and interpret data.
Collapse
Affiliation(s)
- Tim Albrecht
- Department of Chemistry, Imperial College London, Exhibition Road, London SW7 2AZ, United Kingdom
| | | | | | | |
Collapse
|
8
|
Evers F, Venkataraman L. Preface: Special Topic on Frontiers in Molecular Scale Electronics. J Chem Phys 2017. [DOI: 10.1063/1.4977469] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|