Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

25
(from Reference Citation Analysis)

Article PDFs (6)

Cited by > 0 (16)

Searched Name

Reinforcement Learning

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Ovinnikov I, Beuret A, Cavaliere F, Buhmann JM. Fundamentals of Arthroscopic Surgery Training and beyond: a reinforcement learning exploration and benchmark. Int J Comput Assist Radiol Surg 2024:10.1007/s11548-024-03116-z. [PMID: 38684559 DOI: 10.1007/s11548-024-03116-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 03/20/2024] [Indexed: 05/02/2024]

Alali M, Kazeminajafabadi A, Imani M. Deep Reinforcement Learning Sensor Scheduling for Effective Monitoring of Dynamical Systems. Syst Sci Control Eng 2024;12:2329260. [PMID: 38680720 PMCID: PMC11044865 DOI: 10.1080/21642583.2024.2329260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Accepted: 03/04/2024] [Indexed: 05/01/2024]

Al-Sakkari EG, Ragab A, Dagdougui H, Boffito DC, Amazouz M. Carbon capture, utilization and sequestration systems design and operation optimization: Assessment and perspectives of artificial intelligence opportunities. Sci Total Environ 2024;917:170085. [PMID: 38224888 DOI: 10.1016/j.scitotenv.2024.170085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 12/10/2023] [Accepted: 01/09/2024] [Indexed: 01/17/2024]

Abstract

Carbon capture, utilization, and sequestration (CCUS) is a promising solution to decarbonize the energy and industrial sectors to mitigate climate change. An integrated assessment of technological options is required for the effective deployment of CCUS large-scale infrastructure between CO2 production and utilization/sequestration nodes. However, developing cost-effective strategies from engineering and operation perspectives to implement CCUS is challenging. This is due to the diversity of upstream emitting processes located in different geographical areas, available downstream utilization technologies, storage sites capacity/location, and current/future energy/emissions/economic conditions. This paper identifies the need to achieve a robust hybrid assessment tool for CCUS modeling, simulation, and optimization based mainly on artificial intelligence (AI) combined with mechanistic methods. Thus, a critical literature review is conducted to assess CCUS technologies and their related process modeling/simulation/optimization techniques, while evaluating the needs for improvements or new developments to reduce overall CCUS systems design and operation costs. These techniques include first principles- based and data-driven ones, i.e. AI and related machine learning (ML) methods. Besides, the paper gives an overview on the role of life cycle assessment (LCA) to evaluate CCUS systems where the combined LCA-AI approach is assessed. Other advanced methods based on the AI/ML capabilities/algorithms can be developed to optimize the whole CCUS value chain. Interpretable ML combined with explainable AI can accelerate optimum materials selection by giving strong rules which accelerates the design of capture/utilization plants afterwards. Besides, deep reinforcement learning (DRL) coupled with process simulations will accelerate process design/operation optimization through considering simultaneous optimization of equipment sizing and operating conditions. Moreover, generative deep learning (GDL) is a key solution to optimum capture/utilization materials design/discovery. The developed AI methods can be generalizable where the extracted knowledge can be transferred to future works to help cutting the costs of CCUS value chain.

Collapse

Chen Y, Zhang F, Liu Z. Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms. Neural Netw 2024;169:764-777. [PMID: 37981458 DOI: 10.1016/j.neunet.2023.10.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2021] [Revised: 06/26/2023] [Accepted: 10/16/2023] [Indexed: 11/21/2023]

Hassani SA, Womelsdorf T. Noradrenergic alpha-2a Receptor Stimulation Enhances Prediction Error Signaling in Anterior Cingulate Cortex and Striatum. bioRxiv 2023:2023.10.25.564052. [PMID: 37961384 PMCID: PMC10634832 DOI: 10.1101/2023.10.25.564052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]

Zhang M, Lin H, Takagi S, Cao Y, Shahabi C, Xiong L. CSGAN: Modality-Aware Trajectory Generation via Clustering-based Sequence GAN. IEEE Int Conf Mob Data Manag 2023;2023:148-157. [PMID: 37965426 PMCID: PMC10644148 DOI: 10.1109/mdm58254.2023.00032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2023]

Zehfroosh A, Tanner HG. PAC Reinforcement Learning Algorithm for General-Sum Markov Games. IEEE Trans Automat Contr 2023;68:2821-2831. [PMID: 37915545 PMCID: PMC10617487 DOI: 10.1109/tac.2022.3219340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/03/2023]

Coelho-Magalhães T, Azevedo Coste C, Resende-Martins H. A Novel Functional Electrical Stimulation-Induced Cycling Controller Using Reinforcement Learning to Optimize Online Muscle Activation Pattern. Sensors (Basel) 2022;22:9126. [PMID: 36501826 PMCID: PMC9741342 DOI: 10.3390/s22239126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Revised: 11/02/2022] [Accepted: 11/08/2022] [Indexed: 06/17/2023]

Abstract

This study introduces a novel controller based on a Reinforcement Learning (RL) algorithm for real-time adaptation of the stimulation pattern during FES-cycling. Core to our approach is the introduction of an RL agent that interacts with the cycling environment and learns through trial and error how to modulate the electrical charge applied to the stimulated muscle groups according to a predefined policy and while tracking a reference cadence. Instead of a static stimulation pattern to be modified by a control law, we hypothesized that a non-stationary baseline set of parameters would better adjust the amount of injected electrical charge to the time-varying characteristics of the musculature. Overground FES-assisted cycling sessions were performed by a subject with spinal cord injury (SCI AIS-A, T8). For tracking a predefined pedaling cadence, two closed-loop control laws were simultaneously used to modulate the pulse intensity of the stimulation channels responsible for evoking the muscle contractions. First, a Proportional-Integral (PI) controller was used to control the current amplitude of the stimulation channels over an initial parameter setting with predefined pulse amplitude, width and fixed frequency parameters. In parallel, an RL algorithm with a decayed-epsilon-greedy strategy was implemented to randomly explore nine different variations of pulse amplitude and width parameters over the same stimulation setting, aiming to adjust the injected electrical charge according to a predefined policy. The performance of this global control strategy was evaluated in two different RL settings and explored in two different cycling scenarios. The participant was able to pedal overground for distances over 3.5 km, and the results evidenced the RL agent learned to modify the stimulation pattern according to the predefined policy and was simultaneously able to track a predefined pedaling cadence. Despite the simplicity of our approach and the existence of more sophisticated RL algorithms, our method can be used to reduce the time needed to define stimulation patterns. Our results suggest interesting research possibilities to be explored in the future to improve cycling performance since more efficient stimulation cost dynamics can be explored and implemented for the agent to learn.

Collapse

Barnoy Y, Erin O, Raval S, Pryor W, Mair LO, Weinberg IN, Diaz-Mercado Y, Krieger A, Hager GD. Control of Magnetic Surgical Robots With Model-Based Simulators and Reinforcement Learning. IEEE Trans Med Robot Bionics 2022;4:945-956. [PMID: 37600471 PMCID: PMC10438915 DOI: 10.1109/tmrb.2022.3214426] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/22/2023]

Thanawala R, Jesneck J, Shelton J, Rhee R, Seymour NE. Overcoming Systems Factors in Case Logging with Artificial Intelligence Tools. J Surg Educ 2022;79:1024-1030. [PMID: 35193831 DOI: 10.1016/j.jsurg.2022.01.013] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2021] [Revised: 11/03/2021] [Accepted: 01/30/2022] [Indexed: 06/14/2023]

Abstract

INTRODUCTION

Case logs are foundational data in surgical education, yet cases are consistently under-reported. Logging behavior is driven by multiple human and systems factors, including time constraints, ease of case data retrieval, access to data-entry tools, and procedural code decision tools.

METHODS

We examined case logging trends at three mid-sized, general surgery training programs from September 2016-October 2020, January 2019-October 2020 and May 2019-October 2020, respectively. Across the programs we compared the number of cases logged per week when residents logged directly to ACGME versus via a resident education platform with machine learning-based case logging assistance tools. We examined case logging patterns across 4 consecutive phases: baseline default ACGME logging prior to platform access (P0 "Manual"), full platform logging assistance (P1 "Assisted"), partial platform assistance requiring manual data entry without data integrations (P2 "Notebook"), and resumed fully integrated platform with logging assistance (P3 "Resumed").

RESULTS

31,385 cases were logged utilizing the platform since 2016 by 171 residents across the 3 programs.Intelligent case logging assistance significantly increased case logging rates, from 1.44 ± 1.48 cases by manual entry in P0 to 4.77 ± 2.45 cases per resident per week via the platform in P1 (p-value < 0.00001). Despite the burden of manual data entry when the platform's data connectivity was paused, the tool helped to increase overall case logging into ACGME to 2.85 ± 2.37 cases per week (p-value = 0.0002). Upon resuming the data connectivity, case logging levels rose to 4.54 ± 3.33 cases per week via the platform, equivalent to P1 levels (insignificant difference, p-value = 0.57).

CONCLUSIONS

Mapping the influence of systems and human factors in high-quality case logs allows us to target interventions to continually improve the training of surgical residents. System level factors such as access to alternate automation-drive tools and operative schedule integrated platforms to assist in ACGME case log has a significant impact on the number of cases captured in logs.

Collapse

Zong K, Luo C. Reinforcement learning based framework for COVID-19 resource allocation. Comput Ind Eng 2022;167:107960. [PMID: 35125625 PMCID: PMC8800507 DOI: 10.1016/j.cie.2022.107960] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Revised: 12/10/2021] [Accepted: 01/13/2022] [Indexed: 06/14/2023]

Cheng X, Wang L, Lv Q, Wu H, Huang X, Yuan J, Sun X, Zhao X, Yan C, Yi Z. Reduced learning bias towards the reward context in medication-naive first-episode schizophrenia patients. BMC Psychiatry 2022;22:123. [PMID: 35172748 PMCID: PMC8851841 DOI: 10.1186/s12888-021-03682-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Accepted: 12/28/2021] [Indexed: 11/22/2022] Open

Abstract

BACKGROUND

Reinforcement learning has been proposed to contribute to the development of amotivation in individuals with schizophrenia (SZ). Accumulating evidence suggests dysfunctional learning in individuals with SZ in Go/NoGo learning and expected value representation. However, previous findings might have been confounded by the effects of antipsychotic exposure. Moreover, reinforcement learning also rely on the learning context. Few studies have examined the learning performance in reward and loss-avoidance context separately in medication-naïve individuals with first-episode SZ. This study aimed to explore the behaviour profile of reinforcement learning performance in medication-naïve individuals with first-episode SZ, including the contextual performance, the Go/NoGo learning and the expected value representation performance.

METHODS

Twenty-nine medication-naïve individuals with first-episode SZ and 40 healthy controls (HCs) who have no significant difference in age and gender, completed the Gain and Loss Avoidance Task, a reinforcement learning task involving stimulus pairs presented in both the reward and loss-avoidance context. We assessed the group difference in accuracy in the reward and loss-avoidance context, the Go/NoGo learning and the expected value representation. The correlations between learning performance and the negative symptom severity were examined.

RESULTS

Individuals with SZ showed significantly lower accuracy when learning under the reward than the loss-avoidance context as compared to HCs. The accuracies under the reward context (90%win- 10%win) in the Acquisition phase was significantly and negatively correlated with the Scale for the Assessment of Negative Symptoms (SANS) avolition scores in individuals with SZ. On the other hand, individuals with SZ showed spared ability of Go/NoGo learning and expected value representation.

CONCLUSIONS

Despite our small sample size and relatively modest findings, our results suggest possible reduced learning bias towards reward context among medication-naïve individuals with first-episode SZ. The reward learning performance was correlated with amotivation symptoms. This finding may facilitate our understanding of the underlying mechanism of negative symptoms. Reinforcement learning performance under the reward context may be important to better predict and prevent the development of schizophrenia patients' negative symptom, especially amotivation.

Collapse

Affiliation(s)

Xiaoyan Cheng grid.16821.3c0000 0004 0368 8293Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, 600 South Wanping Road, Shanghai, China ,2grid.24516.340000000123704535Clinical Research Center for Mental Disorders, Shanghai Pudong New Area Mental Health Center, School of Medicine, Tongji University, Shanghai, China
Lingling Wang grid.9227.e0000000119573309Neuropsychology and Applied Cognitive Neuroscience Laboratory, CAS Key Laboratory of Mental Health, Institute of Psychology, Chinese Academy of Sciences, Beijing, China ,4grid.410726.60000 0004 1797 8419Department of Psychology, University of Chinese Academy of Sciences, Beijing, China ,5grid.22069.3f0000 0004 0369 6365Key Laboratory of Brain Functional Genomics (MOE&STCSM), Affiliated Mental Health Center (ECNU), School of Psychology and Cognitive Science, East China Normal University, 3663 North Zhongshan Road, Shanghai, 200062 China
Qinyu Lv grid.16821.3c0000 0004 0368 8293Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, 600 South Wanping Road, Shanghai, China
Haisu Wu grid.16821.3c0000 0004 0368 8293Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, 600 South Wanping Road, Shanghai, China
Xinxin Huang grid.16821.3c0000 0004 0368 8293Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, 600 South Wanping Road, Shanghai, China
Jie Yuan grid.24516.340000000123704535Clinical Research Center for Mental Disorders, Shanghai Pudong New Area Mental Health Center, School of Medicine, Tongji University, Shanghai, China
Xirong Sun grid.24516.340000000123704535Clinical Research Center for Mental Disorders, Shanghai Pudong New Area Mental Health Center, School of Medicine, Tongji University, Shanghai, China
Xudong Zhao grid.24516.340000000123704535Clinical Research Center for Mental Disorders, Shanghai Pudong New Area Mental Health Center, School of Medicine, Tongji University, Shanghai, China
Chao Yan Key Laboratory of Brain Functional Genomics (MOE&STCSM), Affiliated Mental Health Center (ECNU), School of Psychology and Cognitive Science, East China Normal University, 3663 North Zhongshan Road, Shanghai, 200062, China.
Zhenghui Yi Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, 600 South Wanping Road, Shanghai, China.

Collapse

Martinez-Saito M, Andraszewicz S, Klucharev V, Rieskamp J. Mine or Ours? Neural Basis of the Exploitation of Common-Pool Resources. Soc Cogn Affect Neurosci 2022;17:837-849. [PMID: 35104883 PMCID: PMC9433840 DOI: 10.1093/scan/nsac008] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2021] [Revised: 12/01/2021] [Accepted: 01/27/2022] [Indexed: 12/01/2022] Open

Abdeldayem OM, Dabbish AM, Habashy MM, Mostafa MK, Elhefnawy M, Amin L, Al-Sakkari EG, Ragab A, Rene ER. Viral outbreaks detection and surveillance using wastewater-based epidemiology, viral air sampling, and machine learning techniques: A comprehensive review and outlook. Sci Total Environ 2022;803:149834. [PMID: 34525746 PMCID: PMC8379898 DOI: 10.1016/j.scitotenv.2021.149834] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 08/05/2021] [Accepted: 08/18/2021] [Indexed: 05/06/2023]

Abstract

A viral outbreak is a global challenge that affects public health and safety. The coronavirus disease 2019 (COVID-19) has been spreading globally, affecting millions of people worldwide, and led to significant loss of lives and deterioration of the global economy. The current adverse effects caused by the COVID-19 pandemic demands finding new detection methods for future viral outbreaks. The environment's transmission pathways include and are not limited to air, surface water, and wastewater environments. The wastewater surveillance, known as wastewater-based epidemiology (WBE), can potentially monitor viral outbreaks and provide a complementary clinical testing method. Another investigated outbreak surveillance technique that has not been yet implemented in a sufficient number of studies is the surveillance of Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2) in the air. Artificial intelligence (AI) and its related machine learning (ML) and deep learning (DL) technologies are currently emerging techniques for detecting viral outbreaks using global data. To date, there are no reports that illustrate the potential of using WBE with AI to detect viral outbreaks. This study investigates the transmission pathways of SARS-CoV-2 in the environment and provides current updates on the surveillance of viral outbreaks using WBE, viral air sampling, and AI. It also proposes a novel framework based on an ensemble of ML and DL algorithms to provide a beneficial supportive tool for decision-makers. The framework exploits available data from reliable sources to discover meaningful insights and knowledge that allows researchers and practitioners to build efficient methods and protocols that accurately monitor and detect viral outbreaks. The proposed framework could provide early detection of viruses, forecast risk maps and vulnerable areas, and estimate the number of infected citizens.

Collapse

Patronov A, Papadopoulos K, Engkvist O. Has Artificial Intelligence Impacted Drug Discovery? Methods Mol Biol 2022;2390:153-76. [PMID: 34731468 DOI: 10.1007/978-1-0716-1787-8_6] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Gil Ó, Garrell A, Sanfeliu A. Social Robot Navigation Tasks: Combining Machine Learning Techniques and Social Force Model. Sensors (Basel) 2021;21:7087. [PMID: 34770395 DOI: 10.3390/s21217087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/07/2021] [Revised: 10/12/2021] [Accepted: 10/15/2021] [Indexed: 11/26/2022]

Eckstein MK, Wilbrecht L, Collins AGE. What do Reinforcement Learning Models Measure? Interpreting Model Parameters in Cognition and Neuroscience. Curr Opin Behav Sci 2021;41:128-137. [PMID: 34984213 PMCID: PMC8722372 DOI: 10.1016/j.cobeha.2021.06.004] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Pereira T, Abbasi M, Ribeiro B, Arrais JP. Diversity oriented Deep Reinforcement Learning for targeted molecule generation. J Cheminform 2021;13:21. [PMID: 33750461 PMCID: PMC7944916 DOI: 10.1186/s13321-021-00498-z] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Accepted: 02/22/2021] [Indexed: 11/10/2022] Open

Abstract

In this work, we explore the potential of deep learning to streamline the process of identifying new potential drugs through the computational generation of molecules with interesting biological properties. Two deep neural networks compose our targeted generation framework: the Generator, which is trained to learn the building rules of valid molecules employing SMILES strings notation, and the Predictor which evaluates the newly generated compounds by predicting their affinity for the desired target. Then, the Generator is optimized through Reinforcement Learning to produce molecules with bespoken properties. The innovation of this approach is the exploratory strategy applied during the reinforcement training process that seeks to add novelty to the generated compounds. This training strategy employs two Generators interchangeably to sample new SMILES: the initially trained model that will remain fixed and a copy of the previous one that will be updated during the training to uncover the most promising molecules. The evolution of the reward assigned by the Predictor determines how often each one is employed to select the next token of the molecule. This strategy establishes a compromise between the need to acquire more information about the chemical space and the need to sample new molecules, with the experience gained so far. To demonstrate the effectiveness of the method, the Generator is trained to design molecules with an optimized coefficient of partition and also high inhibitory power against the Adenosine [Formula: see text] and [Formula: see text] opioid receptors. The results reveal that the model can effectively adjust the newly generated molecules towards the wanted direction. More importantly, it was possible to find promising sets of unique and diverse molecules, which was the main purpose of the newly implemented strategy.

Collapse

Khadilkar H, Ganu T, Seetharam DP. Optimising Lockdown Policies for Epidemic Control using Reinforcement Learning: An AI-Driven Control Approach Compatible with Existing Disease and Network Models. Trans Indian Natl Acad Eng 2020;5:129-132. [PMID: 38624387 PMCID: PMC7311597 DOI: 10.1007/s41403-020-00129-3] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Revised: 06/04/2020] [Accepted: 06/12/2020] [Indexed: 01/15/2023]

Xu H, Liu X, Yu W, Griffith D, Golmie N. Reinforcement Learning-Based Control and Networking Co-design for Industrial Internet of Things. IEEE J Sel Areas Commun 2020;38:10.1109/jsac.2020.2980909. [PMID: 37555009 PMCID: PMC10408385 DOI: 10.1109/jsac.2020.2980909] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/10/2023]

Liao P, Greenewald K, Klasnja P, Murphy S. Personalized HeartSteps: A Reinforcement Learning Algorithm for Optimizing Physical Activity. Proc ACM Interact Mob Wearable Ubiquitous Technol 2020;4:18. [PMID: 34527853 PMCID: PMC8439432 DOI: 10.1145/3381007] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Alexiadis A. Deep multiphysics: Coupling discrete multiphysics with machine learning to attain self-learning in-silico models replicating human physiology. Artif Intell Med 2019;98:27-34. [PMID: 31521250 DOI: 10.1016/j.artmed.2019.06.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2018] [Revised: 05/30/2019] [Accepted: 06/24/2019] [Indexed: 02/03/2023]

Zhu R, Zeng D, Kosorok MR. Reinforcement Learning Trees. J Am Stat Assoc 2015;110:1770-1784. [PMID: 26903687 DOI: 10.1080/01621459.2015.1036994] [Citation(s) in RCA: 56] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Balasubramani PP, Chakravarthy VS, Ravindran B, Moustafa AA. An extended reinforcement learning model of basal ganglia to understand the contributions of serotonin and dopamine in risk-based decision making, reward prediction, and punishment learning. Front Comput Neurosci 2014;8:47. [PMID: 24795614 PMCID: PMC3997037 DOI: 10.3389/fncom.2014.00047] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2013] [Accepted: 03/30/2014] [Indexed: 11/29/2022] Open

Fonteneau R, Murphy SA, Wehenkel L, Ernst D. Batch Mode Reinforcement Learning based on the Synthesis of Artificial Trajectories. Ann Oper Res 2013;208:383-416. [PMID: 24049244 PMCID: PMC3773886 DOI: 10.1007/s10479-012-1248-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]