1
|
Fetecau C, Stan F, Boazu D. Artificial Neural Network Modeling of Mechanical Properties of 3D-Printed Polyamide 12 and Its Fiber-Reinforced Composites. Polymers (Basel) 2025; 17:677. [PMID: 40076169 PMCID: PMC11902404 DOI: 10.3390/polym17050677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2025] [Revised: 02/19/2025] [Accepted: 02/28/2025] [Indexed: 03/14/2025] Open
Abstract
Fused filament fabrication (FFF) has recently emerged as a sustainable digital manufacturing technology to fabricate polymer composite parts with complex structures and minimal waste. However, FFF-printed composite parts frequently exhibit heterogeneous structures with low mechanical properties. To manufacture high-end parts with good mechanical properties, advanced predictive tools are required. In this paper, Artificial Neural Network (ANN) models were developed to evaluate the mechanical properties of 3D-printed polyamide 12 (PA) and carbon fiber (CF) and glass fiber (GF) reinforced PA composites. Tensile samples were fabricated by FFF, considering two input parameters, such as printing orientation and infill density, and tested to determine the mechanical properties. Then, single- and multi-target ANN models were trained using the forward propagation Levenberg-Marquardt algorithm. Post-training performance analysis indicated that the ANN models work efficiently and accurately in predicting Young's modulus and tensile strength of the 3D-printed PA and fiber-reinforced PA composites, with most relative errors being far less than 5%. In terms of mechanical properties, such as Young's modulus and tensile strength, the 3D-printed composites outperform the unreinforced PA. Printing PA composites with 0° orientation and 100% infill density results in a maximum increase in Young's modulus (up to 98% for CF/PA and 32% for GF/PA) and tensile strength (up to 36% for CF/PA and 18% for GF/PA) compared to the unreinforced PA. This study underscores the potential of the ANN models to predict the mechanical properties of 3D-printed parts, enhancing the use of 3D-printed PA composite components in structural applications.
Collapse
Affiliation(s)
- Catalin Fetecau
- Center of Excellence Polymer Processing, Dunarea de Jos University of Galati, Domneasca 47, 800 008 Galati, Romania;
| | - Felicia Stan
- Center of Excellence Polymer Processing, Dunarea de Jos University of Galati, Domneasca 47, 800 008 Galati, Romania;
| | - Doina Boazu
- Faculty of Engineering, Dunarea de Jos University of Galati, Domneasca 47, 800 008 Galati, Romania;
| |
Collapse
|
2
|
Jean H, Wallaert N, Dreumont A, Creff G, Godey B, Paraouty N. Automating Speech Audiometry in Quiet and in Noise Using a Deep Neural Network. BIOLOGY 2025; 14:191. [PMID: 40001959 PMCID: PMC11851792 DOI: 10.3390/biology14020191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/13/2025] [Revised: 02/10/2025] [Accepted: 02/11/2025] [Indexed: 02/27/2025]
Abstract
In addition to pure-tone audiometry tests and electrophysiological tests, a comprehensive hearing evaluation includes assessing a subject's ability to understand speech in quiet and in noise. In fact, speech audiometry tests are commonly used in clinical practice; however, they are time-consuming as they require manual scoring by a hearing professional. To address this issue, we developed an automated speech recognition (ASR) system for scoring subject responses at the phonetic level. The ASR was built using a deep neural network and trained with pre-recorded French speech materials: Lafon's cochlear lists and Dodelé logatoms. Next, we tested the performance and reliability of the ASR in clinical settings with both normal-hearing and hearing-impaired listeners. Our findings indicate that the ASR's performance is statistically similar to manual scoring by expert hearing professionals, both in quiet and in noisy conditions. Moreover, the test-retest reliability of the automated scoring closely matches that of manual scoring. Together, our results validate the use of this deep neural network in both clinical and research contexts for conducting speech audiometry tests in quiet and in noise.
Collapse
Affiliation(s)
- Hadrien Jean
- R&D Department, My Medical Assistant SAS, 51100 Reims, France
| | - Nicolas Wallaert
- R&D Department, My Medical Assistant SAS, 51100 Reims, France
- Department of Otorhinolaryngology-Head and Neck Surgery, Rennes University Hospital, 35000 Rennes, France
| | | | - Gwenaelle Creff
- Department of Otorhinolaryngology-Head and Neck Surgery, Rennes University Hospital, 35000 Rennes, France
| | - Benoit Godey
- Department of Otorhinolaryngology-Head and Neck Surgery, Rennes University Hospital, 35000 Rennes, France
| | - Nihaad Paraouty
- R&D Department, My Medical Assistant SAS, 51100 Reims, France
| |
Collapse
|
3
|
Faber K, Corizzo R, Sniezynski B, Japkowicz N. VLAD: Task-agnostic VAE-based lifelong anomaly detection. Neural Netw 2023; 165:248-273. [PMID: 37307668 DOI: 10.1016/j.neunet.2023.05.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 04/19/2023] [Accepted: 05/17/2023] [Indexed: 06/14/2023]
Abstract
Lifelong learning represents an emerging machine learning paradigm that aims at designing new methods providing accurate analyses in complex and dynamic real-world environments. Although a significant amount of research has been conducted in image classification and reinforcement learning, very limited work has been done to solve lifelong anomaly detection problems. In this context, a successful method has to detect anomalies while adapting to changing environments and preserving knowledge to avoid catastrophic forgetting. While state-of-the-art online anomaly detection methods are able to detect anomalies and adapt to a changing environment, they are not designed to preserve past knowledge. On the other hand, while lifelong learning methods are focused on adapting to changing environments and preserving knowledge, they are not tailored for detecting anomalies, and often require task labels or task boundaries which are not available in task-agnostic lifelong anomaly detection scenarios. This paper proposes VLAD, a novel VAE-based Lifelong Anomaly Detection method addressing all these challenges simultaneously in complex task-agnostic scenarios. VLAD leverages the combination of lifelong change point detection and an effective model update strategy supported by experience replay with a hierarchical memory maintained by means of consolidation and summarization. An extensive quantitative evaluation showcases the merit of the proposed method in a variety of applied settings. VLAD outperforms state-of-the-art methods for anomaly detection, presenting increased robustness and performance in complex lifelong settings.
Collapse
Affiliation(s)
- Kamil Faber
- AGH University of Science and Technology, Institute of Computer Science, Adama Mickiewicza 30, Krakow, 30-059, Poland.
| | - Roberto Corizzo
- American University, Department of Computer Science, 4400 Massachusetts Ave NW, Washington, 20016, DC, United States.
| | - Bartlomiej Sniezynski
- AGH University of Science and Technology, Institute of Computer Science, Adama Mickiewicza 30, Krakow, 30-059, Poland.
| | - Nathalie Japkowicz
- American University, Department of Computer Science, 4400 Massachusetts Ave NW, Washington, 20016, DC, United States.
| |
Collapse
|
4
|
Yuen S, Ezard THG, Sobey AJ. Epigenetic opportunities for evolutionary computation. ROYAL SOCIETY OPEN SCIENCE 2023; 10:221256. [PMID: 37181799 PMCID: PMC10170609 DOI: 10.1098/rsos.221256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 04/20/2023] [Indexed: 05/16/2023]
Abstract
Evolutionary computation is a group of biologically inspired algorithms used to solve complex optimization problems. It can be split into evolutionary algorithms, which take inspiration from genetic inheritance, and swarm intelligence algorithms, that take inspiration from cultural inheritance. However, much of the modern evolutionary literature remains relatively unexplored. To understand which evolutionary mechanisms have been considered, and which have been overlooked, this paper breaks down successful bioinspired algorithms under a contemporary biological framework based on the extended evolutionary synthesis, an extension of the classical, genetics focused, modern synthesis. Although the idea of the extended evolutionary synthesis has not been fully accepted in evolutionary theory, it presents many interesting concepts that could provide benefits to evolutionary computation. The analysis shows that Darwinism and the modern synthesis have been incorporated into evolutionary computation but the extended evolutionary synthesis has been broadly ignored beyond: cultural inheritance, incorporated in the sub-set of swarm intelligence algorithms, evolvability, through covariance matrix adaptation evolution strategy (CMA-ES), and multilevel selection, through multilevel selection genetic algorithm (MLSGA). The framework shows a gap in epigenetic inheritance for evolutionary computation, despite being a key building block in modern interpretations of evolution. This leaves a diverse range of biologically inspired mechanisms as low hanging fruit that should be explored further within evolutionary computation and illustrates the potential of epigenetic based approaches through the recent benchmarks in the literature.
Collapse
Affiliation(s)
- Sizhe Yuen
- Maritime Engineering, University of Southampton, Southampton SO17 1BJ, UK
| | - Thomas H. G. Ezard
- Ocean and Earth Science, National Oceanography Centre Southampton, European Way, University of Southampton, Southampton SO14 3ZH, UK
| | - Adam J. Sobey
- Maritime Engineering, University of Southampton, Southampton SO17 1BJ, UK
- Marine and Maritime Group, Data-centric Engineering, The Alan Turing Institute, The British Library, London NW1 2DB, UK
| |
Collapse
|
5
|
Li Z, Yu S, Lu Y, Chen P. Plastic Gating Network: Adapting to Personal Development and Individual Differences in Knowledge Tracing. Inf Sci (N Y) 2023. [DOI: 10.1016/j.ins.2023.01.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
|
6
|
Alnowaiser KK, Ahmed MA. Digital Twin: Current Research Trends and Future Directions. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2022. [DOI: 10.1007/s13369-022-07459-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
7
|
Hospedales T, Antoniou A, Micaelli P, Storkey A. Meta-Learning in Neural Networks: A Survey. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2022; 44:5149-5169. [PMID: 33974543 DOI: 10.1109/tpami.2021.3079209] [Citation(s) in RCA: 82] [Impact Index Per Article: 27.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
The field of meta-learning, or learning-to-learn, has seen a dramatic rise in interest in recent years. Contrary to conventional approaches to AI where tasks are solved from scratch using a fixed learning algorithm, meta-learning aims to improve the learning algorithm itself, given the experience of multiple learning episodes. This paradigm provides an opportunity to tackle many conventional challenges of deep learning, including data and computation bottlenecks, as well as generalization. This survey describes the contemporary meta-learning landscape. We first discuss definitions of meta-learning and position it with respect to related fields, such as transfer learning and hyperparameter optimization. We then propose a new taxonomy that provides a more comprehensive breakdown of the space of meta-learning methods today. We survey promising applications and successes of meta-learning such as few-shot learning and reinforcement learning. Finally, we discuss outstanding challenges and promising areas for future research.
Collapse
|
8
|
|
9
|
Lessons from infant learning for unsupervised machine learning. NAT MACH INTELL 2022. [DOI: 10.1038/s42256-022-00488-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
10
|
Evolutionary neural networks for deep learning: a review. INT J MACH LEARN CYB 2022. [DOI: 10.1007/s13042-022-01578-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
11
|
Yoder JA, Anderson CB, Wang C, Izquierdo EJ. Reinforcement Learning for Central Pattern Generation in Dynamical Recurrent Neural Networks. Front Comput Neurosci 2022; 16:818985. [PMID: 35465269 PMCID: PMC9028035 DOI: 10.3389/fncom.2022.818985] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2021] [Accepted: 03/10/2022] [Indexed: 11/21/2022] Open
Abstract
Lifetime learning, or the change (or acquisition) of behaviors during a lifetime, based on experience, is a hallmark of living organisms. Multiple mechanisms may be involved, but biological neural circuits have repeatedly demonstrated a vital role in the learning process. These neural circuits are recurrent, dynamic, and non-linear and models of neural circuits employed in neuroscience and neuroethology tend to involve, accordingly, continuous-time, non-linear, and recurrently interconnected components. Currently, the main approach for finding configurations of dynamical recurrent neural networks that demonstrate behaviors of interest is using stochastic search techniques, such as evolutionary algorithms. In an evolutionary algorithm, these dynamic recurrent neural networks are evolved to perform the behavior over multiple generations, through selection, inheritance, and mutation, across a population of solutions. Although, these systems can be evolved to exhibit lifetime learning behavior, there are no explicit rules built into these dynamic recurrent neural networks that facilitate learning during their lifetime (e.g., reward signals). In this work, we examine a biologically plausible lifetime learning mechanism for dynamical recurrent neural networks. We focus on a recently proposed reinforcement learning mechanism inspired by neuromodulatory reward signals and ongoing fluctuations in synaptic strengths. Specifically, we extend one of the best-studied and most-commonly used dynamic recurrent neural networks to incorporate the reinforcement learning mechanism. First, we demonstrate that this extended dynamical system (model and learning mechanism) can autonomously learn to perform a central pattern generation task. Second, we compare the robustness and efficiency of the reinforcement learning rules in relation to two baseline models, a random walk and a hill-climbing walk through parameter space. Third, we systematically study the effect of the different meta-parameters of the learning mechanism on the behavioral learning performance. Finally, we report on preliminary results exploring the generality and scalability of this learning mechanism for dynamical neural networks as well as directions for future work.
Collapse
Affiliation(s)
- Jason A. Yoder
- Computer Science and Software Engineering Department, Rose-Hulman Institute of Technology, Terre Haute, IN, United States
- *Correspondence: Jason A. Yoder
| | - Cooper B. Anderson
- Computer Science and Software Engineering Department, Rose-Hulman Institute of Technology, Terre Haute, IN, United States
| | - Cehong Wang
- Computer Science and Software Engineering Department, Rose-Hulman Institute of Technology, Terre Haute, IN, United States
| | - Eduardo J. Izquierdo
- Computational Neuroethology Lab, Cognitive Science Program, Indiana University, Bloomington, IN, United States
| |
Collapse
|
12
|
Kudithipudi D, Aguilar-Simon M, Babb J, Bazhenov M, Blackiston D, Bongard J, Brna AP, Chakravarthi Raja S, Cheney N, Clune J, Daram A, Fusi S, Helfer P, Kay L, Ketz N, Kira Z, Kolouri S, Krichmar JL, Kriegman S, Levin M, Madireddy S, Manicka S, Marjaninejad A, McNaughton B, Miikkulainen R, Navratilova Z, Pandit T, Parker A, Pilly PK, Risi S, Sejnowski TJ, Soltoggio A, Soures N, Tolias AS, Urbina-Meléndez D, Valero-Cuevas FJ, van de Ven GM, Vogelstein JT, Wang F, Weiss R, Yanguas-Gil A, Zou X, Siegelmann H. Biological underpinnings for lifelong learning machines. NAT MACH INTELL 2022. [DOI: 10.1038/s42256-022-00452-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|
13
|
The Imitation Game in Children With Tourette Syndrome: A Lack of Impulse Control to Mirror Environmental Stimuli. Motor Control 2021; 26:92-96. [PMID: 34768240 DOI: 10.1123/mc.2021-0064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Revised: 09/07/2021] [Accepted: 09/09/2021] [Indexed: 11/18/2022]
Abstract
The learning process in humans requires continuous contacts with environmental stimuli, especially during neurodevelopmental growth. These functions are assisted by the coding potential of mirror neurons to serve social interactions. This ability to learn imitating the observed behavior is no longer necessary during adulthood, and control mechanisms prevent automatic mirroring. However, children with Gilles de la Tourette syndrome could encounter coding errors at the level of the mirror neurons system as these cortical regions are themselves the ones affected in the syndrome. Combined with impulsivity, the resulting sign would be a manifest echopraxia that persists throughout adulthood, averting these individuals from the appraisal of a spot-on motor control.
Collapse
|
14
|
Jordan J, Schmidt M, Senn W, Petrovici MA. Evolving interpretable plasticity for spiking networks. eLife 2021; 10:66273. [PMID: 34709176 PMCID: PMC8553337 DOI: 10.7554/elife.66273] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Accepted: 08/19/2021] [Indexed: 11/25/2022] Open
Abstract
Continuous adaptation allows survival in an ever-changing world. Adjustments in the synaptic coupling strength between neurons are essential for this capability, setting us apart from simpler, hard-wired organisms. How these changes can be mathematically described at the phenomenological level, as so-called ‘plasticity rules’, is essential both for understanding biological information processing and for developing cognitively performant artificial systems. We suggest an automated approach for discovering biophysically plausible plasticity rules based on the definition of task families, associated performance measures and biophysical constraints. By evolving compact symbolic expressions, we ensure the discovered plasticity rules are amenable to intuitive understanding, fundamental for successful communication and human-guided generalization. We successfully apply our approach to typical learning scenarios and discover previously unknown mechanisms for learning efficiently from rewards, recover efficient gradient-descent methods for learning from target signals, and uncover various functionally equivalent STDP-like rules with tuned homeostatic mechanisms. Our brains are incredibly adaptive. Every day we form memories, acquire new knowledge or refine existing skills. This stands in contrast to our current computers, which typically can only perform pre-programmed actions. Our own ability to adapt is the result of a process called synaptic plasticity, in which the strength of the connections between neurons can change. To better understand brain function and build adaptive machines, researchers in neuroscience and artificial intelligence (AI) are modeling the underlying mechanisms. So far, most work towards this goal was guided by human intuition – that is, by the strategies scientists think are most likely to succeed. Despite the tremendous progress, this approach has two drawbacks. First, human time is limited and expensive. And second, researchers have a natural – and reasonable – tendency to incrementally improve upon existing models, rather than starting from scratch. Jordan, Schmidt et al. have now developed a new approach based on ‘evolutionary algorithms’. These computer programs search for solutions to problems by mimicking the process of biological evolution, such as the concept of survival of the fittest. The approach exploits the increasing availability of cheap but powerful computers. Compared to its predecessors (or indeed human brains), it also uses search strategies that are less biased by previous models. The evolutionary algorithms were presented with three typical learning scenarios. In the first, the computer had to spot a repeating pattern in a continuous stream of input without receiving feedback on how well it was doing. In the second scenario, the computer received virtual rewards whenever it behaved in the desired manner – an example of reinforcement learning. Finally, in the third ‘supervised learning’ scenario, the computer was told exactly how much its behavior deviated from the desired behavior. For each of these scenarios, the evolutionary algorithms were able to discover mechanisms of synaptic plasticity to solve the new task successfully. Using evolutionary algorithms to study how computers ‘learn’ will provide new insights into how brains function in health and disease. It could also pave the way for developing intelligent machines that can better adapt to the needs of their users.
Collapse
Affiliation(s)
- Jakob Jordan
- Department of Physiology, University of Bern, Bern, Switzerland
| | - Maximilian Schmidt
- Ascent Robotics, Tokyo, Japan.,RIKEN Center for Brain Science, Tokyo, Japan
| | - Walter Senn
- Department of Physiology, University of Bern, Bern, Switzerland
| | - Mihai A Petrovici
- Department of Physiology, University of Bern, Bern, Switzerland.,Kirchhoff-Institute for Physics, Heidelberg University, Heidelberg, Germany
| |
Collapse
|
15
|
Eiben A. Real-World Robot Evolution: Why Would it (not) Work? Front Robot AI 2021; 8:696452. [PMID: 34386525 PMCID: PMC8353392 DOI: 10.3389/frobt.2021.696452] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Accepted: 07/16/2021] [Indexed: 01/26/2023] Open
Abstract
This paper takes a critical look at the concept of real-world robot evolution discussing specific challenges for making it practicable. After a brief review of the state of the art several enablers are discussed in detail. It is noted that sample efficient evolution is one of the key prerequisites and there are various promising directions towards this in different stages of maturity, including learning as part of the evolutionary system, genotype filtering, and hybridizing real-world evolution with simulations in a new way. Furthermore, it is emphasized that an evolutionary system that works in the real world needs robots that work in the real world. Obvious as it may seem, to achieve this significant complexification of the robots and their tasks is needed compared to the current practice. Finally, the importance of not only building but also understanding evolving robot systems is emphasised, stating that in order to have the technology work we also need the science behind it.
Collapse
Affiliation(s)
- A.E. Eiben
- Department of Computer Science, Vrije Universiteit Amsterdam, Amsterdam, Netherlands
| |
Collapse
|
16
|
Neuromodulated Dopamine Plastic Networks for Heterogeneous Transfer Learning with Hebbian Principle. Symmetry (Basel) 2021. [DOI: 10.3390/sym13081344] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
The plastic modifications in synaptic connectivity is primarily from changes triggered by neuromodulated dopamine signals. These activities are controlled by neuromodulation, which is itself under the control of the brain. The subjective brain’s self-modifying abilities play an essential role in learning and adaptation. The artificial neural networks with neuromodulated plasticity are used to implement transfer learning in the image classification domain. In particular, this has application in image detection, image segmentation, and transfer of learning parameters with significant results. This paper proposes a novel approach to enhance transfer learning accuracy in a heterogeneous source and target, using the neuromodulation of the Hebbian learning principle, called NDHTL (Neuromodulated Dopamine Hebbian Transfer Learning). Neuromodulation of plasticity offers a powerful new technique with applications in training neural networks implementing asymmetric backpropagation using Hebbian principles in transfer learning motivated CNNs (Convolutional neural networks). Biologically motivated concomitant learning, where connected brain cells activate positively, enhances the synaptic connection strength between the network neurons. Using the NDHTL algorithm, the percentage of change of the plasticity between the neurons of the CNN layer is directly managed by the dopamine signal’s value. The discriminative nature of transfer learning fits well with the technique. The learned model’s connection weights must adapt to unseen target datasets with the least cost and effort in transfer learning. Using distinctive learning principles such as dopamine Hebbian learning in transfer learning for asymmetric gradient weights update is a novel approach. The paper emphasizes the NDHTL algorithmic technique as synaptic plasticity controlled by dopamine signals in transfer learning to classify images using source-target datasets. The standard transfer learning using gradient backpropagation is a symmetric framework. Experimental results using CIFAR-10 and CIFAR-100 datasets show that the proposed NDHTL algorithm can enhance transfer learning efficiency compared to existing methods.
Collapse
|
17
|
Zhang T, Mo H. Reinforcement learning for robot research: A comprehensive review and open issues. INT J ADV ROBOT SYST 2021. [DOI: 10.1177/17298814211007305] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Applying the learning mechanism of natural living beings to endow intelligent robots with humanoid perception and decision-making wisdom becomes an important force to promote the revolution of science and technology in robot domains. Advances in reinforcement learning (RL) over the past decades have led robotics to be highly automated and intelligent, which ensures safety operation instead of manual work and implementation of more intelligence for many challenging tasks. As an important branch of machine learning, RL can realize sequential decision-making under uncertainties through end-to-end learning and has made a series of significant breakthroughs in robot applications. In this review article, we cover RL algorithms from theoretical background to advanced learning policies in different domains, which accelerate to solving practical problems in robotics. The challenges, open issues, and our thoughts on future research directions of RL are also presented to discover new research areas with the objective to motivate new interest.
Collapse
Affiliation(s)
- Tengteng Zhang
- College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, China
| | - Hongwei Mo
- College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, China
| |
Collapse
|
18
|
|
19
|
Alzubaidi L, Zhang J, Humaidi AJ, Al-Dujaili A, Duan Y, Al-Shamma O, Santamaría J, Fadhel MA, Al-Amidie M, Farhan L. Review of deep learning: concepts, CNN architectures, challenges, applications, future directions. JOURNAL OF BIG DATA 2021; 8:53. [PMID: 33816053 PMCID: PMC8010506 DOI: 10.1186/s40537-021-00444-8] [Citation(s) in RCA: 1011] [Impact Index Per Article: 252.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Accepted: 03/22/2021] [Indexed: 05/04/2023]
Abstract
In the last few years, the deep learning (DL) computing paradigm has been deemed the Gold Standard in the machine learning (ML) community. Moreover, it has gradually become the most widely used computational approach in the field of ML, thus achieving outstanding results on several complex cognitive tasks, matching or even beating those provided by human performance. One of the benefits of DL is the ability to learn massive amounts of data. The DL field has grown fast in the last few years and it has been extensively used to successfully address a wide range of traditional applications. More importantly, DL has outperformed well-known ML techniques in many domains, e.g., cybersecurity, natural language processing, bioinformatics, robotics and control, and medical information processing, among many others. Despite it has been contributed several works reviewing the State-of-the-Art on DL, all of them only tackled one aspect of the DL, which leads to an overall lack of knowledge about it. Therefore, in this contribution, we propose using a more holistic approach in order to provide a more suitable starting point from which to develop a full understanding of DL. Specifically, this review attempts to provide a more comprehensive survey of the most important aspects of DL and including those enhancements recently added to the field. In particular, this paper outlines the importance of DL, presents the types of DL techniques and networks. It then presents convolutional neural networks (CNNs) which the most utilized DL network type and describes the development of CNNs architectures together with their main features, e.g., starting with the AlexNet network and closing with the High-Resolution network (HR.Net). Finally, we further present the challenges and suggested solutions to help researchers understand the existing research gaps. It is followed by a list of the major DL applications. Computational tools including FPGA, GPU, and CPU are summarized along with a description of their influence on DL. The paper ends with the evolution matrix, benchmark datasets, and summary and conclusion.
Collapse
Affiliation(s)
- Laith Alzubaidi
- School of Computer Science, Queensland University of Technology, Brisbane, QLD 4000 Australia
- AlNidhal Campus, University of Information Technology & Communications, Baghdad, 10001 Iraq
| | - Jinglan Zhang
- School of Computer Science, Queensland University of Technology, Brisbane, QLD 4000 Australia
| | - Amjad J. Humaidi
- Control and Systems Engineering Department, University of Technology, Baghdad, 10001 Iraq
| | - Ayad Al-Dujaili
- Electrical Engineering Technical College, Middle Technical University, Baghdad, 10001 Iraq
| | - Ye Duan
- Faculty of Electrical Engineering & Computer Science, University of Missouri, Columbia, MO 65211 USA
| | - Omran Al-Shamma
- AlNidhal Campus, University of Information Technology & Communications, Baghdad, 10001 Iraq
| | - J. Santamaría
- Department of Computer Science, University of Jaén, 23071 Jaén, Spain
| | - Mohammed A. Fadhel
- College of Computer Science and Information Technology, University of Sumer, Thi Qar, 64005 Iraq
| | - Muthana Al-Amidie
- Faculty of Electrical Engineering & Computer Science, University of Missouri, Columbia, MO 65211 USA
| | - Laith Farhan
- School of Engineering, Manchester Metropolitan University, Manchester, M1 5GD UK
| |
Collapse
|
20
|
Tek FB. An adaptive locally connected neuron model: Focusing neuron. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2020.08.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
21
|
Wang L, Zheng J, Orchard J. Evolving Generalized Modulatory Learning: Unifying Neuromodulation and Synaptic Plasticity. IEEE Trans Cogn Dev Syst 2020. [DOI: 10.1109/tcds.2019.2960766] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
22
|
Abstract
Evolution gave rise to creatures that are arguably more sophisticated than the greatest human-designed systems. This feat has inspired computer scientists since the advent of computing and led to optimization tools that can evolve complex neural networks for machines-an approach known as "neuroevolution." After a few successes in designing evolvable representations for high-dimensional artifacts, the field has been recently revitalized by going beyond optimization: to many, the wonder of evolution is less in the perfect optimization of each species than in the creativity of such a simple iterative process, that is, in the diversity of species. This modern view of artificial evolution is moving the field away from microevolution, following a fitness gradient in a niche, to macroevolution, filling many niches with highly different species. It already opened promising applications, like evolving gait repertoires, video game levels for different tastes, and diverse designs for aerodynamic bikes.
Collapse
|
23
|
Risi S, Togelius J. Increasing generality in machine learning through procedural content generation. NAT MACH INTELL 2020. [DOI: 10.1038/s42256-020-0208-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
24
|
Neural memory plasticity for medical anomaly detection. Neural Netw 2020; 127:67-81. [DOI: 10.1016/j.neunet.2020.04.011] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2019] [Revised: 02/18/2020] [Accepted: 04/12/2020] [Indexed: 11/18/2022]
|
25
|
|
26
|
Shi M, Zhang T, Zeng Y. A Curiosity-Based Learning Method for Spiking Neural Networks. Front Comput Neurosci 2020; 14:7. [PMID: 32116621 PMCID: PMC7020337 DOI: 10.3389/fncom.2020.00007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2019] [Accepted: 01/20/2020] [Indexed: 01/12/2023] Open
Abstract
Spiking Neural Networks (SNNs) have shown favorable performance recently. Nonetheless, the time-consuming computation on neuron level and complex optimization limit their real-time application. Curiosity has shown great performance in brain learning, which helps biological brains grasp new knowledge efficiently and actively. Inspired by this leaning mechanism, we propose a curiosity-based SNN (CBSNN) model, which contains four main learning processes. Firstly, the network is trained with biologically plausible plasticity principles to get the novelty estimations of all samples in only one epoch; secondly, the CBSNN begins to repeatedly learn the samples whose novelty estimations exceed the novelty threshold and dynamically update the novelty estimations of samples according to the learning results in five epochs; thirdly, in order to avoid the overfitting of the novel samples and forgetting of the learned samples, CBSNN retrains all samples in one epoch; finally, step two and step three are periodically taken until network convergence. Compared with the state-of-the-art Voltage-driven Plasticity-centric SNN (VPSNN) under standard architecture, our model achieves a higher accuracy of 98.55% with only 54.95% of its computation cost on the MNIST hand-written digit recognition dataset. Similar conclusion can also be found out in other datasets, i.e., Iris, NETtalk, Fashion-MNIST, and CIFAR-10, respectively. More experiments and analysis further prove that such curiosity-based learning theory is helpful in improving the efficiency of SNNs. As far as we know, this is the first practical combination of the curiosity mechanism and SNN, and these improvements will make the realistic application of SNNs possible on more specific tasks within the von Neumann framework.
Collapse
Affiliation(s)
- Mengting Shi
- Research Center for Brain-inspired Intelligence, Institute of Automation, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Tielin Zhang
- Research Center for Brain-inspired Intelligence, Institute of Automation, Chinese Academy of Sciences, Beijing, China
| | - Yi Zeng
- Research Center for Brain-inspired Intelligence, Institute of Automation, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China.,Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, Shanghai, China.,National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
27
|
Liu B, Chi W, Li X, Li P, Liang W, Liu H, Wang W, He J. Evolving the pulmonary nodules diagnosis from classical approaches to deep learning-aided decision support: three decades' development course and future prospect. J Cancer Res Clin Oncol 2020; 146:153-185. [PMID: 31786740 DOI: 10.1007/s00432-019-03098-5] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2019] [Accepted: 11/25/2019] [Indexed: 02/06/2023]
Abstract
PURPOSE Lung cancer is the commonest cause of cancer deaths worldwide, and its mortality can be reduced significantly by performing early diagnosis and screening. Since the 1960s, driven by the pressing needs to accurately and effectively interpret the massive volume of chest images generated daily, computer-assisted diagnosis of pulmonary nodule has opened up new opportunities to relax the limitation from physicians' subjectivity, experiences and fatigue. And the fair access to the reliable and affordable computer-assisted diagnosis will fight the inequalities in incidence and mortality between populations. It has been witnessed that significant and remarkable advances have been achieved since the 1980s, and consistent endeavors have been exerted to deal with the grand challenges on how to accurately detect the pulmonary nodules with high sensitivity at low false-positive rate as well as on how to precisely differentiate between benign and malignant nodules. There is a lack of comprehensive examination of the techniques' development which is evolving the pulmonary nodules diagnosis from classical approaches to machine learning-assisted decision support. The main goal of this investigation is to provide a comprehensive state-of-the-art review of the computer-assisted nodules detection and benign-malignant classification techniques developed over three decades, which have evolved from the complicated ad hoc analysis pipeline of conventional approaches to the simplified seamlessly integrated deep learning techniques. This review also identifies challenges and highlights opportunities for future work in learning models, learning algorithms and enhancement schemes for bridging current state to future prospect and satisfying future demand. CONCLUSION It is the first literature review of the past 30 years' development in computer-assisted diagnosis of lung nodules. The challenges indentified and the research opportunities highlighted in this survey are significant for bridging current state to future prospect and satisfying future demand. The values of multifaceted driving forces and multidisciplinary researches are acknowledged that will make the computer-assisted diagnosis of pulmonary nodules enter into the main stream of clinical medicine and raise the state-of-the-art clinical applications as well as increase both welfares of physicians and patients. We firmly hold the vision that fair access to the reliable, faithful, and affordable computer-assisted diagnosis for early cancer diagnosis would fight the inequalities in incidence and mortality between populations, and save more lives.
Collapse
Affiliation(s)
- Bo Liu
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China.
| | - Wenhao Chi
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Xinran Li
- Department of Mathematics, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Peng Li
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | - Wenhua Liang
- Department of Thoracic Surgery and Oncology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
- China State Key Laboratory of Respiratory Disease, Guangzhou, China
| | - Haiping Liu
- PET/CT Center, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
- China State Key Laboratory of Respiratory Disease, Guangzhou, China
| | - Wei Wang
- Department of Thoracic Surgery and Oncology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
- China State Key Laboratory of Respiratory Disease, Guangzhou, China
| | - Jianxing He
- Department of Thoracic Surgery and Oncology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China.
- China State Key Laboratory of Respiratory Disease, Guangzhou, China.
| |
Collapse
|
28
|
Barros P, Eppe M, Parisi GI, Liu X, Wermter S. Expectation Learning for Stimulus Prediction Across Modalities Improves Unisensory Classification. Front Robot AI 2019; 6:137. [PMID: 33501152 PMCID: PMC7806099 DOI: 10.3389/frobt.2019.00137] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2019] [Accepted: 11/25/2019] [Indexed: 11/25/2022] Open
Abstract
Expectation learning is a unsupervised learning process which uses multisensory bindings to enhance unisensory perception. For instance, as humans, we learn to associate a barking sound with the visual appearance of a dog, and we continuously fine-tune this association over time, as we learn, e.g., to associate high-pitched barking with small dogs. In this work, we address the problem of developing a computational model that addresses important properties of expectation learning, in particular focusing on the lack of explicit external supervision other than temporal co-occurrence. To this end, we present a novel hybrid neural model based on audio-visual autoencoders and a recurrent self-organizing network for multisensory bindings that facilitate stimulus reconstructions across different sensory modalities. We refer to this mechanism as stimulus prediction across modalities and demonstrate that the proposed model is capable of learning concept bindings by evaluating it on unisensory classification tasks for audio-visual stimuli using the 43,500 Youtube videos from the animal subset of the AudioSet corpus.
Collapse
Affiliation(s)
- Pablo Barros
- Knowledge Technology, Department of Informatics, University of Hamburg, Hamburg, Germany
| | - Manfred Eppe
- Knowledge Technology, Department of Informatics, University of Hamburg, Hamburg, Germany
| | - German I Parisi
- Knowledge Technology, Department of Informatics, University of Hamburg, Hamburg, Germany
| | - Xun Liu
- Department of Psychology, University of CAS, Beijing, China
| | - Stefan Wermter
- Knowledge Technology, Department of Informatics, University of Hamburg, Hamburg, Germany
| |
Collapse
|
29
|
|
30
|
Continual lifelong learning with neural networks: A review. Neural Netw 2019; 113:54-71. [DOI: 10.1016/j.neunet.2019.01.012] [Citation(s) in RCA: 322] [Impact Index Per Article: 53.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2018] [Revised: 01/18/2019] [Accepted: 01/22/2019] [Indexed: 10/27/2022]
|
31
|
|
32
|
Faghihi F, Moustafa AA. Combined Computational Systems Biology and Computational Neuroscience Approaches Help Develop of Future "Cognitive Developmental Robotics". Front Neurorobot 2017; 11:63. [PMID: 29276486 PMCID: PMC5727420 DOI: 10.3389/fnbot.2017.00063] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2017] [Accepted: 10/24/2017] [Indexed: 11/13/2022] Open
Affiliation(s)
- Faramarz Faghihi
- Department for Cognitive Modeling, Institute for Cognitive and Brain Sciences, Shahid Beheshti University, Tehran, Iran
| | - Ahmed A Moustafa
- School of Social Sciences and Psychology and Marcs Institute for Brain and Behavior, Western Sydney University, Sydney, NSW, Australia
| |
Collapse
|