19251
|
Di Cataldo S, Ficarra E. Mining textural knowledge in biological images: Applications, methods and trends. Comput Struct Biotechnol J 2016; 15:56-67. [PMID: 27994798 PMCID: PMC5155047 DOI: 10.1016/j.csbj.2016.11.002] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2016] [Revised: 11/14/2016] [Accepted: 11/15/2016] [Indexed: 12/18/2022] Open
Abstract
Texture analysis is a major task in many areas of computer vision and pattern recognition, including biological imaging. Indeed, visual textures can be exploited to distinguish specific tissues or cells in a biological sample, to highlight chemical reactions between molecules, as well as to detect subcellular patterns that can be evidence of certain pathologies. This makes automated texture analysis fundamental in many applications of biomedicine, such as the accurate detection and grading of multiple types of cancer, the differential diagnosis of autoimmune diseases, or the study of physiological processes. Due to their specific characteristics and challenges, the design of texture analysis systems for biological images has attracted ever-growing attention in the last few years. In this paper, we perform a critical review of this important topic. First, we provide a general definition of texture analysis and discuss its role in the context of bioimaging, with examples of applications from the recent literature. Then, we review the main approaches to automated texture analysis, with special attention to the methods of feature extraction and encoding that can be successfully applied to microscopy images of cells or tissues. Our aim is to provide an overview of the state of the art, as well as a glimpse into the latest and future trends of research in this area.
Collapse
Affiliation(s)
- Santa Di Cataldo
- Dept. of Computer and Control Engineering, Politecnico di Torino, Cso Duca degli Abruzzi 24, Torino 10129, Italy
| | | |
Collapse
|
19252
|
Xia Y, Zhang B, Coenen F. Face Occlusion Detection Using Deep Convolutional Neural Networks. INT J PATTERN RECOGN 2016. [DOI: 10.1142/s0218001416600107] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
With the rise of crimes associated with Automated Teller Machines (ATMs), security reinforcement by surveillance techniques has been a hot topic on the security agenda. As a result, cameras are frequently installed with ATMs, so as to capture the facial images of users. The main objective is to support follow-up criminal investigations in the event of an incident. However, in the case of miss-use, the user’s face is often occluded. Therefore, face occlusion detection has become very important to prevent crimes connected with ATM usage. Traditional approaches to solving the problem typically comprise a succession of steps: localization, segmentation, feature extraction and recognition. This paper proposes an end-to-end facial occlusion detection framework, which is robust and effective by combining region proposal algorithm and Convolutional Neural Networks (CNN). The framework utilizes a coarse-to-fine strategy, which consists of two CNNs. The first CNN detects the head element within an upper body image while the second distinguishes which facial part is occluded from the head image. In comparison with previous approaches, the usage of CNN is optimal from a system point of view as the design is based on the end-to-end principle and the model operates directly on image pixels. For evaluation purposes, a face occlusion database consisting of over 50[Formula: see text]000 images, with annotated facial parts, was used. Experimental results revealed that the proposed framework is very effective. Using the bespoke face occlusion dataset, Aleix and Robert (AR) face dataset and the Labeled Face in the Wild (LFW) database, we achieved over 85.61%, 97.58% and 100% accuracies for head detection when the Intersection over Union-section (IoU) is larger than 0.5, and 94.55%, 98.58% and 95.41% accuracies for occlusion discrimination, respectively.
Collapse
Affiliation(s)
- Yizhang Xia
- Department of Computer Science and Software Engineering, Xian Jiaotong-Liverpool University, SIP, Suzhou 215123, P. R. China
| | - Bailing Zhang
- Department of Computer Science and Software Engineering, Xian Jiaotong-Liverpool University, SIP, Suzhou 215123, P. R. China
| | - Frans Coenen
- Department of Computer Science, University of Liverpool, Liverpool L69 3BX, UK
| |
Collapse
|
19253
|
Lekadir K, Galimzianova A, Betriu A, Del Mar Vila M, Igual L, Rubin DL, Fernandez E, Radeva P, Napel S. A Convolutional Neural Network for Automatic Characterization of Plaque Composition in Carotid Ultrasound. IEEE J Biomed Health Inform 2016; 21:48-55. [PMID: 27893402 DOI: 10.1109/jbhi.2016.2631401] [Citation(s) in RCA: 99] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Characterization of carotid plaque composition, more specifically the amount of lipid core, fibrous tissue, and calcified tissue, is an important task for the identification of plaques that are prone to rupture, and thus for early risk estimation of cardiovascular and cerebrovascular events. Due to its low costs and wide availability, carotid ultrasound has the potential to become the modality of choice for plaque characterization in clinical practice. However, its significant image noise, coupled with the small size of the plaques and their complex appearance, makes it difficult for automated techniques to discriminate between the different plaque constituents. In this paper, we propose to address this challenging problem by exploiting the unique capabilities of the emerging deep learning framework. More specifically, and unlike existing works which require a priori definition of specific imaging features or thresholding values, we propose to build a convolutional neural network (CNN) that will automatically extract from the images the information that is optimal for the identification of the different plaque constituents. We used approximately 90 000 patches extracted from a database of images and corresponding expert plaque characterizations to train and to validate the proposed CNN. The results of cross-validation experiments show a correlation of about 0.90 with the clinical assessment for the estimation of lipid core, fibrous cap, and calcified tissue areas, indicating the potential of deep learning for the challenging task of automatic characterization of plaque composition in carotid ultrasound.
Collapse
|
19254
|
Ionescu RT, Ionescu AL, Mothe J, Popescu D. Patch Autocorrelation Features: a translation and rotation invariant approach for image classification. Artif Intell Rev 2016. [DOI: 10.1007/s10462-016-9532-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
19255
|
Shishkin SL, Nuzhdin YO, Svirin EP, Trofimov AG, Fedorova AA, Kozyrskiy BL, Velichkovsky BM. EEG Negativity in Fixations Used for Gaze-Based Control: Toward Converting Intentions into Actions with an Eye-Brain-Computer Interface. Front Neurosci 2016; 10:528. [PMID: 27917105 PMCID: PMC5114310 DOI: 10.3389/fnins.2016.00528] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2016] [Accepted: 10/31/2016] [Indexed: 11/13/2022] Open
Abstract
We usually look at an object when we are going to manipulate it. Thus, eye tracking can be used to communicate intended actions. An effective human-machine interface, however, should be able to differentiate intentional and spontaneous eye movements. We report an electroencephalogram (EEG) marker that differentiates gaze fixations used for control from spontaneous fixations involved in visual exploration. Eight healthy participants played a game with their eye movements only. Their gaze-synchronized EEG data (fixation-related potentials, FRPs) were collected during game's control-on and control-off conditions. A slow negative wave with a maximum in the parietooccipital region was present in each participant's averaged FRPs in the control-on conditions and was absent or had much lower amplitude in the control-off condition. This wave was similar but not identical to stimulus-preceding negativity, a slow negative wave that can be observed during feedback expectation. Classification of intentional vs. spontaneous fixations was based on amplitude features from 13 EEG channels using 300 ms length segments free from electrooculogram contamination (200-500 ms relative to the fixation onset). For the first fixations in the fixation triplets required to make moves in the game, classified against control-off data, a committee of greedy classifiers provided 0.90 ± 0.07 specificity and 0.38 ± 0.14 sensitivity. Similar (slightly lower) results were obtained for the shrinkage Linear Discriminate Analysis (LDA) classifier. The second and third fixations in the triplets were classified at lower rate. We expect that, with improved feature sets and classifiers, a hybrid dwell-based Eye-Brain-Computer Interface (EBCI) can be built using the FRP difference between the intended and spontaneous fixations. If this direction of BCI development will be successful, such a multimodal interface may improve the fluency of interaction and can possibly become the basis for a new input device for paralyzed and healthy users, the EBCI "Wish Mouse."
Collapse
Affiliation(s)
- Sergei L Shishkin
- Department of Neurocognitive Technologies, Kurchatov Complex of NBICS Technologies, National Research Centre "Kurchatov Institute," Moscow, Russia
| | - Yuri O Nuzhdin
- Department of Neurocognitive Technologies, Kurchatov Complex of NBICS Technologies, National Research Centre "Kurchatov Institute," Moscow, Russia
| | - Evgeny P Svirin
- Department of Neurocognitive Technologies, Kurchatov Complex of NBICS Technologies, National Research Centre "Kurchatov Institute," Moscow, Russia
| | - Alexander G Trofimov
- Department of Cybernetics, National Research Nuclear University MEPhI Moscow, Russia
| | - Anastasia A Fedorova
- Department of Neurocognitive Technologies, Kurchatov Complex of NBICS Technologies, National Research Centre "Kurchatov Institute," Moscow, Russia
| | - Bogdan L Kozyrskiy
- Department of Neurocognitive Technologies, Kurchatov Complex of NBICS Technologies, National Research Centre "Kurchatov Institute,"Moscow, Russia; Department of Computer Systems and Technologies, National Research Nuclear University MEPhIMoscow, Russia
| | - Boris M Velichkovsky
- Department of Neurocognitive Technologies, Kurchatov Complex of NBICS Technologies, National Research Centre "Kurchatov Institute,"Moscow, Russia; Centre for Cognitive Programs and Technologies, Russian State University for HumanitiesMoscow, Russia; Department of Psychology, Technische Universität DresdenDresden, Germany
| |
Collapse
|
19256
|
Unger J, Merhof D, Renner S. Computer vision applied to herbarium specimens of German trees: testing the future utility of the millions of herbarium specimen images for automated identification. BMC Evol Biol 2016; 16:248. [PMID: 27852219 PMCID: PMC5112707 DOI: 10.1186/s12862-016-0827-5] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2016] [Accepted: 11/10/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Global Plants, a collaborative between JSTOR and some 300 herbaria, now contains about 2.48 million high-resolution images of plant specimens, a number that continues to grow, and collections that are digitizing their specimens at high resolution are allocating considerable recourses to the maintenance of computer hardware (e.g., servers) and to acquiring digital storage space. We here apply machine learning, specifically the training of a Support-Vector-Machine, to classify specimen images into categories, ideally at the species level, using the 26 most common tree species in Germany as a test case. RESULTS We designed an analysis pipeline and classification system consisting of segmentation, normalization, feature extraction, and classification steps and evaluated the system in two test sets, one with 26 species, the other with 17, in each case using 10 images per species of plants collected between 1820 and 1995, which simulates the empirical situation that most named species are represented in herbaria and databases, such as JSTOR, by few specimens. We achieved 73.21% accuracy of species assignments in the larger test set, and 84.88% in the smaller test set. CONCLUSIONS The results of this first application of a computer vision algorithm trained on images of herbarium specimens shows that despite the problem of overlapping leaves, leaf-architectural features can be used to categorize specimens to species with good accuracy. Computer vision is poised to play a significant role in future rapid identification at least for frequently collected genera or species in the European flora.
Collapse
Affiliation(s)
- Jakob Unger
- Institute of Imaging and Computer Vision, RWTH Aachen University, Kopernikusstr. 16, 52074, Aachen, Germany
| | - Dorit Merhof
- Institute of Imaging and Computer Vision, RWTH Aachen University, Kopernikusstr. 16, 52074, Aachen, Germany.
| | - Susanne Renner
- Systematic Botany and Mycology, University of Munich (LMU), Menzinger-Str. 67, 80638, Munich, Germany.
| |
Collapse
|
19257
|
Geng W, Du Y, Jin W, Wei W, Hu Y, Li J. Gesture recognition by instantaneous surface EMG images. Sci Rep 2016; 6:36571. [PMID: 27845347 PMCID: PMC5109222 DOI: 10.1038/srep36571] [Citation(s) in RCA: 206] [Impact Index Per Article: 22.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2016] [Accepted: 10/17/2016] [Indexed: 11/09/2022] Open
Abstract
Gesture recognition in non-intrusive muscle-computer interfaces is usually based on windowed descriptive and discriminatory surface electromyography (sEMG) features because the recorded amplitude of a myoelectric signal may rapidly fluctuate between voltages above and below zero. Here, we present that the patterns inside the instantaneous values of high-density sEMG enables gesture recognition to be performed merely with sEMG signals at a specific instant. We introduce the concept of an sEMG image spatially composed from high-density sEMG and verify our findings from a computational perspective with experiments on gesture recognition based on sEMG images with a classification scheme of a deep convolutional network. Without any windowed features, the resultant recognition accuracy of an 8-gesture within-subject test reached 89.3% on a single frame of sEMG image and reached 99.0% using simple majority voting over 40 frames with a 1,000 Hz sampling rate. Experiments on the recognition of 52 gestures of NinaPro database and 27 gestures of CSL-HDEMG database also validated that our approach outperforms state-of-the-arts methods. Our findings are a starting point for the development of more fluid and natural muscle-computer interfaces with very little observational latency. For example, active prostheses and exoskeletons based on high-density electrodes could be controlled with instantaneous responses.
Collapse
Affiliation(s)
- Weidong Geng
- Zhejiang University, College of Computer Science, Hangzhou, 310027, China
| | - Yu Du
- Zhejiang University, College of Computer Science, Hangzhou, 310027, China
| | - Wenguang Jin
- Zhejiang University, College of Computer Science, Hangzhou, 310027, China
| | - Wentao Wei
- Zhejiang University, College of Computer Science, Hangzhou, 310027, China
| | - Yu Hu
- Zhejiang University, College of Computer Science, Hangzhou, 310027, China
| | - Jiajun Li
- Zhejiang University, College of Computer Science, Hangzhou, 310027, China
| |
Collapse
|
19258
|
Rectified-Linear-Unit-Based Deep Learning for Biomedical Multi-label Data. Interdiscip Sci 2016; 9:419-422. [PMID: 27837428 DOI: 10.1007/s12539-016-0196-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2016] [Revised: 10/03/2016] [Accepted: 10/25/2016] [Indexed: 12/15/2022]
Abstract
Disease diagnosis is one of the major data mining questions by the clinicians. The current diagnosis models usually have a strong assumption that one patient has only one disease, i.e. a single-label data mining problem. But the patients, especially when at the late stages, may have more than one disease and require a multi-label diagnosis. The multi-label data mining is much more difficult than a single-label one, and very few algorithms have been developed for this situation. Deep learning is a data mining algorithm with highly dense inner structure and has achieved many successful applications in the other areas. We propose a hypothesis that rectified-linear-unit-based deep learning algorithm may also be good at the clinical questions, by revising the last layer as a multi-label output. The proof-of-concept experimental data support the hypothesis, and the community may be interested in trying more applications.
Collapse
|
19259
|
Abstract
OBJECTIVES The aim of this manuscript is to provide a brief overview of the scientific challenges that should be addressed in order to unlock the full potential of using data from a general point of view, as well as to present some ideas that could help answer specific needs for data understanding in the field of health sciences and epidemiology. METHODS A survey of uses and challenges of big data analyses for medicine and public health was conducted. The first part of the paper focuses on big data techniques, algorithms, and statistical approaches to identify patterns in data. The second part describes some cutting-edge applications of analyses and predictive modeling in public health. RESULTS In recent years, we witnessed a revolution regarding the nature, collection, and availability of data in general. This was especially striking in the health sector and particularly in the field of epidemiology. Data derives from a large variety of sources, e.g. clinical settings, billing claims, care scheduling, drug usage, web based search queries, and Tweets. CONCLUSION The exploitation of the information (data mining, artificial intelligence) relevant to these data has become one of the most promising as well challenging tasks from societal and scientific viewpoints in order to leverage the information available and making public health more efficient.
Collapse
Affiliation(s)
- A Flahault
- Prof. Antoine Flahault, Chair Louis Jeantet in Public Health, Director of the Institute of Global Health, Faculté de Médecine, Université de Genève, Campus Biotech, Cheimin des Mines 9, 1202 Geneva, Switzerland, E-mail:
| | | | | |
Collapse
|
19260
|
Nair SS, Paré D, Vicentic A. Biologically based neural circuit modelling for the study of fear learning and extinction. NPJ SCIENCE OF LEARNING 2016; 1:16015. [PMID: 29541482 PMCID: PMC5846682 DOI: 10.1038/npjscilearn.2016.15] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2016] [Revised: 09/09/2016] [Accepted: 09/19/2016] [Indexed: 05/25/2023]
Abstract
The neuronal systems that promote protective defensive behaviours have been studied extensively using Pavlovian conditioning. In this paradigm, an initially neutral-conditioned stimulus is paired with an aversive unconditioned stimulus leading the subjects to display behavioural signs of fear. Decades of research into the neural bases of this simple behavioural paradigm uncovered that the amygdala, a complex structure comprised of several interconnected nuclei, is an essential part of the neural circuits required for the acquisition, consolidation and expression of fear memory. However, emerging evidence from the confluence of electrophysiological, tract tracing, imaging, molecular, optogenetic and chemogenetic methodologies, reveals that fear learning is mediated by multiple connections between several amygdala nuclei and their distributed targets, dynamical changes in plasticity in local circuit elements as well as neuromodulatory mechanisms that promote synaptic plasticity. To uncover these complex relations and analyse multi-modal data sets acquired from these studies, we argue that biologically realistic computational modelling, in conjunction with experiments, offers an opportunity to advance our understanding of the neural circuit mechanisms of fear learning and to address how their dysfunction may lead to maladaptive fear responses in mental disorders.
Collapse
Affiliation(s)
- Satish S Nair
- Department of Electrical and Computer Engineering, University of Missouri, Columbia, MO, USA
| | - Denis Paré
- Center for Molecular and Behavioral Neuroscience, Rutgers University—Newark, Newark, NJ, USA
| | - Aleksandra Vicentic
- Division of Neuroscience and Basic Behavioral Science, National Institute of Mental Health, Rockville, MD, USA
| |
Collapse
|
19261
|
Lee JH, Delbruck T, Pfeiffer M. Training Deep Spiking Neural Networks Using Backpropagation. Front Neurosci 2016; 10:508. [PMID: 27877107 PMCID: PMC5099523 DOI: 10.3389/fnins.2016.00508] [Citation(s) in RCA: 227] [Impact Index Per Article: 25.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2016] [Accepted: 10/24/2016] [Indexed: 11/25/2022] Open
Abstract
Deep spiking neural networks (SNNs) hold the potential for improving the latency and energy efficiency of deep neural networks through data-driven event-based computation. However, training such networks is difficult due to the non-differentiable nature of spike events. In this paper, we introduce a novel technique, which treats the membrane potentials of spiking neurons as differentiable signals, where discontinuities at spike times are considered as noise. This enables an error backpropagation mechanism for deep SNNs that follows the same principles as in conventional deep networks, but works directly on spike signals and membrane potentials. Compared with previous methods relying on indirect training and conversion, our technique has the potential to capture the statistics of spikes more precisely. We evaluate the proposed framework on artificially generated events from the original MNIST handwritten digit benchmark, and also on the N-MNIST benchmark recorded with an event-based dynamic vision sensor, in which the proposed method reduces the error rate by a factor of more than three compared to the best previous SNN, and also achieves a higher accuracy than a conventional convolutional neural network (CNN) trained and tested on the same data. We demonstrate in the context of the MNIST task that thanks to their event-driven operation, deep SNNs (both fully connected and convolutional) trained with our method achieve accuracy equivalent with conventional neural networks. In the N-MNIST example, equivalent accuracy is achieved with about five times fewer computational operations.
Collapse
Affiliation(s)
- Jun Haeng Lee
- Samsung Advanced Institute of Technology, Samsung ElectronicsSuwon, South Korea; Institute of Neuroinformatics, University of Zurich and ETH ZurichZurich, Switzerland
| | - Tobi Delbruck
- Institute of Neuroinformatics, University of Zurich and ETH Zurich Zurich, Switzerland
| | - Michael Pfeiffer
- Institute of Neuroinformatics, University of Zurich and ETH Zurich Zurich, Switzerland
| |
Collapse
|
19262
|
Kimura R, Saiki A, Fujiwara-Tsukamoto Y, Sakai Y, Isomura Y. Large-scale analysis reveals populational contributions of cortical spike rate and synchrony to behavioural functions. J Physiol 2016; 595:385-413. [PMID: 27488936 DOI: 10.1113/jp272794] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2016] [Accepted: 08/01/2016] [Indexed: 11/08/2022] Open
Abstract
KEY POINTS There have been few systematic population-wide analyses of relationships between spike synchrony within a period of several milliseconds and behavioural functions. In this study, we obtained a large amount of spike data from > 23,000 neuron pairs by multiple single-unit recording from deep layer neurons in motor cortical areas in rats performing a forelimb movement task. The temporal changes of spike synchrony in the whole neuron pairs were statistically independent of behavioural changes during the task performance, although some neuron pairs exhibited correlated changes in spike synchrony. Mutual information analyses revealed that spike synchrony made a smaller contribution than spike rate to behavioural functions. The strength of spike synchrony between two neurons was statistically independent of the spike rate-based preferences of the pair for behavioural functions. ABSTRACT Spike synchrony within a period of several milliseconds in presynaptic neurons enables effective integration of functional information in the postsynaptic neuron. However, few studies have systematically analysed the population-wide relationships between spike synchrony and behavioural functions. Here we obtained a sufficiently large amount of spike data among regular-spiking (putatively excitatory) and fast-spiking (putatively inhibitory) neuron subtypes (> 23,000 pairs) by multiple single-unit recording from deep layers in motor cortical areas (caudal forelimb area, rostral forelimb area) in rats performing a forelimb movement task. After holding a lever, rats pulled the lever either in response to a cue tone (external-trigger trials) or spontaneously without any cue (internal-trigger trials). Many neurons exhibited functional spike activity in association with forelimb movements, and the preference of regular-spiking neurons in the rostral forelimb area was more biased toward externally triggered movement than that in the caudal forelimb area. We found that a population of neuron pairs with spike synchrony does exist, and that some neuron pairs exhibit a dependence on movement phase during task performance. However, the population-wide analysis revealed that spike synchrony was statistically independent of the movement phase and the spike rate-based preferences of the pair for behavioural functions, whereas spike rates were clearly dependent on the movement phase. In fact, mutual information analyses revealed that the contribution of spike synchrony to the behavioural functions was small relative to the contribution of spike rate. Our large-scale analysis revealed that cortical spike rate, rather than spike synchrony, contributes to population coding for movement.
Collapse
Affiliation(s)
- Rie Kimura
- Brain Science Institute, Tamagawa University, Tokyo, Japan.,JST CREST, Tokyo, Japan.,Division of Visual Information Processing, National Institute for Physiological Sciences, National Institutes of Natural Sciences, Okazaki, Japan.,Department of Physiological Sciences, SOKENDAI (The Graduate University for Advanced Studies), Okazaki, Japan
| | - Akiko Saiki
- Brain Science Institute, Tamagawa University, Tokyo, Japan.,JST CREST, Tokyo, Japan
| | - Yoko Fujiwara-Tsukamoto
- Brain Science Institute, Tamagawa University, Tokyo, Japan.,JST CREST, Tokyo, Japan.,Laboratory of Neural Circuitry, Graduate School of Brain Science, Doshisha University, Kyoto, Japan.,Present address: Faculty of Human Life Studies, Department of Food and Nutrition, Hagoromo University of International Studies, Osaka, Japan
| | - Yutaka Sakai
- Brain Science Institute, Tamagawa University, Tokyo, Japan.,JST CREST, Tokyo, Japan
| | - Yoshikazu Isomura
- Brain Science Institute, Tamagawa University, Tokyo, Japan.,JST CREST, Tokyo, Japan
| |
Collapse
|
19263
|
|
19264
|
Hatt M, Tixier F, Visvikis D, Cheze Le Rest C. Radiomics in PET/CT: More Than Meets the Eye? J Nucl Med 2016; 58:365-366. [PMID: 27811126 DOI: 10.2967/jnumed.116.184655] [Citation(s) in RCA: 69] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2016] [Accepted: 10/11/2016] [Indexed: 01/07/2023] Open
Affiliation(s)
- Mathieu Hatt
- LaTIM, INSERM, UMR 1101, University of Brest, IBSAM, Brest, France; and
| | - Florent Tixier
- Academic Department of Nuclear Medicine, CHU Poitiers, Poitiers, France
| | - Dimitris Visvikis
- LaTIM, INSERM, UMR 1101, University of Brest, IBSAM, Brest, France; and
| | | |
Collapse
|
19265
|
Rezaeilouyeh H, Mollahosseini A, Mahoor MH. Microscopic medical image classification framework via deep learning and shearlet transform. J Med Imaging (Bellingham) 2016; 3:044501. [PMID: 27872871 DOI: 10.1117/1.jmi.3.4.044501] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2016] [Accepted: 10/11/2016] [Indexed: 11/14/2022] Open
Abstract
Cancer is the second leading cause of death in US after cardiovascular disease. Image-based computer-aided diagnosis can assist physicians to efficiently diagnose cancers in early stages. Existing computer-aided algorithms use hand-crafted features such as wavelet coefficients, co-occurrence matrix features, and recently, histogram of shearlet coefficients for classification of cancerous tissues and cells in images. These hand-crafted features often lack generalizability since every cancerous tissue and cell has a specific texture, structure, and shape. An alternative approach is to use convolutional neural networks (CNNs) to learn the most appropriate feature abstractions directly from the data and handle the limitations of hand-crafted features. A framework for breast cancer detection and prostate Gleason grading using CNN trained on images along with the magnitude and phase of shearlet coefficients is presented. Particularly, we apply shearlet transform on images and extract the magnitude and phase of shearlet coefficients. Then we feed shearlet features along with the original images to our CNN consisting of multiple layers of convolution, max pooling, and fully connected layers. Our experiments show that using the magnitude and phase of shearlet coefficients as extra information to the network can improve the accuracy of detection and generalize better compared to the state-of-the-art methods that rely on hand-crafted features. This study expands the application of deep neural networks into the field of medical image analysis, which is a difficult domain considering the limited medical data available for such analysis.
Collapse
Affiliation(s)
- Hadi Rezaeilouyeh
- University of Denver , Department of Electrical and Computer Engineering, 2155 East Wesley Avenue, Denver, Colorado 80208, United States
| | - Ali Mollahosseini
- University of Denver , Department of Electrical and Computer Engineering, 2155 East Wesley Avenue, Denver, Colorado 80208, United States
| | - Mohammad H Mahoor
- University of Denver , Department of Electrical and Computer Engineering, 2155 East Wesley Avenue, Denver, Colorado 80208, United States
| |
Collapse
|
19266
|
Van Valen DA, Kudo T, Lane KM, Macklin DN, Quach NT, DeFelice MM, Maayan I, Tanouchi Y, Ashley EA, Covert MW. Deep Learning Automates the Quantitative Analysis of Individual Cells in Live-Cell Imaging Experiments. PLoS Comput Biol 2016; 12:e1005177. [PMID: 27814364 PMCID: PMC5096676 DOI: 10.1371/journal.pcbi.1005177] [Citation(s) in RCA: 301] [Impact Index Per Article: 33.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2016] [Accepted: 10/03/2016] [Indexed: 02/01/2023] Open
Abstract
Live-cell imaging has opened an exciting window into the role cellular heterogeneity plays in dynamic, living systems. A major critical challenge for this class of experiments is the problem of image segmentation, or determining which parts of a microscope image correspond to which individual cells. Current approaches require many hours of manual curation and depend on approaches that are difficult to share between labs. They are also unable to robustly segment the cytoplasms of mammalian cells. Here, we show that deep convolutional neural networks, a supervised machine learning method, can solve this challenge for multiple cell types across the domains of life. We demonstrate that this approach can robustly segment fluorescent images of cell nuclei as well as phase images of the cytoplasms of individual bacterial and mammalian cells from phase contrast images without the need for a fluorescent cytoplasmic marker. These networks also enable the simultaneous segmentation and identification of different mammalian cell types grown in co-culture. A quantitative comparison with prior methods demonstrates that convolutional neural networks have improved accuracy and lead to a significant reduction in curation time. We relay our experience in designing and optimizing deep convolutional neural networks for this task and outline several design rules that we found led to robust performance. We conclude that deep convolutional neural networks are an accurate method that require less curation time, are generalizable to a multiplicity of cell types, from bacteria to mammalian cells, and expand live-cell imaging capabilities to include multi-cell type systems.
Collapse
Affiliation(s)
- David A. Van Valen
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Takamasa Kudo
- Department of Chemical and Systems Biology, Stanford University, Stanford, California, United States of America
| | - Keara M. Lane
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Derek N. Macklin
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Nicolas T. Quach
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Mialy M. DeFelice
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Inbal Maayan
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Yu Tanouchi
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Euan A. Ashley
- Department of Genetics, Stanford University, Stanford, California, United States of America
- Department of Cardiovascular Medicine, Stanford University, Stanford, California, United States of America
| | - Markus W. Covert
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
- Department of Chemical and Systems Biology, Stanford University, Stanford, California, United States of America
| |
Collapse
|
19267
|
Wang H, Xu Z, Fujita H, Liu S. Towards felicitous decision making: An overview on challenges and trends of Big Data. Inf Sci (N Y) 2016. [DOI: 10.1016/j.ins.2016.07.007] [Citation(s) in RCA: 143] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
|
19268
|
Li X, Zhang Y, Marsic I, Sarcevic A, Burd RS. Deep Learning for RFID-Based Activity Recognition. PROCEEDINGS OF THE ... INTERNATIONAL CONFERENCE ON EMBEDDED NETWORKED SENSOR SYSTEMS. INTERNATIONAL CONFERENCE ON EMBEDDED NETWORKED SENSOR SYSTEMS 2016; 2016:164-175. [PMID: 30381808 PMCID: PMC6205502 DOI: 10.1145/2994551.2994569] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
We present a system for activity recognition from passive RFID data using a deep convolutional neural network. We directly feed the RFID data into a deep convolutional neural network for activity recognition instead of selecting features and using a cascade structure that first detects object use from RFID data followed by predicting the activity. Because our system treats activity recognition as a multi-class classification problem, it is scalable for applications with large number of activity classes. We tested our system using RFID data collected in a trauma room, including 14 hours of RFID data from 16 actual trauma resuscitations. Our system outperformed existing systems developed for activity recognition and achieved similar performance with process-phase detection as systems that require wearable sensors or manually-generated input. We also analyzed the strengths and limitations of our current deep learning architecture for activity recognition from RFID data.
Collapse
Affiliation(s)
- Xinyu Li
- Department of Electrical and Computer Engineering, Rutgers University, New Brunswick, NJ, USA
| | - Yanyi Zhang
- Department of Electrical and Computer Engineering, Rutgers University, New Brunswick, NJ, USA
| | - Ivan Marsic
- Department of Electrical and Computer Engineering, Rutgers University, New Brunswick, NJ, USA
| | - Aleksandra Sarcevic
- College of Computing and Informatics, Drexel University, Philadelphia, PA, USA
| | - Randall S Burd
- Division of Trauma and Burn Surgery, Children's National Medical Center, Washington, D.C., USA
| |
Collapse
|
19269
|
Abstract
In 2007, we proposed an explanation of delusion formation as aberrant prediction error-driven associative learning. Further, we argued that the NMDA receptor antagonist ketamine provided a good model for this process. Subsequently, we validated the model in patients with psychosis, relating aberrant prediction error signals to delusion severity. During the ensuing period, we have developed these ideas, drawing on the simple principle that brains build a model of the world and refine it by minimising prediction errors, as well as using it to guide perceptual inferences. While previously we focused on the prediction error signal per se, an updated view takes into account its precision, as well as the precision of prior expectations. With this expanded perspective, we see several possible routes to psychotic symptoms - which may explain the heterogeneity of psychotic illness, as well as the fact that other drugs, with different pharmacological actions, can produce psychotomimetic effects. In this article, we review the basic principles of this model and highlight specific ways in which prediction errors can be perturbed, in particular considering the reliability and uncertainty of predictions. The expanded model explains hallucinations as perturbations of the uncertainty mediated balance between expectation and prediction error. Here, expectations dominate and create perceptions by suppressing or ignoring actual inputs. Negative symptoms may arise due to poor reliability of predictions in service of action. By mapping from biology to belief and perception, the account proffers new explanations of psychosis. However, challenges remain. We attempt to address some of these concerns and suggest future directions, incorporating other symptoms into the model, building towards better understanding of psychosis.
Collapse
Affiliation(s)
| | | | - Paul C Fletcher
- Department of Psychiatry, University of Cambridge, Cambridge, UK .,Cambridgeshire and Peterborough NHS Foundation Trust, Cambridge, UK
| |
Collapse
|
19270
|
|
19271
|
Winkler DA, Le TC. Performance of Deep and Shallow Neural Networks, the Universal Approximation Theorem, Activity Cliffs, and QSAR. Mol Inform 2016; 36. [DOI: 10.1002/minf.201600118] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2016] [Accepted: 10/04/2016] [Indexed: 12/17/2022]
Affiliation(s)
- David A. Winkler
- CSIRO Manufacturing; Clayton 3168 Australia
- Monash Institute of Pharmaceutical Sciences; Monash University; Parkville 3052 Australia
- Latrobe Institute for Molecular Science; Latrobe University; Bundoora 3082 Australia
- School of Chemical and Physical Science; Flinders University; Bedford Park 5042 Australia
| | - Tu C. Le
- CSIRO Manufacturing; Clayton 3168 Australia
| |
Collapse
|
19272
|
A New Data Representation Based on Training Data Characteristics to Extract Drug Name Entity in Medical Text. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2016; 2016:3483528. [PMID: 27843447 PMCID: PMC5098107 DOI: 10.1155/2016/3483528] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/27/2016] [Revised: 08/08/2016] [Accepted: 09/18/2016] [Indexed: 11/18/2022]
Abstract
One essential task in information extraction from the medical corpus is drug name recognition. Compared with text sources come from other domains, the medical text mining poses more challenges, for example, more unstructured text, the fast growing of new terms addition, a wide range of name variation for the same drug, the lack of labeled dataset sources and external knowledge, and the multiple token representations for a single drug name. Although many approaches have been proposed to overwhelm the task, some problems remained with poor F-score performance (less than 0.75). This paper presents a new treatment in data representation techniques to overcome some of those challenges. We propose three data representation techniques based on the characteristics of word distribution and word similarities as a result of word embedding training. The first technique is evaluated with the standard NN model, that is, MLP. The second technique involves two deep network classifiers, that is, DBN and SAE. The third technique represents the sentence as a sequence that is evaluated with a recurrent NN model, that is, LSTM. In extracting the drug name entities, the third technique gives the best F-score performance compared to the state of the art, with its average F-score being 0.8645.
Collapse
|
19273
|
Gauthier I, Tarr MJ. Visual Object Recognition: Do We (Finally) Know More Now Than We Did? Annu Rev Vis Sci 2016; 2:377-396. [DOI: 10.1146/annurev-vision-111815-114621] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Isabel Gauthier
- Department of Psychology, Vanderbilt University, Nashville, Tennessee 37240-7817;
| | - Michael J. Tarr
- Department of Psychology, Center for the Neural Basis of Cognition, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213
| |
Collapse
|
19274
|
Abstract
Deep convolutional neural networks (CNNs) are widely used in modern AI systems for their superior accuracy but at the cost of high computational complexity. The complexity comes from the need to simultaneously process hundreds of filters and channels in the high-dimensional convolutions, which involve a significant amount of data movement. Although highly-parallel compute paradigms, such as SIMD/SIMT, effectively address the computation requirement to achieve high throughput, energy consumption still remains high as data movement can be more expensive than computation. Accordingly, finding a dataflow that supports parallel processing with minimal data movement cost is crucial to achieving energy-efficient CNN processing without compromising accuracy.
In this paper, we present a novel dataflow, called
row-stationary
(RS), that minimizes data movement energy consumption on a spatial architecture. This is realized by exploiting local data reuse of filter weights and feature map pixels, i.e., activations, in the high-dimensional convolutions, and minimizing data movement of partial sum accumulations. Unlike dataflows used in existing designs, which only reduce certain types of data movement, the proposed RS dataflow can adapt to different CNN shape configurations and reduces all types of data movement through maximally utilizing the processing engine (PE) local storage, direct inter-PE communication and spatial parallelism. To evaluate the energy efficiency of the different dataflows, we propose an analysis framework that compares energy cost under the same hardware area and processing parallelism constraints. Experiments using the CNN configurations of AlexNet show that the proposed RS dataflow is more energy efficient than existing dataflows in both convolutional (1.4× to 2.5×) and fully-connected layers (at least 1.3× for batch size larger than 16). The RS dataflow has also been demonstrated on a fabricated chip, which verifies our energy analysis.
Collapse
|
19275
|
Abstract
This work observes that a large fraction of the computations performed by Deep Neural Networks (DNNs) are intrinsically ineffectual as they involve a multiplication where one of the inputs is zero. This observation motivates
Cnvlutin
(
CNV
), a value-based approach to hardware acceleration that eliminates most of these ineffectual operations, improving performance and energy over a state-of-the-art accelerator with no accuracy loss.
CNV
uses hierarchical data-parallel units, allowing groups of lanes to proceed mostly independently enabling them to skip over the ineffectual computations. A co-designed data storage format encodes the computation elimination decisions taking them off the critical path while avoiding control divergence in the data parallel units. Combined, the units and the data storage format result in a data-parallel architecture that maintains wide, aligned accesses to its memory hierarchy and that keeps its data lanes busy. By loosening the ineffectual computation identification criterion,
CNV
enables further performance and energy efficiency improvements, and more so if a loss in accuracy is acceptable. Experimental measurements over a set of state-of-the-art DNNs for image classification show that
CNV
improves performance over a state-of-the-art accelerator from 1.24× to 1.55× and by 1.37× on average without any loss in accuracy by removing zero-valued operand multiplications alone. While
CNV
incurs an area overhead of 4.49%, it improves overall
EDP
(Energy Delay Product) and
ED
2
P
(Energy Delay Squared Product) on average by 1.47× and 2.01×, respectively. The average performance improvements increase to 1.52× without any loss in accuracy with a broader ineffectual identification policy. Further improvements are demonstrated with a loss in accuracy.
Collapse
|
19276
|
Energy-efficient neural network chips approach human recognition capabilities. Proc Natl Acad Sci U S A 2016; 113:11387-11389. [PMID: 27702894 DOI: 10.1073/pnas.1614109113] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|
19277
|
Esser SK, Merolla PA, Arthur JV, Cassidy AS, Appuswamy R, Andreopoulos A, Berg DJ, McKinstry JL, Melano T, Barch DR, di Nolfo C, Datta P, Amir A, Taba B, Flickner MD, Modha DS. Convolutional networks for fast, energy-efficient neuromorphic computing. Proc Natl Acad Sci U S A 2016; 113:11441-11446. [PMID: 27651489 PMCID: PMC5068316 DOI: 10.1073/pnas.1604850113] [Citation(s) in RCA: 171] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Deep networks are now able to achieve human-level performance on a broad spectrum of recognition tasks. Independently, neuromorphic computing has now demonstrated unprecedented energy-efficiency through a new chip architecture based on spiking neurons, low precision synapses, and a scalable communication network. Here, we demonstrate that neuromorphic computing, despite its novel architectural primitives, can implement deep convolution networks that (i) approach state-of-the-art classification accuracy across eight standard datasets encompassing vision and speech, (ii) perform inference while preserving the hardware's underlying energy-efficiency and high throughput, running on the aforementioned datasets at between 1,200 and 2,600 frames/s and using between 25 and 275 mW (effectively >6,000 frames/s per Watt), and (iii) can be specified and trained using backpropagation with the same ease-of-use as contemporary deep learning. This approach allows the algorithmic power of deep learning to be merged with the efficiency of neuromorphic processors, bringing the promise of embedded, intelligent, brain-inspired computing one step closer.
Collapse
Affiliation(s)
- Steven K Esser
- Brain-Inspired Computing, IBM Research-Almaden, San Jose, CA 95120
| | - Paul A Merolla
- Brain-Inspired Computing, IBM Research-Almaden, San Jose, CA 95120
| | - John V Arthur
- Brain-Inspired Computing, IBM Research-Almaden, San Jose, CA 95120
| | - Andrew S Cassidy
- Brain-Inspired Computing, IBM Research-Almaden, San Jose, CA 95120
| | | | | | - David J Berg
- Brain-Inspired Computing, IBM Research-Almaden, San Jose, CA 95120
| | | | - Timothy Melano
- Brain-Inspired Computing, IBM Research-Almaden, San Jose, CA 95120
| | - Davis R Barch
- Brain-Inspired Computing, IBM Research-Almaden, San Jose, CA 95120
| | - Carmelo di Nolfo
- Brain-Inspired Computing, IBM Research-Almaden, San Jose, CA 95120
| | - Pallab Datta
- Brain-Inspired Computing, IBM Research-Almaden, San Jose, CA 95120
| | - Arnon Amir
- Brain-Inspired Computing, IBM Research-Almaden, San Jose, CA 95120
| | - Brian Taba
- Brain-Inspired Computing, IBM Research-Almaden, San Jose, CA 95120
| | - Myron D Flickner
- Brain-Inspired Computing, IBM Research-Almaden, San Jose, CA 95120
| | | |
Collapse
|
19278
|
Wang Q, Zhang R. Double JPEG compression forensics based on a convolutional neural network. EURASIP JOURNAL ON INFORMATION SECURITY 2016. [DOI: 10.1186/s13635-016-0047-y] [Citation(s) in RCA: 84] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
19279
|
Kim M, Rai N, Zorraquino V, Tagkopoulos I. Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli. Nat Commun 2016; 7:13090. [PMID: 27713404 PMCID: PMC5059772 DOI: 10.1038/ncomms13090] [Citation(s) in RCA: 103] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2016] [Accepted: 09/01/2016] [Indexed: 12/20/2022] Open
Abstract
A significant obstacle in training predictive cell models is the lack of integrated data sources. We develop semi-supervised normalization pipelines and perform experimental characterization (growth, transcriptional, proteome) to create Ecomics, a consistent, quality-controlled multi-omics compendium for Escherichia coli with cohesive meta-data information. We then use this resource to train a multi-scale model that integrates four omics layers to predict genome-wide concentrations and growth dynamics. The genetic and environmental ontology reconstructed from the omics data is substantially different and complementary to the genetic and chemical ontologies. The integration of different layers confers an incremental increase in the prediction performance, as does the information about the known gene regulatory and protein-protein interactions. The predictive performance of the model ranges from 0.54 to 0.87 for the various omics layers, which far exceeds various baselines. This work provides an integrative framework of omics-driven predictive modelling that is broadly applicable to guide biological discovery. Multi-omics data integration is a great challenge. Here, the authors compile a database of E. coli proteomics, transcriptomics, metabolomics and fluxomics data to train models of recurrent neural network and constrained regression, enabling prediction of bacterial responses to perturbations.
Collapse
Affiliation(s)
- Minseung Kim
- Department of Computer Science, University of California, Davis, California 95616, USA.,Genome Center, University of California, Davis, California 95616, USA
| | - Navneet Rai
- Genome Center, University of California, Davis, California 95616, USA
| | | | - Ilias Tagkopoulos
- Department of Computer Science, University of California, Davis, California 95616, USA.,Genome Center, University of California, Davis, California 95616, USA
| |
Collapse
|
19280
|
Yaniv Z, Faruque J, Howe S, Dunn K, Sharlip D, Bond A, Perillan P, Bodenreider O, Ackerman MJ, Yoo TS. The National Library of Medicine Pill Image Recognition Challenge: An Initial Report. IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP : [PROCEEDINGS]. IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP 2016; 2016:10.1109/AIPR.2016.8010584. [PMID: 29854569 PMCID: PMC5973812 DOI: 10.1109/aipr.2016.8010584] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
In January 2016 the U.S. National Library of Medicine announced a challenge competition calling for the development and discovery of high-quality algorithms and software that rank how well consumer images of prescription pills match reference images of pills in its authoritative RxIMAGE collection. This challenge was motivated by the need to easily identify unknown prescription pills both by healthcare personnel and the general public. Potential benefits of this capability include confirmation of the pill in settings where the documentation and medication have been separated, such as in a disaster or emergency; and confirmation of a pill when the prescribed medication changes from brand to generic, or for any other reason the shape and color of the pill change. The data for the competition consisted of two types of images, high quality macro photographs, reference images, and consumer quality photographs of the quality we expect users of a proposed application to acquire. A training dataset consisting of 2000 reference images and 5000 corresponding consumer quality images acquired from 1000 pills was provided to challenge participants. A second dataset acquired from 1000 pills with similar distributions of shape and color was reserved as a segregated testing set. Challenge submissions were required to produce a ranking of the reference images, given a consumer quality image as input. Determination of the winning teams was done using the mean average precision quality metric, with the three winners obtaining mean average precision scores of 0.27, 0.09, and 0.08. In the retrieval results, the correct image was amongst the top five ranked images 43%, 12%, and 11% of the time, out of 5000 query/consumer images. This is an initial promising step towards development of an NLM software system and application-programming interface facilitating pill identification. The training dataset will continue to be freely available online at: http://pir.nlm.nih.gov/challenge/submission.html.
Collapse
Affiliation(s)
- Ziv Yaniv
- National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
- TAJ Technologies Inc., Mendota Heights, MN, USA
| | - Jessica Faruque
- National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
- CytoVale Inc., San Francisco, CA, USA
| | - Sally Howe
- National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Kathel Dunn
- National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - David Sharlip
- National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | | | | | - Olivier Bodenreider
- National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Michael J Ackerman
- National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
- TAJ Technologies Inc., Mendota Heights, MN, USA
| | - Terry S Yoo
- National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
19281
|
|
19282
|
Wang Y, Widrow B, Zadeh LA, Howard N, Wood S, Bhavsar VC, Budin G, Chan C, Fiorini RA, Gavrilova ML, Shell DF. Cognitive Intelligence. INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE 2016. [DOI: 10.4018/ijcini.2016100101] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The theme of IEEE ICCI*CC'16 on Cognitive Informatics (CI) and Cognitive Computing (CC) was on cognitive computers, big data cognition, and machine learning. CI and CC are a contemporary field not only for basic studies on the brain, computational intelligence theories, and denotational mathematics, but also for engineering applications in cognitive systems towards deep learning, deep thinking, and deep reasoning. This paper reports a set of position statements presented in the plenary panel (Part I) in IEEE ICCI*CC'16 at Stanford University. The summary is contributed by invited panelists who are part of the world's renowned scholars in the transdisciplinary field of CI and CC.
Collapse
Affiliation(s)
- Yingxu Wang
- International Institute of Cognitive Informatics and Cognitive Computing (ICIC),Laboratory for Computational Intelligence, Denotational Mathematics, and Software Science, Department of Electrical and Computer Engineering, Schulich School of Engineering and Hotchkiss Brain Institute, University of Calgary, Calgary, Canada & Information Systems Lab, Stanford University, Stanford, CA, USA
| | | | | | | | - Sally Wood
- Santa Clara University, Santa Carla, CA, USA
| | | | - Gerhard Budin
- Center for Translation Studies, Vienna University, Vienna, Austria
| | | | - Rodolfo A. Fiorini
- Department of Electronics, Information and Bioengineering (DEIB), Politecnico di Milano University, Milano, Italy
| | | | - Duane F. Shell
- Department of Educational Psychology, University of Nebraska-Lincoln, Lincoln, NE, USA
| |
Collapse
|
19283
|
Multani N, Rudzicz F, Wong WYS, Namasivayam AK, van Lieshout P. Random Item Generation Is Affected by Age. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2016; 59:1172-1178. [PMID: 27681687 DOI: 10.1044/2016_jslhr-l-15-0077] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2015] [Accepted: 01/25/2016] [Indexed: 06/06/2023]
Abstract
PURPOSE Random item generation (RIG) involves central executive functioning. Measuring aspects of random sequences can therefore provide a simple method to complement other tools for cognitive assessment. We examine the extent to which RIG relates to specific measures of cognitive function, and whether those measures can be estimated using RIG only. METHOD Twelve healthy older adults (age: M = 70.3 years, SD = 4.9; 8 women and 4 men) and 20 healthy young adults (age: M = 24 years, SD = 4.0; 12 women and 8 men) participated in this pilot study. Each completed a RIG task, along with the color Stroop test, the Repeatable Battery for the Assessment of Neuropsychological Status, and the Peabody Picture Vocabulary Test-Fourth Edition (Dunn & Dunn, 2007). Several statistical features extracted from RIG sequences, including recurrence quantification, were found to be related to the other measures through correlation, regression, and a neural-network model. RESULTS The authors found significant effects of age in RIG and demonstrate that nonlinear machine learning can use measures of RIG to accurately predict outcomes from other tools. CONCLUSIONS These results suggest that RIG can be used as a relatively simple predictor for other tools and in particular seems promising as a potential screening tool for selective attention in healthy aging.
Collapse
Affiliation(s)
- Namita Multani
- Oral Dynamics Lab, University of Toronto, Ontario, Canada
| | - Frank Rudzicz
- University of Toronto, Ontario, CanadaRehabilitation Sciences Institute, University of Toronto, Ontario, CanadaToronto Rehabilitation Institute-University Health Network, Ontario, Canada
| | | | - Aravind Kumar Namasivayam
- Oral Dynamics Lab, University of Toronto, Ontario, CanadaToronto Rehabilitation Institute-University Health Network, Ontario, Canada
| | - Pascal van Lieshout
- Oral Dynamics Lab, University of Toronto, Ontario, CanadaUniversity of Toronto, Ontario, CanadaRehabilitation Sciences Institute, University of Toronto, Ontario, CanadaToronto Rehabilitation Institute-University Health Network, Ontario, CanadaInstitute of Biomaterials and Biomedical Engineering, University of Toronto, Ontario, Canada
| |
Collapse
|
19284
|
|
19285
|
Grollier J, Querlioz D, Stiles MD. Spintronic Nanodevices for Bioinspired Computing. PROCEEDINGS OF THE IEEE. INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS 2016; 104:2024-2039. [PMID: 27881881 PMCID: PMC5117478 DOI: 10.1109/jproc.2016.2597152] [Citation(s) in RCA: 85] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Bioinspired hardware holds the promise of low-energy, intelligent, and highly adaptable computing systems. Applications span from automatic classification for big data management, through unmanned vehicle control, to control for biomedical prosthesis. However, one of the major challenges of fabricating bioinspired hardware is building ultra-high-density networks out of complex processing units interlinked by tunable connections. Nanometer-scale devices exploiting spin electronics (or spintronics) can be a key technology in this context. In particular, magnetic tunnel junctions (MTJs) are well suited for this purpose because of their multiple tunable functionalities. One such functionality, non-volatile memory, can provide massive embedded memory in unconventional circuits, thus escaping the von-Neumann bottleneck arising when memory and processors are located separately. Other features of spintronic devices that could be beneficial for bioinspired computing include tunable fast nonlinear dynamics, controlled stochasticity, and the ability of single devices to change functions in different operating conditions. Large networks of interacting spintronic nanodevices can have their interactions tuned to induce complex dynamics such as synchronization, chaos, soliton diffusion, phase transitions, criticality, and convergence to multiple metastable states. A number of groups have recently proposed bioinspired architectures that include one or several types of spintronic nanodevices. In this paper, we show how spintronics can be used for bioinspired computing. We review the different approaches that have been proposed, the recent advances in this direction, and the challenges toward fully integrated spintronics complementary metal-oxide-semiconductor (CMOS) bioinspired hardware.
Collapse
Affiliation(s)
- Julie Grollier
- Unité Mixte de Physique CNRS, Thales, Univ. Paris-Sud, Université Paris-Saclay, 91767 Palaiseau, France
| | - Damien Querlioz
- Centre de Nanosciences et de Nanotechnologies, CNRS, Université Paris-Saclay, 91405 Orsay, France
| | - Mark D. Stiles
- Center for Nanoscale Science and Technology, National Institute of Standards and Technology, Gaithersburg, MD 20899-6202 USA
| |
Collapse
|
19286
|
Jackups R, Savage W. Gaps in Research on Adverse Events to Transfusion in Pediatrics. Transfus Med Rev 2016; 30:209-12. [DOI: 10.1016/j.tmrv.2016.06.001] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2016] [Accepted: 06/15/2016] [Indexed: 01/28/2023]
|
19287
|
Kawahara J, Brown CJ, Miller SP, Booth BG, Chau V, Grunau RE, Zwicker JG, Hamarneh G. BrainNetCNN: Convolutional neural networks for brain networks; towards predicting neurodevelopment. Neuroimage 2016; 146:1038-1049. [PMID: 27693612 DOI: 10.1016/j.neuroimage.2016.09.046] [Citation(s) in RCA: 280] [Impact Index Per Article: 31.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2016] [Revised: 08/26/2016] [Accepted: 09/19/2016] [Indexed: 01/01/2023] Open
Abstract
We propose BrainNetCNN, a convolutional neural network (CNN) framework to predict clinical neurodevelopmental outcomes from brain networks. In contrast to the spatially local convolutions done in traditional image-based CNNs, our BrainNetCNN is composed of novel edge-to-edge, edge-to-node and node-to-graph convolutional filters that leverage the topological locality of structural brain networks. We apply the BrainNetCNN framework to predict cognitive and motor developmental outcome scores from structural brain networks of infants born preterm. Diffusion tensor images (DTI) of preterm infants, acquired between 27 and 46 weeks gestational age, were used to construct a dataset of structural brain connectivity networks. We first demonstrate the predictive capabilities of BrainNetCNN on synthetic phantom networks with simulated injury patterns and added noise. BrainNetCNN outperforms a fully connected neural-network with the same number of model parameters on both phantoms with focal and diffuse injury patterns. We then apply our method to the task of joint prediction of Bayley-III cognitive and motor scores, assessed at 18 months of age, adjusted for prematurity. We show that our BrainNetCNN framework outperforms a variety of other methods on the same data. Furthermore, BrainNetCNN is able to identify an infant's postmenstrual age to within about 2 weeks. Finally, we explore the high-level features learned by BrainNetCNN by visualizing the importance of each connection in the brain with respect to predicting the outcome scores. These findings are then discussed in the context of the anatomy and function of the developing preterm infant brain.
Collapse
Affiliation(s)
- Jeremy Kawahara
- Medical Image Analysis Lab, Simon Fraser University, Burnaby, BC, Canada
| | - Colin J Brown
- Medical Image Analysis Lab, Simon Fraser University, Burnaby, BC, Canada
| | - Steven P Miller
- Department of Paediatrics, The Hospital for Sick Children and the University of Toronto, Toronto, ON, Canada
| | - Brian G Booth
- Medical Image Analysis Lab, Simon Fraser University, Burnaby, BC, Canada
| | - Vann Chau
- Department of Paediatrics, The Hospital for Sick Children and the University of Toronto, Toronto, ON, Canada
| | - Ruth E Grunau
- Child and Family Research Institute and the University of British Columbia, Vancouver, BC, Canada
| | - Jill G Zwicker
- Child and Family Research Institute and the University of British Columbia, Vancouver, BC, Canada
| | - Ghassan Hamarneh
- Medical Image Analysis Lab, Simon Fraser University, Burnaby, BC, Canada.
| |
Collapse
|
19288
|
|
19289
|
Estimating CT Image from MRI Data Using 3D Fully Convolutional Networks. DEEP LEARNING AND DATA LABELING FOR MEDICAL APPLICATIONS : FIRST INTERNATIONAL WORKSHOP, LABELS 2016, AND SECOND INTERNATIONAL WORKSHOP, DLMIA 2016, HELD IN CONJUNCTION WITH MICCAI 2016, ATHENS, GREECE, OCTOBER 21, 2016, PROCEEDINGS 2016; 2016:170-178. [PMID: 29075680 DOI: 10.1007/978-3-319-46976-8_18] [Citation(s) in RCA: 117] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Computed tomography (CT) is critical for various clinical applications, e.g., radiotherapy treatment planning and also PET attenuation correction. However, CT exposes radiation during CT imaging, which may cause side effects to patients. Compared to CT, magnetic resonance imaging (MRI) is much safer and does not involve any radiation. Therefore, recently researchers are greatly motivated to estimate CT image from its corresponding MR image of the same subject for the case of radiotherapy planning. In this paper, we propose a 3D deep learning based method to address this challenging problem. Specifically, a 3D fully convolutional neural network (FCN) is adopted to learn an end-to-end nonlinear mapping from MR image to CT image. Compared to the conventional convolutional neural network (CNN), FCN generates structured output and can better preserve the neighborhood information in the predicted CT image. We have validated our method in a real pelvic CT/MRI dataset. Experimental results show that our method is accurate and robust for predicting CT image from MRI image, and also outperforms three state-of-the-art methods under comparison. In addition, the parameters, such as network depth and activation function, are extensively studied to give an insight for deep learning based regression tasks in our application.
Collapse
|
19290
|
Heyde KC, Gallagher PW, Ruder WC. Bioinspired decision architectures containing host and microbiome processing units. BIOINSPIRATION & BIOMIMETICS 2016; 11:056017. [PMID: 27677187 DOI: 10.1088/1748-3190/11/5/056017] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Biomimetic robots have been used to explore and explain natural phenomena ranging from the coordination of ants to the locomotion of lizards. Here, we developed a series of decision architectures inspired by the information exchange between a host organism and its microbiome. We first modeled the biochemical exchanges of a population of synthetically engineered E. coli. We then built a physical, differential drive robot that contained an integrated, onboard computer vision system. A relay was established between the simulated population of cells and the robot's microcontroller. By placing the robot within a target-containing a two-dimensional arena, we explored how different aspects of the simulated cells and the robot's microcontroller could be integrated to form hybrid decision architectures. We found that distinct decision architectures allow for us to develop models of computation with specific strengths such as runtime efficiency or minimal memory allocation. Taken together, our hybrid decision architectures provide a new strategy for developing bioinspired control systems that integrate both living and nonliving components.
Collapse
Affiliation(s)
- K C Heyde
- Engineering Science and Mechanics Program, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA
| | | | | |
Collapse
|
19291
|
Abstract
Zebrafish ( Danio rerio) is an important vertebrate model organism in biomedical research, especially suitable for morphological screening due to its transparent body during early development. Deep learning has emerged as a dominant paradigm for data analysis and found a number of applications in computer vision and image analysis. Here we demonstrate the potential of a deep learning approach for accurate high-throughput classification of whole-body zebrafish deformations in multifish microwell plates. Deep learning uses the raw image data as an input, without the need of expert knowledge for feature design or optimization of the segmentation parameters. We trained the deep learning classifier on as few as 84 images (before data augmentation) and achieved a classification accuracy of 92.8% on an unseen test data set that is comparable to the previous state of the art (95%) based on user-specified segmentation and deformation metrics. Ablation studies by digitally removing whole fish or parts of the fish from the images revealed that the classifier learned discriminative features from the image foreground, and we observed that the deformations of the head region, rather than the visually apparent bent tail, were more important for good classification performance.
Collapse
Affiliation(s)
- Omer Ishaq
- 1 Centre for Image Analysis/SciLifeLab, Uppsala University, Uppsala, Sweden
| | | | - Carolina Wählby
- 1 Centre for Image Analysis/SciLifeLab, Uppsala University, Uppsala, Sweden
| |
Collapse
|
19292
|
Singer W, Lazar A. Does the Cerebral Cortex Exploit High-Dimensional, Non-linear Dynamics for Information Processing? Front Comput Neurosci 2016; 10:99. [PMID: 27713697 PMCID: PMC5031693 DOI: 10.3389/fncom.2016.00099] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2016] [Accepted: 09/02/2016] [Indexed: 12/04/2022] Open
Abstract
The discovery of stimulus induced synchronization in the visual cortex suggested the possibility that the relations among low-level stimulus features are encoded by the temporal relationship between neuronal discharges. In this framework, temporal coherence is considered a signature of perceptual grouping. This insight triggered a large number of experimental studies which sought to investigate the relationship between temporal coordination and cognitive functions. While some core predictions derived from the initial hypothesis were confirmed, these studies, also revealed a rich dynamical landscape beyond simple coherence whose role in signal processing is still poorly understood. In this paper, a framework is presented which establishes links between the various manifestations of cortical dynamics by assigning specific coding functions to low-dimensional dynamic features such as synchronized oscillations and phase shifts on the one hand and high-dimensional non-linear, non-stationary dynamics on the other. The data serving as basis for this synthetic approach have been obtained with chronic multisite recordings from the visual cortex of anesthetized cats and from monkeys trained to solve cognitive tasks. It is proposed that the low-dimensional dynamics characterized by synchronized oscillations and large-scale correlations are substates that represent the results of computations performed in the high-dimensional state-space provided by recurrently coupled networks.
Collapse
Affiliation(s)
- Wolf Singer
- Ernst Strüngmann Institute for Neuroscience in Cooperation with Max Planck SocietyFrankfurt am Main, Germany; Max Planck Institute for Brain ResearchFrankfurt am Main, Germany; Frankfurt Institute for Advanced StudiesFrankfurt am Main, Germany
| | - Andreea Lazar
- Ernst Strüngmann Institute for Neuroscience in Cooperation with Max Planck SocietyFrankfurt am Main, Germany; Max Planck Institute for Brain ResearchFrankfurt am Main, Germany; Frankfurt Institute for Advanced StudiesFrankfurt am Main, Germany
| |
Collapse
|
19293
|
Gastounioti A, Conant EF, Kontos D. Beyond breast density: a review on the advancing role of parenchymal texture analysis in breast cancer risk assessment. Breast Cancer Res 2016; 18:91. [PMID: 27645219 PMCID: PMC5029019 DOI: 10.1186/s13058-016-0755-8] [Citation(s) in RCA: 84] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
BACKGROUND The assessment of a woman's risk for developing breast cancer has become increasingly important for establishing personalized screening recommendations and forming preventive strategies. Studies have consistently shown a strong relationship between breast cancer risk and mammographic parenchymal patterns, typically assessed by percent mammographic density. This paper will review the advancing role of mammographic texture analysis as a potential novel approach to characterize the breast parenchymal tissue to augment conventional density assessment in breast cancer risk estimation. MAIN TEXT The analysis of mammographic texture provides refined, localized descriptors of parenchymal tissue complexity. Currently, there is growing evidence in support of textural features having the potential to augment the typically dichotomized descriptors (dense or not dense) of area or volumetric measures of breast density in breast cancer risk assessment. Therefore, a substantial research effort has been devoted to automate mammographic texture analysis, with the aim of ultimately incorporating such quantitative measures into breast cancer risk assessment models. In this paper, we review current and emerging approaches in this field, summarizing key methodological details and related studies using novel computerized approaches. We also discuss research challenges for advancing the role of parenchymal texture analysis in breast cancer risk stratification and accelerating its clinical translation. CONCLUSIONS The objective is to provide a comprehensive reference for researchers in the field of parenchymal pattern analysis in breast cancer risk assessment, while indicating key directions for future research.
Collapse
Affiliation(s)
- Aimilia Gastounioti
- Department of Radiology, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Emily F Conant
- Department of Radiology, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Despina Kontos
- Department of Radiology, University of Pennsylvania, Philadelphia, PA, 19104, USA.
| |
Collapse
|
19294
|
Hermans M, Antonik P, Haelterman M, Massar S. Embodiment of Learning in Electro-Optical Signal Processors. PHYSICAL REVIEW LETTERS 2016; 117:128301. [PMID: 27689299 DOI: 10.1103/physrevlett.117.128301] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2016] [Indexed: 06/06/2023]
Abstract
Delay-coupled electro-optical systems have received much attention for their dynamical properties and their potential use in signal processing. In particular, it has recently been demonstrated, using the artificial intelligence algorithm known as reservoir computing, that photonic implementations of such systems solve complex tasks such as speech recognition. Here, we show how the backpropagation algorithm can be physically implemented on the same electro-optical delay-coupled architecture used for computation with only minor changes to the original design. We find that, compared to when the backpropagation algorithm is not used, the error rate of the resulting computing device, evaluated on three benchmark tasks, decreases considerably. This demonstrates that electro-optical analog computers can embody a large part of their own training process, allowing them to be applied to new, more difficult tasks.
Collapse
Affiliation(s)
- Michiel Hermans
- Laboratoire d'Information Quantique, Université libre de Bruxelles, 50 Avenue F. D. Roosevelt, CP 224, B-1050 Brussels, Belgium
| | - Piotr Antonik
- Laboratoire d'Information Quantique, Université libre de Bruxelles, 50 Avenue F. D. Roosevelt, CP 224, B-1050 Brussels, Belgium
| | - Marc Haelterman
- Service OPERA-Photonique, Université libre de Bruxelles, 50 Avenue F. D. Roosevelt, CP 194/5, B-1050 Brussels, Belgium
| | - Serge Massar
- Laboratoire d'Information Quantique, Université libre de Bruxelles, 50 Avenue F. D. Roosevelt, CP 224, B-1050 Brussels, Belgium
| |
Collapse
|
19295
|
Cui Y, Ahmad S, Hawkins J. Continuous Online Sequence Learning with an Unsupervised Neural Network Model. Neural Comput 2016; 28:2474-2504. [PMID: 27626963 DOI: 10.1162/neco_a_00893] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
The ability to recognize and predict temporal sequences of sensory inputs is vital for survival in natural environments. Based on many known properties of cortical neurons, hierarchical temporal memory (HTM) sequence memory recently has been proposed as a theoretical framework for sequence learning in the cortex. In this letter, we analyze properties of HTM sequence memory and apply it to sequence learning and prediction problems with streaming data. We show the model is able to continuously learn a large number of variable order temporal sequences using an unsupervised Hebbian-like learning rule. The sparse temporal codes formed by the model can robustly handle branching temporal sequences by maintaining multiple predictions until there is sufficient disambiguating evidence. We compare the HTM sequence memory with other sequence learning algorithms, including statistical methods-autoregressive integrated moving average; feedforward neural networks-time delay neural network and online sequential extreme learning machine; and recurrent neural networks-long short-term memory and echo-state networks on sequence prediction problems with both artificial and real-world data. The HTM model achieves comparable accuracy to other state-of-the-art algorithms. The model also exhibits properties that are critical for sequence learning, including continuous online learning, the ability to handle multiple predictions and branching sequences with high-order statistics, robustness to sensor noise and fault tolerance, and good performance without task-specific hyperparameter tuning. Therefore, the HTM sequence memory not only advances our understanding of how the brain may solve the sequence learning problem but is also applicable to real-world sequence learning problems from continuous data streams.
Collapse
Affiliation(s)
- Yuwei Cui
- Numenta, Inc. Redwood City, CA 94063, U.S.A.
| | | | | |
Collapse
|
19296
|
Marblestone AH, Wayne G, Kording KP. Toward an Integration of Deep Learning and Neuroscience. Front Comput Neurosci 2016; 10:94. [PMID: 27683554 PMCID: PMC5021692 DOI: 10.3389/fncom.2016.00094] [Citation(s) in RCA: 251] [Impact Index Per Article: 27.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2016] [Accepted: 08/24/2016] [Indexed: 01/22/2023] Open
Abstract
Neuroscience has focused on the detailed implementation of computation, studying neural codes, dynamics and circuits. In machine learning, however, artificial neural networks tend to eschew precisely designed codes, dynamics or circuits in favor of brute force optimization of a cost function, often using simple and relatively uniform initial architectures. Two recent developments have emerged within machine learning that create an opportunity to connect these seemingly divergent perspectives. First, structured architectures are used, including dedicated systems for attention, recursion and various forms of short- and long-term memory storage. Second, cost functions and training procedures have become more complex and are varied across layers and over time. Here we think about the brain in terms of these ideas. We hypothesize that (1) the brain optimizes cost functions, (2) the cost functions are diverse and differ across brain locations and over development, and (3) optimization operates within a pre-structured architecture matched to the computational problems posed by behavior. In support of these hypotheses, we argue that a range of implementations of credit assignment through multiple layers of neurons are compatible with our current knowledge of neural circuitry, and that the brain's specialized systems can be interpreted as enabling efficient optimization for specific problem classes. Such a heterogeneously optimized system, enabled by a series of interacting cost functions, serves to make learning data-efficient and precisely targeted to the needs of the organism. We suggest directions by which neuroscience could seek to refine and test these hypotheses.
Collapse
Affiliation(s)
- Adam H. Marblestone
- Synthetic Neurobiology Group, Massachusetts Institute of Technology, Media LabCambridge, MA, USA
| | | | - Konrad P. Kording
- Rehabilitation Institute of Chicago, Northwestern UniversityChicago, IL, USA
| |
Collapse
|
19297
|
Wronkiewicz M, Larson E, Lee AKC. Incorporating modern neuroscience findings to improve brain–computer interfaces: tracking auditory attention. J Neural Eng 2016; 13:056017. [DOI: 10.1088/1741-2560/13/5/056017] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
19298
|
Ekins S, Diaz N, Chung J, Mathews P, McMurtray A. Enabling Anyone to Translate Clinically Relevant Ideas to Therapies. Pharm Res 2016; 34:1-6. [PMID: 27620174 DOI: 10.1007/s11095-016-2039-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2016] [Accepted: 09/07/2016] [Indexed: 11/25/2022]
Abstract
How do we inspire new ideas that could lead to potential treatments for rare or neglected diseases, and allow for serendipity that could help to catalyze them? How many potentially good ideas are lost because they are never tested? What if those ideas could have lead to new therapeutic approaches and major healthcare advances? If a clinician or anyone for that matter, has a new idea they want to test to develop a molecule or therapeutic that they could translate to the clinic, how would they do it without a laboratory or funding? These are not idle theoretical questions but addressing them could have potentially huge economic implications for nations. If we fail to capture the diversity of ideas and test them we may also lose out on the next blockbuster treatments. Many of those involved in the process of ideation may be discouraged and simply not know where to go. We try to address these questions and describe how there are options to raising funding, how even small scale investments can foster preclinical or clinical translation, and how there are several approaches to outsourcing the experiments, whether to collaborators or commercial enterprises. While these are not new or far from complete solutions, they are first steps that can be taken by virtually anyone while we work on other solutions to build a more concrete structure for the "idea-hypothesis testing-proof of concept-translation-breakthrough pathway".
Collapse
Affiliation(s)
- Sean Ekins
- Collaborations Pharmaceuticals, Inc., 5616 Hilltop Needmore Road, Fuquay-Varina, Noth Carolina, 27526, USA.
- Phoenix Nest, Inc., P.O. BOX 150057, Brooklyn, New York, 11215, USA.
| | - Natalie Diaz
- Department of Neurology, Los Angeles Biomedical Research Institute, Torrance, California, 90502, USA
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, California, 90095, USA
- Department of Neurology, Harbor-UCLA Medical Center, Torrance, California, 90509, USA
| | - Julia Chung
- Department of Psychiatry, Los Angeles Biomedical Research Institute, Torrance, California, 90502, USA
- Department of Psychiatry, Harbor-UCLA Medical Center, Torrance, California, 90509, USA
- Department of Psychiatry, David Geffen School of Medicine, University of California, Los Angeles, California, 90095, USA
| | - Paul Mathews
- Department of Neurology, Los Angeles Biomedical Research Institute, Torrance, California, 90502, USA
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, California, 90095, USA
| | - Aaron McMurtray
- Department of Neurology, Los Angeles Biomedical Research Institute, Torrance, California, 90502, USA
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, California, 90095, USA
- Department of Neurology, Harbor-UCLA Medical Center, Torrance, California, 90509, USA
| |
Collapse
|
19299
|
Recent developments in human gait research: parameters, approaches, applications, machine learning techniques, datasets and challenges. Artif Intell Rev 2016. [DOI: 10.1007/s10462-016-9514-6] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
19300
|
Abstract
The launch of the United States' BRAIN Initiative brings with it a new era in systems neuroscience that is being driven by innovative neurotechnologies, increases in computational power and network-style artificial intelligence. A new conceptual framework for understanding cognitive behaviours based on the dynamical patterns of activity in large populations of neurons is emerging.
Collapse
|