51
|
Nanni L, Brahnam S, Ghidoni S, Lumini A. Bioimage Classification with Handcrafted and Learned Features. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 16:874-885. [PMID: 29994096 DOI: 10.1109/tcbb.2018.2821127] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Bioimage classification is increasingly becoming more important in many biological studies including those that require accurate cell phenotype recognition, subcellular localization, and histopathological classification. In this paper, we present a new General Purpose (GenP) bioimage classification method that can be applied to a large range of classification problems. The GenP system we propose is an ensemble that combines multiple texture features (both handcrafted and learned descriptors) for superior and generalizable discriminative power. Our ensemble obtains a boosting of performance by combining local features, dense sampling features, and deep learning features. Each descriptor is used to train a different Support Vector Machine that is then combined by sum rule. We evaluate our method on a diverse set of bioimage classification tasks each represented by a benchmark database, including some of those available in the IICBU 2008 database. Each bioimage classification task represents a typical subcellular, cellular, and tissue level classification problem. Our evaluation on these datasets demonstrates that the proposed GenP bioimage ensemble obtains state-of-the-art performance without any ad-hoc dataset tuning of the parameters (thereby avoiding any risk of overfitting/overtraining). To reproduce the experiments reported in this paper, the MATLAB code of all the descriptors is available at https://github.com/LorisNanni and https://www.dropbox.com/s/bguw035yrqz0pwp/ElencoCode.docx?dl=0.
Collapse
|
52
|
Shankar A, Fernandes JL, Kaur K, Sharma M, Kundu S, Pandey GK. Rice phytoglobins regulate responses under low mineral nutrients and abiotic stresses in Arabidopsis thaliana. PLANT, CELL & ENVIRONMENT 2018; 41:215-230. [PMID: 29044557 DOI: 10.1111/pce.13081] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/22/2016] [Revised: 09/25/2017] [Accepted: 09/27/2017] [Indexed: 06/07/2023]
Abstract
Just like animals, plants also contain haemoglobins (known as phytoglobins in plants). Plant phytoglobins (Pgbs) have been categorized into 6 different classes, namely, Phytogb0 (Pgb0), Phytogb1 (Pgb1), Phytogb2 (Pgb2), SymPhytogb (sPgb), Leghaemoglobin (Lb), and Phytogb3 (Pgb3). Among the 6 Phytogbs, sPgb and Lb have been functionally characterized, whereas understanding of the roles of other Pgbs is still evolving. In our present study, we have explored the function of 2 rice Pgbs (OsPgb1.1 and OsPgb1.2). OsPgb1.1, OsPgb1.2, OsPgb1.3, and OsPgb1.4 displayed increased level of transcript upon salt, drought, cold, and ABA treatment. The overexpression (OX) lines of OsPgb1.2 in Arabidopsis showed a tolerant phenotype in terms of better root growth in low potassium (K+ ) conditions. The expression of the known K+ gene markers such as LOX2, HAK5, and CAX3 was much higher in the OsPgb1.2 OX as compared to wild type. Furthermore, the OsPgb1.2 OX lines showed a decrease in reactive oxygen species (ROS) production and conversely an increase in the K+ content, both in root and shoot, as compared to wild type in K+ limiting condition. Our results indicated the potential involvement of OsPgb1.2 in signalling networks triggered by the nutrient deficiency stresses.
Collapse
Affiliation(s)
- Alka Shankar
- Department of Plant Molecular Biology, University of Delhi South Campus, Benito Juarez Road, New Delhi, 110021, India
| | - Joel Lars Fernandes
- Department of Plant Molecular Biology, University of Delhi South Campus, Benito Juarez Road, New Delhi, 110021, India
| | - Kanwaljeet Kaur
- Department of Plant Molecular Biology, University of Delhi South Campus, Benito Juarez Road, New Delhi, 110021, India
| | - Manisha Sharma
- Department of Plant Molecular Biology, University of Delhi South Campus, Benito Juarez Road, New Delhi, 110021, India
| | - Suman Kundu
- Department of Biochemistry, University of Delhi South Campus, Benito Juarez Road, New Delhi, 110021, India
| | - Girdhar K Pandey
- Department of Plant Molecular Biology, University of Delhi South Campus, Benito Juarez Road, New Delhi, 110021, India
| |
Collapse
|
53
|
pLoc-mEuk: Predict subcellular localization of multi-label eukaryotic proteins by extracting the key GO information into general PseAAC. Genomics 2018; 110:50-58. [DOI: 10.1016/j.ygeno.2017.08.005] [Citation(s) in RCA: 180] [Impact Index Per Article: 25.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2017] [Revised: 08/10/2017] [Accepted: 08/11/2017] [Indexed: 11/22/2022]
|
54
|
Wan S, Duan Y, Zou Q. HPSLPred: An Ensemble Multi-Label Classifier for Human Protein Subcellular Location Prediction with Imbalanced Source. Proteomics 2017; 17. [PMID: 28776938 DOI: 10.1002/pmic.201700262] [Citation(s) in RCA: 70] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2017] [Revised: 07/19/2017] [Indexed: 11/11/2022]
Abstract
Predicting the subcellular localization of proteins is an important and challenging problem. Traditional experimental approaches are often expensive and time-consuming. Consequently, a growing number of research efforts employ a series of machine learning approaches to predict the subcellular location of proteins. There are two main challenges among the state-of-the-art prediction methods. First, most of the existing techniques are designed to deal with multi-class rather than multi-label classification, which ignores connections between multiple labels. In reality, multiple locations of particular proteins imply that there are vital and unique biological significances that deserve special focus and cannot be ignored. Second, techniques for handling imbalanced data in multi-label classification problems are necessary, but never employed. For solving these two issues, we have developed an ensemble multi-label classifier called HPSLPred, which can be applied for multi-label classification with an imbalanced protein source. For convenience, a user-friendly webserver has been established at http://server.malab.cn/HPSLPred.
Collapse
Affiliation(s)
- Shixiang Wan
- School of Computer Science and Technology, Tianjin University, Tianjin, P. R. China
| | - Yucong Duan
- State Key Laboratory of Marine Resource Utilization in the South China Sea, College of Information and Technology, Hainan University, Haikou, Hainan, P. R. China
| | - Quan Zou
- School of Computer Science and Technology, Tianjin University, Tianjin, P. R. China
| |
Collapse
|
55
|
Cheng X, Xiao X, Chou KC. pLoc-mHum: predict subcellular localization of multi-location human proteins via general PseAAC to winnow out the crucial GO information. Bioinformatics 2017; 34:1448-1456. [DOI: 10.1093/bioinformatics/btx711] [Citation(s) in RCA: 127] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2017] [Accepted: 10/31/2017] [Indexed: 01/19/2023] Open
Affiliation(s)
- Xiang Cheng
- Computer Science, Jingdezhen Ceramic Institute, Jingdezhen, China
- Computational Biology, Gordon Life Science Institute, Boston, MA, USA
| | - Xuan Xiao
- Computer Science, Jingdezhen Ceramic Institute, Jingdezhen, China
- Computational Biology, Gordon Life Science Institute, Boston, MA, USA
| | - Kuo-Chen Chou
- Computer Science, Jingdezhen Ceramic Institute, Jingdezhen, China
- Computational Biology, Gordon Life Science Institute, Boston, MA, USA
- Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| |
Collapse
|
56
|
Cheng X, Xiao X, Chou KC. pLoc-mGneg: Predict subcellular localization of Gram-negative bacterial proteins by deep gene ontology learning via general PseAAC. Genomics 2017; 110:S0888-7543(17)30102-7. [PMID: 28989035 DOI: 10.1016/j.ygeno.2017.10.002] [Citation(s) in RCA: 92] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2017] [Revised: 09/28/2017] [Accepted: 10/04/2017] [Indexed: 01/21/2023]
Abstract
Information of the proteins' subcellular localization is crucially important for revealing their biological functions in a cell, the basic unit of life. With the avalanche of protein sequences generated in the postgenomic age, it is highly desired to develop computational tools for timely identifying their subcellular locations based on the sequence information alone. The current study is focused on the Gram-negative bacterial proteins. Although considerable efforts have been made in protein subcellular prediction, the problem is far from being solved yet. This is because mounting evidences have indicated that many Gram-negative bacterial proteins exist in two or more location sites. Unfortunately, most existing methods can be used to deal with single-location proteins only. Actually, proteins with multi-locations may have some special biological functions important for both basic research and drug design. In this study, by using the multi-label theory, we developed a new predictor called "pLoc-mGneg" for predicting the subcellular localization of Gram-negative bacterial proteins with both single and multiple locations. Rigorous cross-validation on a high quality benchmark dataset indicated that the proposed predictor is remarkably superior to "iLoc-Gneg", the state-of-the-art predictor for the same purpose. For the convenience of most experimental scientists, a user-friendly web-server for the novel predictor has been established at http://www.jci-bioinfo.cn/pLoc-mGneg/, by which users can easily get their desired results without the need to go through the complicated mathematics involved.
Collapse
Affiliation(s)
- Xiang Cheng
- Computer Department, Jingdezhen Ceramic Institute, Jingdezhen, China; The Gordon Life Science Institute, Boston, MA 02478, USA.
| | - Xuan Xiao
- Computer Department, Jingdezhen Ceramic Institute, Jingdezhen, China; The Gordon Life Science Institute, Boston, MA 02478, USA.
| | - Kuo-Chen Chou
- The Gordon Life Science Institute, Boston, MA 02478, USA; Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China; Faculty of Computing and Information Technology in Rabigh, King Abdulaziz University, Jeddah, Saudi Arabia.
| |
Collapse
|
57
|
pLoc-mVirus: Predict subcellular localization of multi-location virus proteins via incorporating the optimal GO information into general PseAAC. Gene 2017; 628:315-321. [DOI: 10.1016/j.gene.2017.07.036] [Citation(s) in RCA: 135] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2017] [Revised: 07/08/2017] [Accepted: 07/11/2017] [Indexed: 12/25/2022]
|
58
|
Cheng X, Zhao SG, Lin WZ, Xiao X, Chou KC. pLoc-mAnimal: predict subcellular localization of animal proteins with both single and multiple sites. Bioinformatics 2017; 33:3524-3531. [DOI: 10.1093/bioinformatics/btx476] [Citation(s) in RCA: 167] [Impact Index Per Article: 20.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2017] [Accepted: 07/22/2017] [Indexed: 12/24/2022] Open
Affiliation(s)
- Xiang Cheng
- College of Information Science and Technology, Donghua University, Shanghai, China
- Computer Department, Jingdezhen Ceramic Institute, Jingdezhen, China
| | - Shu-Guang Zhao
- College of Information Science and Technology, Donghua University, Shanghai, China
| | - Wei-Zhong Lin
- Computer Department, Jingdezhen Ceramic Institute, Jingdezhen, China
| | - Xuan Xiao
- Computer Department, Jingdezhen Ceramic Institute, Jingdezhen, China
- The Gordon Life Science Institute, Boston, MA, USA
| | - Kuo-Chen Chou
- The Gordon Life Science Institute, Boston, MA, USA
- Center of Excellence in Genomic Medicine Research (CEGMR), King Abdulaziz University, Jeddah, Saudi Arabia
- Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| |
Collapse
|
59
|
Pärnamaa T, Parts L. Accurate Classification of Protein Subcellular Localization from High-Throughput Microscopy Images Using Deep Learning. G3 (BETHESDA, MD.) 2017; 7:1385-1392. [PMID: 28391243 PMCID: PMC5427497 DOI: 10.1534/g3.116.033654] [Citation(s) in RCA: 88] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/18/2016] [Accepted: 11/22/2016] [Indexed: 11/29/2022]
Abstract
High-throughput microscopy of many single cells generates high-dimensional data that are far from straightforward to analyze. One important problem is automatically detecting the cellular compartment where a fluorescently-tagged protein resides, a task relatively simple for an experienced human, but difficult to automate on a computer. Here, we train an 11-layer neural network on data from mapping thousands of yeast proteins, achieving per cell localization classification accuracy of 91%, and per protein accuracy of 99% on held-out images. We confirm that low-level network features correspond to basic image characteristics, while deeper layers separate localization classes. Using this network as a feature calculator, we train standard classifiers that assign proteins to previously unseen compartments after observing only a small number of training examples. Our results are the most accurate subcellular localization classifications to date, and demonstrate the usefulness of deep learning for high-throughput microscopy.
Collapse
Affiliation(s)
- Tanel Pärnamaa
- Institute of Computer Science, University of Tartu, 50409, Estonia
| | - Leopold Parts
- Institute of Computer Science, University of Tartu, 50409, Estonia
- Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire CB10 1SA, United Kingdom
| |
Collapse
|
60
|
Xue J, Balamurugan S, Li DW, Liu YH, Zeng H, Wang L, Yang WD, Liu JS, Li HY. Glucose-6-phosphate dehydrogenase as a target for highly efficient fatty acid biosynthesis in microalgae by enhancing NADPH supply. Metab Eng 2017; 41:212-221. [PMID: 28465173 DOI: 10.1016/j.ymben.2017.04.008] [Citation(s) in RCA: 113] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2016] [Revised: 04/27/2017] [Accepted: 04/28/2017] [Indexed: 01/10/2023]
Abstract
Oleaginous microalgae have great prospects in the fields of feed, nutrition, biofuel, etc. However, biomass and lipid productivity in microalgae remain a major economic and technological bottleneck. Here we present a novel regulatory target, glucose-6-phosphate dehydrogenase (G6PD) from the pentose phosphate pathway (PPP), in boosting microalgal lipid accumulation. G6PD, involved in the formation of NADPH demanded in fatty acid biosynthesis as reducing power, was characterized in oleaginous microalga Phaeodactylum tricornutum. In G6PD overexpressing microalgae, transcript abundance of G6PD increased by 4.4-fold, and G6PD enzyme activity increased by more than 3.1-fold with enhanced NADPH production. Consequently, the lipid content increased by 2.7-fold and reached up to 55.7% of dry weight, while cell growth was not apparently affected. The fatty acid composition exhibited significant changes, including a remarkable increase in monounsaturated fatty acids C16:1 and C18:1 concomitant with a decrease in polyunsaturated fatty acids C20:5 and C22:6. G6PD was localized to the chloroplast and its overexpression stimulated an increase in the number and size of oil bodies. Proteomic and metabolomic analyzes revealed that G6PD play a key role in regulating pentose phosphate pathway and subsequently upregulating NADPH consuming pathways such as fatty acid synthesis, thus eventually leading to lipid accumulation. Our findings show the critical role of G6PD in microalgal lipid accumulation by enhancing NADPH supply and demonstrate that G6PD is a promising target for metabolic engineering.
Collapse
Affiliation(s)
- Jiao Xue
- Key Laboratory of Eutrophication and Red Tide Prevention of Guangdong Higher Education Institutes, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Srinivasan Balamurugan
- Key Laboratory of Eutrophication and Red Tide Prevention of Guangdong Higher Education Institutes, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Da-Wei Li
- Key Laboratory of Eutrophication and Red Tide Prevention of Guangdong Higher Education Institutes, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Yu-Hong Liu
- Key Laboratory of Eutrophication and Red Tide Prevention of Guangdong Higher Education Institutes, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Hao Zeng
- Key Laboratory of Eutrophication and Red Tide Prevention of Guangdong Higher Education Institutes, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Lan Wang
- Key Laboratory of Eutrophication and Red Tide Prevention of Guangdong Higher Education Institutes, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Wei-Dong Yang
- Key Laboratory of Eutrophication and Red Tide Prevention of Guangdong Higher Education Institutes, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Jie-Sheng Liu
- Key Laboratory of Eutrophication and Red Tide Prevention of Guangdong Higher Education Institutes, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Hong-Ye Li
- Key Laboratory of Eutrophication and Red Tide Prevention of Guangdong Higher Education Institutes, College of Life Science and Technology, Jinan University, Guangzhou 510632, China.
| |
Collapse
|
61
|
Wu W, Lin J, Wang S, Li Y, Liu M, Liu G, Cai J, Chen G, Chen R. A novel multiphoton microscopy images segmentation method based on superpixel and watershed. JOURNAL OF BIOPHOTONICS 2017; 10:532-541. [PMID: 27090206 DOI: 10.1002/jbio.201600007] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2016] [Revised: 03/24/2016] [Accepted: 03/28/2016] [Indexed: 06/05/2023]
Abstract
Multiphoton microscopy (MPM) imaging technique based on two-photon excited fluorescence (TPEF) and second harmonic generation (SHG) shows fantastic performance for biological imaging. The automatic segmentation of cellular architectural properties for biomedical diagnosis based on MPM images is still a challenging issue. A novel multiphoton microscopy images segmentation method based on superpixels and watershed (MSW) is presented here to provide good segmentation results for MPM images. The proposed method uses SLIC superpixels instead of pixels to analyze MPM images for the first time. The superpixels segmentation based on a new distance metric combined with spatial, CIE Lab color space and phase congruency features, divides the images into patches which keep the details of the cell boundaries. Then the superpixels are used to reconstruct new images by defining an average value of superpixels as image pixels intensity level. Finally, the marker-controlled watershed is utilized to segment the cell boundaries from the reconstructed images. Experimental results show that cellular boundaries can be extracted from MPM images by MSW with higher accuracy and robustness.
Collapse
Affiliation(s)
- Weilin Wu
- Key Laboratory of OptoElectronic Science and Technology for Medicine of Ministry of Education Fujian Normal University, Fuzhou, Fujian, 350007, China
- Department of Network and Communication Engineering, Fujian Normal University, Fuzhou, Fujian, 350007, China
| | - Jinyong Lin
- Department of Radiation Oncology, Fujian Provincial Cancer Hospital, Fuzhou, Fujian, 350014, China
| | - Shu Wang
- Key Laboratory of OptoElectronic Science and Technology for Medicine of Ministry of Education Fujian Normal University, Fuzhou, Fujian, 350007, China
- Department of Network and Communication Engineering, Fujian Normal University, Fuzhou, Fujian, 350007, China
| | - Yan Li
- Key Laboratory of OptoElectronic Science and Technology for Medicine of Ministry of Education Fujian Normal University, Fuzhou, Fujian, 350007, China
- Department of Network and Communication Engineering, Fujian Normal University, Fuzhou, Fujian, 350007, China
| | - Mingyu Liu
- Key Laboratory of OptoElectronic Science and Technology for Medicine of Ministry of Education Fujian Normal University, Fuzhou, Fujian, 350007, China
- Department of Network and Communication Engineering, Fujian Normal University, Fuzhou, Fujian, 350007, China
| | - Gaoqiang Liu
- Key Laboratory of OptoElectronic Science and Technology for Medicine of Ministry of Education Fujian Normal University, Fuzhou, Fujian, 350007, China
- Department of Network and Communication Engineering, Fujian Normal University, Fuzhou, Fujian, 350007, China
| | - Jianyong Cai
- Key Laboratory of OptoElectronic Science and Technology for Medicine of Ministry of Education Fujian Normal University, Fuzhou, Fujian, 350007, China
- Department of Network and Communication Engineering, Fujian Normal University, Fuzhou, Fujian, 350007, China
| | - Guannan Chen
- Key Laboratory of OptoElectronic Science and Technology for Medicine of Ministry of Education Fujian Normal University, Fuzhou, Fujian, 350007, China
- Department of Network and Communication Engineering, Fujian Normal University, Fuzhou, Fujian, 350007, China
| | - Rong Chen
- Key Laboratory of OptoElectronic Science and Technology for Medicine of Ministry of Education Fujian Normal University, Fuzhou, Fujian, 350007, China
- Department of Network and Communication Engineering, Fujian Normal University, Fuzhou, Fujian, 350007, China
| |
Collapse
|
62
|
Bower AJ, Chidester B, Li J, Zhao Y, Marjanovic M, Chaney EJ, Do MN, Boppart SA. A quantitative framework for the analysis of multimodal optical microscopy images. Quant Imaging Med Surg 2017; 7:24-37. [PMID: 28275557 DOI: 10.21037/qims.2017.02.07] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
BACKGROUND Multimodal optical microscopy, a set of imaging techniques based on unique, yet complementary contrast mechanisms and spatially and temporally co-registered data acquisition, has emerged as a powerful biomedical tool. However, the analysis of the dense, high-dimensional datasets acquired by these instruments remains mostly qualitative and restricted to analysis of each modality individually. METHODS Using a custom-built multimodal nonlinear optical microscope, high dimensional datasets were acquired for automated classification of functional cell states as well as identification of histopathological features in tissues slices. Supervised classification of cell death modes was performed through support vector machines (SVM) and semi-supervised classification of tissue slices was performed through the use of the expectation maximization (EM) algorithm. RESULTS Applications of these techniques to the automated classification of cell death modes as well as to the identification of tissue components in fixed ex vivo tissue slices are presented. The analysis techniques developed provide a direct link between multimodal image contrast and biological structure and function, resulting in highly accurate classification in both settings. CONCLUSIONS Quantification of multimodal optical microscopy images through statistical modeling of the high dimensional data acquired gives a strong correlation between biological structure and function and image contrast. These methods are sensitive to the identification of diagnostic, cellular-level features important in a variety of clinical settings.
Collapse
Affiliation(s)
- Andrew J Bower
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA; Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Benjamin Chidester
- Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Joanne Li
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA; Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Youbo Zhao
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Marina Marjanovic
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA; Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Eric J Chaney
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Minh N Do
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA; Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, USA; Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Stephen A Boppart
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA; Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, USA; Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL, USA; Department of Internal Medicine, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| |
Collapse
|
63
|
Xiao X, Cheng X, Su S, Mao Q, Chou KC. pLoc-mGpos: Incorporate Key Gene Ontology Information into General PseAAC for Predicting Subcellular Localization of Gram-Positive Bacterial Proteins. ACTA ACUST UNITED AC 2017. [DOI: 10.4236/ns.2017.99032] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
64
|
Cheng X, Xiao X, Chou KC. pLoc-mPlant: predict subcellular localization of multi-location plant proteins by incorporating the optimal GO information into general PseAAC. MOLECULAR BIOSYSTEMS 2017; 13:1722-1727. [DOI: 10.1039/c7mb00267j] [Citation(s) in RCA: 172] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
One of the fundamental goals in cellular biochemistry is to identify the functions of proteins in the context of compartments that organize them in the cellular environment.
Collapse
Affiliation(s)
- Xiang Cheng
- Computer Department
- Jingdezhen Ceramic Institute
- Jingdezhen
- China
| | - Xuan Xiao
- Computer Department
- Jingdezhen Ceramic Institute
- Jingdezhen
- China
- The Gordon Life Science Institute
| | - Kuo-Chen Chou
- The Gordon Life Science Institute
- Boston
- USA
- Center for Informational Biology
- University of Electronic Science and Technology of China
| |
Collapse
|
65
|
Xu S, Yang X, Yu H, Yu DJ, Yang J, Tsang EC. Multi-label learning with label-specific feature reduction. Knowl Based Syst 2016. [DOI: 10.1016/j.knosys.2016.04.012] [Citation(s) in RCA: 102] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
66
|
Biot E, Crowell E, Burguet J, Höfte H, Vernhettes S, Andrey P. Strategy and software for the statistical spatial analysis of 3D intracellular distributions. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2016; 87:230-242. [PMID: 27121260 DOI: 10.1111/tpj.13189] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/12/2015] [Revised: 04/05/2016] [Accepted: 04/07/2016] [Indexed: 06/05/2023]
Abstract
The localization of proteins in specific domains or compartments in the 3D cellular space is essential for many fundamental processes in eukaryotic cells. Deciphering spatial organization principles within cells is a challenging task, in particular because of the large morphological variations between individual cells. We present here an approach for normalizing variations in cell morphology and for statistically analyzing spatial distributions of intracellular compartments from collections of 3D images. The method relies on the processing and analysis of 3D geometrical models that are generated from image stacks and that are used to build representations at progressively increasing levels of integration, ultimately revealing statistical significant traits of spatial distributions. To make this methodology widely available to end-users, we implemented our algorithmic pipeline into a user-friendly, multi-platform, and freely available software. To validate our approach, we generated 3D statistical maps of endomembrane compartments at subcellular resolution within an average epidermal root cell from collections of image stacks. This revealed unsuspected polar distribution patterns of organelles that were not detectable in individual images. By reversing the classical 'measure-then-average' paradigm, one major benefit of the proposed strategy is the production and display of statistical 3D representations of spatial organizations, thus fully preserving the spatial dimension of image data and at the same time allowing their integration over individual observations. The approach and software are generic and should be of general interest for experimental and modeling studies of spatial organizations at multiple scales (subcellular, cellular, tissular) in biological systems.
Collapse
Affiliation(s)
- Eric Biot
- Institut Jean-Pierre Bourgin, INRA, AgroParisTech, CNRS, Université Paris-Saclay, RD10, 78000, Versailles, France
| | - Elizabeth Crowell
- Institut Jean-Pierre Bourgin, INRA, AgroParisTech, CNRS, Université Paris-Saclay, RD10, 78000, Versailles, France
| | - Jasmine Burguet
- Institut Jean-Pierre Bourgin, INRA, AgroParisTech, CNRS, Université Paris-Saclay, RD10, 78000, Versailles, France
- INRA, Neurobiologie de l'Olfaction, UR1197, F-78350, Jouy-en-Josas, France
| | - Herman Höfte
- Institut Jean-Pierre Bourgin, INRA, AgroParisTech, CNRS, Université Paris-Saclay, RD10, 78000, Versailles, France
| | - Samantha Vernhettes
- Institut Jean-Pierre Bourgin, INRA, AgroParisTech, CNRS, Université Paris-Saclay, RD10, 78000, Versailles, France
| | - Philippe Andrey
- Institut Jean-Pierre Bourgin, INRA, AgroParisTech, CNRS, Université Paris-Saclay, RD10, 78000, Versailles, France
- INRA, Neurobiologie de l'Olfaction, UR1197, F-78350, Jouy-en-Josas, France
- Sorbonne Universités, UPMC Univ Paris 06, UFR927, F-75005, Paris, France
| |
Collapse
|
67
|
Classifying Multifunctional Enzymes by Incorporating Three Different Models into Chou’s General Pseudo Amino Acid Composition. J Membr Biol 2016; 249:551-7. [DOI: 10.1007/s00232-016-9904-3] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2015] [Accepted: 04/11/2016] [Indexed: 10/21/2022]
|
68
|
Xu YY, Yang F, Shen HB. Incorporating organelle correlations into semi-supervised learning for protein subcellular localization prediction. Bioinformatics 2016; 32:2184-92. [DOI: 10.1093/bioinformatics/btw219] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2015] [Accepted: 04/18/2016] [Indexed: 01/08/2023] Open
|
69
|
Roybal KT, Buck TE, Ruan X, Cho BH, Clark DJ, Ambler R, Tunbridge HM, Zhang J, Verkade P, Wülfing C, Murphy RF. Computational spatiotemporal analysis identifies WAVE2 and cofilin as joint regulators of costimulation-mediated T cell actin dynamics. Sci Signal 2016; 9:rs3. [PMID: 27095595 DOI: 10.1126/scisignal.aad4149] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Fluorescence microscopy is one of the most important tools in cell biology research because it provides spatial and temporal information to investigate regulatory systems inside cells. This technique can generate data in the form of signal intensities at thousands of positions resolved inside individual live cells. However, given extensive cell-to-cell variation, these data cannot be readily assembled into three- or four-dimensional maps of protein concentration that can be compared across different cells and conditions. We have developed a method to enable comparison of imaging data from many cells and applied it to investigate actin dynamics in T cell activation. Antigen recognition in T cells by the T cell receptor (TCR) is amplified by engagement of the costimulatory receptor CD28. We imaged actin and eight core actin regulators to generate over a thousand movies of T cells under conditions in which CD28 was either engaged or blocked in the context of a strong TCR signal. Our computational analysis showed that the primary effect of costimulation blockade was to decrease recruitment of the activator of actin nucleation WAVE2 (Wiskott-Aldrich syndrome protein family verprolin-homologous protein 2) and the actin-severing protein cofilin to F-actin. Reconstitution of WAVE2 and cofilin activity restored the defect in actin signaling dynamics caused by costimulation blockade. Thus, we have developed and validated an approach to quantify protein distributions in time and space for the analysis of complex regulatory systems.
Collapse
Affiliation(s)
- Kole T Roybal
- School of Cellular and Molecular Medicine, University of Bristol, Bristol BS8 1TD, UK. Department of Immunology, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Taráz E Buck
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Xiongtao Ruan
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Baek Hwan Cho
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Danielle J Clark
- School of Cellular and Molecular Medicine, University of Bristol, Bristol BS8 1TD, UK
| | - Rachel Ambler
- School of Cellular and Molecular Medicine, University of Bristol, Bristol BS8 1TD, UK
| | - Helen M Tunbridge
- School of Cellular and Molecular Medicine, University of Bristol, Bristol BS8 1TD, UK
| | - Jianwei Zhang
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Paul Verkade
- School of Biochemistry, University of Bristol, Bristol BS8 1TD, UK
| | - Christoph Wülfing
- School of Cellular and Molecular Medicine, University of Bristol, Bristol BS8 1TD, UK. Department of Immunology, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA. Department of Cell Biology, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA.
| | - Robert F Murphy
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA. Departments of Biological Sciences, Biomedical Engineering, and Machine Learning, Carnegie Mellon University, Pittsburgh, PA 15213, USA. Freiburg Institute for Advanced Studies and Faculty of Biology, Albert Ludwig University of Freiburg, Freiburg im Breisgau 79104, Baden-Württemberg, Germany.
| |
Collapse
|
70
|
Murphy RF. Building cell models and simulations from microscope images. Methods 2016; 96:33-39. [PMID: 26484733 PMCID: PMC4766043 DOI: 10.1016/j.ymeth.2015.10.011] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2015] [Revised: 10/15/2015] [Accepted: 10/16/2015] [Indexed: 01/13/2023] Open
Abstract
The use of fluorescence microscopy has undergone a major revolution over the past twenty years, both with the development of dramatic new technologies and with the widespread adoption of image analysis and machine learning methods. Many open source software tools provide the ability to use these methods in a wide range of studies, and many molecular and cellular phenotypes can now be automatically distinguished. This article presents the next major challenge in microscopy automation, the creation of accurate models of cell organization directly from images, and reviews the progress that has been made towards this challenge.
Collapse
Affiliation(s)
- Robert F Murphy
- Computational Biology Department, Center for Bioimage Informatics, and Departments of Biological Sciences, Biomedical Engineering and Machine Learning, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA, USA; Freiburg Institute for Advanced Studies and Faculty of Biology, Albert Ludwig University of Freiburg, Germany.
| |
Collapse
|
71
|
Abstract
Data visualization is a fundamental aspect of science. In the context of microscopy-based studies, visualization typically involves presentation of the images themselves. However, data visualization is challenging when microscopy experiments entail imaging of millions of cells, and complex cellular phenotypes are quantified in a high-content manner. Most well-established visualization tools are inappropriate for displaying high-content data, which has driven the development of new visualization methodology. In this review, we discuss how data has been visualized in both classical and high-content microscopy studies; as well as the advantages, and disadvantages, of different visualization methods.
Collapse
Affiliation(s)
- Heba Z Sailem
- a Department of Engineering Science , University of Oxford , Oxford , UK
| | - Sam Cooper
- b Department of Computational Systems Medicine , Imperial College, South Kensington Campus , London , UK , and.,c Division of Cancer Biology , Chester Beatty Laboratories, Institute of Cancer Research , London , UK
| | - Chris Bakal
- c Division of Cancer Biology , Chester Beatty Laboratories, Institute of Cancer Research , London , UK
| |
Collapse
|
72
|
CP-CHARM: segmentation-free image classification made accessible. BMC Bioinformatics 2016; 17:51. [PMID: 26817459 PMCID: PMC4729047 DOI: 10.1186/s12859-016-0895-y] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2015] [Accepted: 01/18/2016] [Indexed: 11/10/2022] Open
Abstract
Background Automated classification using machine learning often relies on features derived from segmenting individual objects, which can be difficult to automate. WND-CHARM is a previously developed classification algorithm in which features are computed on the whole image, thereby avoiding the need for segmentation. The algorithm obtained encouraging results but requires considerable computational expertise to execute. Furthermore, some benchmark sets have been shown to be subject to confounding artifacts that overestimate classification accuracy. Results We developed CP-CHARM, a user-friendly image-based classification algorithm inspired by WND-CHARM in (i) its ability to capture a wide variety of morphological aspects of the image, and (ii) the absence of requirement for segmentation. In order to make such an image-based classification method easily accessible to the biological research community, CP-CHARM relies on the widely-used open-source image analysis software CellProfiler for feature extraction. To validate our method, we reproduced WND-CHARM’s results and ensured that CP-CHARM obtained comparable performance. We then successfully applied our approach on cell-based assay data and on tissue images. We designed these new training and test sets to reduce the effect of batch-related artifacts. Conclusions The proposed method preserves the strengths of WND-CHARM - it extracts a wide variety of morphological features directly on whole images thereby avoiding the need for cell segmentation, but additionally, it makes the methods easily accessible for researchers without computational expertise by implementing them as a CellProfiler pipeline. It has been demonstrated to perform well on a wide range of bioimage classification problems, including on new datasets that have been carefully selected and annotated to minimize batch effects. This provides for the first time a realistic and reliable assessment of the whole image classification strategy. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-0895-y) contains supplementary material, which is available to authorized users.
Collapse
|
73
|
Wang F, Wang C, Yan Y, Jia H, Guo X. Overexpression of Cotton GhMPK11 Decreases Disease Resistance through the Gibberellin Signaling Pathway in Transgenic Nicotiana benthamiana. FRONTIERS IN PLANT SCIENCE 2016; 7:689. [PMID: 27242882 PMCID: PMC4876126 DOI: 10.3389/fpls.2016.00689] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2016] [Accepted: 05/05/2016] [Indexed: 05/21/2023]
Abstract
Many changes in development, growth, hormone activity and environmental stimuli responses are mediated by mitogen-activated protein kinase (MAPK) cascades. However, in plants, studies on MAPKs have mainly focused on MPK3, MPK4 and MPK6. Here, a novel group B MAPK gene, GhMPK11, was isolated from cotton (Gossypium hirsutum L.) and characterized. Both promoter and expression pattern analyses revealed that GhMPK11 is involved in defense responses and signaling pathways. GhMPK11 overexpression in Nicotiana benthamiana plants could increase gibberellin 3 (GA3) content through the regulation of GA-related genes. Interestingly, either GhMPK11 overexpression or exogenous GA3 treatment in N. benthamiana plants could enhance the susceptibility of these plants to the infectious pathogens Ralstonia solanacearum and Rhizoctonia solani. Moreover, reactive oxygen species (ROS) accumulation was increased after pathogen infiltration due to the increased expression of ROS-related gene respiratory burst oxidative homologs (RbohB) and the decreased expression or activity of ROS detoxification enzymes regulated by GA3, such as superoxide dismutases (SODs), peroxidases (PODs), catalase (CAT) and glutathione S-transferase (GST). Taken together, these results suggest that GhMPK11 overexpression could enhance the susceptibility of tobacco to pathogen infection through the GA3 signaling pathway via down-regulation of ROS detoxification enzymes.
Collapse
|
74
|
Yang Q, Zou HY, Zhang Y, Tang LJ, Shen GL, Jiang JH, Yu RQ. Multiplex protein pattern unmixing using a non-linear variable-weighted support vector machine as optimized by a particle swarm optimization algorithm. Talanta 2016; 147:609-14. [DOI: 10.1016/j.talanta.2015.10.047] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2015] [Revised: 10/14/2015] [Accepted: 10/18/2015] [Indexed: 11/30/2022]
|
75
|
Gordonov S, Hwang MK, Wells A, Gertler FB, Lauffenburger DA, Bathe M. Time series modeling of live-cell shape dynamics for image-based phenotypic profiling. Integr Biol (Camb) 2016; 8:73-90. [PMID: 26658688 PMCID: PMC5058786 DOI: 10.1039/c5ib00283d] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Live-cell imaging can be used to capture spatio-temporal aspects of cellular responses that are not accessible to fixed-cell imaging. As the use of live-cell imaging continues to increase, new computational procedures are needed to characterize and classify the temporal dynamics of individual cells. For this purpose, here we present the general experimental-computational framework SAPHIRE (Stochastic Annotation of Phenotypic Individual-cell Responses) to characterize phenotypic cellular responses from time series imaging datasets. Hidden Markov modeling is used to infer and annotate morphological state and state-switching properties from image-derived cell shape measurements. Time series modeling is performed on each cell individually, making the approach broadly useful for analyzing asynchronous cell populations. Two-color fluorescent cells simultaneously expressing actin and nuclear reporters enabled us to profile temporal changes in cell shape following pharmacological inhibition of cytoskeleton-regulatory signaling pathways. Results are compared with existing approaches conventionally applied to fixed-cell imaging datasets, and indicate that time series modeling captures heterogeneous dynamic cellular responses that can improve drug classification and offer additional important insight into mechanisms of drug action. The software is available at http://saphire-hcs.org.
Collapse
Affiliation(s)
- Simon Gordonov
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
- The David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
| | - Mun Kyung Hwang
- The David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
| | - Alan Wells
- Department of Pathology, University of Pittsburgh, and Pittsburgh VA Health System, Pittsburgh, PA, USA
| | - Frank B. Gertler
- The David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Douglas A. Lauffenburger
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
- The David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Mark Bathe
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| |
Collapse
|
76
|
Schoenauer Sebag A, Plancade S, Raulet-Tomkiewicz C, Barouki R, Vert JP, Walter T. A generic methodological framework for studying single cell motility in high-throughput time-lapse data. Bioinformatics 2015; 31:i320-8. [PMID: 26072499 PMCID: PMC4765885 DOI: 10.1093/bioinformatics/btv225] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Motivation: Motility is a fundamental cellular attribute, which plays a major part in processes ranging from embryonic development to metastasis. Traditionally, single cell motility is often studied by live cell imaging. Yet, such studies were so far limited to low throughput. To systematically study cell motility at a large scale, we need robust methods to quantify cell trajectories in live cell imaging data. Results: The primary contribution of this article is to present Motility study Integrated Workflow (MotIW), a generic workflow for the study of single cell motility in high-throughput time-lapse screening data. It is composed of cell tracking, cell trajectory mapping to an original feature space and hit detection according to a new statistical procedure. We show that this workflow is scalable and demonstrates its power by application to simulated data, as well as large-scale live cell imaging data. This application enables the identification of an ontology of cell motility patterns in a fully unsupervised manner. Availability and implementation: Python code and examples are available online (http://cbio.ensmp.fr/∼aschoenauer/motiw.html) Contact:thomas.walter@mines-paristech.fr Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Alice Schoenauer Sebag
- MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, Institut Curie, Paris, INSERM U900, Paris, Université Paris Descartes, Paris, INSERM UMR-S 1124, Paris, Agro ParisTech, Paris and Mathématiques et Informatique Appliquées, INRA, Jouy-en-Josas, France MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, Institut Curie, Paris, INSERM U900, Paris, Université Paris Descartes, Paris, INSERM UMR-S 1124, Paris, Agro ParisTech, Paris and Mathématiques et Informatique Appliquées, INRA, Jouy-en-Josas, France MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, Institut Curie, Paris, INSERM U900, Paris, Université Paris Descartes, Paris, INSERM UMR-S 1124, Paris, Agro ParisTech, Paris and Mathématiques et Informatique Appliquées, INRA, Jouy-en-Josas, France MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, Institut Curie, Paris, INSERM U900, Paris, Université Paris Descartes, Paris, INSERM UMR-S 1124, Paris, Agro ParisTech, Paris and Mathématiques et Informatique Appliquées, INRA, Jouy-en-Josas, France MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, Institut Curie, Paris, INSERM U900, Paris, Université Paris Descartes, Paris, INSERM UMR-S 1124, Paris, Agro ParisTech, Paris and Mathématiques et Informatique Appliquées, INRA, Jouy-en-Josas, France MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, Institut Curie, Paris, INSERM U900, Paris, Université Paris Descartes, Paris, INSERM UMR-S 1124, Paris, Agro ParisTech, Paris and Mathématiques et Informatique Appliquées, INRA, Jouy-en-Josas, France
| | - Sandra Plancade
- MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, Institut Curie, Paris, INSERM U900, Paris, Université Paris Descartes, Paris, INSERM UMR-S 1124, Paris, Agro ParisTech, Paris and Mathématiques et Informatique Appliquées, INRA, Jouy-en-Josas, France
| | - Céline Raulet-Tomkiewicz
- MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, Institut Curie, Paris, INSERM U900, Paris, Université Paris Descartes, Paris, INSERM UMR-S 1124, Paris, Agro ParisTech, Paris and Mathématiques et Informatique Appliquées, INRA, Jouy-en-Josas, France MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, Institut Curie, Paris, INSERM U900, Paris, Université Paris Descartes, Paris, INSERM UMR-S 1124, Paris, Agro ParisTech, Paris and Mathématiques et Informatique Appliquées, INRA, Jouy-en-Josas, France
| | - Robert Barouki
- MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, Institut Curie, Paris, INSERM U900, Paris, Université Paris Descartes, Paris, INSERM UMR-S 1124, Paris, Agro ParisTech, Paris and Mathématiques et Informatique Appliquées, INRA, Jouy-en-Josas, France MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, Institut Curie, Paris, INSERM U900, Paris, Université Paris Descartes, Paris, INSERM UMR-S 1124, Paris, Agro ParisTech, Paris and Mathématiques et Informatique Appliquées, INRA, Jouy-en-Josas, France
| | - Jean-Philippe Vert
- MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, Institut Curie, Paris, INSERM U900, Paris, Université Paris Descartes, Paris, INSERM UMR-S 1124, Paris, Agro ParisTech, Paris and Mathématiques et Informatique Appliquées, INRA, Jouy-en-Josas, France MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, Institut Curie, Paris, INSERM U900, Paris, Université Paris Descartes, Paris, INSERM UMR-S 1124, Paris, Agro ParisTech, Paris and Mathématiques et Informatique Appliquées, INRA, Jouy-en-Josas, France MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, Institut Curie, Paris, INSERM U900, Paris, Université Paris Descartes, Paris, INSERM UMR-S 1124, Paris, Agro ParisTech, Paris and Mathématiques et Informatique Appliquées, INRA, Jouy-en-Josas, France
| | - Thomas Walter
- MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, Institut Curie, Paris, INSERM U900, Paris, Université Paris Descartes, Paris, INSERM UMR-S 1124, Paris, Agro ParisTech, Paris and Mathématiques et Informatique Appliquées, INRA, Jouy-en-Josas, France MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, Institut Curie, Paris, INSERM U900, Paris, Université Paris Descartes, Paris, INSERM UMR-S 1124, Paris, Agro ParisTech, Paris and Mathématiques et Informatique Appliquées, INRA, Jouy-en-Josas, France MINES ParisTech, PSL-Research University, CBIO-Centre for Computational Biology, Fontainebleau, Institut Curie, Paris, INSERM U900, Paris, Université Paris Descartes, Paris, INSERM UMR-S 1124, Paris, Agro ParisTech, Paris and Mathématiques et Informatique Appliquées, INRA, Jouy-en-Josas, France
| |
Collapse
|
77
|
Shao W, Liu M, Zhang D. Human cell structure-driven model construction for predicting protein subcellular location from biological images. Bioinformatics 2015; 32:114-21. [PMID: 26363175 DOI: 10.1093/bioinformatics/btv521] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2015] [Accepted: 08/31/2015] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION The systematic study of subcellular location pattern is very important for fully characterizing the human proteome. Nowadays, with the great advances in automated microscopic imaging, accurate bioimage-based classification methods to predict protein subcellular locations are highly desired. All existing models were constructed on the independent parallel hypothesis, where the cellular component classes are positioned independently in a multi-class classification engine. The important structural information of cellular compartments is missed. To deal with this problem for developing more accurate models, we proposed a novel cell structure-driven classifier construction approach (SC-PSorter) by employing the prior biological structural information in the learning model. Specifically, the structural relationship among the cellular components is reflected by a new codeword matrix under the error correcting output coding framework. Then, we construct multiple SC-PSorter-based classifiers corresponding to the columns of the error correcting output coding codeword matrix using a multi-kernel support vector machine classification approach. Finally, we perform the classifier ensemble by combining those multiple SC-PSorter-based classifiers via majority voting. RESULTS We evaluate our method on a collection of 1636 immunohistochemistry images from the Human Protein Atlas database. The experimental results show that our method achieves an overall accuracy of 89.0%, which is 6.4% higher than the state-of-the-art method. AVAILABILITY AND IMPLEMENTATION The dataset and code can be downloaded from https://github.com/shaoweinuaa/. CONTACT dqzhang@nuaa.edu.cn SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Wei Shao
- School of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
| | - Mingxia Liu
- School of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
| | - Daoqiang Zhang
- School of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
| |
Collapse
|
78
|
Štěpka K, Matula P, Matula P, Wörz S, Rohr K, Kozubek M. Performance and sensitivity evaluation of 3D spot detection methods in confocal microscopy. Cytometry A 2015; 87:759-72. [PMID: 26033916 DOI: 10.1002/cyto.a.22692] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2014] [Revised: 03/06/2015] [Accepted: 04/30/2015] [Indexed: 12/11/2022]
Abstract
Reliable 3D detection of diffraction-limited spots in fluorescence microscopy images is an important task in subcellular observation. Generally, fluorescence microscopy images are heavily degraded by noise and non-specifically stained background, making reliable detection a challenging task. In this work, we have studied the performance and parameter sensitivity of eight recent methods for 3D spot detection. The study is based on both 3D synthetic image data and 3D real confocal microscopy images. The synthetic images were generated using a simulator modeling the complete imaging setup, including the optical path as well as the image acquisition process. We studied the detection performance and parameter sensitivity under different noise levels and under the influence of uneven background signal. To evaluate the parameter sensitivity, we propose a novel measure based on the gradient magnitude of the F1 score. We measured the success rate of the individual methods for different types of the image data and found that the type of image degradation is an important factor. Using the F1 score and the newly proposed sensitivity measure, we found that the parameter sensitivity is not necessarily proportional to the success rate of a method. This also provided an explanation why the best performing method for synthetic data was outperformed by other methods when applied to the real microscopy images. On the basis of the results obtained, we conclude with the recommendation of the HDome method for data with relatively low variations in quality, or the Sorokin method for image sets in which the quality varies more. We also provide alternative recommendations for high-quality images, and for situations in which detailed parameter tuning might be deemed expensive.
Collapse
Affiliation(s)
- Karel Štěpka
- Centre for Biomedical Image Analysis, Faculty of Informatics, Masaryk University, Brno, Czech Republic
| | - Pavel Matula
- Centre for Biomedical Image Analysis, Faculty of Informatics, Masaryk University, Brno, Czech Republic
| | - Petr Matula
- Centre for Biomedical Image Analysis, Faculty of Informatics, Masaryk University, Brno, Czech Republic
| | - Stefan Wörz
- IPMB and BIOQUANT, Department of Bioinformatics and Functional Genomics, and DKFZ, University of Heidelberg, Heidelberg, Germany
| | - Karl Rohr
- IPMB and BIOQUANT, Department of Bioinformatics and Functional Genomics, and DKFZ, University of Heidelberg, Heidelberg, Germany
| | - Michal Kozubek
- Centre for Biomedical Image Analysis, Faculty of Informatics, Masaryk University, Brno, Czech Republic
| |
Collapse
|
79
|
Gertych A, Ma Z, Tajbakhsh J, Velásquez-Vacca A, Knudsen BS. Rapid 3-D delineation of cell nuclei for high-content screening platforms. Comput Biol Med 2015; 69:328-38. [PMID: 25982066 DOI: 10.1016/j.compbiomed.2015.04.025] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2014] [Revised: 04/08/2015] [Accepted: 04/16/2015] [Indexed: 12/17/2022]
Abstract
High-resolution three-dimensional (3-D) microscopy combined with multiplexing of fluorescent labels allows high-content analysis of large numbers of cell nuclei. The full automation of 3-D screening platforms necessitates image processing algorithms that can accurately and robustly delineate nuclei in images with little to no human intervention. Imaging-based high-content screening was originally developed as a powerful tool for drug discovery. However, cell confluency, complexity of nuclear staining as well as poor contrast between nuclei and background result in slow and unreliable 3-D image processing and therefore negatively affect the performance of studying a drug response. Here, we propose a new method, 3D-RSD, to delineate nuclei by means of 3-D radial symmetries and test it on high-resolution image data of human cancer cells treated by drugs. The nuclei detection performance was evaluated by means of manually generated ground truth from 2351 nuclei (27 confocal stacks). When compared to three other nuclei segmentation methods, 3D-RSD possessed a better true positive rate of 83.3% and F-score of 0.895±0.045 (p-value=0.047). Altogether, 3D-RSD is a method with a very good overall segmentation performance. Furthermore, implementation of radial symmetries offers good processing speed, and makes 3D-RSD less sensitive to staining patterns. In particular, the 3D-RSD method performs well in cell lines, which are often used in imaging-based HCS platforms and are afflicted by nuclear crowding and overlaps that hinder feature extraction.
Collapse
Affiliation(s)
- Arkadiusz Gertych
- Department of Surgery, Cedars-Sinai Medical Center, Los Angeles, CA, USA; Department of Pathology and Laboratory Medicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA.
| | - Zhaoxuan Ma
- Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Jian Tajbakhsh
- Department of Surgery, Cedars-Sinai Medical Center, Los Angeles, CA, USA; Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | | | - Beatrice S Knudsen
- Department of Pathology and Laboratory Medicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA; Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, CA, USA; Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| |
Collapse
|
80
|
Finkbeiner S, Frumkin M, Kassner PD. Cell-based screening: extracting meaning from complex data. Neuron 2015; 86:160-74. [PMID: 25856492 PMCID: PMC4457442 DOI: 10.1016/j.neuron.2015.02.023] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2014] [Revised: 09/23/2014] [Accepted: 01/22/2015] [Indexed: 01/23/2023]
Abstract
Unbiased discovery approaches have the potential to uncover neurobiological insights into CNS disease and lead to the development of therapies. Here, we review lessons learned from imaging-based screening approaches and recent advances in these areas, including powerful new computational tools to synthesize complex data into more useful knowledge that can reliably guide future research and development.
Collapse
Affiliation(s)
- Steven Finkbeiner
- Director of the Taube/Koret Center for Neurodegenerative Disease and the Hellman Family Foundation Program in Alzheimer's Disease Research, Gladstone Institute of Neurological Disease, San Francisco, CA 94158, USA; Departments of Neurology and Physiology, University of California, San Francisco, San Francisco, CA 94143, USA.
| | - Michael Frumkin
- Director of Engineering, Research, Google, Inc., 1600 Amphitheatre Parkway, Mountain View, CA 94043, USA
| | - Paul D Kassner
- Director of Research, Amgen, Inc., 1120 Veterans Boulevard South, San Francisco, CA 94080, USA
| |
Collapse
|
81
|
Krause C, Ens K, Fechner K, Voigt J, Fraune J, Rohwäder E, Hahn M, Danckwardt M, Feirer C, Barth E, Martinetz T, Stöcker W. EUROPattern Suite technology for computer-aided immunofluorescence microscopy in autoantibody diagnostics. Lupus 2015; 24:516-29. [DOI: 10.1177/0961203314559635] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Antinuclear autoantibodies (ANA) are highly informative biomarkers in autoimmune diagnostics. The increasing demand for effective test systems, however, has led to the development of a confusingly large variety of different platforms. One of them, the indirect immunofluorescence (IIF), is regarded as the common gold standard for ANA screening, as described in a position statement by the American College of Rheumatology in 2009. Technological solutions have been developed aimed at standardization and automation of IIF to overcome methodological limitations and subjective bias in IIF interpretation. In this review, we present the EUROPattern Suite, a system for computer-aided immunofluorescence microscopy (CAIFM) including automated acquisition of digital images and evaluation of IIF results. The system was originally designed for ANA diagnostics on human epithelial cells, but its applications have been extended with the latest system update version 1.5 to the analysis of antineutrophil cytoplasmic antibodies (ANCA) and anti-dsDNA antibodies.
Collapse
Affiliation(s)
- C Krause
- Institute for Experimental Immunology, Euroimmun AG, Lübeck, Germany
| | - K Ens
- Institute for Experimental Immunology, Euroimmun AG, Lübeck, Germany
| | - K Fechner
- Institute for Experimental Immunology, Euroimmun AG, Lübeck, Germany
| | - J Voigt
- Institute for Experimental Immunology, Euroimmun AG, Lübeck, Germany
| | - J Fraune
- Institute for Experimental Immunology, Euroimmun AG, Lübeck, Germany
| | - E Rohwäder
- Institute for Experimental Immunology, Euroimmun AG, Lübeck, Germany
| | - M Hahn
- Institute for Experimental Immunology, Euroimmun AG, Lübeck, Germany
| | - M Danckwardt
- Institute for Experimental Immunology, Euroimmun AG, Lübeck, Germany
| | - C Feirer
- Institute for Experimental Immunology, Euroimmun AG, Lübeck, Germany
| | - E Barth
- Institute for Neuro- and Bioinformatics, University of Lübeck, Lübeck, Germany
| | - T Martinetz
- Institute for Neuro- and Bioinformatics, University of Lübeck, Lübeck, Germany
| | - W Stöcker
- Institute for Experimental Immunology, Euroimmun AG, Lübeck, Germany
| |
Collapse
|
82
|
Coelho LP, Pato C, Friães A, Neumann A, von Köckritz-Blickwede M, Ramirez M, Carriço JA. Automatic determination of NET (neutrophil extracellular traps) coverage in fluorescent microscopy images. Bioinformatics 2015; 31:2364-70. [PMID: 25792554 DOI: 10.1093/bioinformatics/btv156] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2014] [Accepted: 02/16/2015] [Indexed: 01/07/2023] Open
Abstract
MOTIVATION Neutrophil extracellular traps (NETs) are believed to be essential in controlling several bacterial pathogens. Quantification of NETs in vitro is an important tool in studies aiming to clarify the biological and chemical factors contributing to NET production, stabilization and degradation. This estimation can be performed on the basis of fluorescent microscopy images using appropriate labelings. In this context, it is desirable to automate the analysis to eliminate both the tedious process of manual annotation and possible operator-specific biases. RESULTS We propose a framework for the automated determination of NET content, based on visually annotated images which are used to train a supervised machine-learning method. We derive several methods in this framework. The best results are obtained by combining these into a single prediction. The overall Q(2) of the combined method is 93%. By having two experts label part of the image set, we were able to compare the performance of the algorithms to the human interoperator variability. We find that the two operators exhibited a very high correlation on their overall assessment of the NET coverage area in the images (R(2) is 97%), although there were consistent differences in labeling at pixel level (Q(2), which unlike R(2) does not correct for additive and multiplicative biases, was only 89%). AVAILABILITY AND IMPLEMENTATION Open source software (under the MIT license) is available at https://github.com/luispedro/Coelho2015_NetsDetermination for both reproducibility and application to new data.
Collapse
Affiliation(s)
- Luis Pedro Coelho
- Unidade de Biofísica e Expressão Genética, Instituto de Medicina Molecular and
| | - Catarina Pato
- Unidade de Biofísica e Expressão Genética, Instituto de Medicina Molecular and
| | - Ana Friães
- Unidade de Biofísica e Expressão Genética, Instituto de Medicina Molecular and
| | - Ariane Neumann
- Unidade de Biofísica e Expressão Genética, Instituto de Medicina Molecular and
| | | | - Mário Ramirez
- Unidade de Biofísica e Expressão Genética, Instituto de Medicina Molecular and
| | - João André Carriço
- Unidade de Biofísica e Expressão Genética, Instituto de Medicina Molecular and
| |
Collapse
|
83
|
Sailem HZ, Sero JE, Bakal C. Visualizing cellular imaging data using PhenoPlot. Nat Commun 2015; 6:5825. [PMID: 25569359 PMCID: PMC4354266 DOI: 10.1038/ncomms6825] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2014] [Accepted: 11/11/2014] [Indexed: 11/24/2022] Open
Abstract
Visualization is essential for data interpretation, hypothesis formulation and communication of results. However, there is a paucity of visualization methods for image-derived data sets generated by high-content analysis in which complex cellular phenotypes are described as high-dimensional vectors of features. Here we present a visualization tool, PhenoPlot, which represents quantitative high-content imaging data as easily interpretable glyphs, and we illustrate how PhenoPlot can be used to improve the exploration and interpretation of complex breast cancer cell phenotypes.
Collapse
Affiliation(s)
- Heba Z. Sailem
- Dynamical Cell Systems, Division of Cancer Biology, Institute of Cancer Research, 237 Fulham Road, London SW3 6JB, UK
| | - Julia E. Sero
- Dynamical Cell Systems, Division of Cancer Biology, Institute of Cancer Research, 237 Fulham Road, London SW3 6JB, UK
| | - Chris Bakal
- Dynamical Cell Systems, Division of Cancer Biology, Institute of Cancer Research, 237 Fulham Road, London SW3 6JB, UK
| |
Collapse
|
84
|
Chen J, Tang YY, Chen CLP, Fang B, Lin Y, Shang Z. Multi-Label Learning With Fuzzy Hypergraph Regularization for Protein Subcellular Location Prediction. IEEE Trans Nanobioscience 2014; 13:438-47. [DOI: 10.1109/tnb.2014.2341111] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
85
|
A new multi-label classifier in identifying the functional types of human membrane proteins. J Membr Biol 2014; 248:179-86. [PMID: 25433431 DOI: 10.1007/s00232-014-9755-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2014] [Accepted: 11/11/2014] [Indexed: 10/24/2022]
Abstract
Membrane proteins were found to be involved in various cellular processes performing various important functions, which are mainly associated to their type. Given a membrane protein sequence, how can we identify its type(s)? Particularly, how can we deal with the multi-type problem since one membrane protein may simultaneously belong to two or more different types? To address these problems, which are obviously very important to both basic research and drug development, a new multi-label classifier was developed based on pseudo amino acid composition with multi-label k-nearest neighbor algorithm. The success rate achieved by the new predictor on the benchmark dataset by jackknife test is 73.94%, indicating that the method is promising and the predictor may become a very useful high-throughput tool, or at least play a complementary role to the existing predictors in identifying functional types of membrane proteins.
Collapse
|
86
|
Steininger RJ, Rajaram S, Girard L, Minna JD, Wu LF, Altschuler SJ. On comparing heterogeneity across biomarkers. Cytometry A 2014; 87:558-67. [PMID: 25425168 DOI: 10.1002/cyto.a.22599] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2014] [Revised: 10/30/2014] [Accepted: 11/06/2014] [Indexed: 01/28/2023]
Abstract
Microscopy reveals complex patterns of cellular heterogeneity that can be biologically informative. However, a limitation of microscopy is that only a small number of biomarkers can typically be monitored simultaneously. Thus, a natural question is whether additional biomarkers provide a deeper characterization of the distribution of cellular states in a population. How much information about a cell's phenotypic state in one biomarker is gained by knowing its state in another biomarker? Here, we describe a framework for comparing phenotypic states across biomarkers. Our approach overcomes the current limitation of microscopy by not requiring costaining biomarkers on the same cells; instead, we require staining of biomarkers (possibly separately) on a common collection of phenotypically diverse cell lines. We evaluate our approach on two image datasets: 33 oncogenically diverse lung cancer cell lines stained with 7 biomarkers, and 49 less diverse subclones of one lung cancer cell line stained with 12 biomarkers. We first validate our method by comparing it to the "gold standard" of costaining. We then apply our approach to all pairs of biomarkers and use it to identify biomarkers that yield similar patterns of heterogeneity. The results presented in this work suggest that many biomarkers provide redundant information about heterogeneity. Thus, our approach provides a practical guide for selecting independently informative biomarkers and, more generally, will yield insights into both the connectivity of biological networks and the complexity of the state space of biological systems.
Collapse
Affiliation(s)
- Robert J Steininger
- Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, Texas
| | - Satwik Rajaram
- Department of Pharmaceutical Chemistry, University of California at San Francisco, San Francisco, California
| | - Luc Girard
- Hamon Center for Therapeutic Oncology Research and Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, Texas
| | - John D Minna
- Hamon Center for Therapeutic Oncology Research and Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, Texas.,Departments of Pharmacology and Internal Medicine, University of Texas Southwestern Medical Center, Dallas, Texas
| | - Lani F Wu
- Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, Texas.,Department of Pharmaceutical Chemistry, University of California at San Francisco, San Francisco, California
| | - Steven J Altschuler
- Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, Texas.,Department of Pharmaceutical Chemistry, University of California at San Francisco, San Francisco, California
| |
Collapse
|
87
|
Xu YY, Yang F, Zhang Y, Shen HB. Bioimaging-based detection of mislocalized proteins in human cancers by semi-supervised learning. ACTA ACUST UNITED AC 2014; 31:1111-9. [PMID: 25414362 DOI: 10.1093/bioinformatics/btu772] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2014] [Accepted: 11/15/2014] [Indexed: 12/30/2022]
Abstract
MOTIVATION There is a long-term interest in the challenging task of finding translocated and mislocated cancer biomarker proteins. Bioimages of subcellular protein distribution are new data sources which have attracted much attention in recent years because of their intuitive and detailed descriptions of protein distribution. However, automated methods in large-scale biomarker screening suffer significantly from the lack of subcellular location annotations for bioimages from cancer tissues. The transfer prediction idea of applying models trained on normal tissue proteins to predict the subcellular locations of cancerous ones is arbitrary because the protein distribution patterns may differ in normal and cancerous states. RESULTS We developed a new semi-supervised protocol that can use unlabeled cancer protein data in model construction by an iterative and incremental training strategy. Our approach enables us to selectively use the low-quality images in normal states to expand the training sample space and provides a general way for dealing with the small size of annotated images used together with large unannotated ones. Experiments demonstrate that the new semi-supervised protocol can result in improved accuracy and sensitivity of subcellular location difference detection. AVAILABILITY AND IMPLEMENTATION The data and code are available at: www.csbio.sjtu.edu.cn/bioinf/SemiBiomarker/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ying-Ying Xu
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China and Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Fan Yang
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China and Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Yang Zhang
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China and Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Hong-Bin Shen
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China and Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China and Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
88
|
Xue J, Niu YF, Huang T, Yang WD, Liu JS, Li HY. Genetic improvement of the microalga Phaeodactylum tricornutum for boosting neutral lipid accumulation. Metab Eng 2014; 27:1-9. [PMID: 25447640 DOI: 10.1016/j.ymben.2014.10.002] [Citation(s) in RCA: 145] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2014] [Revised: 09/18/2014] [Accepted: 10/14/2014] [Indexed: 11/18/2022]
Abstract
To obtain fast growing oil-rich microalgal strains has been urgently demanded for microalgal biofuel. Malic enzyme (ME), which is involved in pyruvate metabolism and carbon fixation, was first characterized in microalgae here. Overexpression of Phaeodactylum tricornutum ME (PtME) significantly enhanced the expression of PtME and its enzymatic activity in transgenic P. tricornutum. The total lipid content in transgenic cells markedly increased by 2.5-fold and reached a record 57.8% of dry cell weight with a similar growth rate to wild type, thus keeping a high biomass. The neutral lipid content was further increased by 31% under nitrogen-deprivation treatment, still 66% higher than that of wild type. Transgenic microalgae cells exhibited obvious morphological changes, as the cells were shorter and thicker and contained larger oil bodies. Immuno-electron microscopy targeted PtME to the mitochondrion. This study markedly increased the oil content in microalgae, suggesting a new route for developing ideal microalgal strains for industrial biodiesel production.
Collapse
Affiliation(s)
- Jiao Xue
- Key Laboratory of Eutrophication and Red Tide Prevention of Guangdong Higher Education Institutes, College of Life Science, Jinan University, Guangzhou 510632, China
| | - Ying-Fang Niu
- Key Laboratory of Eutrophication and Red Tide Prevention of Guangdong Higher Education Institutes, College of Life Science, Jinan University, Guangzhou 510632, China
| | - Tan Huang
- Key Laboratory of Eutrophication and Red Tide Prevention of Guangdong Higher Education Institutes, College of Life Science, Jinan University, Guangzhou 510632, China
| | - Wei-Dong Yang
- Key Laboratory of Eutrophication and Red Tide Prevention of Guangdong Higher Education Institutes, College of Life Science, Jinan University, Guangzhou 510632, China
| | - Jie-Sheng Liu
- Key Laboratory of Eutrophication and Red Tide Prevention of Guangdong Higher Education Institutes, College of Life Science, Jinan University, Guangzhou 510632, China
| | - Hong-Ye Li
- Key Laboratory of Eutrophication and Red Tide Prevention of Guangdong Higher Education Institutes, College of Life Science, Jinan University, Guangzhou 510632, China.
| |
Collapse
|
89
|
Pacharawongsakda E, Theeramunkong T. Predict subcellular locations of singleplex and multiplex proteins by semi-supervised learning and dimension-reducing general mode of Chou's PseAAC. IEEE Trans Nanobioscience 2014; 12:311-20. [PMID: 23864226 DOI: 10.1109/tnb.2013.2272014] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Predicting protein subcellular location is one of major challenges in Bioinformatics area since such knowledge helps us understand protein functions and enables us to select the targeted proteins during drug discovery process. While many computational techniques have been proposed to improve predictive performance for protein subcellular location, they have several shortcomings. In this work, we propose a method to solve three main issues in such techniques; i) manipulation of multiplex proteins which may exist or move between multiple cellular compartments, ii) handling of high dimensionality in input and output spaces and iii) requirement of sufficient labeled data for model training. Towards these issues, this work presents a new computational method for predicting proteins which have either single or multiple locations. The proposed technique, namely iFLAST-CORE, incorporates the dimensionality reduction in the feature and label spaces with co-training paradigm for semi-supervised multi-label classification. For this purpose, the Singular Value Decomposition (SVD) is applied to transform the high-dimensional feature space and label space into the lower-dimensional spaces. After that, due to limitation of labeled data, the co-training regression makes use of unlabeled data by predicting the target values in the lower-dimensional spaces of unlabeled data. In the last step, the component of SVD is used to project labels in the lower-dimensional space back to those in the original space and an adaptive threshold is used to map a numeric value to a binary value for label determination. A set of experiments on viral proteins and gram-negative bacterial proteins evidence that our proposed method improve the classification performance in terms of various evaluation metrics such as Aiming (or Precision), Coverage (or Recall) and macro F-measure, compared to the traditional method that uses only labeled data.
Collapse
|
90
|
Chen X, Shi SP, Suo SB, Xu HD, Qiu JD. Proteomic analysis and prediction of human phosphorylation sites in subcellular level reveal subcellular specificity. Bioinformatics 2014; 31:194-200. [DOI: 10.1093/bioinformatics/btu598] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
|
91
|
Peng KT, Zheng CN, Xue J, Chen XY, Yang WD, Liu JS, Bai W, Li HY. Delta 5 fatty acid desaturase upregulates the synthesis of polyunsaturated fatty acids in the marine diatom Phaeodactylum tricornutum. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2014; 62:8773-6. [PMID: 25109502 DOI: 10.1021/jf5031086] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
Microalgae are important primary producers in the marine ecosystem and excellent sources of lipids and other bioactive compounds. The marine diatom Phaeodactylum tricornutum accumulates eicosapentaenoic acid (EPA, 20:5n-3) as its major component of fatty acids. To improve the EPA production, delta 5 desaturase, which plays a role in EPA biosynthetic pathway, was characterized in P. tricornutum. An annotated delta 5 desaturase PtD5b gene was cloned and overexpressed in P. tricornutum. The transgene was integrated into the genome demonstrated by Southern blot, and the overexpression of PtD5b was verified by qPCR and Western blot analysis. Fatty acid composition exhibited a significant increase in the unsaturated fatty acids. Monounsaturated fatty acids (MUFA) and polyunsaturated fatty acids (PUFA) showed an increase of 75% and 64%, respectively. In particular, EPA showed an increase of 58% in engineered microalgae. Meanwhile, neutral lipid content showed an increase up to 65% in engineered microalgae. More importantly, engineered cells showed a similar growth rate with the wild type, thus keeping high biomass productivity. This work provides an effective way to improve the production of microalgal value-added bioproducts by metabolic engineering.
Collapse
Affiliation(s)
- Kun-Tao Peng
- Key Laboratory of Eutrophication and Red Tide Prevention of Guangdong Higher Education Institutes, College of Life Science and §Department of Food Science, Jinan University , Guangzhou, Guangdong 510632, China
| | | | | | | | | | | | | | | |
Collapse
|
92
|
A Multi-label Classifier for Prediction Membrane Protein Functional Types in Animal. J Membr Biol 2014; 247:1141-8. [DOI: 10.1007/s00232-014-9708-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2014] [Accepted: 07/14/2014] [Indexed: 11/26/2022]
|
93
|
Yang F, Xu YY, Shen HB. Many local pattern texture features: which is better for image-based multilabel human protein subcellular localization classification? ScientificWorldJournal 2014; 2014:429049. [PMID: 25050396 PMCID: PMC4094881 DOI: 10.1155/2014/429049] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2014] [Accepted: 05/22/2014] [Indexed: 01/14/2023] Open
Abstract
Human protein subcellular location prediction can provide critical knowledge for understanding a protein's function. Since significant progress has been made on digital microscopy, automated image-based protein subcellular location classification is urgently needed. In this paper, we aim to investigate more representative image features that can be effectively used for dealing with the multilabel subcellular image samples. We prepared a large multilabel immunohistochemistry (IHC) image benchmark from the Human Protein Atlas database and tested the performance of different local texture features, including completed local binary pattern, local tetra pattern, and the standard local binary pattern feature. According to our experimental results from binary relevance multilabel machine learning models, the completed local binary pattern, and local tetra pattern are more discriminative for describing IHC images when compared to the traditional local binary pattern descriptor. The combination of these two novel local pattern features and the conventional global texture features is also studied. The enhanced performance of final binary relevance classification model trained on the combined feature space demonstrates that different features are complementary to each other and thus capable of improving the accuracy of classification.
Collapse
Affiliation(s)
- Fan Yang
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai 200240, China
- Key Laboratory of Optic-Electronic and Communication, Jiangxi Science & Technology Normal University, Nanchang 330013, China
- Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China
| | - Ying-Ying Xu
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai 200240, China
- Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China
| | - Hong-Bin Shen
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai 200240, China
- Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China
| |
Collapse
|
94
|
Roybal KT, Sinai P, Verkade P, Murphy RF, Wülfing C. The actin-driven spatiotemporal organization of T-cell signaling at the system scale. Immunol Rev 2014; 256:133-47. [PMID: 24117818 DOI: 10.1111/imr.12103] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
T cells are activated through interaction with antigen-presenting cells (APCs). During activation, receptors and signaling intermediates accumulate in diverse spatiotemporal distributions. These distributions control the probability of signaling interactions and thus govern information flow through the signaling system. Spatiotemporally resolved system-scale investigation of signaling can extract the regulatory information thus encoded, allowing unique insight into the control of T-cell function. Substantial technical challenges exist, and these are briefly discussed herein. While much of the work assessing T-cell spatiotemporal organization uses planar APC substitutes, we focus here on B-cell APCs with often stark differences. Spatiotemporal signaling distributions are driven by cell biologically distinct structures, a large protein assembly at the interface center, a large invagination, the actin-supported interface periphery as extended by smaller individual lamella, and a newly discovered whole-interface actin-driven lamellum. The more than 60 elements of T-cell activation studied to date are dynamically distributed between these structures, generating a complex organization of the signaling system. Signal initiation and core signaling prefer the interface center, while signal amplification is localized in the transient lamellum. Actin dynamics control signaling distributions through regulation of the underlying structures and drive a highly undulating T-cell/APC interface that imposes substantial constraints on T-cell organization. We suggest that the regulation of actin dynamics, by controlling signaling distributions and membrane topology, is an important rheostat of T-cell signaling.
Collapse
Affiliation(s)
- Kole T Roybal
- Department of Immunology, UT Southwestern Medical Center, Dallas, TX, USA
| | | | | | | | | |
Collapse
|
95
|
Tozzoli R, Bonaguri C, Melegari A, Antico A, Bassetti D, Bizzaro N. Current state of diagnostic technologies in the autoimmunology laboratory. Clin Chem Lab Med 2014; 51:129-38. [PMID: 23092800 DOI: 10.1515/cclm-2012-0191] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2012] [Accepted: 05/04/2012] [Indexed: 12/21/2022]
Abstract
The methods for detecting and measuring autoantibodies have evolved markedly in recent years, encompassing three generations of analytical technologies. Many different immunoassay methods have been developed and used for research and laboratory practice purposes, from the early conventional (or monoplex) analytical methods able to detect single autoantibodies to the more recent multiplex platforms that can quantify tens of molecules. Although it has been in use for over 50 years, indirect immunofluorescence remains the standard method for research on many types of autoantibodies, due to its characteristics of diagnostic sensitivity and also to recent technological innovations which permit it a greater level of automation and standardization. The recent multiplex immunometric methods, with varying levels of automation, present characteristics of higher diagnostic accuracy, but are not yet widely diffused in autoimmunology laboratories due to the limited number of autoantibodies that are detectable, and due to the high cost of reagents and systems. Technological advancement in autoimmunology continues to evolve rapidly, and in the coming years new proteomic techniques will be able to radically change the approach to diagnostics and possibly also clinical treatment of autoimmune diseases. The scope of this review is to update the state of the art of technologies and methods for the measurement of autoantibodies, with special reference to innovations in indirect immunofluorescence and in multiple proteomic methods.
Collapse
Affiliation(s)
- Renato Tozzoli
- Laboratorio di Patologia Clinica, Dipartimento di Medicina di Laboratorio, Azienda Ospedaliera S. Maria degli Angeli, Pordenone, Italy
| | | | | | | | | | | |
Collapse
|
96
|
Gong X, Zhao Y, Cai S, Fu S, Yang C, Zhang S, Zhang X. Single cell analysis with probe ESI-mass spectrometry: detection of metabolites at cellular and subcellular levels. Anal Chem 2014; 86:3809-16. [PMID: 24641101 DOI: 10.1021/ac500882e] [Citation(s) in RCA: 158] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Molecular analysis at cellular and subcellular levels, whether on selected molecules or at the metabolomics scale, is still a challenge now. Here we propose a method based on probe ESI mass spectrometry (PESI-MS) for single cell analysis. Detection of metabolites at cellular and subcellular levels was successfully achieved. In our work, tungsten probes with a tip diameter of about 1 μm were directly inserted into live cells to enrich metabolites. Then the enriched metabolites were directly desorbed/ionized from the tip of the probe for mass spectrometry (MS) detection. The direct desorption/ionization of the enriched metabolites from the tip of the probe greatly improved the sensitivity by a factor of about 30 fold compared to those methods that eluted the enriched analytes into a liquid phase for subsequent MS detection. We applied the PESI-MS to the detection of metabolites in single Allium cepa cells. Different kinds of metabolites, including 6 fructans, 4 lipids, and 8 flavone derivatives in single cells, have been successfully detected. Significant metabolite diversity was observed among different cells types of A. cepa bulb and different subcellular compartments of the same cell. We found that the inner epidermal cells had about 20 fold more fructans than the outer epidermal cells, while the outer epidermal cells had more lipids. We expected that PESI-MS might be a candidate in the future studies of single cell "omics".
Collapse
Affiliation(s)
- Xiaoyun Gong
- Beijing Key Laboratory for Microanalytical Methods and Instrumentation, Department of Chemistry, Tsinghua University , Beijing China
| | | | | | | | | | | | | |
Collapse
|
97
|
Abraham Y, Zhang X, Parker CN. Multiparametric Analysis of Screening Data: Growing Beyond the Single Dimension to Infinity and Beyond. ACTA ACUST UNITED AC 2014; 19:628-39. [PMID: 24598104 DOI: 10.1177/1087057114524987] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2013] [Accepted: 01/14/2014] [Indexed: 11/16/2022]
Abstract
Advances in instrumentation now allow the development of screening assays that are capable of monitoring multiple readouts such as transcript or protein levels, or even multiple parameters derived from images. Such advances in assay technologies highlight the complex nature of biology and disease. Harnessing this complexity requires integration of all the different parameters that can be measured rather than just monitoring a single dimension as is commonly used. Although some of the methods used to combine multiple measurements, such as principal component analysis, are commonly used for microarray analysis, biologists are not yet using many of the tools that have been developed in other fields to address such issues. Visualization of multiparametric data sets is one of the major challenges in this field, and a depiction of the results in a manner that can be readily interpreted is essential. This article describes a number of assay systems being used to generate such data sets en masse, and the methods being applied to their visualization and analysis. We also discuss some of the challenges of applying methods developed in other fields to biology.
Collapse
Affiliation(s)
- Yann Abraham
- Novartis Institute for Biomedical Research, Basel, Switzerland
| | - Xian Zhang
- Novartis Institute for Biomedical Research, Basel, Switzerland
| | | |
Collapse
|
98
|
Zhang SW, Liu YF, Yu Y, Zhang TH, Fan XN. MSLoc-DT: A new method for predicting the protein subcellular location of multispecies based on decision templates. Anal Biochem 2014; 449:164-71. [DOI: 10.1016/j.ab.2013.12.013] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2013] [Revised: 11/08/2013] [Accepted: 12/12/2013] [Indexed: 12/12/2022]
|
99
|
Wu B, Xie J, Du Z, Wu J, Zhang P, Xu L, Li E. PPI network analysis of mRNA expression profile of ezrin knockdown in esophageal squamous cell carcinoma. BIOMED RESEARCH INTERNATIONAL 2014; 2014:651954. [PMID: 25126570 PMCID: PMC4122099 DOI: 10.1155/2014/651954] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/03/2014] [Revised: 06/13/2014] [Accepted: 06/17/2014] [Indexed: 02/05/2023]
Abstract
Ezrin, coding protein EZR which cross-links actin filaments, overexpresses and involves invasion, metastasis, and poor prognosis in various cancers including esophageal squamous cell carcinoma (ESCC). In our previous study, Ezrin was knock down and analyzed by mRNA expression profile which has not been fully mined. In this study, we applied protein-protein interactions (PPI) network knowledge and methods to explore our understanding of these differentially expressed genes (DEGs). PPI subnetworks showed that hundreds of DEGs interact with thousands of other proteins. Subcellular localization analyses found that the DEGs and their directly or indirectly interacting proteins distribute in multiple layers, which was applied to analyze the shortest paths between EZR and other DEGs. Gene ontology annotation generated a functional annotation map and found hundreds of significant terms, especially those associated with cytoskeleton organization of Ezrin protein, such as "cytoskeleton organization," "regulation of actin filament-based process," and "regulation of actin cytoskeleton organization." The algorithm of Random Walk with Restart was applied to prioritize the DEGs and identified several cancer related DEGs ranked closest to EZR. These analyses based on PPI network have greatly expanded our comprehension of the mRNA expression profile of Ezrin knockdown for future examination of the roles and mechanisms of Ezrin.
Collapse
Affiliation(s)
- Bingli Wu
- Department of Biochemistry and Molecular Biology, Shantou University Medical College, Shantou 515041, China
| | - Jianjun Xie
- Department of Biochemistry and Molecular Biology, Shantou University Medical College, Shantou 515041, China
| | - Zepeng Du
- Department of Pathology, Shantou Central Hospital, Shantou 515041, China
| | - Jianyi Wu
- Department of Biochemistry and Molecular Biology, Shantou University Medical College, Shantou 515041, China
| | - Pixian Zhang
- Department of Biochemistry and Molecular Biology, Shantou University Medical College, Shantou 515041, China
| | - Liyan Xu
- Institute of Oncologic Pathology, Shantou University Medical College, Shantou 515041, China
- *Liyan Xu: and
| | - Enmin Li
- Department of Biochemistry and Molecular Biology, Shantou University Medical College, Shantou 515041, China
- *Enmin Li:
| |
Collapse
|
100
|
Comparative genomics study for identification of drug and vaccine targets in Vibrio cholerae: MurA ligase as a case study. Genomics 2013; 103:83-93. [PMID: 24368230 DOI: 10.1016/j.ygeno.2013.12.002] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2013] [Revised: 12/11/2013] [Accepted: 12/12/2013] [Indexed: 12/15/2022]
Abstract
A systematic workflow consisting of comparative genomics, metabolic pathways analysis and additional drug prioritization parameters identified 264 proteins of Vibrio cholerae which were predicted to be absent in Homo sapiens. Among these, 40 proteins were identified as essential proteins that could serve as potential drug and vaccine targets. Additional prioritization parameters characterized 11 proteins as vaccine candidates while druggability of each of the identified proteins as evaluated by the Drug Bank database which prioritized 16 proteins suitable for drug targets. As a case study, we built a homology model of one of the potential drug targets, MurA ligase, using MODELLER (9v12) software. The model has been further explored for in silico docking with inhibitors having druggability potential from the Drug Bank database. Results from this study could facilitate selecting V. cholerae proteins for drug design and vaccine production pipelines in future.
Collapse
|