1
|
Chen K, Bhunia RK, Wendt MM, Campidilli G, McNinch C, Hassan A, Li L, Nikolau BJ, Yandeau-Nelson MD. Cuticle development and the underlying transcriptome-metabolome associations during early seedling establishment. JOURNAL OF EXPERIMENTAL BOTANY 2024; 75:6500-6522. [PMID: 39031128 PMCID: PMC11522977 DOI: 10.1093/jxb/erae311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 07/18/2024] [Indexed: 07/22/2024]
Abstract
The plant cuticle is a complex extracellular lipid barrier that has multiple protective functions. This study investigated cuticle deposition by integrating metabolomics and transcriptomics data gathered from six different maize seedling organs of four genotypes, the inbred lines B73 and Mo17, and their reciprocal hybrids. These datasets captured the developmental transition of the seedling from heterotrophic skotomorphogenic growth to autotrophic photomorphogenic growth, a transition that is highly vulnerable to environmental stresses. Statistical interrogation of these data revealed that the predominant determinant of cuticle composition is seedling organ type, whereas the seedling genotype has a smaller effect on this phenotype. Gene-to-metabolite associations assessed by integrated statistical analyses identified three gene networks associated with the deposition of different elements of the cuticle: cuticular waxes; monomers of lipidized cell wall biopolymers, including cutin and suberin; and both of these elements. These gene networks reveal three metabolic programs that appear to support cuticle deposition, including processes of chloroplast biogenesis, lipid metabolism, and molecular regulation (e.g. transcription factors, post-translational regulators, and phytohormones). This study demonstrates the wider physiological metabolic context that can determine cuticle deposition and lays the groundwork for new targets for modulating the properties of this protective barrier.
Collapse
Affiliation(s)
- Keting Chen
- Department of Genetics, Development & Cell Biology, Iowa State University, Ames, IA, USA
- Bioinformatics & Computational Biology Graduate Program, Iowa State University, Ames, IA, USA
| | - Rupam Kumar Bhunia
- Roy J. Carver Department of Biochemistry, Biophysics & Molecular Biology, Iowa State University, Ames, IA, USA
| | - Matthew M Wendt
- Department of Genetics, Development & Cell Biology, Iowa State University, Ames, IA, USA
- Interdepartmental Genetics and Genomics Graduate Program, Iowa State University, Ames, IA, USA
| | - Grace Campidilli
- Department of Genetics, Development & Cell Biology, Iowa State University, Ames, IA, USA
- Undergraduate Genetics Major, Iowa State University, Ames, IA, USA
| | - Colton McNinch
- Molecular, Cellular, and Developmental Biology Graduate Program, Iowa State University, Ames, IA, USA
| | - Ahmed Hassan
- Department of Genetics, Development & Cell Biology, Iowa State University, Ames, IA, USA
- Undergraduate Data Science Major, Iowa State University, Ames, IA, USA
| | - Ling Li
- Department of Biological Sciences, Mississippi State University, Mississippi State, MS, USA
| | - Basil J Nikolau
- Roy J. Carver Department of Biochemistry, Biophysics & Molecular Biology, Iowa State University, Ames, IA, USA
- Interdepartmental Genetics and Genomics Graduate Program, Iowa State University, Ames, IA, USA
- Molecular, Cellular, and Developmental Biology Graduate Program, Iowa State University, Ames, IA, USA
- Center for Metabolic Biology, Iowa State University, Ames, IA, USA
| | - Marna D Yandeau-Nelson
- Department of Genetics, Development & Cell Biology, Iowa State University, Ames, IA, USA
- Bioinformatics & Computational Biology Graduate Program, Iowa State University, Ames, IA, USA
- Interdepartmental Genetics and Genomics Graduate Program, Iowa State University, Ames, IA, USA
- Molecular, Cellular, and Developmental Biology Graduate Program, Iowa State University, Ames, IA, USA
- Center for Metabolic Biology, Iowa State University, Ames, IA, USA
| |
Collapse
|
2
|
Biswal JK, Das S, Mohapatra JK, Rout M, Ranjan R, Singh RP. A species-independent indirect-ELISA for detection of antibodies to the non-structural protein 2B of foot-and-mouth disease virus. Biologicals 2024; 87:101785. [PMID: 39121525 DOI: 10.1016/j.biologicals.2024.101785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2024] [Revised: 07/24/2024] [Accepted: 08/04/2024] [Indexed: 08/12/2024] Open
Abstract
Diagnostic assays that are able to detect foot-and-mouth disease (FMD) virus infection in the vaccinated population are essential tools in the progressive control pathway for the FMD. However, testing of serum samples using a single diagnostic assay may not completely substantiate freedom from the virus infection. Therefore, viral non-structural proteins (NSPs)-based various serological assays have been developed for the detection of FMD infection. Nevertheless, the NSPs-based ELISAs have been developed in the indirect-ELISA format, thereby necessitating the use of species-specific conjugated secondary-antibodies for the detection of anti-NSP antibodies in various FMD-susceptible species. Therefore, this study presents a novel recombinant 2B-NSP-based indirect ELISA, employing HRP-conjugated protein-A/G detection system which can detect anti-NSPs antibodies from multiple FMD-susceptible species in a single ELISA platform. Recombinant 2B (r2B) protein was expressed as His-SUMO tagged protein in the E. Coli cells and purified using NI-NTA affinity column chromatography. Using the r2B protein and HRP-conjugated protein A/G, an indirect ELISA was developed and validated for the detection of anti-2B antibodies in serum samples collected from multiple FMD-susceptible animal species with known FMD status. Further, a resampling based statistical technique has been reported for determination of optimal cut-off value for the diagnostic assay. Through this technique, the optimal cut-off of 44 percentage of positivity value was determined for the assay. At this optimal cut-off value, the developed diagnostic assay provided diagnostic sensitivity, specificity, and accuracy, positive and negative predictive values (PPV and NPV) of 92.35 %, 98.41 %, 95.21 %, 98.58 %, and 91.67 %, respectively. The assay was validated further by analyzing random serum samples collected across multi-locations in India. The assay can be used as a single platform for testing serum samples from different species of FMDV-susceptible animals and will be useful for NSP-based serosurveillance of FMDV.
Collapse
Affiliation(s)
- Jitendra K Biswal
- ICAR-National Institute on Foot-and-mouth Disease, Arugul, Bhubaneswar, Odisha, 752050, India.
| | - Samarendra Das
- ICAR-National Institute on Foot-and-mouth Disease, Arugul, Bhubaneswar, Odisha, 752050, India
| | - Jajati K Mohapatra
- ICAR-National Institute on Foot-and-mouth Disease, Arugul, Bhubaneswar, Odisha, 752050, India
| | - Manoranjan Rout
- ICAR-National Institute on Foot-and-mouth Disease, Arugul, Bhubaneswar, Odisha, 752050, India
| | - Rajeev Ranjan
- ICAR-National Institute on Foot-and-mouth Disease, Arugul, Bhubaneswar, Odisha, 752050, India
| | - Rabindra Prasad Singh
- ICAR-National Institute on Foot-and-mouth Disease, Arugul, Bhubaneswar, Odisha, 752050, India
| |
Collapse
|
3
|
Saballos AI, Brooks MD, Tranel PJ, Williams MM. Mapping of flumioxazin tolerance in a snap bean diversity panel leads to the discovery of a master genomic region controlling multiple stress resistance genes. FRONTIERS IN PLANT SCIENCE 2024; 15:1404889. [PMID: 39015289 PMCID: PMC11250381 DOI: 10.3389/fpls.2024.1404889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Accepted: 06/12/2024] [Indexed: 07/18/2024]
Abstract
Introduction Effective weed management tools are crucial for maintaining the profitable production of snap bean (Phaseolus vulgaris L.). Preemergence herbicides help the crop to gain a size advantage over the weeds, but the few preemergence herbicides registered in snap bean have poor waterhemp (Amaranthus tuberculatus) control, a major pest in snap bean production. Waterhemp and other difficult-to-control weeds can be managed by flumioxazin, an herbicide that inhibits protoporphyrinogen oxidase (PPO). However, there is limited knowledge about crop tolerance to this herbicide. We aimed to quantify the degree of snap bean tolerance to flumioxazin and explore the underlying mechanisms. Methods We investigated the genetic basis of herbicide tolerance using genome-wide association mapping approach utilizing field-collected data from a snap bean diversity panel, combined with gene expression data of cultivars with contrasting response. The response to a preemergence application of flumioxazin was measured by assessing plant population density and shoot biomass variables. Results Snap bean tolerance to flumioxazin is associated with a single genomic location in chromosome 02. Tolerance is influenced by several factors, including those that are indirectly affected by seed size/weight and those that directly impact the herbicide's metabolism and protect the cell from reactive oxygen species-induced damage. Transcriptional profiling and co-expression network analysis identified biological pathways likely involved in flumioxazin tolerance, including oxidoreductase processes and programmed cell death. Transcriptional regulation of genes involved in those processes is possibly orchestrated by a transcription factor located in the region identified in the GWAS analysis. Several entries belonging to the Romano class, including Bush Romano 350, Roma II, and Romano Purpiat presented high levels of tolerance in this study. The alleles identified in the diversity panel that condition snap bean tolerance to flumioxazin shed light on a novel mechanism of herbicide tolerance and can be used in crop improvement.
Collapse
Affiliation(s)
- Ana I. Saballos
- Global Change and Photosynthesis Research Unit, United States Department of Agriculture–Agricultural Research Service, Urbana, IL, United States
| | - Matthew D. Brooks
- Global Change and Photosynthesis Research Unit, United States Department of Agriculture–Agricultural Research Service, Urbana, IL, United States
| | - Patrick J. Tranel
- Department of Crop Sciences, University of Illinois, Urbana, IL, United States
| | - Martin M. Williams
- Global Change and Photosynthesis Research Unit, United States Department of Agriculture–Agricultural Research Service, Urbana, IL, United States
| |
Collapse
|
4
|
Wei H, Teng F, Wang X, Hou X, Wang H, Wang H, Sun H, Zhou X. Identification of a prognosis-related gene signature and ceRNA regulatory networks in lung adenocarcinoma. Heliyon 2024; 10:e28084. [PMID: 38601687 PMCID: PMC11004716 DOI: 10.1016/j.heliyon.2024.e28084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 02/23/2024] [Accepted: 03/12/2024] [Indexed: 04/12/2024] Open
Abstract
The ceRNA network, consisting of both noncoding RNA and protein-coding RNA, governs the occurrence, progression, metastasis, and infiltration of lung adenocarcinoma. Signatures comprising multiple genes can effectively determine survival stratification and prognosis of patients with lung adenocarcinoma. To explore the mechanisms of lung adenocarcinoma progression and identify potential biological targets, we carried out systematic bioinformatics analyses of the genetic profiles of lung adenocarcinoma, such as weighted gene co-expression network analysis (WGCNA), differential expression (DE) assessment, univariate and multivariate Cox proportional hazard regression models, ceRNA modulatory networks generated using the ENCORI and miRcode databases, nomogram models, ROC curve assessment, and Kaplan-Meier survival curve analysis. The ceRNA network encompassed 37 nodes, comprising 12 mRNAs, 22 lncRNAs, and three miRNAs. Simultaneously, we performed integration analysis using the 12 genes from the ceRNA network. Our findings revealed that the signature established by these 12 genes serves as an adverse element in lung adenocarcinoma, contributing to unfavorable patient prognosis. To ensure the credibility of our results, we used in vitro experiments for further verification. In conclusion, our study delved into the potential mechanisms underlying lung adenocarcinoma via the ceRNA regulatory network, specifically focusing on the PIF1 and has-miR-125a-5p axis. Additionally, a signature comprising 12 genes was identified as a biomarker related to the prognosis of lung adenocarcinoma.
Collapse
Affiliation(s)
- Hong Wei
- In-Patient Ultrasound Department, The Second Affiliated Hospital of Harbin Medical University, Harbin, 150081, China
| | - Fei Teng
- In-Patient Ultrasound Department, The Second Affiliated Hospital of Harbin Medical University, Harbin, 150081, China
| | - XiaoLei Wang
- In-Patient Ultrasound Department, The Second Affiliated Hospital of Harbin Medical University, Harbin, 150081, China
| | - XiuJuan Hou
- In-Patient Ultrasound Department, The Second Affiliated Hospital of Harbin Medical University, Harbin, 150081, China
| | - HongBo Wang
- In-Patient Ultrasound Department, The Second Affiliated Hospital of Harbin Medical University, Harbin, 150081, China
| | - Hong Wang
- In-Patient Ultrasound Department, The Second Affiliated Hospital of Harbin Medical University, Harbin, 150081, China
| | - Hui Sun
- In-Patient Ultrasound Department, The Second Affiliated Hospital of Harbin Medical University, Harbin, 150081, China
| | - XianLi Zhou
- In-Patient Ultrasound Department, The Second Affiliated Hospital of Harbin Medical University, Harbin, 150081, China
| |
Collapse
|
5
|
Li T, Ling J, Du X, Zhang S, Yang Y, Zhang L. Exploring the underlying mechanisms of fisetin in the treatment of hepatic insulin resistance via network pharmacology and in vitro validation. Nutr Metab (Lond) 2023; 20:51. [PMID: 37996895 PMCID: PMC10666360 DOI: 10.1186/s12986-023-00770-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Accepted: 11/16/2023] [Indexed: 11/25/2023] Open
Abstract
OBJECTIVE To characterize potential mechanisms of fisetin on hepatic insulin resistance (IR) using network pharmacology and in vitro validation. METHODS Putative targets of fisetin were retrieved from the Traditional Chinese Medicine Systems Pharmacology database, whereas the potential genes of hepatic IR were obtained from GeneCards database. A protein-protein interaction (PPI) network was constructed according to the intersection targets of fisetin and hepatic IR using the Venn diagram. The biological functions and potential pathways related to genes were determined using Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses. Cell experiments were also conducted to further verify the mechanism of fisetin on hepatic IR. RESULTS A total of 118 potential targets from fisetin were associated with hepatic IR. The areas of nodes and corresponding degree values of TP53, AKT1, TNF, IL6, CASP3, CTNNB1, JUN, SRC, epidermal growth factor receptor (EGFR), and HSP90AA1 were larger and could be easily found in the PPI network. Furthermore, GO analysis revealed that these key targets were significantly involved in multiple biological processes that participated in oxidative stress and serine/threonine kinase activity. KEGG enrichment analysis showed that the PI3K/AKT signaling pathway was a significant pathway involved in hepatic IR. Our in vitro results demonstrated that fisetin treatment increased the expressions of EGFR and IRS in HepG2 and L02 cells under normal or IR conditions. Western blot results revealed that p-AKT/AKT levels were significantly up-regulated, suggesting that fisetin was involved in the PI3K/AKT signaling pathway to regulate insulin signaling. CONCLUSION We explored the pharmacological actions and the potential molecular mechanism of fisetin in treating hepatic IR from a holistic perspective. Our study lays a theoretical foundation for the development of fisetin for type 2 diabetes.
Collapse
Affiliation(s)
- Tian Li
- Metabilic Vascular Disease Key Laboratory of Sichuan Province, Luzhou, 646000, China
- Drug Discovery Research Center, Southwest Medical University, Luzhou, 646000, China
- School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610000, China
| | - Junjun Ling
- Affiliated Hospital of Guizhou Medical University, Guiyang, 550000, China
| | - Xingrong Du
- Metabilic Vascular Disease Key Laboratory of Sichuan Province, Luzhou, 646000, China
- Drug Discovery Research Center, Southwest Medical University, Luzhou, 646000, China
| | - Siyu Zhang
- Drug Discovery Research Center, Southwest Medical University, Luzhou, 646000, China
| | - Yan Yang
- Chongqing Tongnan NO.1 Middle School, Tongnan, 402660, China
| | - Liang Zhang
- Metabilic Vascular Disease Key Laboratory of Sichuan Province, Luzhou, 646000, China.
- Affiliated Hospital of Guizhou Medical University, Guiyang, 550000, China.
| |
Collapse
|
6
|
Rani R, Raza G, Ashfaq H, Rizwan M, Razzaq MK, Waheed MQ, Shimelis H, Babar AD, Arif M. Genome-wide association study of soybean ( Glycine max [L.] Merr.) germplasm for dissecting the quantitative trait nucleotides and candidate genes underlying yield-related traits. FRONTIERS IN PLANT SCIENCE 2023; 14:1229495. [PMID: 37636105 PMCID: PMC10450938 DOI: 10.3389/fpls.2023.1229495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Accepted: 07/25/2023] [Indexed: 08/29/2023]
Abstract
Soybean (Glycine max [L.] Merr.) is one of the most significant crops in the world in terms of oil and protein. Owing to the rising demand for soybean products, there is an increasing need for improved varieties for more productive farming. However, complex correlation patterns among quantitative traits along with genetic interactions pose a challenge for soybean breeding. Association studies play an important role in the identification of accession with useful alleles by locating genomic sites associated with the phenotype in germplasm collections. In the present study, a genome-wide association study was carried out for seven agronomic and yield-related traits. A field experiment was conducted in 2015/2016 at two locations that include 155 diverse soybean germplasm. These germplasms were genotyped using SoySNP50K Illumina Infinium Bead-Chip. A total of 51 markers were identified for node number, plant height, pods per plant, seeds per plant, seed weight per plant, hundred-grain weight, and total yield using a multi-locus linear mixed model (MLMM) in FarmCPU. Among these significant SNPs, 18 were putative novel QTNs, while 33 co-localized with previously reported QTLs. A total of 2,356 genes were found in 250 kb upstream and downstream of significant SNPs, of which 17 genes were functional and the rest were hypothetical proteins. These 17 candidate genes were located in the region of 14 QTNs, of which ss715580365, ss715608427, ss715632502, and ss715620131 are novel QTNs for PH, PPP, SDPP, and TY respectively. Four candidate genes, Glyma.01g199200, Glyma.10g065700, Glyma.18g297900, and Glyma.14g009900, were identified in the vicinity of these novel QTNs, which encode lsd one like 1, Ergosterol biosynthesis ERG4/ERG24 family, HEAT repeat-containing protein, and RbcX2, respectively. Although further experimental validation of these candidate genes is required, several appear to be involved in growth and developmental processes related to the respective agronomic traits when compared with their homologs in Arabidopsis thaliana. This study supports the usefulness of association studies and provides valuable data for functional markers and investigating candidate genes within a diverse germplasm collection in future breeding programs.
Collapse
Affiliation(s)
- Reena Rani
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Constituent College Pakistan Institute of Engineering and Applied Sciences (PIEAS), Faisalabad, Pakistan
| | - Ghulam Raza
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Constituent College Pakistan Institute of Engineering and Applied Sciences (PIEAS), Faisalabad, Pakistan
| | - Hamza Ashfaq
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Constituent College Pakistan Institute of Engineering and Applied Sciences (PIEAS), Faisalabad, Pakistan
| | - Muhammad Rizwan
- Plant Breeding and Genetics Division, Nuclear Institute of Agriculture (NIA), Tando Jam, Pakistan
| | - Muhammad Khuram Razzaq
- Soybean Research Institute, National Center for Soybean Improvement, Nanjing Agricultural University, Nanjing, China
| | - Muhammad Qandeel Waheed
- Plant Breeding and Genetics Division, Nuclear Institute for Agriculture and Biology (NIAB), Constituent College Pakistan Institute of Engineering and Applied Sciences (PIEAS), Faisalabad, Pakistan
| | - Hussein Shimelis
- School of Agricultural, Earth and Environmental Sciences, African Centre for Crop Improvement, University of KwaZulu-Natal, Pietermaritzburg, South Africa
| | - Allah Ditta Babar
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Constituent College Pakistan Institute of Engineering and Applied Sciences (PIEAS), Faisalabad, Pakistan
| | - Muhammad Arif
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Constituent College Pakistan Institute of Engineering and Applied Sciences (PIEAS), Faisalabad, Pakistan
| |
Collapse
|
7
|
Guo X, Yan N, Liu L, Yin X, Chen Y, Zhang Y, Wang J, Cao G, Fan C, Hu Z. Transcriptomic comparison of seeds and silique walls from two rapeseed genotypes with contrasting seed oil content. FRONTIERS IN PLANT SCIENCE 2023; 13:1082466. [PMID: 36714692 PMCID: PMC9880416 DOI: 10.3389/fpls.2022.1082466] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Accepted: 12/22/2022] [Indexed: 06/18/2023]
Abstract
Silique walls play pivotal roles in contributing photoassimilates and nutrients to fuel seed growth. However, the interaction between seeds and silique walls impacting oil biosynthesis is not clear during silique development. Changes in sugar, fatty acid and gene expression during Brassica napus silique development of L192 with high oil content and A260 with low oil content were investigated to identify key factors affecting difference of their seed oil content. During the silique development, silique walls contained more hexose and less sucrose than seeds, and glucose and fructose contents in seeds and silique walls of L192 were higher than that of A260 at 15 DAF, and sucrose content in the silique walls of L192 were lower than that of A260 at three time points. Genes related to fatty acid biosynthesis were activated over time, and differences on fatty acid content between the two genotypes occurred after 25 DAF. Genes related to photosynthesis expressed more highly in silique walls than in contemporaneous seeds, and were inhibited over time. Gene set enrichment analysis suggested photosynthesis were activated in L192 at 25 and 35 DAF in silique walls and at both 15 and 35 DAF in the seed. Expressions of sugar transporter genes in L192 was higher than that in A260, especially at 35 DAF. Expressions of genes related to fatty acid biosynthesis, such as BCCP2s, bZIP67 and LEC1s were higher in L192 than in A260, especially at 35 DAF. Meanwhile, genes related to oil body proteins were expressed at much lower levels in L192 than in A260. According to the WGCNA results, hub modules, such as ME.turquoise relative to photosynthesis, ME.green relative to embryo development and ME.yellow relative to lipid biosynthesis, were identified and synergistically regulated seed development and oil accumulation. Our results are helpful for understanding the mechanism of oil accumulation of seeds in oilseed rape for seed oil content improvement.
Collapse
Affiliation(s)
- Xupeng Guo
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China
- Hybrid Rapeseed Research Center of Shaanxi Province, Yangling, Shaanxi, China
| | - Na Yan
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Linpo Liu
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Xiangzhen Yin
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Yuhong Chen
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Yanfeng Zhang
- Hybrid Rapeseed Research Center of Shaanxi Province, Yangling, Shaanxi, China
| | - Jingqiao Wang
- Institute of Economical Crops, Yunnan Agricultural Academy, Kunming, Yunnan, China
| | - Guozhi Cao
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Chengming Fan
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Zanmin Hu
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
8
|
Rai SN, Das S, Pan J, Mishra DC, Fu XA. Multigroup prediction in lung cancer patients and comparative controls using signature of volatile organic compounds in breath samples. PLoS One 2022; 17:e0277431. [PMID: 36449484 PMCID: PMC9710764 DOI: 10.1371/journal.pone.0277431] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Accepted: 10/26/2022] [Indexed: 12/05/2022] Open
Abstract
Early detection of lung cancer is a crucial factor for increasing its survival rates among the detected patients. The presence of carbonyl volatile organic compounds (VOCs) in exhaled breath can play a vital role in early detection of lung cancer. Identifying these VOC markers in breath samples through innovative statistical and machine learning techniques is an important task in lung cancer research. Therefore, we proposed an experimental approach for generation of VOC molecular concentration data using unique silicon microreactor technology and further identification and characterization of key relevant VOCs important for lung cancer detection through statistical and machine learning algorithms. We reported several informative VOCs and tested their effectiveness in multi-group classification of patients. Our analytical results indicated that seven key VOCs, including C4H8O2, C13H22O, C11H22O, C2H4O2, C7H14O, C6H12O, and C5H8O, are sufficient to detect the lung cancer patients with higher mean classification accuracy (92%) and lower standard error (0.03) compared to other combinations. In other words, the molecular concentrations of these VOCs in exhaled breath samples were able to discriminate the patients with lung cancer (n = 156) from the healthy smoker and nonsmoker controls (n = 193) and patients with benign pulmonary nodules (n = 65). The quantification of carbonyl VOC profiles from breath samples and identification of crucial VOCs through our experimental approach paves the way forward for non-invasive lung cancer detection. Further, our experimental and analytical approach of VOC quantitative analysis in breath samples may be extended to other diseases, including COVID-19 detection.
Collapse
Affiliation(s)
- Shesh N. Rai
- Biostatistics and Bioinformatics Facility, Brown Cancer Center, University of Louisville, Louisville, KY, United States of America
- School of Interdisciplinary and Graduate Studies, University of Louisville, Louisville, KY, United States of America
- Hepatobiology and Toxicology Center, University of Louisville, Louisville, KY, United States of America
- Department of Bioinformatics and Biostatistics, University of Louisville, Louisville, KY, United States of America
- Biostatistics and Informatics Facility, Center for Integrative Environmental Research Sciences, University of Louisville, Louisville, KY, United States of America
- Christina Lee Brown Envirome Institute, University of Louisville, Louisville, KY, United States of America
- * E-mail: (SNR); (SD)
| | - Samarendra Das
- Biostatistics and Bioinformatics Facility, Brown Cancer Center, University of Louisville, Louisville, KY, United States of America
- School of Interdisciplinary and Graduate Studies, University of Louisville, Louisville, KY, United States of America
- ICAR-Directorate of Foot and Mouth Disease, Arugul, Bhubaneswar, Odisha, India
- International Centre for Foot and Mouth Disease, Arugul, Bhubaneswar, Odisha, India
- ICAR-Indian Agricultural Statistics Research Institute, PUSA, New Delhi, India
- * E-mail: (SNR); (SD)
| | - Jianmin Pan
- Biostatistics and Bioinformatics Facility, Brown Cancer Center, University of Louisville, Louisville, KY, United States of America
| | - Dwijesh C. Mishra
- ICAR-Indian Agricultural Statistics Research Institute, PUSA, New Delhi, India
| | - Xiao-An Fu
- Department of Chemical Engineering, University of Louisville, Louisville, KY, United States of America
| |
Collapse
|
9
|
Hasankhani A, Bahrami A, Mackie S, Maghsoodi S, Alawamleh HSK, Sheybani N, Safarpoor Dehkordi F, Rajabi F, Javanmard G, Khadem H, Barkema HW, De Donato M. In-depth systems biological evaluation of bovine alveolar macrophages suggests novel insights into molecular mechanisms underlying Mycobacterium bovis infection. Front Microbiol 2022; 13:1041314. [PMID: 36532492 PMCID: PMC9748370 DOI: 10.3389/fmicb.2022.1041314] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2022] [Accepted: 11/04/2022] [Indexed: 08/26/2023] Open
Abstract
Objective Bovine tuberculosis (bTB) is a chronic respiratory infectious disease of domestic livestock caused by intracellular Mycobacterium bovis infection, which causes ~$3 billion in annual losses to global agriculture. Providing novel tools for bTB managements requires a comprehensive understanding of the molecular regulatory mechanisms underlying the M. bovis infection. Nevertheless, a combination of different bioinformatics and systems biology methods was used in this study in order to clearly understand the molecular regulatory mechanisms of bTB, especially the immunomodulatory mechanisms of M. bovis infection. Methods RNA-seq data were retrieved and processed from 78 (39 non-infected control vs. 39 M. bovis-infected samples) bovine alveolar macrophages (bAMs). Next, weighted gene co-expression network analysis (WGCNA) was performed to identify the co-expression modules in non-infected control bAMs as reference set. The WGCNA module preservation approach was then used to identify non-preserved modules between non-infected controls and M. bovis-infected samples (test set). Additionally, functional enrichment analysis was used to investigate the biological behavior of the non-preserved modules and to identify bTB-specific non-preserved modules. Co-expressed hub genes were identified based on module membership (MM) criteria of WGCNA in the non-preserved modules and then integrated with protein-protein interaction (PPI) networks to identify co-expressed hub genes/transcription factors (TFs) with the highest maximal clique centrality (MCC) score (hub-central genes). Results As result, WGCNA analysis led to the identification of 21 modules in the non-infected control bAMs (reference set), among which the topological properties of 14 modules were altered in the M. bovis-infected bAMs (test set). Interestingly, 7 of the 14 non-preserved modules were directly related to the molecular mechanisms underlying the host immune response, immunosuppressive mechanisms of M. bovis, and bTB development. Moreover, among the co-expressed hub genes and TFs of the bTB-specific non-preserved modules, 260 genes/TFs had double centrality in both co-expression and PPI networks and played a crucial role in bAMs-M. bovis interactions. Some of these hub-central genes/TFs, including PSMC4, SRC, BCL2L1, VPS11, MDM2, IRF1, CDKN1A, NLRP3, TLR2, MMP9, ZAP70, LCK, TNF, CCL4, MMP1, CTLA4, ITK, IL6, IL1A, IL1B, CCL20, CD3E, NFKB1, EDN1, STAT1, TIMP1, PTGS2, TNFAIP3, BIRC3, MAPK8, VEGFA, VPS18, ICAM1, TBK1, CTSS, IL10, ACAA1, VPS33B, and HIF1A, had potential targets for inducing immunomodulatory mechanisms by M. bovis to evade the host defense response. Conclusion The present study provides an in-depth insight into the molecular regulatory mechanisms behind M. bovis infection through biological investigation of the candidate non-preserved modules directly related to bTB development. Furthermore, several hub-central genes/TFs were identified that were significant in determining the fate of M. bovis infection and could be promising targets for developing novel anti-bTB therapies and diagnosis strategies.
Collapse
Affiliation(s)
- Aliakbar Hasankhani
- Department of Animal Science, College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
| | - Abolfazl Bahrami
- Department of Animal Science, College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
- Biomedical Center for Systems Biology Science Munich, Ludwig-Maximilians-University, Munich, Germany
| | - Shayan Mackie
- Faculty of Science, Earth Sciences Building, University of British Columbia, Vancouver, BC, Canada
| | - Sairan Maghsoodi
- Faculty of Paramedical Sciences, Kurdistan University of Medical Sciences, Kurdistan, Iran
| | - Heba Saed Kariem Alawamleh
- Department of Basic Scientific Sciences, AL-Balqa Applied University, AL-Huson University College, AL-Huson, Jordan
| | - Negin Sheybani
- Department of Animal and Poultry Science, College of Aburaihan, University of Tehran, Tehran, Iran
| | - Farhad Safarpoor Dehkordi
- Halal Research Center of IRI, FDA, Tehran, Iran
- Department of Food Hygiene and Quality Control, Faculty of Veterinary Medicine, University of Tehran, Tehran, Iran
| | - Fatemeh Rajabi
- Department of Agronomy and Plant Breeding, College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
| | - Ghazaleh Javanmard
- Department of Animal Science, College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
| | - Hosein Khadem
- Department of Agronomy and Plant Breeding, College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
| | - Herman W. Barkema
- Department of Production Animal Health, Faculty of Veterinary Medicine, University of Calgary, Calgary, AB, Canada
| | - Marcos De Donato
- Regional Department of Bioengineering, Tecnológico de Monterrey, Monterrey, Mexico
| |
Collapse
|
10
|
Zeng J, Hua S, Liu J, Mungur R, He Y, Feng J. Identification of core genes as potential biomarkers for predicting progression and prognosis in glioblastoma. Front Genet 2022; 13:928407. [PMID: 36238156 PMCID: PMC9552700 DOI: 10.3389/fgene.2022.928407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 09/01/2022] [Indexed: 11/17/2022] Open
Abstract
Background: Glioblastoma is a common malignant neuroepithelial neoplasm with poor clinical outcomes and limited treatment options. It is extremely important to search and confirm diverse hub genes that are effective in the advance and prediction of glioblastoma. Methods: We analyzed GSE50161, GSE4290, and GSE68848, the three microarray datasets retrieved from the GEO database. GO function and KEGG pathway enrichment analyses for differentially expressed genes (DEGs) were performed using DAVID. The PPI network of the DEGs was analyzed using the Search Tool for the Retrieval of Interacting Genes database and visualized by Cytoscape software. Hub genes were identified through the PPI network and a robust rank aggregation method. The Cancer Genome Atlas (TCGA) and the Oncomine database were used to validate the hub genes. In addition, a survival curve analysis was conducted to verify the correlation between the expression of hub genes and patient prognosis. Human glioblastoma cells and normal cells were collected, and then RT-PCR, Western blot, and immunofluorescence were conducted to validate the expression of the NDC80 gene. A cell proliferation assay was used to detect the proliferation of glioma cells. The effects of NDC80 expression on migration and invasion of GBM cell lines were evaluated by conducting scratch and transwell assays. Results: A total of 716 DEGs were common to all three microarray datasets, which included 188 upregulated DEGs and 528 downregulated DEGs. Furthermore, we found that among the common DEGs, 10 hub genes showed a high degree of connectivity. The expression of the 10 hub genes in TCGA and the Oncomine database was significantly overexpressed in glioblastoma compared with normal genes. Additionally, the survival analysis showed that the patients with low expression of six genes (BIR5C, CDC20, NDC80, CDK1, TOP2A, and MELK) had a significantly favorable prognosis (p < 0.01). We discovered that NDC80, which has been shown to be important in other cancers, also has an important role in malignant gliomas. The RT-PCR, Western blot, and immunofluorescence results showed that the expression level of NDC80 was significantly higher in human glioblastoma cells than in normal cells. Moreover, we identified that NDC80 increased the proliferation and invasion abilities of human glioblastoma cells. Conclusion: The six genes identified here may be utilized to form a panel of disease progression and predictive biomarkers of glioblastoma for clinical purposes. NDC80, one of the six genes, was discovered to have a potentially important role in GBM, a finding that needs to be further studied.
Collapse
Affiliation(s)
- Jianping Zeng
- Department of Neurosurgery, The First Affiliated Hospital of Nanchang University, Nanchang, China
- *Correspondence: Jianping Zeng,
| | - Shushan Hua
- Department of Neurobiology, Zhejiang University School of Medicine, Hangzhou, China
| | - Jing Liu
- Department of Pharmacy, The First Affiliated Hospital of Nanchang University, Nanchang, China
| | - Rajneesh Mungur
- Department of Neurosurgery, The First Affiliated Hospital of Zhejiang University, Hangzhou, China
| | - Yongsheng He
- Department of Neurosurgery, The First Affiliated Hospital of Nanchang University, Nanchang, China
| | - Jiugeng Feng
- Department of Neurosurgery, The First Affiliated Hospital of Nanchang University, Nanchang, China
| |
Collapse
|
11
|
Shahriari AG, Soltani Z, Tahmasebi A, Poczai P. Integrative System Biology Analysis of Transcriptomic Responses to Drought Stress in Soybean ( Glycine max L.). Genes (Basel) 2022; 13:1732. [PMID: 36292617 PMCID: PMC9602024 DOI: 10.3390/genes13101732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2022] [Revised: 08/21/2022] [Accepted: 09/20/2022] [Indexed: 11/17/2022] Open
Abstract
Drought is a major abiotic stressor that causes yield losses and limits the growing area for most crops. Soybeans are an important legume crop that is sensitive to water-deficit conditions and suffers heavy yield losses from drought stress. To improve drought-tolerant soybean cultivars through breeding, it is necessary to understand the mechanisms of drought tolerance in soybeans. In this study, we applied several transcriptome datasets obtained from soybean plants under drought stress in comparison to those grown under normal conditions to identify novel drought-responsive genes and their underlying molecular mechanisms. We found 2168 significant up/downregulated differentially expressed genes (DEGs) and 8 core modules using gene co-expression analysis to predict their biological roles in drought tolerance. Gene Ontology and KEGG analyses revealed key biological processes and metabolic pathways involved in drought tolerance, such as photosynthesis, glyceraldehyde-3-phosphate dehydrogenase and cytokinin dehydrogenase activity, and regulation of systemic acquired resistance. Genome-wide analysis of plants' cis-acting regulatory elements (CREs) and transcription factors (TFs) was performed for all of the identified DEG promoters in soybeans. Furthermore, the PPI network analysis revealed significant hub genes and the main transcription factors regulating the expression of drought-responsive genes in each module. Among the four modules associated with responses to drought stress, the results indicated that GLYMA_04G209700, GLYMA_02G204700, GLYMA_06G030500, GLYMA_01G215400, and GLYMA_09G225400 have high degrees of interconnection and, thus, could be considered as potential candidates for improving drought tolerance in soybeans. Taken together, these findings could lead to a better understanding of the mechanisms underlying drought responses in soybeans, which may useful for engineering drought tolerance in plants.
Collapse
Affiliation(s)
- Amir Ghaffar Shahriari
- Department of Agriculture and Natural Resources, Higher Education Center of Eghlid, Eghlid 7381943885, Iran
| | - Zahra Soltani
- Institute of Biotechnology, Shiraz University, Shiraz 7144113131, Iran
| | - Aminallah Tahmasebi
- Department of Agriculture, Minab Higher Education Center, University of Hormozgan, Bandar Abbas 7916193145, Iran
- Plant Protection Research Group, University of Hormozgan, Bandar Abbas 7916193145, Iran
| | - Péter Poczai
- Finnish Museum of Natural History, University of Helsinki, P.O. Box 7, FI-00014 Helsinki, Finland
- Faculty of Biological and Environmental Sciences, University of Helsinki, P.O. Box 65, FI-00065 Helsinki, Finland
- Institute of Advanced Studies Kőszeg (iASK), P.O. Box 4, H-9731 Kőszeg, Hungary
| |
Collapse
|
12
|
Saikia M, Bhattacharyya DK, Kalita JK. CBDCEM: An effective centrality based differential co-expression method for critical gene finding. GENE REPORTS 2022. [DOI: 10.1016/j.genrep.2022.101688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/09/2022]
|
13
|
Zhang Q, Huang H, Zhang M, Fang C, Wang N, Jing X, Guo J, Sun W, Yang X, Xu Z. Exome Sequencing Reveals Genetic Variability and Identifies Chronic Prognostic Loci in Chinese Sarcoidosis Patients. Front Oncol 2022; 12:910227. [PMID: 35860586 PMCID: PMC9289133 DOI: 10.3389/fonc.2022.910227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Accepted: 05/30/2022] [Indexed: 11/13/2022] Open
Abstract
Background Sarcoidosis is an inflammatory disease characterized by non-caseating granuloma formation in various organs, with several recognized genetic and environmental risk factors. Despite substantial progress, the genetic determinants associated with its prognosis remain largely unknown. Objectives This study aimed to identify the genetic changes involved in sarcoidosis and evaluate their clinical relevance. Methods We performed whole-exome sequencing (WES) in 116 sporadic sarcoidosis patients (acute sarcoidosis patients, n=58; chronic sarcoidosis patients, n=58). In addition, 208 healthy controls were selected from 1000 G East Asian population data. To identify genes enriched in sarcoidosis, Fisher exact tests were performed. The identified genes were included for further pathway analysis using Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG). Additionally, we used the STRING database to construct a protein network of rare variants and Cytoscape to identify hub genes of signaling pathways. Results WES and Fisher’s exact test identified 1,311 variants in 439 protein-coding genes. A total of 135 single nucleotide polymorphisms (SNPs) on 30 protein-coding genes involved in the immunological process based on the GO and KEGG enrichment analysis. Pathway enrichment analysis showed osteoclast differentiation and cytokine–cytokine receptor interactions. Three missense mutations (rs76740888, rs149664918, and rs78251590) in two genes (PRSS3 and CNN2) of immune-related genes showed significantly different mutation frequencies between the disease group and healthy controls. The correlation of genetic abnormalities with clinical outcomes using multivariate analysis of the clinical features and mutation loci showed that the missense variant (rs76740888, Chr9:33796673 G>A) of PRSS3 [p=0.04, odds ratio (OR) = 2.49] was significantly associated with chronic disease prognosis. Additionally, the top two hub genes were CCL4 and CXCR4 based on protein–protein interaction (PPI) network analysis. Conclusion Our study provides new insights into the molecular pathogenesis of sarcoidosis and identifies novel genetic alterations in this disease, especially PRSS3, which may be promising targets for future therapeutic strategies for chronic sarcoidosis.
Collapse
Affiliation(s)
- Qian Zhang
- Department of Respiratory Medicine, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Hui Huang
- Department of Respiratory Medicine, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | | | - Chuling Fang
- Department of Respiratory Medicine, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Na Wang
- Department of Respiratory Medicine, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Xiaoyan Jing
- Department of Respiratory Medicine, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Jian Guo
- Department of Respiratory Medicine, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Wei Sun
- Department of Respiratory Medicine, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Xiaoyu Yang
- Department of Respiratory Medicine, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Zuojun Xu
- Department of Respiratory Medicine, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
- *Correspondence: Zuojun Xu,
| |
Collapse
|
14
|
Liu YN, Chen RM, Pu QT, Nneji LM, Sun YB. Expression Plasticity of Transposable Elements is Highly Associated with Organismal Re-adaptation to Ancestral Environments. Genome Biol Evol 2022; 14:6596371. [PMID: 35642321 PMCID: PMC9174648 DOI: 10.1093/gbe/evac084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/25/2022] [Indexed: 11/20/2022] Open
Abstract
Understanding the roles of phenotypic plasticity in adaptive evolution has gained recognition for decades. Studies involving multiple taxa have shown that gene expression plasticity serves as “long-term memory” to facilitate re-adaptations to ancestral environments. Nevertheless, the general pattern and the underlying genetic basis of expression plasticity remain unclear. The transposable elements (TEs) play crucial roles in gene expression regulation and are widely distributed within the genome. Given this, we re-analyzed the transcriptomic data of chicken (Gallus gallus) generated from a reciprocal transplant experiment to examine whether expression shifts of TEs are involved in the re-adaptation process. Similar to the protein-coding genes, the plastic changes of TEs overwhelmingly exceed the genetic changes in the re-adaptation process. Further, the associated TEs co-expressed with diverse genes to perform a regulatory activity. Thus, our study supports the general function of phenotypic plasticity in adaptive evolution, and suggests a regulatory functions of TEs in this process.
Collapse
Affiliation(s)
- Yan-Nan Liu
- Yunnan Key Laboratory of Plant Reproductive Adaptation and Evolutionary Ecology, School of Ecology and Environmental Sciences, Yunnan University, Kunming 650500, China
| | - Rong-Mei Chen
- Yunnan Key Laboratory of Plant Reproductive Adaptation and Evolutionary Ecology, School of Ecology and Environmental Sciences, Yunnan University, Kunming 650500, China
| | - Qi-Ting Pu
- Yunnan Key Laboratory of Plant Reproductive Adaptation and Evolutionary Ecology, School of Ecology and Environmental Sciences, Yunnan University, Kunming 650500, China
| | - Lotanna M Nneji
- Department of Ecology and Evolutionary Biology, Princeton University, NJ, 08544, USA
| | - Yan-Bo Sun
- Yunnan Key Laboratory of Plant Reproductive Adaptation and Evolutionary Ecology, School of Ecology and Environmental Sciences, Yunnan University, Kunming 650500, China.,Laboratory for Conservation and Utilization of Bio-resources, Yunnan University, Kunming 650091, China
| |
Collapse
|
15
|
Li J, Zhang Z, Guo K, Wu S, Guo C, Zhang X, Wang Z. Identification of a key glioblastoma candidate gene, FUBP3, based on weighted gene co-expression network analysis. BMC Neurol 2022; 22:139. [PMID: 35413821 PMCID: PMC9004042 DOI: 10.1186/s12883-022-02661-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 03/16/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Glioblastoma multiforme (GBM) is the most common aggressive malignant brain tumor. However, the molecular mechanism of glioblastoma formation is still poorly understood. To identify candidate genes that may be connected to glioma growth and development, weighted gene co-expression network analysis (WGCNA) was performed to construct a gene co-expression network between gene sets and clinical characteristics. We also explored the function of the key candidate gene. METHODS Two GBM datasets were selected from GEO Datasets. The R language was used to identify differentially expressed genes. WGCNA was performed to construct a gene co-expression network in the GEO glioblastoma samples. A custom Venn diagram website was used to find the intersecting genes. The GEPIA website was applied for survival analysis to determine the significant gene, FUBP3. OS, DSS, and PFI analyses, based on the UCSC Cancer Genomics Browser, were performed to verify the significance of FUBP3. Immunohistochemistry was performed to evaluate the expression of FUBP3 in glioblastoma and adjacent normal tissue. KEGG and GO enrichment analyses were used to reveal possible functions of FUBP3. Microenvironment analysis was used to explore the relationship between FUBP3 and immune infiltration. Immunohistochemistry was performed to verify the results of the microenvironment analysis. RESULTS GSE70231 and GSE108474 were selected from GEO Datasets, then 715 and 694 differentially expressed genes (DEGs) from GSE70231 and GSE108474, respectively, were identified. We then performed weighted gene co-expression network analysis (WGCNA) and identified the most downregulated gene modules of GSE70231 and GSE108474, and 659 and 3915 module genes from GSE70231 and GSE108474, respectively, were selected. Five intersection genes (FUBP3, DAD1, CLIC1, ABR, and DNM1) were calculated by Venn diagram. FUBP3 was then identified as the only significant gene by survival analysis using the GEPIA website. OS, DSS, and PFI analyses verified the significance of FUBP3. Immunohistochemical analysis revealed FUBP3 expression in GBM and adjacent normal tissue. KEGG and GO analyses uncovered the possible function of FUBP3 in GBM. Tumor microenvironment analysis showed that FUBP3 may be connected to immune infiltration, and immunohistochemistry identified a positive correlation between immune cells (CD4 + T cells, CD8 + T cells, and macrophages) and FUBP3. CONCLUSION FUBP3 is associated with immune surveillance in GBM, indicating that it has a great impact on GBM development and progression. Therefore, interventions involving FUBP3 and its regulatory pathway may be a new approach for GBM treatment.
Collapse
Affiliation(s)
- Jianmin Li
- Department of Neurosurgery, Binzhou Medical University Hospital, Binzhou, Shandong Province, People's Republic of China.
| | - Zhao Zhang
- Department of Neurosurgery, Binzhou Medical University Hospital, Binzhou, Shandong Province, People's Republic of China
| | - Ke Guo
- Department of Neurosurgery, Binzhou Medical University Hospital, Binzhou, Shandong Province, People's Republic of China
| | - Shuhua Wu
- Department of Pathology, Binzhou Medical University Hospital, Binzhou, Shandong Province, China
| | - Chong Guo
- Department of Neurosurgery, Binzhou Medical University Hospital, Binzhou, Shandong Province, People's Republic of China
| | - Xinfan Zhang
- Department of Neurosurgery, Binzhou Medical University Hospital, Binzhou, Shandong Province, People's Republic of China
| | - Zi Wang
- Department of Neurosurgery, Binzhou Medical University Hospital, Binzhou, Shandong Province, People's Republic of China
| |
Collapse
|
16
|
Mishra DC, Arora D, Budhlakoti N, Solanke AU, Mithra SVACR, Kumar A, Pandey PS, Srivastava S, Kumar S, Farooqi MS, Lal SB, Rai A, Chaturvedi KK. Identification of Potential Cytokinin Responsive Key Genes in Rice Treated With Trans-Zeatin Through Systems Biology Approach. Front Genet 2022; 12:780599. [PMID: 35198001 PMCID: PMC8859635 DOI: 10.3389/fgene.2021.780599] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 11/18/2021] [Indexed: 02/04/2023] Open
Abstract
Rice is an important staple food grain consumed by most of the population around the world. With climate and environmental changes, rice has undergone a tremendous stress state which has impacted crop production and productivity. Plant growth hormones are essential component that controls the overall outcome of the growth and development of the plant. Cytokinin is a hormone that plays an important role in plant immunity and defense systems. Trans-zeatin is an active form of cytokinin that can affect plant growth which is mediated by a multi-step two-component phosphorelay system that has different roles in various developmental stages. Systems biology is an approach for pathway analysis to trans-zeatin treated rice that could provide a deep understanding of different molecules associated with them. In this study, we have used a weighted gene co-expression network analysis method to identify the functional modules and hub genes involved in the cytokinin pathway. We have identified nine functional modules comprising of different hub genes which contribute to the cytokinin signaling route. The biological significance of these identified hub genes has been tested by applying well-proven statistical techniques to establish the association with the experimentally validated QTLs and annotated by the DAVID server. The establishment of key genes in different pathways has been confirmed. These results will be useful to design new stress-resistant cultivars which can provide sustainable yield in stress-specific conditions.
Collapse
Affiliation(s)
- Dwijesh Chandra Mishra
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Devender Arora
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
- National Institute of Animal Science, Rural Development Administration, Jeonju, South Korea
| | - Neeraj Budhlakoti
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | | | | | - Anuj Kumar
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - P. S. Pandey
- Agricultural Education Division, Indian Council of Agricultural Research, New Delhi, India
| | - Sudhir Srivastava
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Sanjeev Kumar
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - M. S. Farooqi
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - S. B. Lal
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Anil Rai
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - K. K. Chaturvedi
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
- *Correspondence: K. K. Chaturvedi,
| |
Collapse
|
17
|
Huang Y, Wei J, Cooper A, Morris MJ. Parkinson's Disease: From Genetics to Molecular Dysfunction and Targeted Therapeutic Approaches. Genes Dis 2022. [DOI: 10.1016/j.gendis.2021.12.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022] Open
|
18
|
Abstract
Gene co-expression analysis is a data analysis technique that helps identify groups of genes with similar expression patterns across several different conditions. By means of these techniques, different groups have been able to assign putative metabolic pathways and functions to understudied genes and to identify novel metabolic regulation networks for different metabolites. Some groups have even used network comparative studies to understand the evolution of these networks from green algae to land plants. In this chapter, we will go over the basic definitions required to understand network topology and gene module identification. Additionally, we offer the reader a walk-through a standard analysis pipeline as implemented in the package WGCNA that takes as input raw fastq files and obtains co-expressed gene clusters and representative gene expression patterns from each module for downstream applications.
Collapse
Affiliation(s)
- Juan D Montenegro
- Department of Neuroscience and Developmental Biology, University of Vienna, Vienna, Austria.
| |
Collapse
|
19
|
Shi Y, Wu M, Liu Y, Hu L, Wu H, Xie L, Liu Z, Wu A, Chen L, Xu C. ITGA5 Predicts Dual-Drug Resistance to Temozolomide and Bevacizumab in Glioma. Front Oncol 2021; 11:769592. [PMID: 34976814 PMCID: PMC8719456 DOI: 10.3389/fonc.2021.769592] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Accepted: 11/24/2021] [Indexed: 12/12/2022] Open
Abstract
AIMS Anti-angiotherapy (Bevacizumab) is currently regarded as a promising option for glioma patients who are resistant to temozolomide (TMZ) treatment. But ongoing clinical research failed to meet therapeutic expectations. This study aimed to explore the pivotal genetic feature responsible for TMZ and Bevacizumab resistance in glioma patients. METHODS We downloaded the transcriptomic and methylation data of glioma patients from The Cancer Genome Atlas (TCGA), Chinese Glioma Genome Atlas (CGGA), and Gene Expression Omnibus (GEO) databases and grouped these patients into resistant and non-resistant groups based on their clinical profiles. Differentially expressed genes and pathways were identified and exhibited with software in R platform. A TMZ-resistant cell line was constructed for validating the expression change of the candidate gene, ITGA5. An ITGA5-overexpressing cell line was also constructed to investigate its biological function using the CCK8 assay, Western blot, periodic acid-Schiff (PAS) staining, and transcriptional sequencing. RESULTS Change of the cell morphology and polarity was closely associated with TMZ mono-resistance and TMZ/Bevacizumab dual resistance in glioma patients. The expression level of ITGA5 was effective in determining drug resistance and the outcome of glioma patients, which is regulated by methylation on two distinct sites. ITGA5 was augmented in TMZ-resistant glioma cells, while overexpressing ITGA5 altered the cell-promoted TMZ resistance through enhancing vascular mimicry (VM) formation correspondingly. CONCLUSIONS Both the epigenetic and transcriptional levels of ITGA5 are effective in predicting TMZ and Bevacizumab resistance, indicating that ITGA5 may serve as a predictor of the treatment outcomes of glioma patients.
Collapse
Affiliation(s)
- Ying Shi
- Integrative Cancer Center & Cancer Clinical Research Center, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
| | - Mengwan Wu
- Integrative Cancer Center & Cancer Clinical Research Center, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
- Department of Radiation Oncology, Sichuan Cancer Hospital, Chengdu, China
| | - Yuyang Liu
- Chinese People’s Liberation Army (PLA) Institute of Neurosurgery, Chinese PLA General Hospital and PLA Medical College, Beijing, China
| | - Lanlin Hu
- Integrative Cancer Center & Cancer Clinical Research Center, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
- Department of Radiation Oncology, Sichuan Cancer Hospital, Chengdu, China
| | - Hong Wu
- Integrative Cancer Center & Cancer Clinical Research Center, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
- Department of Radiation Oncology, Sichuan Cancer Hospital, Chengdu, China
| | - Lei Xie
- Integrative Cancer Center & Cancer Clinical Research Center, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
| | - Zhiwei Liu
- The Center for Advanced Semiconductor & Integrated Micro-System, University of Electronic Science and Technology of China, Chengdu, China
| | - Anhua Wu
- Department of Neurosurgery, The First Affiliated Hospital of China Medical University, Shenyang, China
| | - Ling Chen
- Chinese People’s Liberation Army (PLA) Institute of Neurosurgery, Chinese PLA General Hospital and PLA Medical College, Beijing, China
| | - Chuan Xu
- Integrative Cancer Center & Cancer Clinical Research Center, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
- Department of Radiation Oncology, Sichuan Cancer Hospital, Chengdu, China
| |
Collapse
|
20
|
Schlosser P, Knaus J, Schmutz M, Dohner K, Plass C, Bullinger L, Claus R, Binder H, Lubbert M, Schumacher M. Netboost: Boosting-Supported Network Analysis Improves High-Dimensional Omics Prediction in Acute Myeloid Leukemia and Huntington's Disease. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:2635-2648. [PMID: 32365034 DOI: 10.1109/tcbb.2020.2983010] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
State-of-the art selection methods fail to identify weak but cumulative effects of features found in many high-dimensional omics datasets. Nevertheless, these features play an important role in certain diseases. We present Netboost, a three-step dimension reduction technique. First, a boosting-based filter is combined with the topological overlap measure to identify the essential edges of the network. Second, sparse hierarchical clustering is applied on the selected edges to identify modules and finally module information is aggregated by the first principal components. We demonstrate the application of the newly developed Netboost in combination with CoxBoost for survival prediction of DNA methylation and gene expression data from 180 acute myeloid leukemia (AML) patients and show, based on cross-validated prediction error curve estimates, its prediction superiority over variable selection on the full dataset as well as over an alternative clustering approach. The identified signature related to chromatin modifying enzymes was replicated in an independent dataset, the phase II AMLSG 12-09 study. In a second application we combine Netboost with Random Forest classification and improve the disease classification error in RNA-sequencing data of Huntington's disease mice. Netboost is a freely available Bioconductor R package for dimension reduction and hypothesis generation in high-dimensional omics applications.
Collapse
|
21
|
Sharmin RA, Karikari B, Chang F, Al Amin GM, Bhuiyan MR, Hina A, Lv W, Chunting Z, Begum N, Zhao T. Genome-wide association study uncovers major genetic loci associated with seed flooding tolerance in soybean. BMC PLANT BIOLOGY 2021; 21:497. [PMID: 34715792 PMCID: PMC8555181 DOI: 10.1186/s12870-021-03268-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Accepted: 09/29/2021] [Indexed: 06/01/2023]
Abstract
BACKGROUND Seed flooding stress is one of the threatening environmental stressors that adversely limits soybean at the germination stage across the globe. The knowledge on the genetic basis underlying seed-flooding tolerance is limited. Therefore, we performed a genome-wide association study (GWAS) using 34,718 single nucleotide polymorphism (SNPs) in a panel of 243 worldwide soybean collections to identify genetic loci linked to soybean seed flooding tolerance at the germination stage. RESULTS In the present study, GWAS was performed with two contrasting models, Mixed Linear Model (MLM) and Multi-Locus Random-SNP-Effect Mixed Linear Model (mrMLM) to identify significant SNPs associated with electrical conductivity (EC), germination rate (GR), shoot length (ShL), and root length (RL) traits at germination stage in soybean. With MLM, a total of 20, 40, 4, and 9 SNPs associated with EC, GR, ShL and RL, respectively, whereas in the same order mrMLM detected 27, 17, 13, and 18 SNPs. Among these SNPs, two major SNPs, Gm_08_11971416, and Gm_08_46239716 were found to be consistently connected with seed-flooding tolerance related traits, namely EC and GR across two environments. We also detected two SNPs, Gm_05_1000479 and Gm_01_53535790 linked to ShL and RL, respectively. Based on Gene Ontology enrichment analysis, gene functional annotations, and protein-protein interaction network analysis, we predicted eight candidate genes and three hub genes within the regions of the four SNPs with Cis-elements in promoter regions which may be involved in seed-flooding tolerance in soybeans and these warrant further screening and functional validation. CONCLUSIONS Our findings demonstrate that GWAS based on high-density SNP markers is an efficient approach to dissect the genetic basis of complex traits and identify candidate genes in soybean. The trait associated SNPs could be used for genetic improvement in soybean breeding programs. The candidate genes could help researchers better understand the molecular mechanisms underlying seed-flooding stress tolerance in soybean.
Collapse
Affiliation(s)
- Ripa Akter Sharmin
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetics and Breeding for Soybean, Ministry of Agriculture, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China
- Jagannath University, Dhaka, 1100, Bangladesh
| | - Benjamin Karikari
- Department of Crop Science, Faculty of Agriculture, Food and Consumer Sciences, University for Development Studies, Tamale, Ghana
| | - Fangguo Chang
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetics and Breeding for Soybean, Ministry of Agriculture, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China
| | - G M Al Amin
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetics and Breeding for Soybean, Ministry of Agriculture, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China
| | - Mashiur Rahman Bhuiyan
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetics and Breeding for Soybean, Ministry of Agriculture, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China
| | - Aiman Hina
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetics and Breeding for Soybean, Ministry of Agriculture, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China
| | - Wenhuan Lv
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetics and Breeding for Soybean, Ministry of Agriculture, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China
| | - Zhang Chunting
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetics and Breeding for Soybean, Ministry of Agriculture, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China
| | - Naheeda Begum
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetics and Breeding for Soybean, Ministry of Agriculture, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China
| | - Tuanjie Zhao
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetics and Breeding for Soybean, Ministry of Agriculture, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China.
| |
Collapse
|
22
|
Huang S, Pang L, Wei C. Identification of a Four-Gene Signature With Prognostic Significance in Endometrial Cancer Using Weighted-Gene Correlation Network Analysis. Front Genet 2021; 12:678780. [PMID: 34616422 PMCID: PMC8488359 DOI: 10.3389/fgene.2021.678780] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Accepted: 08/30/2021] [Indexed: 01/01/2023] Open
Abstract
Endometrial hyperplasia (EH) is a precursor for endometrial cancer (EC). However, biomarkers for the progression from EH to EC and standard prognostic biomarkers for EC have not been identified. In this study, we aimed to identify key genes with prognostic significance for the progression from EH to EC. Weighted-gene correlation network analysis (WGCNA) was used to identify hub genes utilizing microarray data (GSE106191) downloaded from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) were identified from the Uterine Corpus Endometrial Carcinoma (UCEC) dataset of The Cancer Genome Atlas database. The Limma-Voom R package was applied to detect differentially expressed genes (DEGs; mRNAs) between cancer and normal samples. Genes with |log2 (fold change [FC])| > 1.0 and p < 0.05 were considered as DEGs. Univariate and multivariate Cox regression and survival analyses were performed to identify potential prognostic genes using hub genes overlapping in the two datasets. All analyses were conducted using R Bioconductor and related packages. Through WGCNA and overlapping genes in hub modules with DEGs in the UCEC dataset, we identified 42 hub genes. The results of the univariate and multivariate Cox regression analyses revealed that four hub genes, BUB1B, NDC80, TPX2, and TTK, were independently associated with the prognosis of EC (Hazard ratio [95% confidence interval]: 0.591 [0.382–0.912], p = 0.017; 0.605 [0.371–0.986], p = 0.044; 1.678 [1.132–2.488], p = 0.01; 2.428 [1.372–4.29], p = 0.02, respectively). A nomogram was established with a risk score calculated using the four genes’ coefficients in the multivariate analysis, and tumor grade and stage had a favorable predictive value for the prognosis of EC. The survival analysis showed that the high-risk group had an unfavorable prognosis compared with the low-risk group (p < 0.0001). The receiver operating characteristic curves also indicated that the risk model had a potential predictive value of prognosis with area under the curve 0.807 at 2 years, 0.783 at 3 years, and 0.786 at 5 years. We established a four-gene signature with prognostic significance in EC using WGCNA and established a nomogram to predict the prognosis of EC.
Collapse
Affiliation(s)
- Shijin Huang
- Department of Obstetrics and Gynecology, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Lihong Pang
- Department of Obstetrics and Gynecology, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Changqiang Wei
- Department of Obstetrics and Gynecology, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| |
Collapse
|
23
|
Kumar G, Kumar R, Pal MK, Pramanik N, Lahiri T, Gupta A, Pandey S. APT: An Automated Probe Tracker From Gene Expression Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:1864-1874. [PMID: 31825870 DOI: 10.1109/tcbb.2019.2958345] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Out of currently available semi-automatic tools for detecting diagnostic probes relevant to a pathophysiological condition, ArrayMining and GEO2R of NCBI are most popular. The shortcomings of ArrayMining and GEO2R are that both tools list the probes ordering them on the basis of their individual statistical level of significances with only difference of statistical methods used by them. While the latest tool GEO2R outputs either top 250 or all genes following its own ranking mechanism, ArrayMining requires number of probes to be inputted by the user. This study provided a way for automatic selection of probe-set that can be obtained from the voting of outputs resulted from statistical methods, t-Test, Mann-Whitney Test and Empirical Bayes Moderated t-test. It was also intriguing to find that the parameters of these statistical methods can be represented as a mathematical function of group fisher's discriminant ratio of a disease-control expression data-pair. Result of this fully automatic method, APT shows 88.97 percent success in comparison to 80.40 and 87.60 percent successes of ArrayMining and GEO2R respectively to include reported probes. Furthermore, out of 10 fold cross validation and 5 new test cases, APT shows a better performance than both ArrayMining and GEO2R in regards to sensitivity and specificity.
Collapse
|
24
|
Mahapatra S, Bhuyan R, Das J, Swarnkar T. Integrated multiplex network based approach for hub gene identification in oral cancer. Heliyon 2021; 7:e07418. [PMID: 34258466 PMCID: PMC8258848 DOI: 10.1016/j.heliyon.2021.e07418] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Revised: 01/27/2021] [Accepted: 06/23/2021] [Indexed: 02/01/2023] Open
Abstract
Background: The incidence of Oral Cancer (OC) is high in Asian countries, which goes undetected at its early stage. The study of genetics, especially genetic networks holds great promise in this endeavor. Hub genes in a genetic network are prominent in regulating the whole network structure of genes. Thus identification of such genes related to specific cancer types can help in reducing the gap in OC prognosis. Methods: Traditional study of network biology is unable to decipher the inter-dependencies within and across diverse biological networks. Multiplex network provides a powerful representation of such systems and encodes much richer information than isolated networks. In this work, we focused on the entire multiplex structure of the genetic network integrating the gene expression profile and DNA methylation profile for OC. Further, hub genes were identified by considering their connectivity in the multiplex structure and the respective protein-protein interaction (PPI) network as well. Results: 46 hub genes were inferred in our approach with a high prediction accuracy (96%), outstanding Matthews coefficient correlation value (93%) and significant biological implications. Among them, genes PIK3CG, PIK3R5, MYH7, CDC20 and CCL4 were differentially expressed and predominantly enriched in molecular cascades specific to OC. Conclusions: The identified hub genes in this work carry ontological signatures specific to cancer, which may further facilitate improved understanding of the tumorigenesis process and the underlying molecular events. Result indicates the effectiveness of our integrated multiplex network approach for hub gene identification. This work puts an innovative research route for multi-omics biological data analysis.
Collapse
Affiliation(s)
- S. Mahapatra
- Department of Computer Application, Siksha O Anusandhan Deemed to be University, Bhubaneswar, India
| | - R. Bhuyan
- Department of Oral Pathology & Microbiology, Siksha O Anusandhan Deemed to be University, Bhubaneswar, India
| | - J. Das
- Centre for Genomics & Biomedical Informatics, Siksha O Anusandhan Deemed to be University, Bhubaneswar, India
| | - T. Swarnkar
- Department of Computer Application, Siksha O Anusandhan Deemed to be University, Bhubaneswar, India
| |
Collapse
|
25
|
Wang Y, Wang X, Huang X, Zhang J, Hu J, Qi Y, Xiang B, Wang Q. Integrated Genomic and Transcriptomic Analysis reveals key genes for predicting dual-phenotype Hepatocellular Carcinoma Prognosis. J Cancer 2021; 12:2993-3010. [PMID: 33854600 PMCID: PMC8040886 DOI: 10.7150/jca.56005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Accepted: 02/20/2021] [Indexed: 12/24/2022] Open
Abstract
Dual-phenotype hepatocellular carcinoma (DPHCC) expresses both hepatocyte and cholangiocyte markers, and is characterized by high recurrence and low survival rates. The underlying molecular mechanisms of DPHCC pathogenesis are unclear. We performed whole exome sequencing and RNA sequencing of three subtypes of HCC (10 DPHCC, 10 CK19-positive HCC, and 14 CK19-negative HCC), followed by integrated bioinformatics analysis, including somatic mutation analysis, mutation signal analysis, differential gene expression analysis, and pathway enrichment analysis. Cox proportional hazard regression analyses were applied for exploring survival related characteristics. We found that mutated genes in DPHCC patients were associated with carcinogenesis and immunity, and the up-regulated genes were mainly enriched in transcription-related and cancer-related pathways, and the down-regulated genes were mainly enriched in immune-related pathways. CXCL9 was selected as the hub gene, which is associated with immune cells and survival prognosis. Our results showed that low CXCL9 expression was significantly associated with poor prognosis, and its expression was significantly reduced in DPHCC samples. In conclusion, we explored the molecular mechanisms governing DPHCC development and progression and identified CXCL9, which influences the immune microenvironment and prognosis of DPHCC and might be new clinically significant biomarkers for predicting prognosis.
Collapse
Affiliation(s)
- Yaobang Wang
- Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region, China.,Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region, China.,Department of Clinical Laboratory. First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region, China
| | - Xi Wang
- Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region, China.,Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region, China
| | - Xiaoliang Huang
- Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region, China.,Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region, China
| | - Jie Zhang
- Department of Hepatobiliary Surgery, Guangxi Medical University Cancer Hospital, Guangxi Zhuang Autonomous Region, China
| | - Junwen Hu
- Department of Hepatobiliary Surgery, Guangxi Medical University Cancer Hospital, Guangxi Zhuang Autonomous Region, China
| | - Yapeng Qi
- Department of Hepatobiliary Surgery, Guangxi Medical University Cancer Hospital, Guangxi Zhuang Autonomous Region, China
| | - Bangde Xiang
- Department of Hepatobiliary Surgery, Guangxi Medical University Cancer Hospital, Guangxi Zhuang Autonomous Region, China
| | - Qiuyan Wang
- Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region, China.,Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region, China
| |
Collapse
|
26
|
Modern Approaches for Transcriptome Analyses in Plants. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2021; 1346:11-50. [DOI: 10.1007/978-3-030-80352-0_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
27
|
Sadeghi M, Barzegar A. Precision medicine insight into primary prostate tumor through transcriptomic data and an integrated systems biology approach. Meta Gene 2020. [DOI: 10.1016/j.mgene.2020.100787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
|
28
|
Das S, Rai SN. Statistical Approach for Biologically Relevant Gene Selection from High-Throughput Gene Expression Data. ENTROPY (BASEL, SWITZERLAND) 2020; 22:E1205. [PMID: 33286973 PMCID: PMC7712650 DOI: 10.3390/e22111205] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Revised: 10/19/2020] [Accepted: 10/21/2020] [Indexed: 12/16/2022]
Abstract
Selection of biologically relevant genes from high-dimensional expression data is a key research problem in gene expression genomics. Most of the available gene selection methods are either based on relevancy or redundancy measure, which are usually adjudged through post selection classification accuracy. Through these methods the ranking of genes was conducted on a single high-dimensional expression data, which led to the selection of spuriously associated and redundant genes. Hence, we developed a statistical approach through combining a support vector machine with Maximum Relevance and Minimum Redundancy under a sound statistical setup for the selection of biologically relevant genes. Here, the genes were selected through statistical significance values and computed using a nonparametric test statistic under a bootstrap-based subject sampling model. Further, a systematic and rigorous evaluation of the proposed approach with nine existing competitive methods was carried on six different real crop gene expression datasets. This performance analysis was carried out under three comparison settings, i.e., subject classification, biological relevant criteria based on quantitative trait loci and gene ontology. Our analytical results showed that the proposed approach selects genes which are more biologically relevant as compared to the existing methods. Moreover, the proposed approach was also found to be better with respect to the competitive existing methods. The proposed statistical approach provides a framework for combining filter and wrapper methods of gene selection.
Collapse
Affiliation(s)
- Samarendra Das
- Division of Statistical Genetics, Indian Council of Agricultural Research (ICAR)-Indian Agricultural Statistics Research Institute, PUSA, New Delhi 110012, India;
- Netaji Subhas-Indian Council of Agricultural Research (ICAR) International Fellow, Indian Council of Agricultural Research, Krishi Bhawan, New Delhi 110001, India
- Biostatistics and Bioinformatics Facility, JG Brown Cancer Center, University of Louisville, Louisville, KY 40292, USA
- School of Interdisciplinary and Graduate Studies, University of Louisville, Louisville, KY 40292, USA
| | - Shesh N. Rai
- Biostatistics and Bioinformatics Facility, JG Brown Cancer Center, University of Louisville, Louisville, KY 40292, USA
- School of Interdisciplinary and Graduate Studies, University of Louisville, Louisville, KY 40292, USA
- Alcohol Research Center, University of Louisville, Louisville, KY 40292, USA
- Department of Hepatobiology and Toxicology, University of Louisville, Louisville, KY 40292, USA
- Department of Bioinformatics and Biostatistics, University of Louisville, Louisville, KY 40292, USA
- Wendell Cherry Chair in Clinical Trial Research, University of Louisville, Louisville, KY 40292, USA
| |
Collapse
|
29
|
Genome-Wide Identification and Coexpression Network Analysis of DNA Methylation Pathway Genes and Their Differentiated Functions in Ginkgo biloba L. FORESTS 2020. [DOI: 10.3390/f11101076] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
DNA methylation plays a vital role in diverse biological processes. DNA methyltransferases (DNMTs) genes and RNA-directed DNA methylation (RdDM)-related genes are key genes responsible for establishing and maintaining genome DNA methylation in plants. In the present study, we systematically identified nine GbDNMTs in Ginkgo biloba, including the three common families of GbMET1a/1b, GbCMT2, and GbDRMa/b/2a/2b/2c, and a fourth family—GbDNMT3—which is absent in most angiosperms. We also identified twenty RdDM-related genes, including four GbDCLs, six GbAGOs, and ten GbRDRs. Expression analysis of the genes showed the different patterns of individual genes, and 15 of 29 genes displayed expression change under five types of abiotic stress. Gene coexpression analysis and weighted gene co-expression network analysis (WGCNA) using 126 public transcriptomic datasets revealed that these genes were clustered into two groups. In group I, genes covered members from all six families which were preferentially expressed in the ovulate strobile and fruit. A gene ontology (GO) enrichment analysis of WGCNA modules indicated that group I genes were most correlated with the biological process of cell proliferation. Group II only consisted of RdDM-related genes, including GbDRMs, GbAGOs, and GbRDRs, but no GbDCLs, and these genes were specifically expressed in the cambium, suggesting that they may function in a dicer-like (DCL)-independent RdDM pathway in specific tissues. The gene module related to group II was most enriched in signal transduction, cell communication, and the response to the stimulus. These results demonstrate that gene family members could be conserved or diverged across species, and multi-member families in the same pathway may cluster into different modules to function differentially. The study provides insight into the DNA methylation genes and their possible functions in G. biloba, laying a foundation for the further study of DNA methylation in gymnosperms.
Collapse
|
30
|
Zhao B, Qu X, Lv X, Wang Q, Bian D, Yang F, Zhao X, Ji Z, Ni J, Fu Y, Xin G, Yu H. Construction and Characterization of a Synergistic lncRNA-miRNA Network Reveals a Crucial and Prognostic Role of lncRNAs in Colon Cancer. Front Genet 2020; 11:572983. [PMID: 33101392 PMCID: PMC7522580 DOI: 10.3389/fgene.2020.572983] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Accepted: 08/24/2020] [Indexed: 12/18/2022] Open
Abstract
Non-coding RNAs such as long non-coding RNAs (lncRNAs) and microRNAs (miRNAs) have been found to be indispensable factors in carcinogenesis and cancer development. Numerous studies have explored the regulatory functions of these molecules and identified the synergistic interactions among lncRNAs or miRNAs, while those between lncRNAs and miRNAs remain to be investigated. In this study, we constructed and characterized an lncRNA–miRNA synergistic network following a four-step approach by integrating the regulatory pairs and expression profiles. The synergistic interactions with more shared regulatory mRNAs were found to have higher interactional intensity. Through the analysis of nodes in the network, we found that lncRNAs played roles that are more central and had similar synergistic interactions with their neighbors when compared with miRNAs. In addition, known colon adenocarcinoma (COAD)-related RNAs were found to be enriched in this synergistic network, with higher degrees, betweenness, and closeness. Finally, we proposed a risk score model to predict the clinical outcome for COAD patients based on two prognostic hub lncRNAs, MEG3 and ZEB1-AS1. Moreover, the hierarchical networks of these two lncRNAs could contribute to the understanding of the biological mechanism of tumorigenesis. For each lncRNA–miRNA interaction in the hub-related subnetwork and two hierarchical networks, we performed RNAup method to evaluate their binding energy. Our results identified two important lncRNAs with prognostic roles in colon cancer and dissected their regulatory mechanism involving synergistic interaction with miRNAs.
Collapse
Affiliation(s)
- Bin Zhao
- Department of Proctology, First Affiliated Hospital of Jiamusi University, Jiamusi, China
| | - Xiusheng Qu
- Department of Chemoradiotherapy, First Affiliated Hospital of Jiamusi University, Jiamusi, China
| | - Xin Lv
- Department of General Surgery, Samii Medical Center, Shenzhen, China
| | - Qingdong Wang
- Department of Anesthesiology, First Affiliated Hospital of Jiamusi University, Jiamusi, China
| | - Deqiang Bian
- Scientific Research Departments, First Affiliated Hospital of Jiamusi University, Jiamusi, China
| | - Fan Yang
- Department of Proctology, First Affiliated Hospital of Jiamusi University, Jiamusi, China
| | - Xingwang Zhao
- Department of Proctology, First Affiliated Hospital of Jiamusi University, Jiamusi, China
| | - Zhiwu Ji
- Department of Proctology, First Affiliated Hospital of Jiamusi University, Jiamusi, China
| | - Jian Ni
- Department of Proctology, First Affiliated Hospital of Jiamusi University, Jiamusi, China
| | - Yan Fu
- Department of Proctology, First Affiliated Hospital of Jiamusi University, Jiamusi, China
| | - Guorong Xin
- Department of Proctology, First Affiliated Hospital of Jiamusi University, Jiamusi, China
| | - Haitao Yu
- Department of Proctology, First Affiliated Hospital of Jiamusi University, Jiamusi, China
| |
Collapse
|
31
|
Karikari B, Wang Z, Zhou Y, Yan W, Feng J, Zhao T. Identification of quantitative trait nucleotides and candidate genes for soybean seed weight by multiple models of genome-wide association study. BMC PLANT BIOLOGY 2020; 20:404. [PMID: 32873245 PMCID: PMC7466808 DOI: 10.1186/s12870-020-02604-z] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Accepted: 08/16/2020] [Indexed: 05/15/2023]
Abstract
BACKGROUND Seed weight is a complex yield-related trait with a lot of quantitative trait loci (QTL) reported through linkage mapping studies. Integration of QTL from linkage mapping into breeding program is challenging due to numerous limitations, therefore, Genome-wide association study (GWAS) provides more precise location of QTL due to higher resolution and diverse genetic diversity in un-related individuals. RESULTS The present study utilized 573 breeding lines population with 61,166 single nucleotide polymorphisms (SNPs) to identify quantitative trait nucleotides (QTNs) and candidate genes for seed weight in Chinese summer-sowing soybean. GWAS was conducted with two single-locus models (SLMs) and six multi-locus models (MLMs). Thirty-nine SNPs were detected by the two SLMs while 209 SNPs were detected by the six MLMs. In all, two hundred and thirty-one QTNs were found to be associated with seed weight in YHSBLP with various effects. Out of these, seventy SNPs were concurrently detected by both SLMs and MLMs on 8 chromosomes. Ninety-four QTNs co-localized with previously reported QTL/QTN by linkage/association mapping studies. A total of 36 candidate genes were predicted. Out of these candidate genes, four hub genes (Glyma06g44510, Glyma08g06420, Glyma12g33280 and Glyma19g28070) were identified by the integration of co-expression network. Among them, three were relatively expressed higher in the high HSW genotypes at R5 stage compared with low HSW genotypes except Glyma12g33280. Our results show that using more models especially MLMs are effective to find important QTNs, and the identified HSW QTNs/genes could be utilized in molecular breeding work for soybean seed weight and yield. CONCLUSION Application of two single-locus plus six multi-locus models of GWAS identified 231 QTNs. Four hub genes (Glyma06g44510, Glyma08g06420, Glyma12g33280 & Glyma19g28070) detected via integration of co-expression network among the predicted candidate genes.
Collapse
Affiliation(s)
- Benjamin Karikari
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetic Improvement of Soybean (Ministry of Agriculture), State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, People's Republic of China
| | - Zili Wang
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetic Improvement of Soybean (Ministry of Agriculture), State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, People's Republic of China
| | - Yilan Zhou
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetic Improvement of Soybean (Ministry of Agriculture), State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, People's Republic of China
| | - Wenliang Yan
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetic Improvement of Soybean (Ministry of Agriculture), State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, People's Republic of China
| | - Jianying Feng
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetic Improvement of Soybean (Ministry of Agriculture), State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, People's Republic of China.
| | - Tuanjie Zhao
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetic Improvement of Soybean (Ministry of Agriculture), State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, People's Republic of China.
| |
Collapse
|
32
|
Chowdhury HA, Bhattacharyya DK, Kalita JK. (Differential) Co-Expression Analysis of Gene Expression: A Survey of Best Practices. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020; 17:1154-1173. [PMID: 30668502 DOI: 10.1109/tcbb.2019.2893170] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Analysis of gene expression data is widely used in transcriptomic studies to understand functions of molecules inside a cell and interactions among molecules. Differential co-expression analysis studies diseases and phenotypic variations by finding modules of genes whose co-expression patterns vary across conditions. We review the best practices in gene expression data analysis in terms of analysis of (differential) co-expression, co-expression network, differential networking, and differential connectivity considering both microarray and RNA-seq data along with comparisons. We highlight hurdles in RNA-seq data analysis using methods developed for microarrays. We include discussion of necessary tools for gene expression analysis throughout the paper. In addition, we shed light on scRNA-seq data analysis by including preprocessing and scRNA-seq in co-expression analysis along with useful tools specific to scRNA-seq. To get insights, biological interpretation and functional profiling is included. Finally, we provide guidelines for the analyst, along with research issues and challenges which should be addressed.
Collapse
|
33
|
Fifteen Years of Gene Set Analysis for High-Throughput Genomic Data: A Review of Statistical Approaches and Future Challenges. ENTROPY 2020; 22:e22040427. [PMID: 33286201 PMCID: PMC7516904 DOI: 10.3390/e22040427] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Revised: 03/18/2020] [Accepted: 04/03/2020] [Indexed: 12/22/2022]
Abstract
Over the last decade, gene set analysis has become the first choice for gaining insights into underlying complex biology of diseases through gene expression and gene association studies. It also reduces the complexity of statistical analysis and enhances the explanatory power of the obtained results. Although gene set analysis approaches are extensively used in gene expression and genome wide association data analysis, the statistical structure and steps common to these approaches have not yet been comprehensively discussed, which limits their utility. In this article, we provide a comprehensive overview, statistical structure and steps of gene set analysis approaches used for microarrays, RNA-sequencing and genome wide association data analysis. Further, we also classify the gene set analysis approaches and tools by the type of genomic study, null hypothesis, sampling model and nature of the test statistic, etc. Rather than reviewing the gene set analysis approaches individually, we provide the generation-wise evolution of such approaches for microarrays, RNA-sequencing and genome wide association studies and discuss their relative merits and limitations. Here, we identify the key biological and statistical challenges in current gene set analysis, which will be addressed by statisticians and biologists collectively in order to develop the next generation of gene set analysis approaches. Further, this study will serve as a catalog and provide guidelines to genome researchers and experimental biologists for choosing the proper gene set analysis approach based on several factors.
Collapse
|
34
|
Comprehensive expression-based isoform biomarkers predictive of drug responses based on isoform co-expression networks and clinical data. Genomics 2020; 112:647-658. [DOI: 10.1016/j.ygeno.2019.04.017] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2018] [Revised: 03/21/2019] [Accepted: 04/23/2019] [Indexed: 11/19/2022]
|
35
|
Alanni R, Hou J, Azzawi H, Xiang Y. Deep gene selection method to select genes from microarray datasets for cancer classification. BMC Bioinformatics 2019; 20:608. [PMID: 31775613 PMCID: PMC6880643 DOI: 10.1186/s12859-019-3161-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Accepted: 10/15/2019] [Indexed: 12/15/2022] Open
Abstract
Background Microarray datasets consist of complex and high-dimensional samples and genes, and generally the number of samples is much smaller than the number of genes. Due to this data imbalance, gene selection is a demanding task for microarray expression data analysis. Results The gene set selected by DGS has shown its superior performances in cancer classification. DGS has a high capability of reducing the number of genes in the original microarray datasets. The experimental comparisons with other representative and state-of-the-art gene selection methods also showed that DGS achieved the best performance in terms of the number of selected genes, classification accuracy, and computational cost. Conclusions We provide an efficient gene selection algorithm can select relevant genes which are significantly sensitive to the samples’ classes. With the few discriminative genes and less cost time by the proposed algorithm achieved much high prediction accuracy on several public microarray data, which in turn verifies the efficiency and effectiveness of the proposed gene selection method.
Collapse
Affiliation(s)
- Russul Alanni
- School of Information Technology, Deakin University, Geelong, Victoria, Australia.
| | - Jingyu Hou
- School of Information Technology, Deakin University, Geelong, Victoria, Australia
| | - Hasseeb Azzawi
- School of Information Technology, Deakin University, Geelong, Victoria, Australia
| | - Yong Xiang
- School of Information Technology, Deakin University, Geelong, Victoria, Australia
| |
Collapse
|
36
|
Liang R, Xie J, Zhang C, Zhang M, Huang H, Huo H, Cao X, Niu B. Identifying Cancer Targets Based on Machine Learning Methods via Chou's 5-steps Rule and General Pseudo Components. Curr Top Med Chem 2019; 19:2301-2317. [PMID: 31622219 DOI: 10.2174/1568026619666191016155543] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2019] [Revised: 07/19/2019] [Accepted: 08/26/2019] [Indexed: 01/09/2023]
Abstract
In recent years, the successful implementation of human genome project has made people realize that genetic, environmental and lifestyle factors should be combined together to study cancer due to the complexity and various forms of the disease. The increasing availability and growth rate of 'big data' derived from various omics, opens a new window for study and therapy of cancer. In this paper, we will introduce the application of machine learning methods in handling cancer big data including the use of artificial neural networks, support vector machines, ensemble learning and naïve Bayes classifiers.
Collapse
Affiliation(s)
- Ruirui Liang
- School of Life Sciences, Shanghai University, Shanghai, 200444, China
| | - Jiayang Xie
- School of Life Sciences, Shanghai University, Shanghai, 200444, China
| | - Chi Zhang
- Foshan Huaxia Eye Hospital, Huaxia Eye Hospital Group, Foshan 528000, China
| | - Mengying Zhang
- School of Life Sciences, Shanghai University, Shanghai, 200444, China
| | - Hai Huang
- School of Life Sciences, Shanghai University, Shanghai, 200444, China
| | - Haizhong Huo
- Department of General Surgery, Shanghai Ninth People's Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai 200011, China
| | - Xin Cao
- Zhongshan Hospital, Institute of Clinical Science, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Bing Niu
- School of Life Sciences, Shanghai University, Shanghai, 200444, China
| |
Collapse
|
37
|
Rosenthal SB, Bush KT, Nigam SK. A Network of SLC and ABC Transporter and DME Genes Involved in Remote Sensing and Signaling in the Gut-Liver-Kidney Axis. Sci Rep 2019; 9:11879. [PMID: 31417100 PMCID: PMC6695406 DOI: 10.1038/s41598-019-47798-x] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Accepted: 07/23/2019] [Indexed: 02/07/2023] Open
Abstract
Genes central to drug absorption, distribution, metabolism and elimination (ADME) also regulate numerous endogenous molecules. The Remote Sensing and Signaling Hypothesis argues that an ADME gene-centered network-including SLC and ABC "drug" transporters, "drug" metabolizing enzymes (DMEs), and regulatory genes-is essential for inter-organ communication via metabolites, signaling molecules, antioxidants, gut microbiome products, uremic solutes, and uremic toxins. By cross-tissue co-expression network analysis, the gut, liver, and kidney (GLK) formed highly connected tissue-specific clusters of SLC transporters, ABC transporters, and DMEs. SLC22, SLC25 and SLC35 families were network hubs, having more inter-organ and intra-organ connections than other families. Analysis of the GLK network revealed key physiological pathways (e.g., involving bile acids and uric acid). A search for additional genes interacting with the network identified HNF4α, HNF1α, and PXR. Knockout gene expression data confirmed ~60-70% of predictions of ADME gene regulation by these transcription factors. Using the GLK network and known ADME genes, we built a tentative gut-liver-kidney "remote sensing and signaling network" consisting of SLC and ABC transporters, as well as DMEs and regulatory proteins. Together with protein-protein interactions to prioritize likely functional connections, this network suggests how multi-specificity combines with oligo-specificity and mono-specificity to regulate homeostasis of numerous endogenous small molecules.
Collapse
Affiliation(s)
- Sara Brin Rosenthal
- Center for Computational Biology and Bioinformatics, University of California at San Diego, La Jolla, CA, 92093-0693, USA
| | - Kevin T Bush
- Departments of Pediatrics, University of California at San Diego, La Jolla, CA, 92093-0693, USA
| | - Sanjay K Nigam
- Departments of Pediatrics, University of California at San Diego, La Jolla, CA, 92093-0693, USA.
- Departments of Medicine, University of California at San Diego, La Jolla, CA, 92093-0693, USA.
| |
Collapse
|
38
|
Lin F, Zhou L, He B, Zhang X, Dai H, Qian Y, Ruan L, Zhao H. QTL mapping for maize starch content and candidate gene prediction combined with co-expression network analysis. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2019; 132:1931-1941. [PMID: 30887095 DOI: 10.1007/s00122-019-03326-z] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2018] [Accepted: 03/11/2019] [Indexed: 05/28/2023]
Abstract
A major QTL Qsta9.1 was identified on chromosome 9, combined with GWAS, and co-expression network analysis showed that GRMZM2G110929 and GRMZM5G852704 are the potential candidates for association with maize kernel starch content. Increasing maize kernel starch content may not only lead to higher maize kernel yields and qualities, but also help meet industry demands. By using the intermated B73 × Mo17 population, QTLs were mapped for starch content in this study. A major QTL Qsta9.1 was detected in a 1.7 Mb interval on chromosome 9 and validated by allele frequency analysis in extreme tails of a newly constructed segregating population. According to genome-wide association study (GWAS) based on genotyping of a natural population, we identified a significant SNP for starch content within the ORF region of GRMZM5G852704_T01 colocalized with QTL Qsta9.1. Co-expression network analysis was also conducted, and 28 modules were constructed during six seed developmental stages. Functional enrichment was performed for each module, and one module showed the most possibility for the association with carbohydrate-related processes. In this module, one transcripts GRMZM2G110929_T01 located in the Qsta9.1 assigned 1.7 Mb interval encoding GLABRA2 expression modulator. Its expression level in B73 was lower than that in Mo17 across all seed developmental stages, implying the possibility for the candidate gene of Qsta9.1. Our studies combined GWAS, mRNA profiling, and traditional QTL analyses to identify a major locus for controlling seed starch content in maize.
Collapse
Affiliation(s)
- Feng Lin
- Provincial Key Laboratory of Agrobiology, Institute of Crop Germplasm and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing, China
| | - Ling Zhou
- Provincial Key Laboratory of Agrobiology, Institute of Crop Germplasm and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing, China
| | - Bing He
- Provincial Key Laboratory of Agrobiology, Institute of Crop Germplasm and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing, China
| | - Xiaolin Zhang
- Provincial Key Laboratory of Agrobiology, Institute of Crop Germplasm and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing, China
| | - Huixue Dai
- Nanjing Institute of Vegetable Sciences, Nanjing, China
| | - Yiliang Qian
- Anhui Academy of Agricultural Sciences, Hefei, China
| | - Long Ruan
- Anhui Academy of Agricultural Sciences, Hefei, China
| | - Han Zhao
- Provincial Key Laboratory of Agrobiology, Institute of Crop Germplasm and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing, China.
| |
Collapse
|
39
|
Meng Y, Cai XH, Wang L. Potential Genes and Pathways of Neonatal Sepsis Based on Functional Gene Set Enrichment Analyses. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2018; 2018:6708520. [PMID: 30154914 PMCID: PMC6091373 DOI: 10.1155/2018/6708520] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/18/2018] [Revised: 06/04/2018] [Accepted: 06/27/2018] [Indexed: 12/16/2022]
Abstract
BACKGROUND Neonatal sepsis (NS) is considered as the most common cause of neonatal deaths that newborns suffer from. Although numerous studies focus on gene biomarkers of NS, the predictive value of the gene biomarkers is low. NS pathogenesis is still needed to be investigated. METHODS After data preprocessing, we used KEGG enrichment method to identify the differentially expressed pathways between NS and normal controls. Then, functional principal component analysis (FPCA) was adopted to calculate gene values in NS. In order to further study the key signaling pathway of the NS, elastic-net regression model, Mann-Whitney U test, and coexpression network were used to estimate the weights of signaling pathway and hub genes. RESULTS A total of 115 different pathways between NS and controls were first identified. FPCA made full use of time-series gene expression information and estimated F values of genes in the different pathways. The top 1000 genes were considered as the different genes and were further analyzed by elastic-net regression and MWU test. There were 7 key signaling pathways between the NS and controls, according to different sources. Among those genes involved in key pathways, 7 hub genes, PIK3CA, TGFBR2, CDKN1B, KRAS, E2F3, TRAF6, and CHUK, were determined based on the coexpression network. Most of them were cancer-related genes. PIK3CA was considered as the common marker, which is highly expressed in the lymphocyte group. Little was known about the correlation of PIK3CA with NS, which gives us a new enlightenment for NS study. CONCLUSION This research might provide the perspective information to explore the potential novel genes and pathways as NS therapy targets.
Collapse
Affiliation(s)
- YuXiu Meng
- Department of Neonatology, First People's Hospital of Jining, Jining, Shandong 272000, China
| | - Xue Hong Cai
- Department of Pediatrics, Traditional Chinese Medicine Hospital of Yanzhou, Jining, Shandong 272100, China
| | - LiPei Wang
- Department of Neonatology, First People's Hospital of Jining, Jining, Shandong 272000, China
| |
Collapse
|
40
|
Statistical approach for selection of biologically informative genes. Gene 2018; 655:71-83. [PMID: 29458166 DOI: 10.1016/j.gene.2018.02.044] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2017] [Revised: 11/26/2017] [Accepted: 02/14/2018] [Indexed: 11/23/2022]
Abstract
Selection of informative genes from high dimensional gene expression data has emerged as an important research area in genomics. Many gene selection techniques have been proposed so far are either based on relevancy or redundancy measure. Further, the performance of these techniques has been adjudged through post selection classification accuracy computed through a classifier using the selected genes. This performance metric may be statistically sound but may not be biologically relevant. A statistical approach, i.e. Boot-MRMR, was proposed based on a composite measure of maximum relevance and minimum redundancy, which is both statistically sound and biologically relevant for informative gene selection. For comparative evaluation of the proposed approach, we developed two biological sufficient criteria, i.e. Gene Set Enrichment with QTL (GSEQ) and biological similarity score based on Gene Ontology (GO). Further, a systematic and rigorous evaluation of the proposed technique with 12 existing gene selection techniques was carried out using five gene expression datasets. This evaluation was based on a broad spectrum of statistically sound (e.g. subject classification) and biological relevant (based on QTL and GO) criteria under a multiple criteria decision-making framework. The performance analysis showed that the proposed technique selects informative genes which are more biologically relevant. The proposed technique is also found to be quite competitive with the existing techniques with respect to subject classification and computational time. Our results also showed that under the multiple criteria decision-making setup, the proposed technique is best for informative gene selection over the available alternatives. Based on the proposed approach, an R Package, i.e. BootMRMR has been developed and available at https://cran.r-project.org/web/packages/BootMRMR. This study will provide a practical guide to select statistical techniques for selecting informative genes from high dimensional expression data for breeding and system biology studies.
Collapse
|
41
|
Statistical Approach for Gene Set Analysis with Trait Specific Quantitative Trait Loci. Sci Rep 2018; 8:2391. [PMID: 29402907 PMCID: PMC5799309 DOI: 10.1038/s41598-018-19736-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2017] [Accepted: 12/06/2017] [Indexed: 11/20/2022] Open
Abstract
The analysis of gene sets is usually carried out based on gene ontology terms and known biological pathways. These approaches may not establish any formal relation between genotype and trait specific phenotype. In plant biology and breeding, analysis of gene sets with trait specific Quantitative Trait Loci (QTL) data are considered as great source for biological knowledge discovery. Therefore, we proposed an innovative statistical approach called Gene Set Analysis with QTLs (GSAQ) for interpreting gene expression data in context of gene sets with traits. The utility of GSAQ was studied on five different complex abiotic and biotic stress scenarios in rice, which yields specific trait/stress enriched gene sets. Further, the GSAQ approach was more innovative and effective in performing gene set analysis with underlying QTLs and identifying QTL candidate genes than the existing approach. The GSAQ approach also provided two potential biological relevant criteria for performance analysis of gene selection methods. Based on this proposed approach, an R package, i.e., GSAQ (https://cran.r-project.org/web/packages/GSAQ) has been developed. The GSAQ approach provides a valuable platform for integrating the gene expression data with genetically rich QTL data.
Collapse
|