1
|
Sahoo S, Rakshit R. The pattern of coding sequences in the chloroplast genome of Atropa belladonna and a comparative analysis with other related genomes in the nightshade family. Genomics Inform 2022; 20:e43. [PMID: 36617650 PMCID: PMC9847383 DOI: 10.5808/gi.22045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 12/12/2022] [Indexed: 12/31/2022] Open
Abstract
Atropa belladonna is a valuable medicinal plant and a commercial source of tropane alkaloids, which are frequently utilized in therapeutic practice. In this study, bioinformaticmethodologies were used to examine the pattern of coding sequences and the factors thatmight influence codon usage bias in the chloroplast genome of Atropa belladonna andother nightshade genomes. The chloroplast engineering being a promising field in modernbiotechnology, the characterization of chloroplast genome is very important. The resultsrevealed that the chloroplast genomes of Nicotiana tabacum, Solanum lycopersicum, Capsicum frutescens, Datura stramonium, Lyciumbarbarum, Solanum melongena, and Solanumtuberosum exhibited comparable codon usage patterns. In these chloroplast genomes, weobserved a weak codon usage bias. According to the correspondence analysis, the genesisof the codon use bias in these chloroplast genes might be explained by natural selection,directed mutational pressure, and other factors. GC12 and GC3S were shown to have nomeaningful relationship. Further research revealed that natural selection primarily shapedthe codon usage in A. belladonna and other nightshade genomes for translational efficiency. The sequencing properties of these chloroplast genomes were also investigated by investing the occurrences of palindromes and inverted repeats, which would be useful forfuture research on medicinal plants.
Collapse
Affiliation(s)
- Satyabrata Sahoo
- Department of Physics, Dhruba Chand Halder College, Dakshin Barasat 743372, India,*Corresponding author E-mail:
| | - Ria Rakshit
- Department of Botany, Baruipur College, Baruipur 743610, India
| |
Collapse
|
2
|
Sahoo S, Das SS, Rakshit R. Codon usage pattern and predicted gene expression in Arabidopsis thaliana. Gene 2019; 721S:100012. [PMID: 32550546 PMCID: PMC7286098 DOI: 10.1016/j.gene.2019.100012] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2018] [Revised: 01/30/2019] [Accepted: 02/21/2019] [Indexed: 01/20/2023]
Abstract
The extensive research for predicting highly expressed genes in plant genome sequences has been going on for decades. The codon usage pattern of genes in Arabidopsis thaliana genome is a classical topic for plant biologists for its significance in the understanding of molecular plant biology. Here we have used a gene expression profiling methodology based on the score of modified relative codon bias (MRCBS) to elucidate expression pattern of genes in Arabidopsis thaliana. MRCBS relies exclusively on sequence features for identifying the highly expressed genes. In this study, a critical analysis of predicted highly expressed (PHE) genes in Arabidopsis thaliana has been performed using MRCBS as a numerical estimator of gene expression level. Consistent with previous other results, our study indicates that codon composition plays an important role in the regulation of gene expression. We found a systematic strong correlation between MRCBS and CAI (codon adaptation index) or other expression-measures. Additionally, MRCBS correlates well with experimental gene expression data. Our study highlights the relationship between gene expression and compositional signature in relation to codon usage bias and sets the ground for the further investigation of the evolution of the protein-coding genes in the plant genome.
Collapse
Key Words
- Arabidopsis thaliana
- CAI
- CAI, Codon adaptation index
- CP, Chloroplast Pltd CP
- Codon usage bias
- GC content
- GEO, Gene Expression Omnibus
- Gene expression
- MADS, Minichromosome maintenance1, Agamous, Deficiens and Serum response factor
- MBP, Megabase pair
- MRCBS, Score of Modified relative codon bias
- MT, Mitochondrion
- PHE genes
- PHE, Predicted Highly Expressed
- RCA, Relative Codon Adaptation
- RCB, Relative codon bias
- RCBS, Relative Codon Bias Strength
- RMA, Relative Molecular Abundance
- RP, Ribosomal protein
- SAGE, Serial Analysis of Gene Expression
- TAIR, The Arabidopsis Information Resourses
Collapse
Affiliation(s)
- Satyabrata Sahoo
- Department of Physics, Dhruba Chand Halder College, Dakshin Barasat, South 24 Parganas, W.B., India
| | - Shib Sankar Das
- Department of Mathematics, Uluberia College, Uluberia, Howrah, W.B., India
| | - Ria Rakshit
- Department of Botany, Baruipur College, South 24 Parganas, W.B., India
| |
Collapse
|
3
|
Das S, Chottopadhyay B, Sahoo S. Comparative Analysis of Predicted Gene Expression among Crenarchaeal Genomes. Genomics Inform 2017; 15:38-47. [PMID: 28416948 PMCID: PMC5389947 DOI: 10.5808/gi.2017.15.1.38] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2016] [Revised: 11/28/2016] [Accepted: 01/26/2017] [Indexed: 12/13/2022] Open
Abstract
Research into new methods for identifying highly expressed genes in anonymous genome sequences has been going on for more than 15 years. We presented here an alternative approach based on modified score of relative codon usage bias to identify highly expressed genes in crenarchaeal genomes. The proposed algorithm relies exclusively on sequence features for identifying the highly expressed genes. In this study, a comparative analysis of predicted highly expressed genes in five crenarchaeal genomes was performed using the score of Modified Relative Codon Bias Strength (MRCBS) as a numerical estimator of gene expression level. We found a systematic strong correlation between Codon Adaptation Index and MRCBS. Additionally, MRCBS correlated well with other expression measures. Our study indicates that MRCBS can consistently capture the highly expressed genes.
Collapse
Affiliation(s)
- Shibsankar Das
- Department of Mathematics, Uluberia College, Uluberia 711315, India
| | | | | |
Collapse
|
4
|
Ling MHT, Poh CL. A predictor for predicting Escherichia coli transcriptome and the effects of gene perturbations. BMC Bioinformatics 2014; 15:140. [PMID: 24884349 PMCID: PMC4038595 DOI: 10.1186/1471-2105-15-140] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2013] [Accepted: 05/09/2014] [Indexed: 11/24/2022] Open
Abstract
Background A means to predict the effects of gene over-expression, knockouts, and environmental stimuli in silico is useful for system biologists to develop and test hypotheses. Several studies had predicted the expression of all Escherichia coli genes from sequences and reported a correlation of 0.301 between predicted and actual expression. However, these do not allow biologists to study the effects of gene perturbations on the native transcriptome. Results We developed a predictor to predict transcriptome-scale gene expression from a small number (n = 59) of known gene expressions using gene co-expression network, which can be used to predict the effects of over-expressions and knockdowns on E. coli transcriptome. In terms of transcriptome prediction, our results show that the correlation between predicted and actual expression value is 0.467, which is similar to the microarray intra-array variation (p-value = 0.348), suggesting that intra-array variation accounts for a substantial portion of the transcriptome prediction error. In terms of predicting the effects of gene perturbation(s), our results suggest that the expression of 83% of the genes affected by perturbation can be predicted within 40% of error and the correlation between predicted and actual expression values among the affected genes to be 0.698. With the ability to predict the effects of gene perturbations, we demonstrated that our predictor has the potential to estimate the effects of varying gene expression level on the native transcriptome. Conclusion We present a potential means to predict an entire transcriptome and a tool to estimate the effects of gene perturbations for E. coli, which will aid biologists in hypothesis development. This study forms the baseline for future work in using gene co-expression network for gene expression prediction.
Collapse
Affiliation(s)
- Maurice H T Ling
- School of Chemical and Biomedical Engineering, Nanyang Technological University, Nanyang Ave, Singapore, Singapore.
| | | |
Collapse
|
5
|
Wang J, Chen L, Huang S, Liu J, Ren X, Tian X, Qiao J, Zhang W. RNA-seq based identification and mutant validation of gene targets related to ethanol resistance in cyanobacterial Synechocystis sp. PCC 6803. BIOTECHNOLOGY FOR BIOFUELS 2012; 5:89. [PMID: 23259593 PMCID: PMC3564720 DOI: 10.1186/1754-6834-5-89] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2012] [Accepted: 12/04/2012] [Indexed: 05/03/2023]
Abstract
BACKGROUND Fermentation production of biofuel ethanol consumes agricultural crops, which will compete directly with the food supply. As an alternative, photosynthetic cyanobacteria have been proposed as microbial factories to produce ethanol directly from solar energy and CO2. However, the ethanol productivity from photoautotrophic cyanobacteria is still very low, mostly due to the low tolerance of cyanobacterial systems to ethanol stress. RESULTS To build a foundation necessary to engineer robust ethanol-producing cyanobacterial hosts, in this study we applied a quantitative transcriptomics approach with a next-generation sequencing technology, combined with quantitative reverse-transcript PCR (RT-PCR) analysis, to reveal the global metabolic responses to ethanol in model cyanobacterial Synechocystis sp. PCC 6803. The results showed that ethanol exposure induced genes involved in common stress responses, transporting and cell envelope modification. In addition, the cells can also utilize enhanced polyhydroxyalkanoates (PHA) accumulation and glyoxalase detoxication pathway as means against ethanol stress. The up-regulation of photosynthesis by ethanol was also further confirmed at transcriptional level. Finally, we used gene knockout strains to validate the potential target genes related to ethanol tolerance. CONCLUSION RNA-Seq based global transcriptomic analysis provided a comprehensive view of cellular response to ethanol exposure. The analysis provided a list of gene targets for engineering ethanol tolerance in cyanobacterium Synechocystis.
Collapse
Affiliation(s)
- Jiangxin Wang
- School of Chemical Engineering & Technology, Tianjin University, Tianjin, 300072, People's Republic of China
- Key Laboratory of Systems Bioengineering, Ministry of Education, Tianjin, 300072, People's Republic of China
| | - Lei Chen
- School of Chemical Engineering & Technology, Tianjin University, Tianjin, 300072, People's Republic of China
- Key Laboratory of Systems Bioengineering, Ministry of Education, Tianjin, 300072, People's Republic of China
| | - Siqiang Huang
- School of Chemical Engineering & Technology, Tianjin University, Tianjin, 300072, People's Republic of China
- Key Laboratory of Systems Bioengineering, Ministry of Education, Tianjin, 300072, People's Republic of China
| | - Jie Liu
- School of Chemical Engineering & Technology, Tianjin University, Tianjin, 300072, People's Republic of China
- Key Laboratory of Systems Bioengineering, Ministry of Education, Tianjin, 300072, People's Republic of China
| | - Xiaoyue Ren
- School of Chemical Engineering & Technology, Tianjin University, Tianjin, 300072, People's Republic of China
- Key Laboratory of Systems Bioengineering, Ministry of Education, Tianjin, 300072, People's Republic of China
| | - Xiaoxu Tian
- School of Chemical Engineering & Technology, Tianjin University, Tianjin, 300072, People's Republic of China
- Key Laboratory of Systems Bioengineering, Ministry of Education, Tianjin, 300072, People's Republic of China
| | - Jianjun Qiao
- School of Chemical Engineering & Technology, Tianjin University, Tianjin, 300072, People's Republic of China
- Key Laboratory of Systems Bioengineering, Ministry of Education, Tianjin, 300072, People's Republic of China
| | - Weiwen Zhang
- School of Chemical Engineering & Technology, Tianjin University, Tianjin, 300072, People's Republic of China
- Key Laboratory of Systems Bioengineering, Ministry of Education, Tianjin, 300072, People's Republic of China
| |
Collapse
|
6
|
Guo FB, Ye YN, Zhao HL, Lin D, Wei W. Universal pattern and diverse strengths of successive synonymous codon bias in three domains of life, particularly among prokaryotic genomes. DNA Res 2012; 19:477-85. [PMID: 23132389 PMCID: PMC3514858 DOI: 10.1093/dnares/dss027] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
There has been significant progress in understanding the process of protein translation in recent years. One of the best examples is the discovery of usage bias in successive synonymous codons and its role in eukaryotic translation efficiency. We observed here a similar type of bias in the other two life domains, bacteria and archaea, although the bias strength was much smaller than in eukaryotes. Among 136 prokaryotic genomes, 98 were found to have significant bias from random use of successive synonymous codons with Z scores larger than three. Furthermore, significantly different bias strengths were found between prokaryotes grouped by various genomic or biochemical characteristics. Interestingly, the bias strength measured by a general Z score could be fitted well (R = 0.83, P < 10−15) by three genomic variables: genome size, G + C content, and tRNA gene number based on multiple linear regression. A different distribution of synonymous codon pairs between protein-coding genes and intergenic sequences suggests that bias is caused by translation selection. The present results indicate that protein translation is tuned by codon (pair) usage, and the intensity of the regulation is associated with genome size, tRNA gene number, and G + C content.
Collapse
Affiliation(s)
- Feng-Biao Guo
- Center of Bioinformatics and Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 610054, China.
| | | | | | | | | |
Collapse
|
7
|
Pourmir A, Johannes TW. Directed evolution: selection of the host organism. Comput Struct Biotechnol J 2012; 2:e201209012. [PMID: 24688653 PMCID: PMC3962113 DOI: 10.5936/csbj.201209012] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2012] [Revised: 10/06/2012] [Accepted: 10/12/2012] [Indexed: 11/29/2022] Open
Abstract
Directed evolution has become a well-established tool for improving proteins and biological systems. A critical aspect of directed evolution is the selection of a suitable host organism for achieving functional expression of the target gene. To date, most directed evolution studies have used either Escherichia coli or Saccharomyces cerevisiae as a host; however, other bacterial and yeast species, as well as mammalian and insect cell lines, have also been successfully used. Recent advances in synthetic biology and genomics have opened the possibility of expanding the use of directed evolution to new host organisms such as microalgae. This review focuses on the different host organisms used in directed evolution and highlights some of the recent directed evolution strategies used in these organisms.
Collapse
Affiliation(s)
- Azadeh Pourmir
- Department of Chemical Engineering, The University of Tulsa, 800 S. Tucker Dr, Tulsa, OK 74104, United States
| | - Tyler W Johannes
- Department of Chemical Engineering, The University of Tulsa, 800 S. Tucker Dr, Tulsa, OK 74104, United States
| |
Collapse
|