1
|
Amatore Z, Gunn S, Harris LK. An Educational Bioinformatics Project to Improve Genome Annotation. Front Microbiol 2020; 11:577497. [PMID: 33365016 PMCID: PMC7750189 DOI: 10.3389/fmicb.2020.577497] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Accepted: 10/27/2020] [Indexed: 01/28/2023] Open
Abstract
Scientific advancement is hindered without proper genome annotation because biologists lack a complete understanding of cellular protein functions. In bacterial cells, hypothetical proteins (HPs) are open reading frames with unknown functions. HPs result from either an outdated database or insufficient experimental evidence (i.e., indeterminate annotation). While automated annotation reviews help keep genome annotation up to date, often manual reviews are needed to verify proper annotation. Students can provide the manual review necessary to improve genome annotation. This paper outlines an innovative classroom project that determines if HPs have outdated or indeterminate annotation. The Hypothetical Protein Characterization Project uses multiple well-documented, freely available, web-based, bioinformatics resources that analyze an amino acid sequence to (1) detect sequence similarities to other proteins, (2) identify domains, (3) predict tertiary structure including active site characterization and potential binding ligands, and (4) determine cellular location. Enough evidence can be generated from these analyses to support re-annotation of HPs or prioritize HPs for experimental examinations such as structural determination via X-ray crystallography. Additionally, this paper details several approaches for selecting HPs to characterize using the Hypothetical Protein Characterization Project. These approaches include student- and instructor-directed random selection, selection using differential gene expression from mRNA expression data, and selection based on phylogenetic relations. This paper also provides additional resources to support instructional use of the Hypothetical Protein Characterization Project, such as example assignment instructions with grading rubrics, links to training videos in YouTube, and several step-by-step example projects to demonstrate and interpret the range of achievable results that students might encounter. Educational use of the Hypothetical Protein Characterization Project provides students with an opportunity to learn and apply knowledge of bioinformatic programs to address scientific questions. The project is highly customizable in that HP selection and analysis can be specifically formulated based on the scope and purpose of each student's investigations. Programs used for HP analysis can be easily adapted to course learning objectives. The project can be used in both online and in-seat instruction for a wide variety of undergraduate and graduate classes as well as undergraduate capstone, honor's, and experiential learning projects.
Collapse
Affiliation(s)
- Zoie Amatore
- Science Department, Harris Interdisciplinary Research, Davenport University, Lansing, MI, United States
| | - Susan Gunn
- College of Urban Education, Davenport University, Grand Rapids, MI, United States
| | - Laura K. Harris
- Science Department, Harris Interdisciplinary Research, Davenport University, Lansing, MI, United States
| |
Collapse
|
2
|
Zhao J, Zhu J, Guo J, Zhu T, Zhong J, Liu M, Ruan Y, Liao S, Li F. Genetic variability and functional implication of HPV16 from cervical intraepithelial neoplasia in Shanghai women. J Med Virol 2019; 92:372-381. [PMID: 31670402 DOI: 10.1002/jmv.25618] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Accepted: 10/26/2019] [Indexed: 12/23/2022]
Abstract
Human papillomavirus (HPV)16 gene mutation is usually associated with persistent HPV infection and cervical intraepithelial neoplasia (CIN). However, the functional implications of HPV16 mutations remain poorly understood.145 LCR/E6/E7 of the HPV16 isolates were amplified and sequenced, and HPV16 integration status was detected. In total, 89 SNPs (68 in the LCR, 13 in E6, 8 in E7) were discovered, 11 of which were nonsynonymous mutations (8 in E6, 3 in E7). The H85Y and E120D variants in E6 were significantly reduced in the high-grade squamous intraepithelial lesion (HSIL) group compared to the <HSIL group (P = .046 and .005), conversely the N29S in E7(P = .01). Amino acid substitutions (D32N/E, E36Q, H85Y, and E120D in E6 and N29H/S and R77C in E7) were predicted to have an effect on conserved structural and functional residues, and five amino acid substitutions (H85Y, E36Q, I34L, and D32E in E6; R77C in E7) would potentially change the secondary structure. "6329G>T," a potential binding site for TATA-binding protein, is the most common in LCR variants. A4 (Asian) was associated with an increased risk of HSIL compared to A1-3(P = .009). The H85/E120 in E6 and N29 in HPV16 E7 might play a critical role in carcinogenesis by disrupting p53 and Rb degradation due to affecting their interaction, respectively. In a word, the findings in this study provide preventative and therapeutic interventions of HPV16 -related cervical lesions/cancer.
Collapse
Affiliation(s)
- Junwei Zhao
- Department of Obstetrics and Gynecology, East Hospital, Tongji University School of Medicine, Shanghai, China
| | - Jiacheng Zhu
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, China.,Maternal and Child Health Research Institute, BGI-Shenzhen, Shenzhen, China
| | - Junhan Guo
- Department of Obstetrics and Gynecology, East Hospital, Tongji University School of Medicine, Shanghai, China
| | - Tailin Zhu
- School of Physics HH Wills Physics Laboratory, University of Bristol, Bristol, UK
| | - Jixing Zhong
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, China.,School of Biological Sciences and Medical Engineering, Southeast University, Nanjing, China
| | - Min Liu
- Department of Obstetrics and Gynecology, East Hospital, Tongji University School of Medicine, Shanghai, China
| | - Yetian Ruan
- Department of Obstetrics and Gynecology, East Hospital, Tongji University School of Medicine, Shanghai, China
| | - Shujie Liao
- Shujie Liao Cancer Biology Research Center, Department of Obstetrics and Gynecology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Fang Li
- Department of Obstetrics and Gynecology, East Hospital, Tongji University School of Medicine, Shanghai, China
| |
Collapse
|
3
|
Freitas RCD, Odisi EJ, Kato C, da Silva MAC, Lima AODS. Draft Genome Sequence of the Deep-Sea Bacterium Moritella sp. JT01 and Identification of Biotechnologically Relevant Genes. MARINE BIOTECHNOLOGY (NEW YORK, N.Y.) 2017; 19:480-487. [PMID: 28733934 DOI: 10.1007/s10126-017-9767-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2017] [Accepted: 06/19/2017] [Indexed: 06/07/2023]
Abstract
Deep-sea bacteria can produce various biotechnologically relevant enzymes due to their adaptations to high pressures and low temperatures. To identify such enzymes, we have sequenced the genome of the polycaprolactone-degrading bacterium Moritella sp. JT01, isolated from sediment samples from Japan Trench (6957 m depth), using a Illumina HiSeq2000 sequencer (12.1 million paired-end reads) and CLC Genomics Workbench (version 6.5.1) for the assembly, resulting in a 4.83-Mb genome (42 scaffolds). The genome was annotated using Rapid Annotation using Subsystem Technology (RAST), Protein Homology/analogY Recognition Engine V 2.0 (PHYRE2), and BLAST2Go, revealing 4439 protein coding sequences and 101 RNAs. Gene products with industrial relevance, such as lipases (three) and esterases (four), were identified and are related to bacterium's ability to degrade polycaprolactone. The annotation revealed proteins related to deep-sea survival, such as cold-shock proteins (six) and desaturases (three). The presence of secondary metabolite biosynthetic gene clusters suggests that this bacterium could produce nonribosomal peptides, polyunsaturated fatty acids, and bacteriocins. To demonstrate the potential of this genome, a lipase was cloned an introduced into Escherichia coli. The lipase was purified and characterized, showing activity over a wide temperature range (over 50% at 20-60 °C) and pH range (over 80% at pH 6.3 to 9). This enzyme has tolerance to the surfactant action of sodium dodecyl sulfate and shows 30% increased activity when subjected to a working pressure of 200 MPa. The genomic characterization of Moritella sp. JT01 reveals traits associated with survival in the deep-sea and their potential uses in biotechnology, as exemplified by the characterized lipase.
Collapse
Affiliation(s)
- Robert Cardoso de Freitas
- Technological Science Center of Earth and Sea, UNIVALI, R Uruguai 458, Itajai, SC, 88302-202, Brazil
| | - Estácio Jussie Odisi
- Technological Science Center of Earth and Sea, UNIVALI, R Uruguai 458, Itajai, SC, 88302-202, Brazil
| | - Chiaki Kato
- Department of Marine Biodiversity Research, JAMSTEC, Natsushima-cho 2-15, Yokosuka, Kanagawa, 237-0061, Japan
| | | | | |
Collapse
|