151
|
Chaturvedi A, Raeymaekers JAM, Volckaert FAM. Computational identification of miRNAs, their targets and functions in three-spined stickleback (Gasterosteus aculeatus). Mol Ecol Resour 2014; 14:768-77. [DOI: 10.1111/1755-0998.12223] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2013] [Revised: 12/27/2013] [Accepted: 01/03/2014] [Indexed: 12/12/2022]
Affiliation(s)
- Anurag Chaturvedi
- Laboratory of Biodiversity and Evolutionary Genomics; University of Leuven; Ch. Deberiotstraat 32 Leuven B-3000 Belgium
| | - Joost A. M. Raeymaekers
- Laboratory of Biodiversity and Evolutionary Genomics; University of Leuven; Ch. Deberiotstraat 32 Leuven B-3000 Belgium
- Zoological Institute; University of Basel; Vesalgasse 1 Basel CH-4051 Switzerland
| | - Filip A. M. Volckaert
- Laboratory of Biodiversity and Evolutionary Genomics; University of Leuven; Ch. Deberiotstraat 32 Leuven B-3000 Belgium
| |
Collapse
|
152
|
Danilova TV, Friebe B, Gill BS. Development of a wheat single gene FISH map for analyzing homoeologous relationship and chromosomal rearrangements within the Triticeae. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2014. [PMID: 24408375 DOI: 10.1007/s00122‐013‐2253‐z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
A cytogenetic map of wheat was constructed using FISH with cDNA probes. FISH markers detected homoeology and chromosomal rearrangements of wild relatives, an important source of genes for wheat improvement. To transfer agronomically important genes from wild relatives to bread wheat (Triticum aestivum L., 2n = 6 x = 42, AABBDD) by induced homoeologous recombination, it is important to know the chromosomal relationships of the species involved. Fluorescence in situ hybridization (FISH) can be used to study chromosome structure. The genomes of allohexaploid bread wheat and other species from the Triticeae tribe are colinear to some extent, i.e., composed of homoeoloci at similar positions along the chromosomes, and with genic regions being highly conserved. To develop cytogenetic markers specific for genic regions of wheat homoeologs, we selected more than 60 full-length wheat cDNAs using BLAST against mapped expressed sequence tags and used them as FISH probes. Most probes produced signals on all three homoeologous chromosomes at the expected positions. We developed a wheat physical map with several cDNA markers located on each of the 14 homoeologous chromosome arms. The FISH markers confirmed chromosome rearrangements within wheat genomes and were successfully used to study chromosome structure and homoeology in wild Triticeae species. FISH analysis detected 1 U-6 U chromosome translocation in the genome of Aegilops umbellulata, showed colinearity between chromosome A of Ae. caudata and group-1 wheat chromosomes, and between chromosome arm 7S#3 L of Thinopyrum intermedium and the long arm of the group-7 wheat chromosomes.
Collapse
Affiliation(s)
- Tatiana V Danilova
- Department of Plant Pathology, Wheat Genetics Resource Center, Kansas State University, Manhattan, KS, 66506, USA
| | | | | |
Collapse
|
153
|
VH1-44 gene usage defines a subset of canine B-cell lymphomas associated with better patient survival. Vet Immunol Immunopathol 2013; 157:125-30. [PMID: 24332568 DOI: 10.1016/j.vetimm.2013.10.020] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2013] [Revised: 10/24/2013] [Accepted: 10/31/2013] [Indexed: 11/21/2022]
Abstract
The use of specific immunoglobulin heavy chain variable region (VH) genes has been associated with increased patient survival in human B-cell lymphomas (hBCL). Given the similarity of human and canine BCL (cBCL) in morphology and clinical treatment, we examined the choice of VH in cBCL and determined whether VH gene selection was a distinct feature associated with survival time in dogs. VH gene selection and mutational status in 52 cBCL, including 29 diffuse large B-cell lymphomas (cDLBCL, the most common subtype of cBCL), were analyzed by comparison with the 80 published canine germline VH gene sequences. We further examined the prognostic impact of the subgroups defined by these features on canine survival. We found that VH1-44 was preferentially expressed in the majority of the 52 cBCLs (60%) as well as in the majority of the cDLBCL subset (59%). VH1-44 gene expression was associated with a statistically better overall survival (p=0.039) in cBCL patients, as well as in the cDLBCL subset of patients (p=0.038). These findings suggest that VH gene selection in cBCL is not random and may therefore have functional implications for cBCL lymphomagenesis, in addition to being a useful prognostic biomarker.
Collapse
|
154
|
Sangal V, Fineran PC, Hoskisson PA. Novel configurations of type I and II CRISPR–Cas systems in Corynebacterium diphtheriae. Microbiology (Reading) 2013; 159:2118-2126. [DOI: 10.1099/mic.0.070235-0] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Affiliation(s)
- Vartul Sangal
- Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, 161 Cathedral Street, Glasgow G4 0RE, UK
| | - Peter C. Fineran
- Department of Microbiology and Immunology, University of Otago, PO Box 56, Dunedin 9054, New Zealand
| | - Paul A. Hoskisson
- Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, 161 Cathedral Street, Glasgow G4 0RE, UK
| |
Collapse
|
155
|
He C, Tsend-Ayush E, Myers MA, Forbes BE, Grützner F. Changes in the ghrelin hormone pathway maybe part of an unusual gastric system in monotremes. Gen Comp Endocrinol 2013; 191:74-82. [PMID: 23770219 DOI: 10.1016/j.ygcen.2013.06.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/31/2013] [Revised: 05/31/2013] [Accepted: 06/01/2013] [Indexed: 12/14/2022]
Abstract
Ghrelin is a growth hormone (GH)-releasing and appetite-regulating peptide predominately released from the stomach. Ghrelin is evolutionarily highly conserved and known to have a wide range of functions including the regulation of metabolism by maintaining an insulin-glucose balance. The peptide is produced as a single proprotein, which is later proteolytically cleaved. Ghrelin exerts its biological function after O-n-octanoylation at residue serine 3, which is catalyzed by ghrelin O-acyl transferase (GOAT) and allows binding to the growth hormone secretagogue receptor (GHS-R 1a). Genes involved in the ghrelin pathway have been identified in a broad range of vertebrate species, however, little is known about this pathway in the basal mammalian lineage of monotremes (platypus and echidna). Monotremes are particularly interesting in this context, as they have undergone massive changes in stomach anatomy and physiology, accompanied by a striking loss of genes involved in gastric function. In this study, we investigated genes in the ghrelin pathway in monotremes. Using degenerate PCR, database searches and synteny analysis we found that genes encoding ghrelin and GOAT are missing in the platypus genome, whilst, as has been reported in other species, the GHSR is present and expressed in brain, pancreas, kidney, intestine, heart and stomach. This is the first report suggesting the loss of ghrelin in a mammal. The loss of this gene may be related to changes to the platypus digestive system and raises questions about the control of blood glucose levels and insulin response in monotreme mammals. In addition, the conservation of the ghrelin receptor gene in platypus indicates that another ligand(s) maybe acting via this receptor in monotremes.
Collapse
Affiliation(s)
- Chuan He
- School of Molecular and Biomedical Science, The University of Adelaide, SA 5005, Australia
| | | | | | | | | |
Collapse
|
156
|
Two independent pathways for self-recognition in Proteus mirabilis are linked by type VI-dependent export. mBio 2013; 4:mBio.00374-13. [PMID: 23882014 PMCID: PMC3735182 DOI: 10.1128/mbio.00374-13] [Citation(s) in RCA: 85] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
UNLABELLED Swarming colonies of the bacterium Proteus mirabilis are capable of self-recognition and territorial behavior. Swarms of independent P. mirabilis isolates can recognize each other as foreign and establish a visible boundary where they meet; in contrast, genetically identical swarms merge. The ids genes, which encode self-identity proteins, are necessary but not sufficient for this territorial behavior. Here we have identified two new gene clusters: one (idr) encodes rhs-related products, and another (tss) encodes a putative type VI secretion (T6S) apparatus. The Ids and Idr proteins function independently of each other in extracellular transport and in territorial behaviors; however, these self-recognition systems are linked via this type VI secretion system. The T6S system is required for export of select Ids and Idr proteins. Our results provide a mechanistic and physiological basis for the fundamental behaviors of self-recognition and territoriality in a bacterial model system. IMPORTANCE Our results support a model in which self-recognition in P. mirabilis is achieved by the combined action of two independent pathways linked by a shared machinery for export of encoded self-recognition elements. These proteins together form a mechanistic network for self-recognition that can serve as a foundation for examining the prevalent biological phenomena of territorial behaviors and self-recognition in a simple, bacterial model system.
Collapse
|
157
|
Ebbert MTW, Mallory MA, Wilson AR, Dooley SK, Hillyard DR. Application of a new informatics tool for contamination screening in the HIV sequencing laboratory. J Clin Virol 2013; 57:249-53. [PMID: 23583427 DOI: 10.1016/j.jcv.2013.03.013] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2012] [Revised: 03/14/2013] [Accepted: 03/16/2013] [Indexed: 11/17/2022]
Abstract
BACKGROUND Current HIV-1 sequencing-based methods for detecting drug resistance-associated mutations are open and susceptible to contamination. Informatic identification of clinical sequences that are nearly identical to one another may indicate specimen-to-specimen contamination or another laboratory-associated issue. OBJECTIVES To design an informatic tool to rapidly identify potential contamination in the clinical laboratory using sequence analysis and to establish reference ranges for sequence variation in the HIV-1 protease and reverse transcriptase regions among a U.S. patient population. STUDY DESIGN We developed an open-source tool named HIV Contamination Detection (HIVCD). HIVCD was utilized to make pairwise comparisons of nearly 8000 partial HIV-1 pol gene sequences from patients across the United States and to calculate percent identities (PIDs) for each pair. ROC analysis and standard deviations of PID data were used to determine reference ranges for between-patient and within-patient comparisons and to guide selection of a threshold for identifying abnormally high PID between two unrelated sequences. RESULTS The PID reference range for between-patient comparisons ranged from 83.8 to 95.7% while within-patient comparisons ranged from 96 to 100%. Interestingly, 48% of between-patient sequence pairs with a PID>96.5 were geographically related. The selected threshold for abnormally high PIDs was 96 (AUC=0.993, sensitivity=0.980, specificity=0.999). During routine use, HIVCD identified a specimen mix-up and the source of contamination of a negative control. CONCLUSIONS In our experience, HIVCD is easily incorporated into laboratory workflow, useful for identifying potential laboratory errors, and contributes to quality testing. This type of analysis should be incorporated into routine laboratory practice.
Collapse
Affiliation(s)
- Mark T W Ebbert
- ARUP Institute for Clinical and Experimental Pathology, ARUP Laboratories, 500 Chipeta Way, Salt Lake City, UT 84108, USA.
| | | | | | | | | |
Collapse
|
158
|
Shavrukov Y, Bovill J, Afzal I, Hayes JE, Roy SJ, Tester M, Collins NC. HVP10 encoding V-PPase is a prime candidate for the barley HvNax3 sodium exclusion gene: evidence from fine mapping and expression analysis. PLANTA 2013; 237:1111-22. [PMID: 23277165 DOI: 10.1007/s00425-012-1827-3] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/16/2012] [Accepted: 12/05/2012] [Indexed: 05/20/2023]
Abstract
In cereals, a common salinity tolerance mechanism is to limit accumulation of Na(+) in the shoot. In a cross between the barley variety Barque-73 (Hordeum vulgare ssp. vulgare) and the accession CPI-71284 of wild barley (H. vulgare ssp. spontaneum), the HvNax3 locus on chromosome 7H was found to determine a ~10-25 % difference in leaf Na(+) accumulation in seedlings grown in saline hydroponics, with the beneficial exclusion trait originating from the wild parent. The Na(+) exclusion allele was also associated with a 13-21 % increase in shoot fresh weight. The HvNax3 locus was delimited to a 0.4 cM genetic interval, where it cosegregated with the HVP10 gene for vacuolar H(+)-pyrophosphatase (V-PPase). Sequencing revealed that the mapping parents encoded identical HVP10 proteins, but salinity-induced mRNA expression of HVP10 was higher in CPI-71284 than in Barque-73, in both roots and shoots. By contrast, the expression of several other genes predicted by comparative mapping to be located in the HvNax3 interval was similar in the two parent lines. Previous work demonstrated roles for V-PPase in ion transport and salinity tolerance. We therefore considered transcription levels of HVP10 to be a possible basis for variation in shoot Na(+) accumulation and biomass production controlled by the HvNax3 locus under saline conditions. Potential mechanisms linking HVP10 expression patterns to the observed phenotypes are discussed.
Collapse
Affiliation(s)
- Yuri Shavrukov
- Australian Centre for Plant Functional Genomics, School of Agriculture, Food and Wine, University of Adelaide, Urrbrae, SA, 5064, Australia.
| | | | | | | | | | | | | |
Collapse
|
159
|
Barvkar VT, Pardeshi VC, Kale SM, Qiu S, Rollins M, Datla R, Gupta VS, Kadoo NY. Genome-wide identification and characterization of microRNA genes and their targets in flax (Linum usitatissimum): Characterization of flax miRNA genes. PLANTA 2013; 237:1149-61. [PMID: 23291876 DOI: 10.1007/s00425-012-1833-5] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2012] [Accepted: 12/18/2012] [Indexed: 05/11/2023]
Abstract
MicroRNAs (miRNAs) are small (20-24 nucleotide long) endogenous regulatory RNAs that play important roles in plant growth and development. They regulate gene expression at the post-transcriptional level by translational repression or target degradation and gene silencing. In this study, we identified 116 conserved miRNAs belonging to 23 families from the flax (Linum usitatissimum L.) genome using a computational approach. The precursor miRNAs varied in length; while most of the mature miRNAs were 21 nucleotide long, intergenic and showed conserved signatures of RNA polymerase II transcripts in their upstream regions. Promoter region analysis of the flax miRNA genes indicated prevalence of MYB transcription factor binding sites. Four miRNA gene clusters containing members of three phylogenetic groups were identified. Further, 142 target genes were predicted for these miRNAs and most of these represent transcriptional regulators. The miRNA encoding genes were expressed in diverse tissues as determined by digital expression analysis as well as real-time PCR. The expression of fourteen miRNAs and nine target genes was independently validated using the quantitative reverse transcription PCR (qRT-PCR). This study suggests that a large number of conserved plant miRNAs are also found in flax and these may play important roles in growth and development of flax.
Collapse
Affiliation(s)
- Vitthal T Barvkar
- Biochemical Sciences Division, CSIR-National Chemical Laboratory, Pune, 411008, India
| | | | | | | | | | | | | | | |
Collapse
|
160
|
Sun J, Chen Q, Lun JCY, Xu J, Qiu JW. PcarnBase: development of a transcriptomic database for the brain coral Platygyra carnosus. MARINE BIOTECHNOLOGY (NEW YORK, N.Y.) 2013; 15:244-251. [PMID: 22875536 DOI: 10.1007/s10126-012-9482-z] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2012] [Accepted: 07/20/2012] [Indexed: 06/01/2023]
Abstract
The aims of this study were to sequence the transcriptome and organize the sequence data into a searchable database for the brain coral Platygyra carnosus, a structure-forming dominant species along the coast of southern China. We collected healthy and tumorous coral tissues from two locations, extracted RNA from each tissue sample, pooled the RNA from all tissue samples, generated a cDNA library from the pooled samples, and conducted paired-end sequencing of the cDNA library using the Illumina platform to produce 59.6 M clean sequences with a read length of 90 bp. De novo assembly of the sequence data resulted in 162,468 unigenes with an average length of 606 bp (range, 201 to 23,923 bp). This is the largest transcriptome dataset for a species of coral whose genome has not been sequenced. A BLASTx search against the NCBI protein database showed that 55,355 of the unigenes matched at least a sequence with an E-value of < 0.00001; 59 % of the matched sequences are from Metazoa, 13 % are from Alveolata to which the symbiont Symbiodinium belongs, and 7 % are from bacteria. A database (PcarnBase) was constructed to provide easy access to the unigenes with attributes such as NCBI protein annotation, GO annotation, and KEGG pathway. It will facilitate functional genomic studies of P. carnosus, such as biomarker discovery for bleaching, tumor formation, and disease development at the gene or protein level, involvement of coral symbiotic algae in the host coral's stress responses, and genetic basis of stress resistance.
Collapse
Affiliation(s)
- Jin Sun
- Department of Biology, Hong Kong Baptist University, Hong Kong, China
| | | | | | | | | |
Collapse
|
161
|
prfectBLAST: a platform-independent portable front end for the command terminal BLAST+ stand-alone suite. Biotechniques 2013; 53:299-300. [PMID: 23148880 DOI: 10.2144/000113953] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2011] [Accepted: 10/22/2012] [Indexed: 11/23/2022] Open
Abstract
prfectBLAST is a multiplatform graphical user interface (GUI) for the stand-alone BLAST+ suite of applications. It allows researchers to do nucleotide or amino acid sequence similarity searches against public (or user-customized) databases that are locally stored. It does not require any dependencies or installation and can be used from a portable flash drive. prfectBLAST is implemented in Java version 6 (SUN) and runs on all platforms that support Java and for which National Center for Biotechnology Information has made available stand-alone BLAST executables, including MS Windows, Mac OS X, and Linux. It is free and open source software, made available under the GNU General Public License version 3 (GPLv3) and can be downloaded at www.cicy.mx/sitios/jramirez or http://code.google.com/p/prfectblast/.
Collapse
|
162
|
Colmsee C, Mascher M, Czauderna T, Hartmann A, Schlüter U, Zellerhoff N, Schmitz J, Bräutigam A, Pick TR, Alter P, Gahrtz M, Witt S, Fernie AR, Börnke F, Fahnenstich H, Bucher M, Dresselhaus T, Weber APM, Schreiber F, Scholz U, Sonnewald U. OPTIMAS-DW: a comprehensive transcriptomics, metabolomics, ionomics, proteomics and phenomics data resource for maize. BMC PLANT BIOLOGY 2012; 12:245. [PMID: 23272737 PMCID: PMC3577462 DOI: 10.1186/1471-2229-12-245] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/14/2012] [Accepted: 12/12/2012] [Indexed: 05/05/2023]
Abstract
BACKGROUND Maize is a major crop plant, grown for human and animal nutrition, as well as a renewable resource for bioenergy. When looking at the problems of limited fossil fuels, the growth of the world's population or the world's climate change, it is important to find ways to increase the yield and biomass of maize and to study how it reacts to specific abiotic and biotic stress situations. Within the OPTIMAS systems biology project maize plants were grown under a large set of controlled stress conditions, phenotypically characterised and plant material was harvested to analyse the effect of specific environmental conditions or developmental stages. Transcriptomic, metabolomic, ionomic and proteomic parameters were measured from the same plant material allowing the comparison of results across different omics domains. A data warehouse was developed to store experimental data as well as analysis results of the performed experiments. DESCRIPTION The OPTIMAS Data Warehouse (OPTIMAS-DW) is a comprehensive data collection for maize and integrates data from different data domains such as transcriptomics, metabolomics, ionomics, proteomics and phenomics. Within the OPTIMAS project, a 44K oligo chip was designed and annotated to describe the functions of the selected unigenes. Several treatment- and plant growth stage experiments were performed and measured data were filled into data templates and imported into the data warehouse by a Java based import tool. A web interface allows users to browse through all stored experiment data in OPTIMAS-DW including all data domains. Furthermore, the user can filter the data to extract information of particular interest. All data can be exported into different file formats for further data analysis and visualisation. The data analysis integrates data from different data domains and enables the user to find answers to different systems biology questions. Finally, maize specific pathway information is provided. CONCLUSIONS With OPTIMAS-DW a data warehouse for maize was established, which is able to handle different data domains, comprises several analysis results that will support researchers within their work and supports systems biological research in particular. The system is available at http://www.optimas-bioenergy.org/optimas_dw.
Collapse
Affiliation(s)
- Christian Colmsee
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Stadt Seeland, Corrensstr. 3
| | - Martin Mascher
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Stadt Seeland, Corrensstr. 3
| | - Tobias Czauderna
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Stadt Seeland, Corrensstr. 3
| | - Anja Hartmann
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Stadt Seeland, Corrensstr. 3
| | - Urte Schlüter
- Department of Biology, Friedrich-Alexander University of Erlangen-Nuremberg, 91054 Erlangen, Staudtstr. 5, Germany
| | - Nina Zellerhoff
- University of Cologne, Botanical Institute, 50923 Köln, Albertus-Magnus-Platz, Germany
| | - Jessica Schmitz
- University of Cologne, Botanical Institute, 50923 Köln, Albertus-Magnus-Platz, Germany
| | - Andrea Bräutigam
- Plant Biochemistry, Heinrich-Heine-University, Universitätsstr. 1, 40225 Düsseldorf, Germany
| | - Thea R Pick
- Plant Biochemistry, Heinrich-Heine-University, Universitätsstr. 1, 40225 Düsseldorf, Germany
- International Graduate Program for Plant Science (iGrad-plant), Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany
| | - Philipp Alter
- Cell Biology and Plant Biochemistry, University of Regensburg, Universitätsstr. 31, 93040 Regensburg, Germany
| | - Manfred Gahrtz
- Cell Biology and Plant Biochemistry, University of Regensburg, Universitätsstr. 31, 93040 Regensburg, Germany
| | - Sandra Witt
- Department of Molecular Physiology, Max Planck Institute of Molecular Plant Physiology, 14476 Potsdam-Golm, Am Mühlenberg 1, Germany
| | - Alisdair R Fernie
- Department of Molecular Physiology, Max Planck Institute of Molecular Plant Physiology, 14476 Potsdam-Golm, Am Mühlenberg 1, Germany
| | - Frederik Börnke
- Department of Biology, Friedrich-Alexander University of Erlangen-Nuremberg, 91054 Erlangen, Staudtstr. 5, Germany
| | | | - Marcel Bucher
- University of Cologne, Botanical Institute, 50923 Köln, Albertus-Magnus-Platz, Germany
| | - Thomas Dresselhaus
- Cell Biology and Plant Biochemistry, University of Regensburg, Universitätsstr. 31, 93040 Regensburg, Germany
| | - Andreas PM Weber
- Plant Biochemistry, Heinrich-Heine-University, Universitätsstr. 1, 40225 Düsseldorf, Germany
| | - Falk Schreiber
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Stadt Seeland, Corrensstr. 3
- Martin Luther University Halle-Wittenberg, Institute of Computer Science, 06120 Halle, Von-Seckendorff-Platz 1, Germany
| | - Uwe Scholz
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Stadt Seeland, Corrensstr. 3
| | - Uwe Sonnewald
- Department of Biology, Friedrich-Alexander University of Erlangen-Nuremberg, 91054 Erlangen, Staudtstr. 5, Germany
| |
Collapse
|
163
|
Kale SM, Pardeshi VC, Barvkar VT, Gupta VS, Kadoo NY. Genome-wide identification and characterization of nucleotide binding site leucine-rich repeat genes in linseed reveal distinct patterns of gene structure. Genome 2012; 56:91-9. [PMID: 23517318 DOI: 10.1139/gen-2012-0135] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Plants employ different disease-resistance genes to detect pathogens and to induce defense responses. The largest class of these genes encodes proteins with nucleotide binding site (NBS) and leucine-rich repeat (LRR) domains. To identify the putative NBS-LRR encoding genes from linseed, we analyzed the recently published linseed genome sequence and identified 147 NBS-LRR genes. The NBS domain was used for phylogeny construction and these genes were classified into two well-known families, non-TIR (CNL) and TIR related (TNL), and formed eight clades in the neighbor-joining bootstrap tree. Eight different gene structures were observed among these genes. An unusual domain arrangement was observed in the TNL family members, predominantly in the TNL-5 clade members belonging to class D. About 12% of the genes observed were linseed specific. The study indicated that the linseed genes probably have an ancient origin with few progenitor genes. Quantitative expression analysis of five genes showed inducible expression. The in silico expression evidence was obtained for a few of these genes, and the expression was not correlated with the presence of any particular regulatory element or with unusual domain arrangement in those genes. This study will help in understanding the evolution of these genes, the development of disease resistant varieties, and the mechanism of disease resistance in linseed.
Collapse
Affiliation(s)
- Sandip M Kale
- Biochemical Sciences Division, National Chemical Laboratory, Pune 411008, India
| | | | | | | | | |
Collapse
|
164
|
Barvkar VT, Pardeshi VC, Kale SM, Kadoo NY, Giri AP, Gupta VS. Proteome profiling of flax (Linum usitatissimum) seed: characterization of functional metabolic pathways operating during seed development. J Proteome Res 2012; 11:6264-76. [PMID: 23153172 DOI: 10.1021/pr300984r] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Flax (Linum usitatissimum L.) seeds are an important source of food and feed due to the presence of various health promoting compounds, making it a nutritionally and economically important plant. An in-depth analysis of the proteome of developing flax seed is expected to provide significant information with respect to the regulation and accumulation of such storage compounds. Therefore, a proteomic analysis of seven seed developmental stages (4, 8, 12, 16, 22, 30, and 48 days after anthesis) in a flax variety, NL-97 was carried out using a combination of 1D-SDS-PAGE and LC-MSE methods. A total 1716 proteins were identified and their functional annotation revealed that a majority of them were involved in primary metabolism, protein destination, storage and energy. Three carbon assimilatory pathways appeared to operate in flax seeds. Reverse transcription quantitative PCR of selected 19 genes was carried out to understand their roles during seed development. Besides storage proteins, methionine synthase, RuBisCO and S-adenosylmethionine synthetase were highly expressed transcripts, highlighting their importance in flax seed development. Further, the identified proteins were mapped onto developmental seed specific expressed sequence tag (EST) libraries of flax to obtain transcriptional evidence and 81% of them had detectable expression at the mRNA level. This study provides new insights into the complex seed developmental processes operating in flax.
Collapse
Affiliation(s)
- Vitthal T Barvkar
- Biochemical Sciences Division, CSIR-National Chemical Laboratory, Pune 411008, India
| | | | | | | | | | | |
Collapse
|
165
|
Single-copy gene fluorescence in situ hybridization and genome analysis: Acc-2 loci mark evolutionary chromosomal rearrangements in wheat. Chromosoma 2012; 121:597-611. [DOI: 10.1007/s00412-012-0384-7] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2012] [Revised: 09/12/2012] [Accepted: 09/17/2012] [Indexed: 12/30/2022]
|
166
|
Abstract
OBJECTIVES Despite prevention efforts, new HIV diagnoses continue in the southern United States, where the epidemic is characterized by significant racial/ethnic disparities. We integrated phylogenetic analyses with clinical data to reveal trends in local HIV transmission. DESIGN Cross-sectional analysis of 1671 HIV-infected individuals each with one B-subtype pol sequence obtained during chronic (82%; UNC Center for AIDS Research Clinical Cohort) or acute/recent (18%; Duke/UNC Acute HIV Consortium) infection. METHODS Phylogenies were inferred using neighbor joining to select related sequences then confirmed with Bayesian methods. We characterized transmission clusters (clades n ≥ 3 sequences supported by posterior probabilities = 1) by factors including race/ethnicity and transmission risk. Factors associated with cluster membership were evaluated for newly diagnosed patients. RESULTS Overall, 72% were men, 59% black and 39% men who have sex with men (MSM). A total of 557 (33%) sequences grouped in either 108 pairs (n = 216) or 67 clusters (n = 341). Clusters ranged from three to 36 (median 4) members. Composition was delineated primarily by race, with 28% exclusively black, and to a lesser extent by risk group. Both MSM and heterosexuals formed discrete clusters, although substantial mixing was observed. In multivariable analysis, patients with age 30 years or less (P = 0.009), acute infection (P = 0.02), local residence (P = 0.002) and transmitted drug resistance (P = 0.02) were more likely to be cluster members, whereas Latinos were less likely (P < 0.001). CONCLUSION Integration of molecular, clinical and demographic data offers a unique view into the structure of local transmission networks. Clustering by black race, youth and transmitted drug resistance and inability to identify Latino clusters will inform prevention, testing and linkage to care strategies.
Collapse
|
167
|
Lo R, Matthews J. High-resolution genome-wide mapping of AHR and ARNT binding sites by ChIP-Seq. Toxicol Sci 2012; 130:349-61. [PMID: 22903824 DOI: 10.1093/toxsci/kfs253] [Citation(s) in RCA: 98] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
The aryl hydrocarbon receptor (AHR) and AHR nuclear translocator (ARNT) activated complex regulates genes in response to the environmental contaminant 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD). AHR has also emerged as a potential therapeutic target for the treatment of human diseases and different cancers, including breast cancer. To better understand AHR and ARNT signaling in breast cancer cells, we used chromatin immunoprecipitation linked to high-throughput sequencing to identify AHR- and ARNT-binding sites across the genome in TCDD-treated MCF-7 cells. We identified 2594 AHR-bound, 1352 ARNT-bound, and 882 AHR/ARNT cobound regions. No significant differences in the genomic distribution of AHR and ARNT were observed. Approximately 60% of the cobound regions contained at least one core an aryl hydrocarbon response element (AHRE), 5'-GCGTG-3'. AHR/ARNT peak density was the highest within 1 kb of transcription start sites (TSS); however, a number of AHR/ARNT cobound regions were located as far as 100 kb from TSS. De novo motif discovery identified a symmetrical variation of the AHRE (5'-GTGCGTG-3'), as well as FOXA1 and SP1 binding motifs. Microarray analysis identified 104 TCDD-responsive genes where 98 genes were upregulated by TCDD. Of the 104 regulated genes, 69 (66.3%) were associated with an AHR- or ARNT-bound region within 100 kb of their TSS. Overall our study identified AHR/ARNT cobound regions across the genome, revealed the importance but not absolute requirement for an AHRE in AHR/ARNT interactions with DNA, and identified a modified AHRE motif, thereby increasing our understanding of AHR/ARNT signaling pathway.
Collapse
Affiliation(s)
- Raymond Lo
- Department of Pharmacology and Toxicology, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| | | |
Collapse
|
168
|
Hazard B, Zhang X, Colasuonno P, Uauy C, Beckles DM, Dubcovsky J. Induced mutations in the starch branching enzyme II ( SBEII) genes increase amylose and resistant starch content in durum wheat. CROP SCIENCE 2012; 52:1754-1766. [PMID: 26924849 PMCID: PMC4768815 DOI: 10.2135/cropsci2012.02.0126] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
Starch is the largest component of the wheat (Triticum aestivum L.) grain and consists of approximately 70-80% amylopectin and 20-30% amylose. Amylopectin is a highly-branched, readily digested polysaccharide, whereas amylose has few branches and forms complexes that resist digestion and mimic dietary fiber (resistant starch). Down-regulation of the starch branching enzyme II (SBEII) gene by RNA interference (RNAi) was previously shown to increase amylose content in both hexaploid and tetraploid wheat. We generated ethyl methane sulphonate (EMS) mutants for the SBEIIa-A and SBEIIa-B homoeologs in the tetraploid durum wheat variety Kronos (T. turgidum ssp. durum L.). Single-gene mutants showed non-significant increases in amylose and resistant starch content, but a double mutant combining a SBEIIa-A knock-out mutation with a SBEIIa-B splice-site mutation showed a 22% increase in amylose content (P<0.0001) and a 115% increase in resistant starch content (P<0.0001). In addition, we obtained mutants for the A and B genome copies of the paralogous SBEIIb gene, mapped them 1-2 cM from SBEIIa, and generated double SBEIIa-SBEIIb mutants to study the effect of the SBEIIb gene in the absence of SBEIIa. These mutants are available to those interested in increasing amylose content and resistant starch in durum wheat.
Collapse
Affiliation(s)
- Brittany Hazard
- Dept. of Plant Sciences, University of California, Davis, CA 95616, USA
| | - Xiaoqin Zhang
- Dept. of Plant Sciences, University of California, Davis, CA 95616, USA
| | | | - Cristobal Uauy
- Dept. of Plant Sciences, University of California, Davis, CA 95616, USA
| | - Diane M. Beckles
- Dept. of Plant Sciences, University of California, Davis, CA 95616, USA
| | | |
Collapse
|
169
|
Gruszka D, Marzec M, Szarejko I. The barley EST DNA Replication and Repair Database (bEST-DRRD) as a tool for the identification of the genes involved in DNA replication and repair. BMC PLANT BIOLOGY 2012; 12:88. [PMID: 22697361 PMCID: PMC3410793 DOI: 10.1186/1471-2229-12-88] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2012] [Accepted: 06/14/2012] [Indexed: 05/29/2023]
Abstract
BACKGROUND The high level of conservation of genes that regulate DNA replication and repair indicates that they may serve as a source of information on the origin and evolution of the species and makes them a reliable system for the identification of cross-species homologs. Studies that had been conducted to date shed light on the processes of DNA replication and repair in bacteria, yeast and mammals. However, there is still much to be learned about the process of DNA damage repair in plants. DESCRIPTION These studies, which were conducted mainly using bioinformatics tools, enabled the list of genes that participate in various pathways of DNA repair in Arabidopsis thaliana (L.) Heynh to be outlined; however, information regarding these mechanisms in crop plants is still very limited. A similar, functional approach is particularly difficult for a species whose complete genomic sequences are still unavailable. One of the solutions is to apply ESTs (Expressed Sequence Tags) as the basis for gene identification. For the construction of the barley EST DNA Replication and Repair Database (bEST-DRRD), presented here, the Arabidopsis nucleotide and protein sequences involved in DNA replication and repair were used to browse for and retrieve the deposited sequences, derived from four barley (Hordeum vulgare L.) sequence databases, including the "Barley Genome version 0.05" database (encompassing ca. 90% of barley coding sequences) and from two databases covering the complete genomes of two monocot models: Oryza sativa L. and Brachypodium distachyon L. in order to identify homologous genes. Sequences of the categorised Arabidopsis queries are used for browsing the repositories, which are located on the ViroBLAST platform. The bEST-DRRD is currently used in our project during the identification and validation of the barley genes involved in DNA repair. CONCLUSIONS The presented database provides information about the Arabidopsis genes involved in DNA replication and repair, their expression patterns and models of protein interactions. It was designed and established to provide an open-access tool for the identification of monocot homologs of known Arabidopsis genes that are responsible for DNA-related processes. The barley genes identified in the project are currently being analysed to validate their function.
Collapse
Affiliation(s)
- Damian Gruszka
- Department of Genetics, Faculty of Biology and Environment Protection, University of Silesia, Jagiellonska 28, 40-032, Katowice, Poland
| | - Marek Marzec
- Department of Genetics, Faculty of Biology and Environment Protection, University of Silesia, Jagiellonska 28, 40-032, Katowice, Poland
| | - Iwona Szarejko
- Department of Genetics, Faculty of Biology and Environment Protection, University of Silesia, Jagiellonska 28, 40-032, Katowice, Poland
| |
Collapse
|
170
|
Abstract
BACKGROUND Renal cell carcinoma or RCC is one of the common and most lethal urological cancers, with 40% of the patients succumbing to death because of metastatic progression of the disease. Treatment of metastatic RCC remains highly challenging because of its resistance to chemotherapy as well as radiotherapy, besides surgical resection. Whereas RCC comprises tumors with differing histological types, clear cell RCC remains the most common. A major problem in the clinical management of patients presenting with localized ccRCC is the inability to determine tumor aggressiveness and accurately predict the risk of metastasis following surgery. As a measure to improve the diagnosis and prognosis of RCC, researchers have identified several molecular markers through a number of techniques. However the wealth of information available is scattered in literature and not easily amenable to data-mining. To reduce this gap, this work describes a comprehensive repository called Renal Cancer Gene Database, as an integrated gateway to study renal cancer related data. FINDINGS Renal Cancer Gene Database is a manually curated compendium of 240 protein-coding and 269 miRNA genes contributing to the etiology and pathogenesis of various forms of renal cell carcinomas. The protein coding genes have been classified according to the kind of gene alteration observed in RCC. RCDB also includes the miRNAsdysregulated in RCC, along with the corresponding information regarding the type of RCC and/or metastatic or prognostic significance. While some of the miRNA genes showed an association with other types of cancers few were unique to RCC. Users can query the database using keywords, category and chromosomal location of the genes. The knowledgebase can be freely accessed via a user-friendly web interface at http://www.juit.ac.in/attachments/jsr/rcdb/homenew.html. CONCLUSIONS It is hoped that this database would serve as a useful complement to the existing public resources and as a good starting point for researchers and physicians interested in RCC genetics.
Collapse
Affiliation(s)
- Jayashree Ramana
- Department of Biotechnology and Bioinformatics, Jaypee University of Information Technology, 173234, Waknaghat, Solan, Himachal Pradesh, India.
| |
Collapse
|
171
|
Novais SC, Arrais J, Lopes P, Vandenbrouck T, De Coen W, Roelofs D, Soares AMVM, Amorim MJB. Enchytraeus albidus microarray: enrichment, design, annotation and database (EnchyBASE). PLoS One 2012; 7:e34266. [PMID: 22558086 PMCID: PMC3338728 DOI: 10.1371/journal.pone.0034266] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2011] [Accepted: 02/24/2012] [Indexed: 12/27/2022] Open
Abstract
Enchytraeus albidus (Oligochaeta) is an ecologically relevant species used as standard test organisms for risk assessment. Effects of stressors in this species are commonly determined at the population level using reproduction and survival as endpoints. The assessment of transcriptomic responses can be very useful e.g. to understand underlying mechanisms of toxicity with gene expression fingerprinting. In the present paper the following is being addressed: 1) development of suppressive subtractive hybridization (SSH) libraries enriched for differentially expressed genes after metal and pesticide exposures; 2) sequencing and characterization of all generated cDNA inserts; 3) development of a publicly available genomic database on E. albidus. A total of 2100 Expressed Sequence Tags (ESTs) were isolated, sequenced and assembled into 1124 clusters (947 singletons and 177 contigs). From these sequences, 41% matched known proteins in GenBank (BLASTX, e-value ≤ 10(-5)) and 37% had at least one Gene Ontology (GO) term assigned. In total, 5.5% of the sequences were assigned to a metabolic pathway, based on KEGG. With this new sequencing information, an Agilent custom oligonucleotide microarray was designed, representing a potential tool for transcriptomic studies. EnchyBASE (http://bioinformatics.ua.pt/enchybase/) was developed as a web freely available database containing genomic information on E. albidus and will be further extended in the near future for other enchytraeid species. The database so far includes all ESTs generated for E. albidus from three cDNA libraries. This information can be downloaded and applied in functional genomics and transcription studies.
Collapse
Affiliation(s)
- Sara C. Novais
- Department of Biology & CESAM, University of Aveiro, Aveiro, Portugal
- * E-mail:
| | - Joel Arrais
- Department of Electronics, Telecommunications and Informatics (DETI), Institute of Electronics and Telematics Engineering of Aveiro (IEETA), University of Aveiro, Aveiro, Portugal
| | - Pedro Lopes
- Department of Electronics, Telecommunications and Informatics (DETI), Institute of Electronics and Telematics Engineering of Aveiro (IEETA), University of Aveiro, Aveiro, Portugal
| | - Tine Vandenbrouck
- University of Antwerp, Department of Biology - E.B.T., Groenenborgerlaan, Antwerp, Belgium
| | - Wim De Coen
- University of Antwerp, Department of Biology - E.B.T., Groenenborgerlaan, Antwerp, Belgium
| | - Dick Roelofs
- VU University Amsterdam, Institute of Ecological Sciences, De Boelelaan, The Netherlands
| | | | | |
Collapse
|
172
|
Serwanga J, Mugaba S, Pimego E, Nanteza B, Lyagoba F, Nakubulwa S, Heath L, Nsubuga RN, Ndembi N, Gotch F, Kaleebu P. Profile of T cell recognition of HIV type 1 consensus group M Gag and Nef peptides in a clade A1- and D-infected Ugandan population. AIDS Res Hum Retroviruses 2012; 28:384-92. [PMID: 21867408 PMCID: PMC3316116 DOI: 10.1089/aid.2011.0175] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Reagents for evaluating non-clade B HIV-specific T cell responses are uncommon. Peptides based on highly conserved HIV-1 consensus group M sequences that are phylogenetically closer to most circulating strains may provide potential alternative reagents in populations with diverse infections, and may be relevant for vaccine design. Recognition of such reagents in clade A1-and D-infected populations has not been previously evaluated. Interferon (IFN)-γ ELISpot assay was used to evaluate T cell recognition of Gag and Nef peptides based on consensus group M sequences in 50 treatment-naive adults predominantly infected with HIV-1 clades A1 and D. Gag-induced T cell responses were correlated with gag sequence diversity. Infecting clades were determined from gag sequences for 45 of the 50 subjects as 40% clade A1 (18/45), 45% clade D (20/45), 2% clade C (1/45), 2% A1/C recombinant (1/45), 2% A1/D (1/45), 7% CRF10_CD (3/45), and 2% U (unclassifiable) (1/45). The mean genetic divergence and diversity of clade A and D gag region compared to group M consensus sequences at synonymous and nonsynonymous nucleotide and amino acid levels were not always significant. Gag peptides were targeted at significantly higher frequency [88% (44/50)] than Nef [64% (32/50)]; p=0.014, although their mean IFN-γ magnitudes were comparable ([3703 (95% CI 2567-4839)] vs. [2120 (95% CI 478-3762)]), respectively. Measurable virus-induced IFN-γ responses were detected in 96% (48/50) individuals, primarily targeting the more conserved Gag p24 and Nef central core regions. Use of these reagents to screen for HIV-specific IFN-γ responses may mitigate the challenge of viral diversity; although this targeting is apparently biased toward a few highly conserved epitopes.
Collapse
|
173
|
Wang Q, Arighi CN, King BL, Polson SW, Vincent J, Chen C, Huang H, Kingham BF, Page ST, Farnum Rendino M, Thomas WK, Udwary DW, Wu CH, the North East Bioinformatics Collaborative Curation Team. Community annotation and bioinformatics workforce development in concert--Little Skate Genome Annotation Workshops and Jamborees. Database (Oxford) 2012; 2012:bar064. [PMID: 22434832 PMCID: PMC3308154 DOI: 10.1093/database/bar064] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2011] [Revised: 12/07/2011] [Accepted: 12/08/2011] [Indexed: 11/14/2022]
Abstract
Recent advances in high-throughput DNA sequencing technologies have equipped biologists with a powerful new set of tools for advancing research goals. The resulting flood of sequence data has made it critically important to train the next generation of scientists to handle the inherent bioinformatic challenges. The North East Bioinformatics Collaborative (NEBC) is undertaking the genome sequencing and annotation of the little skate (Leucoraja erinacea) to promote advancement of bioinformatics infrastructure in our region, with an emphasis on practical education to create a critical mass of informatically savvy life scientists. In support of the Little Skate Genome Project, the NEBC members have developed several annotation workshops and jamborees to provide training in genome sequencing, annotation and analysis. Acting as a nexus for both curation activities and dissemination of project data, a project web portal, SkateBase (http://skatebase.org) has been developed. As a case study to illustrate effective coupling of community annotation with workforce development, we report the results of the Mitochondrial Genome Annotation Jamborees organized to annotate the first completely assembled element of the Little Skate Genome Project, as a culminating experience for participants from our three prior annotation workshops. We are applying the physical/virtual infrastructure and lessons learned from these activities to enhance and streamline the genome annotation workflow, as we look toward our continuing efforts for larger-scale functional and structural community annotation of the L. erinacea genome.
Collapse
Affiliation(s)
- Qinghua Wang
- Department of Computer and Information Sciences, Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, Mount Dessert Island Biological Laboratory, Salisbury Cove, ME 04672, Vermont Genetics Network, University of Vermont, Burlington, VT 05405, Sequencing and Genotyping Center, University of Delaware, Newark, DE 19711, Department of Environmental and Biological Sciences, University of Maine at Machias, Machias, ME 04654, Department of Molecular Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH 03824 and Department of Biomedical and Pharmaceutical Sciences, University of Rhode Island, Kingston, RI 02881, USA
| | - Cecilia N. Arighi
- Department of Computer and Information Sciences, Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, Mount Dessert Island Biological Laboratory, Salisbury Cove, ME 04672, Vermont Genetics Network, University of Vermont, Burlington, VT 05405, Sequencing and Genotyping Center, University of Delaware, Newark, DE 19711, Department of Environmental and Biological Sciences, University of Maine at Machias, Machias, ME 04654, Department of Molecular Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH 03824 and Department of Biomedical and Pharmaceutical Sciences, University of Rhode Island, Kingston, RI 02881, USA
| | - Benjamin L. King
- Department of Computer and Information Sciences, Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, Mount Dessert Island Biological Laboratory, Salisbury Cove, ME 04672, Vermont Genetics Network, University of Vermont, Burlington, VT 05405, Sequencing and Genotyping Center, University of Delaware, Newark, DE 19711, Department of Environmental and Biological Sciences, University of Maine at Machias, Machias, ME 04654, Department of Molecular Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH 03824 and Department of Biomedical and Pharmaceutical Sciences, University of Rhode Island, Kingston, RI 02881, USA
| | - Shawn W. Polson
- Department of Computer and Information Sciences, Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, Mount Dessert Island Biological Laboratory, Salisbury Cove, ME 04672, Vermont Genetics Network, University of Vermont, Burlington, VT 05405, Sequencing and Genotyping Center, University of Delaware, Newark, DE 19711, Department of Environmental and Biological Sciences, University of Maine at Machias, Machias, ME 04654, Department of Molecular Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH 03824 and Department of Biomedical and Pharmaceutical Sciences, University of Rhode Island, Kingston, RI 02881, USA
| | - James Vincent
- Department of Computer and Information Sciences, Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, Mount Dessert Island Biological Laboratory, Salisbury Cove, ME 04672, Vermont Genetics Network, University of Vermont, Burlington, VT 05405, Sequencing and Genotyping Center, University of Delaware, Newark, DE 19711, Department of Environmental and Biological Sciences, University of Maine at Machias, Machias, ME 04654, Department of Molecular Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH 03824 and Department of Biomedical and Pharmaceutical Sciences, University of Rhode Island, Kingston, RI 02881, USA
| | - Chuming Chen
- Department of Computer and Information Sciences, Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, Mount Dessert Island Biological Laboratory, Salisbury Cove, ME 04672, Vermont Genetics Network, University of Vermont, Burlington, VT 05405, Sequencing and Genotyping Center, University of Delaware, Newark, DE 19711, Department of Environmental and Biological Sciences, University of Maine at Machias, Machias, ME 04654, Department of Molecular Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH 03824 and Department of Biomedical and Pharmaceutical Sciences, University of Rhode Island, Kingston, RI 02881, USA
| | - Hongzhan Huang
- Department of Computer and Information Sciences, Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, Mount Dessert Island Biological Laboratory, Salisbury Cove, ME 04672, Vermont Genetics Network, University of Vermont, Burlington, VT 05405, Sequencing and Genotyping Center, University of Delaware, Newark, DE 19711, Department of Environmental and Biological Sciences, University of Maine at Machias, Machias, ME 04654, Department of Molecular Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH 03824 and Department of Biomedical and Pharmaceutical Sciences, University of Rhode Island, Kingston, RI 02881, USA
| | - Brewster F. Kingham
- Department of Computer and Information Sciences, Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, Mount Dessert Island Biological Laboratory, Salisbury Cove, ME 04672, Vermont Genetics Network, University of Vermont, Burlington, VT 05405, Sequencing and Genotyping Center, University of Delaware, Newark, DE 19711, Department of Environmental and Biological Sciences, University of Maine at Machias, Machias, ME 04654, Department of Molecular Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH 03824 and Department of Biomedical and Pharmaceutical Sciences, University of Rhode Island, Kingston, RI 02881, USA
| | - Shallee T. Page
- Department of Computer and Information Sciences, Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, Mount Dessert Island Biological Laboratory, Salisbury Cove, ME 04672, Vermont Genetics Network, University of Vermont, Burlington, VT 05405, Sequencing and Genotyping Center, University of Delaware, Newark, DE 19711, Department of Environmental and Biological Sciences, University of Maine at Machias, Machias, ME 04654, Department of Molecular Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH 03824 and Department of Biomedical and Pharmaceutical Sciences, University of Rhode Island, Kingston, RI 02881, USA
| | - Marc Farnum Rendino
- Department of Computer and Information Sciences, Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, Mount Dessert Island Biological Laboratory, Salisbury Cove, ME 04672, Vermont Genetics Network, University of Vermont, Burlington, VT 05405, Sequencing and Genotyping Center, University of Delaware, Newark, DE 19711, Department of Environmental and Biological Sciences, University of Maine at Machias, Machias, ME 04654, Department of Molecular Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH 03824 and Department of Biomedical and Pharmaceutical Sciences, University of Rhode Island, Kingston, RI 02881, USA
| | - William Kelley Thomas
- Department of Computer and Information Sciences, Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, Mount Dessert Island Biological Laboratory, Salisbury Cove, ME 04672, Vermont Genetics Network, University of Vermont, Burlington, VT 05405, Sequencing and Genotyping Center, University of Delaware, Newark, DE 19711, Department of Environmental and Biological Sciences, University of Maine at Machias, Machias, ME 04654, Department of Molecular Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH 03824 and Department of Biomedical and Pharmaceutical Sciences, University of Rhode Island, Kingston, RI 02881, USA
| | - Daniel W. Udwary
- Department of Computer and Information Sciences, Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, Mount Dessert Island Biological Laboratory, Salisbury Cove, ME 04672, Vermont Genetics Network, University of Vermont, Burlington, VT 05405, Sequencing and Genotyping Center, University of Delaware, Newark, DE 19711, Department of Environmental and Biological Sciences, University of Maine at Machias, Machias, ME 04654, Department of Molecular Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH 03824 and Department of Biomedical and Pharmaceutical Sciences, University of Rhode Island, Kingston, RI 02881, USA
| | - Cathy H. Wu
- Department of Computer and Information Sciences, Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, Mount Dessert Island Biological Laboratory, Salisbury Cove, ME 04672, Vermont Genetics Network, University of Vermont, Burlington, VT 05405, Sequencing and Genotyping Center, University of Delaware, Newark, DE 19711, Department of Environmental and Biological Sciences, University of Maine at Machias, Machias, ME 04654, Department of Molecular Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH 03824 and Department of Biomedical and Pharmaceutical Sciences, University of Rhode Island, Kingston, RI 02881, USA
| | | |
Collapse
|
174
|
Viral linkage in HIV-1 seroconverters and their partners in an HIV-1 prevention clinical trial. PLoS One 2011; 6:e16986. [PMID: 21399681 PMCID: PMC3047537 DOI: 10.1371/journal.pone.0016986] [Citation(s) in RCA: 77] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2010] [Accepted: 01/18/2011] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND Characterization of viruses in HIV-1 transmission pairs will help identify biological determinants of infectiousness and evaluate candidate interventions to reduce transmission. Although HIV-1 sequencing is frequently used to substantiate linkage between newly HIV-1 infected individuals and their sexual partners in epidemiologic and forensic studies, viral sequencing is seldom applied in HIV-1 prevention trials. The Partners in Prevention HSV/HIV Transmission Study (ClinicalTrials.gov #NCT00194519) was a prospective randomized placebo-controlled trial that enrolled serodiscordant heterosexual couples to determine the efficacy of genital herpes suppression in reducing HIV-1 transmission; as part of the study analysis, HIV-1 sequences were examined for genetic linkage between seroconverters and their enrolled partners. METHODOLOGY/PRINCIPAL FINDINGS We obtained partial consensus HIV-1 env and gag sequences from blood plasma for 151 transmission pairs and performed deep sequencing of env in some cases. We analyzed sequences with phylogenetic techniques and developed a Bayesian algorithm to evaluate the probability of linkage. For linkage, we required monophyletic clustering between enrolled partners' sequences and a Bayesian posterior probability of ≥ 50%. Adjudicators classified each seroconversion, finding 108 (71.5%) linked, 40 (26.5%) unlinked, and 3 (2.0%) indeterminate transmissions, with linkage determined by consensus env sequencing in 91 (84%). Male seroconverters had a higher frequency of unlinked transmissions than female seroconverters. The likelihood of transmission from the enrolled partner was related to time on study, with increasing numbers of unlinked transmissions occurring after longer observation periods. Finally, baseline viral load was found to be significantly higher among linked transmitters. CONCLUSIONS/SIGNIFICANCE In this first use of HIV-1 sequencing to establish endpoints in a large clinical trial, more than one-fourth of transmissions were unlinked to the enrolled partner, illustrating the relevance of these methods in the design of future HIV-1 prevention trials in serodiscordant couples. A hierarchy of sequencing techniques, analysis methods, and expert adjudication contributed to the linkage determination process.
Collapse
|
175
|
Heath L, Frenkel LM, Foley BT, Mullins JI. Comment on "The origins of sexually transmitted HIV among men who have sex with men". Sci Transl Med 2011; 2:50le1; author reply 50lr1. [PMID: 20861507 DOI: 10.1126/scitranslmed.3001416] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Whether HIV from seminal cells or free HIV in semen is the origin of transmitted virus has important implications for the design of transmission prevention strategies. We found that a recent claim that HIV originates from seminal plasma and not from seminal cells was erroneous, because it was based on biological specimens that had been mislabeled, mixed-up, or contaminated. The origin of transmitted virus from semen therefore remains an open question.
Collapse
|
176
|
Mullins JI, Heath L, Hughes JP, Kicha J, Styrchak S, Wong KG, Rao U, Hansen A, Harris KS, Laurent JP, Li D, Simpson JH, Essigmann JM, Loeb LA, Parkins J. Mutation of HIV-1 genomes in a clinical population treated with the mutagenic nucleoside KP1461. PLoS One 2011; 6:e15135. [PMID: 21264288 PMCID: PMC3021505 DOI: 10.1371/journal.pone.0015135] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2010] [Accepted: 10/23/2010] [Indexed: 12/12/2022] Open
Abstract
The deoxycytidine analog KP1212, and its prodrug KP1461, are prototypes of a new class of antiretroviral drugs designed to increase viral mutation rates, with the goal of eventually causing the collapse of the viral population. Here we present an extensive analysis of viral sequences from HIV-1 infected volunteers from the first “mechanism validation” phase II clinical trial of a mutagenic base analog in which individuals previously treated with antiviral drugs received 1600 mg of KP1461 twice per day for 124 days. Plasma viral loads were not reduced, and overall levels of viral mutation were not increased during this short-term study, however, the mutation spectrum of HIV was altered. A large number (N = 105 per sample) of sequences were analyzed, each derived from individual HIV-1 RNA templates, after 0, 56 and 124 days of therapy from 10 treated and 10 untreated control individuals (>7.1 million base pairs of unique viral templates were sequenced). We found that private mutations, those not found in more than one viral sequence and likely to have occurred in the most recent rounds of replication, increased in treated individuals relative to controls after 56 (p = 0.038) and 124 (p = 0.002) days of drug treatment. The spectrum of mutations observed in the treated group showed an excess of A to G and G to A mutations (p = 0.01), and to a lesser extent T to C and C to T mutations (p = 0.09), as predicted by the mechanism of action of the drug. These results validate the proposed mechanism of action in humans and should spur development of this novel antiretroviral approach.
Collapse
Affiliation(s)
- James I Mullins
- Department of Microbiology, University of Washington, School of Medicine, Seattle, Washington, United States of America.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
177
|
Mittler T, Levy M, Chad F, Karen S. MULTBLAST: A web application for multiple BLAST searches. Bioinformation 2010; 5:224-6. [PMID: 21364803 PMCID: PMC3040504 DOI: 10.6026/97320630005224] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2010] [Accepted: 09/28/2010] [Indexed: 12/05/2022] Open
Abstract
Basic Local Alignment Search Tool, (BLAST) allows the comparison of a query sequence/s
to a database of sequences and identifies those sequences that are similar to the query above a
user-defined threshold. We have developed a user friendly web application, MULTBLAST that runs a
series of BLAST searches on a user-supplied list of proteins against one or more target protein or
nucleotide databases. The application pre-processes the data, launches each individual BLAST search
on the University of Nevada, Reno's-TimeLogic DeCypher® system (available from
Active Motif, Inc.) and retrieves and combines all the results into a simple, easy to read output file.
The output file presents the list of the query proteins, followed by the BLAST results for the matching
sequences from each target database in consecutive columns. This format is especially useful for
either comparing the results from the different target databases, or analyzing the results while keeping
the identification of each target database separate.
Collapse
Affiliation(s)
- Taliah Mittler
- Department of Biochemistry and Molecular Biology, University of Nevada, Reno
| | - Marcel Levy
- Center for Bioinformatics, University of Nevada, Reno
| | - Feller Chad
- Center for Bioinformatics, University of Nevada, Reno
| | - Schlauch Karen
- Department of Biochemistry and Molecular Biology, University of Nevada, Reno
- Karen SchlauchPhone: 775-784-6236; Fax: 775-784-1312
| |
Collapse
|
178
|
Chu Y, Yuan X, Guo Y, Zhang Y, Wu Y, Liu H, Wu D, Bao H, Guan L, Jin X. YeastWeb: a workset-centric web resource for gene family analysis in yeast. BMC Genomics 2010; 11:429. [PMID: 20624324 PMCID: PMC2996957 DOI: 10.1186/1471-2164-11-429] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2010] [Accepted: 07/13/2010] [Indexed: 11/10/2022] Open
Abstract
Background Currently, a number of yeast genomes with different physiological features have been sequenced and annotated, which provides invaluable information to investigate yeast genetics, evolutionary mechanism, structure and function of gene families. Description YeastWeb is a novel database created to provide access to gene families derived from the available yeast genomes by assigning the genes into putative families. It has many useful features that complement existing databases, such as SGD, CYGD and Génolevures: 1) Detailed computational annotation was conducted with each entry with InterProScan, EMBOSS and functional/pathway databases, such as GO, COG and KEGG; 2) A well established user-friendly environment was created to allow users to retrieve the annotated genes and gene families using functional classification browser, keyword search or similarity-based search; 3) Workset offers users many powerful functions to manage the retrieved data efficiently, associate the individual items easily and save the intermediate results conveniently; 4) A series of comparative genomics and molecular evolution analysis tools are neatly implemented to allow users to view multiple sequence alignments and phylogenetic tree of gene families. At present, YeastWeb holds the gene families clustered from various MCL inflation values from a total of 13 available yeast genomes. Conclusions Given the great interest in yeast research, YeastWeb has the potential to become a useful resource for the scientific community of yeast biologists and related researchers investigating the evolutionary relationship of yeast gene families. YeastWeb is available at http://centre.bioinformatics.zj.cn/Yeast/.
Collapse
Affiliation(s)
- Yanhui Chu
- Heilongjiang Key Laboratory of Anti-fibrosis Biotherapy, Mudanjiang Medical University, Heilongjiang 157011, China.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
179
|
Restriction of HIV-1 genotypes in breast milk does not account for the population transmission genetic bottleneck that occurs following transmission. PLoS One 2010; 5:e10213. [PMID: 20422033 PMCID: PMC2857876 DOI: 10.1371/journal.pone.0010213] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2010] [Accepted: 03/17/2010] [Indexed: 12/27/2022] Open
Abstract
Background Breast milk transmission of HIV-1 remains a major route of pediatric infection. Defining the characteristics of viral variants to which breastfeeding infants are exposed is important for understanding the genetic bottleneck that occurs in the majority of mother-to-child transmissions. The blood-milk epithelial barrier markedly restricts the quantity of HIV-1 in breast milk, even in the absence of antiretroviral drugs. The basis of this restriction and the genetic relationship between breast milk and blood variants are not well established. Methodology/Principal Findings We compared 356 HIV-1 subtype C gp160 envelope (env) gene sequences from the plasma and breast milk of 13 breastfeeding women. A trend towards lower viral population diversity and divergence in breast milk was observed, potentially indicative of clonal expansion within the breast. No differences in potential N-linked glycosylation site numbers or in gp160 variable loop amino acid lengths were identified. Genetic compartmentalization was evident in only one out of six subjects in whom contemporaneously obtained samples were studied. However, in samples that were collected 10 or more days apart, six of seven subjects were classified as having compartmentalized viral populations, highlighting the necessity of contemporaneous sampling for genetic compartmentalization studies. We found evidence of CXCR4 co-receptor using viruses in breast milk and blood in nine out of the thirteen subjects, but no evidence of preferential localization of these variants in either tissue. Conclusions/Significance Despite marked restriction of HIV-1 quantities in milk, our data indicate intermixing of virus between blood and breast milk. Thus, we found no evidence that a restriction in viral genotype diversity in breast milk accounts for the genetic bottleneck observed following transmission. In addition, our results highlight the rapidity of HIV-1 env evolution and the importance of sample timing in analyses of gene flow.
Collapse
|
180
|
Nakamura KJ, Gach JS, Jones L, Semrau K, Walter J, Bibollet-Ruche F, Decker JM, Heath L, Decker WD, Sinkala M, Kankasa C, Thea D, Mullins J, Kuhn L, Zwick MB, Aldrovandi GM. 4E10-resistant HIV-1 isolated from four subjects with rare membrane-proximal external region polymorphisms. PLoS One 2010; 5:e9786. [PMID: 20352106 PMCID: PMC2843716 DOI: 10.1371/journal.pone.0009786] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2010] [Accepted: 02/10/2010] [Indexed: 11/19/2022] Open
Abstract
Human antibody 4E10 targets the highly conserved membrane-proximal external region (MPER) of the HIV-1 transmembrane glycoprotein, gp41, and has extraordinarily broad neutralizing activity. It is considered by many to be a prototype for vaccine development. In this study, we describe four subjects infected with viruses carrying rare MPER polymorphisms associated with resistance to 4E10 neutralization. In one case resistant virus carrying a W680G substitution was transmitted from mother to infant. We used site-directed mutagenesis to demonstrate that the W680G substitution is necessary for conferring the 4E10-resistant phenotype, but that it is not sufficient to transfer the phenotype to a 4E10-sensitive Env. Our third subject carried Envs with a W680R substitution causing variable resistance to 4E10, indicating that residues outside the MPER are required to confer the phenotype. A fourth subject possessed a F673L substitution previously associated with 4E10 resistance. For all three subjects with W680 polymorphisms, we observed additional residues in the MPER that co-varied with position 680 and preserved charged distributions across this region. Our data provide important caveats for vaccine development targeting the MPER. Naturally occurring Env variants described in our study also represent unique tools for probing the structure-function of HIV-1 envelope.
Collapse
Affiliation(s)
- Kyle J. Nakamura
- Department of Pediatrics, Childrens Hospital of Los Angeles, Los Angeles, California, United States of America
- Systems Biology and Disease Program, University of Southern California Keck School of Medicine, Los Angeles, California, United States of America
| | - Johannes S. Gach
- Department of Immunology and Microbial Science, The Scripps Research Institute, La Jolla, California, United States of America
| | - Laura Jones
- Department of Pediatrics, Childrens Hospital of Los Angeles, Los Angeles, California, United States of America
| | - Katherine Semrau
- Center for International Health and Development, Boston University School of Public Health, Boston, Massachusetts, United States of America
| | - Jan Walter
- Department of Pediatrics, Childrens Hospital of Los Angeles, Los Angeles, California, United States of America
| | - Frederic Bibollet-Ruche
- Department of Medicine, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Julie M. Decker
- Department of Medicine, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Laura Heath
- Department of Microbiology, University of Washington, Seattle, Washington, United States of America
| | - William D. Decker
- Department of Pediatrics, Childrens Hospital of Los Angeles, Los Angeles, California, United States of America
| | - Moses Sinkala
- Lusaka District Health Management Team, Lusaka, Zambia
| | - Chipepo Kankasa
- University Teaching Hospital, University of Zambia, Lusaka, Zambia
| | - Donald Thea
- Center for International Health and Development, Boston University School of Public Health, Boston, Massachusetts, United States of America
| | - James Mullins
- Department of Microbiology, University of Washington, Seattle, Washington, United States of America
- Department of Medicine, University of Washington, Seattle, Washington, United States of America
| | - Louise Kuhn
- Department of Epidemiology, Columbia University, New York, New York, United States of America
| | - Michael B. Zwick
- Department of Immunology and Microbial Science, The Scripps Research Institute, La Jolla, California, United States of America
| | - Grace M. Aldrovandi
- Department of Pediatrics, Childrens Hospital of Los Angeles, Los Angeles, California, United States of America
| |
Collapse
|
181
|
LipocalinPred: a SVM-based method for prediction of lipocalins. BMC Bioinformatics 2009; 10:445. [PMID: 20030857 PMCID: PMC2813246 DOI: 10.1186/1471-2105-10-445] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2009] [Accepted: 12/24/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Functional annotation of rapidly amassing nucleotide and protein sequences presents a challenging task for modern bioinformatics. This is particularly true for protein families sharing extremely low sequence identity, as for lipocalins, a family of proteins with varied functions and great diversity at the sequence level, yet conserved structures. RESULTS In the present study we propose a SVM based method for identification of lipocalin protein sequences. The SVM models were trained with the input features generated using amino acid, dipeptide and secondary structure compositions as well as PSSM profiles. The model derived using both PSSM and secondary structure emerged as the best model in the study. Apart from achieving a high prediction accuracy (>90% in leave-one-out), lipocalinpred correctly differentiates closely related fatty acid-binding proteins and triabins as non-lipocalins. CONCLUSION The method offers a promising approach as a lipocalin prediction tool, complementing PROSITE, Pfam and homology modelling methods.
Collapse
|
182
|
Heath L, Fox A, McClure J, Diem K, van 't Wout AB, Zhao H, Park DR, Schouten JT, Twigg HL, Corey L, Mullins JI, Mittler JE. Evidence for limited genetic compartmentalization of HIV-1 between lung and blood. PLoS One 2009; 4:e6949. [PMID: 19759830 PMCID: PMC2736399 DOI: 10.1371/journal.pone.0006949] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2009] [Accepted: 07/29/2009] [Indexed: 01/04/2023] Open
Abstract
Background HIV-1 is frequently detected in the lungs of infected individuals and is likely important in the development of pulmonary opportunistic infections. The unique environment of the lung, rich in alveolar macrophages and with specialized local immune responses, may contribute to differential evolution or selection of HIV-1. Methodology and Findings We characterized HIV-1 in the lung in relation to contemporaneous viral populations in the blood. The C2-V5 region of HIV-1 env was sequenced from paired lung (induced sputum or bronchoalveolar lavage) and blood (plasma RNA and proviral DNA from sorted or unsorted PBMC) from 18 subjects. Compartmentalization between tissue pairs was assessed using 5 established tree or distance-based methods, including permutation tests to determine statistical significance. We found statistical evidence of compartmentalization between lung and blood in 10/18 subjects, although lung and blood sequences were intermingled on phylogenetic trees in all subjects. The subject showing the greatest compartmentalization contained many nearly identical sequences in BAL sample, suggesting clonal expansion may contribute to reduced viral diversity in the lung in some cases. However, HIV-1 sequences in lung were not more homogeneous overall, nor were we able to find a lung-specific genotype associated with macrophage tropism in V3. In all four subjects in whom predicted X4 genotypes were found in blood, predicted X4 genotypes were also found in lung. Conclusions Our results support a picture of continuous migration of HIV-1 between circulating blood and lung tissue, with perhaps a very limited degree of localized evolution or clonal replication.
Collapse
Affiliation(s)
- Laura Heath
- Department of Microbiology, University of Washington, Seattle, Washington, United States of America
| | - Alan Fox
- Department of Laboratory Medicine, University of Washington, Seattle, Washington, United States of America
| | - Jan McClure
- Department of Microbiology, University of Washington, Seattle, Washington, United States of America
| | - Kurt Diem
- Department of Laboratory Medicine, University of Washington, Seattle, Washington, United States of America
| | - Angélique B. van 't Wout
- Department of Experimental Immunology, Center for Infection and Immunity Amsterdam (CINIMA) at the Academic Medical Center of the University of Amsterdam, Amsterdam, The Netherlands
| | - Hong Zhao
- Department of Microbiology, University of Washington, Seattle, Washington, United States of America
| | - David R. Park
- Pulmonary and Critical Care Medicine, University of Washington, Seattle, Washington, United States of America
| | - Jeffrey T. Schouten
- General Surgery, University of Washington, Seattle, Washington, United States of America
| | - Homer L. Twigg
- Division of Pulmonary and Critical Care, Indiana University Medical Center, Indianapolis, Indiana, United States of America
| | - Lawrence Corey
- Department of Laboratory Medicine, University of Washington, Seattle, Washington, United States of America
| | - James I. Mullins
- Department of Microbiology, University of Washington, Seattle, Washington, United States of America
- Department of Laboratory Medicine, University of Washington, Seattle, Washington, United States of America
| | - John E. Mittler
- Department of Microbiology, University of Washington, Seattle, Washington, United States of America
- * E-mail:
| |
Collapse
|
183
|
Tong JC, Ren EC. Immunoinformatics: current trends and future directions. Drug Discov Today 2009; 14:684-9. [PMID: 19379830 PMCID: PMC7108239 DOI: 10.1016/j.drudis.2009.04.001] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2008] [Revised: 03/30/2009] [Accepted: 04/06/2009] [Indexed: 01/28/2023]
Abstract
Immunoinformatics has recently emerged as a critical field for accelerating immunology research. Although still an evolving process, computational models now play instrumental roles, not only in directing the selection of key experiments, but also in the formulation of new testable hypotheses through detailed analysis of complex immunologic data that could not be achieved using traditional approaches alone. Immunomics, which combines traditional immunology with computer science, mathematics, chemistry, biochemistry, genomics and proteomics for the large-scale analysis of immune system function, offers new opportunities for future bench-to-bedside research. In this article, we review the latest trends and future directions of the field.
Collapse
Affiliation(s)
- Joo Chuan Tong
- Institute for Infocomm Research, 1 Fusionopolis Way, #21-01 Connexis, South Tower, Singapore 138632, Singapore.
| | | |
Collapse
|
184
|
Sjödin A, Street NR, Sandberg G, Gustafsson P, Jansson S. The Populus Genome Integrative Explorer (PopGenIE): a new resource for exploring the Populus genome. THE NEW PHYTOLOGIST 2009; 182:1013-1025. [PMID: 19383103 DOI: 10.1111/j.1469-8137.2009.02807.x] [Citation(s) in RCA: 108] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Populus has become an important model plant system. However, utilization of the increasingly extensive collection of genetics and genomics data created by the community is currently hindered by the lack of a central resource, such as a model organism database (MOD). Such MODs offer a single entry point to the collection of resources available within a model system, typically including tools for exploring and querying those resources. As a starting point to overcoming the lack of such an MOD for Populus, we present the Populus Genome Integrative Explorer (PopGenIE), an integrated set of tools for exploring the Populus genome and transcriptome. The resource includes genome, synteny and quantitative trait locus (QTL) browsers for exploring genetic data. Expression tools include an electronic fluorescent pictograph (eFP) browser, expression profile plots, co-regulation within collated transcriptomics data sets, and identification of over-represented functional categories and genomic hotspot locations. A number of collated transcriptomics data sets are made available in the eFP browser to facilitate functional exploration of gene function. Additional homology and data extraction tools are provided. PopGenIE significantly increases accessibility to Populus genomics resources and allows exploration of transcriptomics data without the need to learn or understand complex statistical analysis methods. PopGenIE is available at www.popgenie.org or via www.populusgenome.info.
Collapse
Affiliation(s)
- Andreas Sjödin
- Umeå Plant Science Centre, Department of Plant Physiology, University of Umeå, SE-901-87 Umeå, Sweden
- CBRN Security and Defence, Swedish Defence Research Agency, SE-90182 Umeå, Sweden
| | - Nathaniel Robert Street
- Umeå Plant Science Centre, Department of Plant Physiology, University of Umeå, SE-901-87 Umeå, Sweden
| | - Göran Sandberg
- Umeå Plant Science Centre, Department of Plant Physiology, University of Umeå, SE-901-87 Umeå, Sweden
| | - Petter Gustafsson
- Umeå Plant Science Centre, Department of Plant Physiology, University of Umeå, SE-901-87 Umeå, Sweden
| | - Stefan Jansson
- Umeå Plant Science Centre, Department of Plant Physiology, University of Umeå, SE-901-87 Umeå, Sweden
| |
Collapse
|
185
|
Wu J, Xu X, Xiao J, Xu L, Yi H, Gao S, Liu J, Bao Q, Zhao F, Li X. FlyPhy: a phylogenomic analysis platform for Drosophila genes and gene families. BMC Bioinformatics 2009; 10:123. [PMID: 19393099 PMCID: PMC2680407 DOI: 10.1186/1471-2105-10-123] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2009] [Accepted: 04/25/2009] [Indexed: 12/22/2022] Open
Abstract
Background The availability of 12 fully sequenced Drosophila species genomes provides an excellent opportunity to explore the evolutionary mechanism, structure and function of gene families in Drosophila. Currently, several important resources, such as FlyBase, FlyMine and DroSpeGe, have been devoted to integrating genetic, genomic, and functional data of Drosophila into a well-organized form. However, all of these resources are gene-centric and lack the information of the gene families in Drosophila. Description FlyPhy is a comprehensive phylogenomic analysis platform devoted to analyzing the genes and gene families in Drosophila. Genes were classified into families using a graph-based Markov Clustering algorithm and extensively annotated by a number of bioinformatic tools, such as basic sequence features, functional category, gene ontology terms, domain organization and sequence homolog to other databases. FlyPhy provides a simple and user-friendly web interface to allow users to browse and retrieve the information at multiple levels. An outstanding feature of the FlyPhy is that all the retrieved results can be added to a workset for further data manipulation. For the data stored in the workset, multiple sequence alignment, phylogenetic tree construction and visualization can be easily performed to investigate the sequence variation of each given family and to explore its evolutionary mechanism. Conclusion With the above functionalities, FlyPhy will be a useful resource and convenient platform for the Drosophila research community. The FlyPhy is available at .
Collapse
Affiliation(s)
- Jinyu Wu
- Institute of Biomedical Informatics/Zhejiang Provincial Key Laboratory of Medical Genetics, Wenzhou Medical College, Wenzhou, PR China.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
186
|
|
187
|
Xu X, Wu J, Xiao J, Tan Y, Bao Q, Zhao F, Li X. PlasmoGF: an integrated system for comparative genomics and phylogenetic analysis of Plasmodium gene families. Bioinformatics 2008; 24:1217-20. [PMID: 18337260 DOI: 10.1093/bioinformatics/btn092] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
UNLABELLED Malaria, one of the world's most common diseases, is caused by the intracellular protozoan parasite known as Plasmodium. Recently, with the arrival of several malaria parasite genomes, we established an integrated system named PlasmoGF for comparative genomics and phylogenetic analysis of Plasmodium gene families. Gene families were clustered using the Markov Cluster algorithm implemented in TribeMCL program and could be searched using keywords, gene-family information, domain composition, Gene Ontology and BLAST. Moreover, a number of useful bioinformatics tools were implemented to facilitate the analysis of these putative Plasmodium gene families, including gene retrieval, annotation, sequence alignment, phylogeny construction and visualization. In the current version, PlasmoGF contained 8980 sets of gene families derived from six malaria parasite genomes: Plasmodium. falciparum, P. berghei, P. knowlesi, P. chabaudi, P. vivax and P. yoelii. The availability of such a highly integrated system would be of great interest for the community of researchers working on malaria parasite phylogenomics. AVAILABILITY PlasmoGF is freely available at http://bioinformatics.zj.cn/pgf/
Collapse
Affiliation(s)
- Xiang Xu
- School of Pharmaceutical Science/Zhejiang Provincial Key Laboratory of Biotechnology Pharmaceutical Engineering, Wenzhou Medical College, Wenzhou 325035, PR China
| | | | | | | | | | | | | |
Collapse
|
188
|
Givan SA, Sullivan CM, Carrington JC. The Personal Sequence Database: a suite of tools to create and maintain web-accessible sequence databases. BMC Bioinformatics 2007; 8:479. [PMID: 18088438 PMCID: PMC2225426 DOI: 10.1186/1471-2105-8-479] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2007] [Accepted: 12/18/2007] [Indexed: 11/18/2022] Open
Abstract
Background Large molecular sequence databases are fundamental resources for modern bioscientists. Whether for project-specific purposes or sharing data with colleagues, it is often advantageous to maintain smaller sequence databases. However, this is usually not an easy task for the average bench scientist. Results We present the Personal Sequence Database (PSD), a suite of tools to create and maintain small- to medium-sized web-accessible sequence databases. All interactions with PSD tools occur via the internet with a web browser. Users may define sequence groups within their database that can be maintained privately or published to the web for public use. A sequence group can be downloaded, browsed, searched by keyword or searched for sequence similarities using BLAST. Publishing a sequence group extends these capabilities to colleagues and collaborators. In addition to being able to manage their own sequence databases, users can enroll sequences in BLASTAgent, a BLAST hit tracking system, to monitor NCBI databases for new entries displaying a specified level of nucleotide or amino acid similarity. Conclusion The PSD offers a valuable set of resources unavailable elsewhere. In addition to managing sequence data and BLAST search results, it facilitates data sharing with colleagues, collaborators and public users. The PSD is hosted by the authors and is available at .
Collapse
Affiliation(s)
- Scott A Givan
- Center for Genome Research and Biocomputing, Oregon State University, Corvallis, Oregon, USA.
| | | | | |
Collapse
|
189
|
Rolland M, Nickle DC, Deng W, Frahm N, Brander C, Learn GH, Heckerman D, Jojic N, Jojic V, Walker BD, Mullins JI. Recognition of HIV-1 peptides by host CTL is related to HIV-1 similarity to human proteins. PLoS One 2007; 2:e823. [PMID: 17786195 PMCID: PMC1952107 DOI: 10.1371/journal.pone.0000823] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2007] [Accepted: 05/18/2007] [Indexed: 12/03/2022] Open
Abstract
Background While human immunodeficiency virus type 1 (HIV-1)-specific cytotoxic T lymphocytes preferentially target specific regions of the viral proteome, HIV-1 features that contribute to immune recognition are not well understood. One hypothesis is that similarities between HIV and human proteins influence the host immune response, i.e., resemblance between viral and host peptides could preclude reactivity against certain HIV epitopes. Methodology/Principal Findings We analyzed the extent of similarity between HIV-1 and the human proteome. Proteins from the HIV-1 B consensus sequence from 2001 were dissected into overlapping k-mers, which were then probed against a non-redundant database of the human proteome in order to identify segments of high similarity. We tested the relationship between HIV-1 similarity to host encoded peptides and immune recognition in HIV-infected individuals, and found that HIV immunogenicity could be partially modulated by the sequence similarity to the host proteome. ELISpot responses to peptides spanning the entire viral proteome evaluated in 314 individuals showed a trend indicating an inverse relationship between the similarity to the host proteome and the frequency of recognition. In addition, analysis of responses by a group of 30 HIV-infected individuals against 944 overlapping peptides representing a broad range of individual HIV-1B Nef variants, affirmed that the degree of similarity to the host was significantly lower for peptides with reactive epitopes than for those that were not recognized. Conclusions/Significance Our results suggest that antigenic motifs that are scarcely represented in human proteins might represent more immunogenic CTL targets not selected against in the host. This observation could provide guidance in the design of more effective HIV immunogens, as sequences devoid of host-like features might afford superior immune reactivity.
Collapse
Affiliation(s)
- Morgane Rolland
- Department of Microbiology, University of Washington, Seattle, Washington, United States of America
| | - David C. Nickle
- Department of Microbiology, University of Washington, Seattle, Washington, United States of America
| | - Wenjie Deng
- Department of Microbiology, University of Washington, Seattle, Washington, United States of America
| | - Nicole Frahm
- Partners AIDS Research Center, Massachusetts General Hospital, Harvard Medical School, Charlestown, Massachusetts, United States of America
| | - Christian Brander
- Partners AIDS Research Center, Massachusetts General Hospital, Harvard Medical School, Charlestown, Massachusetts, United States of America
| | - Gerald H. Learn
- Department of Microbiology, University of Washington, Seattle, Washington, United States of America
| | - David Heckerman
- Machine Learning and Applied Statistics Group, Microsoft Research, Redmond, Washington, United States of America
| | - Nebosja Jojic
- Machine Learning and Applied Statistics Group, Microsoft Research, Redmond, Washington, United States of America
| | - Vladimir Jojic
- Machine Learning and Applied Statistics Group, Microsoft Research, Redmond, Washington, United States of America
| | - Bruce D. Walker
- Partners AIDS Research Center, Massachusetts General Hospital, Harvard Medical School, Charlestown, Massachusetts, United States of America
| | - James I. Mullins
- Department of Microbiology, University of Washington, Seattle, Washington, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|