1
|
Heslop R, Gao M, Brito Lira A, Sternlieb T, Loock M, Sanghi SR, Cestari I. Genome-Wide Libraries for Protozoan Pathogen Drug Target Screening Using Yeast Surface Display. ACS Infect Dis 2023; 9:1078-1091. [PMID: 37083339 PMCID: PMC10187560 DOI: 10.1021/acsinfecdis.2c00568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Indexed: 04/22/2023]
Abstract
The lack of genetic tools to manipulate protozoan pathogens has limited the use of genome-wide approaches to identify drug or vaccine targets and understand these organisms' biology. We have developed an efficient method to construct genome-wide libraries for yeast surface display (YSD) and developed a YSD fitness screen (YSD-FS) to identify drug targets. We show the efficacy of our method by generating genome-wide libraries for Trypanosoma brucei, Trypanosoma cruzi, and Giardia lamblia parasites. Each library has a diversity of ∼105 to 106 clones, representing ∼6- to 30-fold of the parasite's genome. Nanopore sequencing confirmed the libraries' genome coverage with multiple clones for each parasite gene. Western blot and imaging analysis confirmed surface expression of the G. lamblia library proteins in yeast. Using the YSD-FS assay, we identified bonafide interactors of metronidazole, a drug used to treat protozoan and bacterial infections. We also found enrichment in nucleotide-binding domain sequences associated with yeast increased fitness to metronidazole, indicating that this drug might target multiple enzymes containing nucleotide-binding domains. The libraries are valuable biological resources for discovering drug or vaccine targets, ligand receptors, protein-protein interactions, and pathogen-host interactions. The library assembly approach can be applied to other organisms or expression systems, and the YSD-FS assay might help identify new drug targets in protozoan pathogens.
Collapse
Affiliation(s)
- Rhiannon Heslop
- Institute
of Parasitology, McGill University, Ste Anne de Bellevue, Montreal, QC H9X 3V9, Canada
- Faculté
de Pharmacie de Tours, 31, Avenue Monge, 37200 Tours, France
| | - Mengjin Gao
- Institute
of Parasitology, McGill University, Ste Anne de Bellevue, Montreal, QC H9X 3V9, Canada
| | - Andressa Brito Lira
- Institute
of Parasitology, McGill University, Ste Anne de Bellevue, Montreal, QC H9X 3V9, Canada
| | - Tamara Sternlieb
- Institute
of Parasitology, McGill University, Ste Anne de Bellevue, Montreal, QC H9X 3V9, Canada
| | - Mira Loock
- Institute
of Parasitology, McGill University, Ste Anne de Bellevue, Montreal, QC H9X 3V9, Canada
| | - Sahil Rao Sanghi
- Institute
of Parasitology, McGill University, Ste Anne de Bellevue, Montreal, QC H9X 3V9, Canada
| | - Igor Cestari
- Institute
of Parasitology, McGill University, Ste Anne de Bellevue, Montreal, QC H9X 3V9, Canada
- Division
of Experimental Medicine, McGill University, Montreal, QC H4A 3J1, Canada
| |
Collapse
|
2
|
Identification and motif analyses of candidate nonreceptor olfactory genes of Dendroctonus adjunctus Blandford (Coleoptera: Curculionidae) from the head transcriptome. Sci Rep 2020; 10:20695. [PMID: 33244016 PMCID: PMC7691339 DOI: 10.1038/s41598-020-77144-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Accepted: 11/06/2020] [Indexed: 01/05/2023] Open
Abstract
The round-headed pine beetle Dendroctonus adjunctus, whose dispersion and colonization behaviors are linked to a communication system mediated by semiochemicals, is one of the five most critical primary pests in forest ecosystems in Mexico. This study provides the first head transcriptome analysis of D. adjunctus and the identification of the nonreceptor olfactory genes involved in the perception of odors. De novo assembly yielded 44,420 unigenes, and GO annotations were similar to those of antennal transcriptomes of other beetle species, which reflect metabolic processes related to smell and signal transduction. A total of 36 new transcripts of nonreceptor olfactory genes were identified, of which 27 encode OBPs, 7 encode CSPs, and 2 encode SNMP candidates, which were subsequently compared to homologous proteins from other bark beetles and Coleoptera species by searching for sequence motifs and performing phylogenetic analyses. Our study provides information on genes encoding nonreceptor proteins in D. adjunctus and broadens the knowledge of olfactory genes in Coleoptera and bark beetle species, and will help to understand colonization and aggregation behaviors for the development of tools that complement management strategies.
Collapse
|
3
|
Experimental support for the evolution of symmetric protein architecture from a simple peptide motif. Proc Natl Acad Sci U S A 2010; 108:126-30. [PMID: 21173271 DOI: 10.1073/pnas.1015032108] [Citation(s) in RCA: 88] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
The majority of protein architectures exhibit elements of structural symmetry, and "gene duplication and fusion" is the evolutionary mechanism generally hypothesized to be responsible for their emergence from simple peptide motifs. Despite the central importance of the gene duplication and fusion hypothesis, experimental support for a plausible evolutionary pathway for a specific protein architecture has yet to be effectively demonstrated. To address this question, a unique "top-down symmetric deconstruction" strategy was utilized to successfully identify a simple peptide motif capable of recapitulating, via gene duplication and fusion processes, a symmetric protein architecture (the threefold symmetric β-trefoil fold). The folding properties of intermediary forms in this deconstruction agree precisely with a previously proposed "conserved architecture" model for symmetric protein evolution. Furthermore, a route through foldable sequence-space between the simple peptide motif and extant protein fold is demonstrated. These results provide compelling experimental support for a plausible evolutionary pathway of symmetric protein architecture via gene duplication and fusion processes.
Collapse
|
4
|
Katoh K, Kuma KI, Toh H, Miyata T. MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res 2005; 33:511-8. [PMID: 15661851 PMCID: PMC548345 DOI: 10.1093/nar/gki198] [Citation(s) in RCA: 3333] [Impact Index Per Article: 175.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
The accuracy of multiple sequence alignment program MAFFT has been improved. The new version (5.3) of MAFFT offers new iterative refinement options, H-INS-i, F-INS-i and G-INS-i, in which pairwise alignment information are incorporated into objective function. These new options of MAFFT showed higher accuracy than currently available methods including TCoffee version 2 and CLUSTAL W in benchmark tests consisting of alignments of >50 sequences. Like the previously available options, the new options of MAFFT can handle hundreds of sequences on a standard desktop computer. We also examined the effect of the number of homologues included in an alignment. For a multiple alignment consisting of ∼8 sequences with low similarity, the accuracy was improved (2–10 percentage points) when the sequences were aligned together with dozens of their close homologues (E-value < 10−5–10−20) collected from a database. Such improvement was generally observed for most methods, but remarkably large for the new options of MAFFT proposed here. Thus, we made a Ruby script, mafftE.rb, which aligns the input sequences together with their close homologues collected from SwissProt using NCBI-BLAST.
Collapse
Affiliation(s)
- Kazutaka Katoh
- Bioinformatics Center, Institute for Chemical Research, Kyoto University Uji, Kyoto 611-0011, Japan.
| | | | | | | |
Collapse
|
5
|
Basu A, Chaudhuri P, Majumder PP. Identification of polymorphic motifs using probabilistic search algorithms. Genome Res 2005; 15:67-77. [PMID: 15632091 PMCID: PMC540278 DOI: 10.1101/gr.2358005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2004] [Accepted: 10/21/2004] [Indexed: 01/12/2023]
Abstract
The problem of identifying motifs comprising nucleotides at a set of polymorphic DNA sites, not necessarily contiguous, arises in many human genetic problems. However, when the sites are not contiguous, no efficient algorithm exists for polymorphic motif identification. A search based on complete enumeration is computationally inefficient. We have developed probabilistic search algorithms to discover motifs of known or unknown lengths. We have developed statistical tests of significance for assessing a motif discovery, and a statistical criterion for simultaneously estimating motif length and discovering it. We have tested these algorithms on various synthetic data sets and have shown that they are very efficient, in the sense that the "true" motifs can be detected in the vast majority of replications and in a small number of iterations. Additionally, we have applied them to some real data sets and have shown that they are able to identify known motifs. In certain applications, it is pertinent to find motifs that contain contrasting nucleotides at the sites included in the motif (e.g., motifs identified in case-control association studies). For this, we have suggested appropriate modifications. Using simulations, we have discovered that the success rate of identification of the correct motif is high in case-control studies except when relative risks are small. Our analyses of evolutionary data sets resulted in the identification of some motifs that appear to have important implications on human evolutionary inference. These algorithms can easily be implemented to discover motifs from multilocus genotype data by simple numerical recoding of genotypes.
Collapse
Affiliation(s)
- Analabha Basu
- Human Genetics Unit, Indian Statistical Institute, Kolkata, 700108 India
| | | | | |
Collapse
|
6
|
Katoh K, Misawa K, Kuma KI, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res 2002; 30:3059-66. [PMID: 12136088 PMCID: PMC135756 DOI: 10.1093/nar/gkf436] [Citation(s) in RCA: 9337] [Impact Index Per Article: 424.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
A multiple sequence alignment program, MAFFT, has been developed. The CPU time is drastically reduced as compared with existing methods. MAFFT includes two novel techniques. (i) Homo logous regions are rapidly identified by the fast Fourier transform (FFT), in which an amino acid sequence is converted to a sequence composed of volume and polarity values of each amino acid residue. (ii) We propose a simplified scoring system that performs well for reducing CPU time and increasing the accuracy of alignments even for sequences having large insertions or extensions as well as distantly related sequences of similar length. Two different heuristics, the progressive method (FFT-NS-2) and the iterative refinement method (FFT-NS-i), are implemented in MAFFT. The performances of FFT-NS-2 and FFT-NS-i were compared with other methods by computer simulations and benchmark tests; the CPU time of FFT-NS-2 is drastically reduced as compared with CLUSTALW with comparable accuracy. FFT-NS-i is over 100 times faster than T-COFFEE, when the number of input sequences exceeds 60, without sacrificing the accuracy.
Collapse
Affiliation(s)
- Kazutaka Katoh
- Department of Biophysics, Graduate School of Science, Kyoto University, Kyoto 606-8502, Japan
| | | | | | | |
Collapse
|
7
|
Kunihiro S, Kawanishi Y, Sano M, Naito K, Matsuura Y, Tateno Y, Gojobori T, Yamagata Y, Abe K, Machida M. A polymerase chain reaction-based method for cloning novel members of a gene family using a combination of degenerate and inhibitory primers. Gene 2002; 289:177-84. [PMID: 12036596 DOI: 10.1016/s0378-1119(02)00547-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
We have developed a novel method for cloning gene family members by using a polymerase chain reaction technique. The method is based on the amplification of a broad range of homologous genes in combination with the specific inhibition of already cloned genes. To accomplish this, we designed degenerate primers to highly conserved regions among the gene family members, and inhibitory primers to the divergent region at the 3'-margin of each degenerate primer. The 5'-end of the inhibitory primer, the 3'-end of which was aminated, had 3-4 bases overlapping the 3'-end of the degenerate primer. The potential of this method was demonstrated by the successful cloning of a novel member of the yeast MKC7/YAP3 gene family homologue from a filamentous fungus, Aspergillus oryzae, by inhibiting amplification of an already cloned homologue, opsB.
Collapse
Affiliation(s)
- Sumiko Kunihiro
- Institute of Molecular and Cell Biology, National Institute of Advanced Industrial Science and Technology, Central 6, 1-1, Higashi, Tsukuba, Ibaraki 305-8566, Japan
| | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Endo T, Imanishi T, Gojobori T, Inoko H. Evolutionary significance of intra-genome duplications on human chromosomes. Gene 1997; 205:19-27. [PMID: 9461376 DOI: 10.1016/s0378-1119(97)00478-2] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Phylogenetic analyses indicated that a series of paralogous gene pairs, found in two extensive regions on human chromosomal bands 6p21.3 and 9q33-34, were created by at least two independent duplications. The duplicated genes on chromosomal band 6p21.3 include the genes for type 11 collagen alpha2 subunit (COL11A2), NOTCH4 (mouse int-3 homologue), 70 kDa heat shock protein (HSPA1A, HSPA1B, and HSPA1L), valyl-tRNA synthetase 2 (VARS2), complement components (C2 and C4), pre-B cell leukemia transcription factor 2 (PBX2), retinoid X receptor beta (RXRB), NAT/RING3, and four other proteins. Their paralogous genes on chromosomal band 9q33-34 are genes for type 5 collagen alpha1 subunit (COL5A1), NOTCH1, 78 kDa glucose-regulated protein (HSPA5), valyl-tRNA synthetase 1 (VARS1), complement component V (C5), PBX3, retinoid X receptor alpha (RXRA), ORFX/RING3L, and others. Among these, the genes for collagen, complement components, NAT/RING3, PBX, and RXR appear to have been duplicated around the time of vertebrate emergence, supporting the idea that they were duplicated simultaneously at that time. Another group of genes that includes NOTCH and HSP appear to have diverged long before that time. A comparison of the physical maps of these two regions revealed that the genes which duplicated in the same period were arranged in almost the same order in the two regions, with the assumption of a few chromosomal rearrangements. We propose a possible model for the evolution of these regions, taking into account the molecular mechanisms of regional duplication, gene duplication, translocation, and inversion. We also propose that a comparative mapping of paralogous genes within the human genome would be useful for identifying new genes.
Collapse
Affiliation(s)
- T Endo
- Center for Information Biology, National Institute of Genetics, Mishima, Shizuoka, Japan
| | | | | | | |
Collapse
|