Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Joseph JM, Durand D. Family classification without domain chaining. Bioinformatics 2009;25:i45-53. [PMID: 19478015 PMCID: PMC2687961 DOI: 10.1093/bioinformatics/btp207] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

For:	Joseph JM, Durand D. Family classification without domain chaining. Bioinformatics 2009;25:i45-53. [PMID: 19478015 PMCID: PMC2687961 DOI: 10.1093/bioinformatics/btp207] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Number

Cited by Other Article(s)

Ijaz AZ, Ali RH, Sarwar A, Ali Khan T, Baig MM. Importance of Synteny in Homology Inference. 2022 17TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET) 2022. [DOI: 10.1109/icet56601.2022.10004649] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]

Gauthier CH, Cresawn SG, Hatfull GF. PhaMMseqs: a new pipeline for constructing phage gene phamilies using MMseqs2. G3 (BETHESDA, MD.) 2022;12:6717792. [PMID: 36161315 PMCID: PMC9635663 DOI: 10.1093/g3journal/jkac233] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Accepted: 08/30/2022] [Indexed: 06/09/2023]

In-Silico Evaluation of a New Gene From Wheat Reveals the Divergent Evolution of the CAP160 Homologous Genes Into Monocots. J Mol Evol 2019;88:151-163. [PMID: 31820048 DOI: 10.1007/s00239-019-09920-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Accepted: 11/19/2019] [Indexed: 10/25/2022]

Li L, Bansal MS. An Integrated Reconciliation Framework for Domain, Gene, and Species Level Evolution. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019;16:63-76. [PMID: 29994126 DOI: 10.1109/tcbb.2018.2846253] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

GenFamClust: an accurate, synteny-aware and reliable homology inference algorithm. BMC Evol Biol 2016;16:120. [PMID: 27260514 PMCID: PMC4893229 DOI: 10.1186/s12862-016-0684-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2015] [Accepted: 05/12/2016] [Indexed: 11/24/2022] Open

Bitard-Feildel T, Kemena C, Greenwood JM, Bornberg-Bauer E. Domain similarity based orthology detection. BMC Bioinformatics 2015;16:154. [PMID: 25968113 PMCID: PMC4443542 DOI: 10.1186/s12859-015-0570-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2014] [Accepted: 04/10/2015] [Indexed: 11/10/2022] Open

Abstract

Background

Orthologous protein detection software mostly uses pairwise comparisons of amino-acid sequences to assert whether two proteins are orthologous or not. Accordingly, when the number of sequences for comparison increases, the number of comparisons to compute grows in a quadratic order. A current challenge of bioinformatic research, especially when taking into account the increasing number of sequenced organisms available, is to make this ever-growing number of comparisons computationally feasible in a reasonable amount of time. We propose to speed up the detection of orthologous proteins by using strings of domains to characterize the proteins.

Results

We present two new protein similarity measures, a cosine and a maximal weight matching score based on domain content similarity, and new software, named porthoDom. The qualities of the cosine and the maximal weight matching similarity measures are compared against curated datasets. The measures show that domain content similarities are able to correctly group proteins into their families. Accordingly, the cosine similarity measure is used inside porthoDom, the wrapper developed for proteinortho. porthoDom makes use of domain content similarity measures to group proteins together before searching for orthologs. By using domains instead of amino acid sequences, the reduction of the search space decreases the computational complexity of an all-against-all sequence comparison.

Conclusion

We demonstrate that representing and comparing proteins as strings of discrete domains, i.e. as a concatenation of their unique identifiers, allows a drastic simplification of search space. porthoDom has the advantage of speeding up orthology detection while maintaining a degree of accuracy similar to proteinortho. The implementation of porthoDom is released using python and C++ languages and is available under the GNU GPL licence 3 at http://www.bornberglab.org/pages/porthoda.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0570-8) contains supplementary material, which is available to authorized users.

Collapse

Doerr D, Stoye J, Böcker S, Jahn K. Identifying gene clusters by discovering common intervals in indeterminate strings. BMC Genomics 2015;15 Suppl 6:S2. [PMID: 25571793 PMCID: PMC4274641 DOI: 10.1186/1471-2164-15-s6-s2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

Massive fungal biodiversity data re-annotation with multi-level clustering. Sci Rep 2014;4:6837. [PMID: 25355642 PMCID: PMC4213798 DOI: 10.1038/srep06837] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2014] [Accepted: 10/10/2014] [Indexed: 11/08/2022] Open

Zheng C, Kononenko A, Leebens-Mack J, Lyons E, Sankoff D. Gene families as soft cliques with backbones: Amborella contrasted with other flowering plants. BMC Genomics 2014;15 Suppl 6:S8. [PMID: 25572777 PMCID: PMC4240082 DOI: 10.1186/1471-2164-15-s6-s8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Ali RH, Muhammad S, Khan M, Arvestad L. Quantitative synteny scoring improves homology inference and partitioning of gene families. BMC Bioinformatics 2014;14 Suppl 15:S12. [PMID: 24564516 PMCID: PMC3852004 DOI: 10.1186/1471-2105-14-s15-s12] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Automatic identification of highly conserved family regions and relationships in genome wide datasets including remote protein sequences. PLoS One 2013;8:e75458. [PMID: 24069417 PMCID: PMC3771926 DOI: 10.1371/journal.pone.0075458] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2013] [Accepted: 08/19/2013] [Indexed: 11/19/2022] Open

Ramos-Silva P, Kaandorp J, Huisman L, Marie B, Zanella-Cléon I, Guichard N, Miller DJ, Marin F. The skeletal proteome of the coral Acropora millepora: the evolution of calcification by co-option and domain shuffling. Mol Biol Evol 2013;30:2099-112. [PMID: 23765379 PMCID: PMC3748352 DOI: 10.1093/molbev/mst109] [Citation(s) in RCA: 122] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

Lopez FJ, Bernabeu M, Fernandez-Becerra C, del Portillo HA. A new computational approach redefines the subtelomeric vir superfamily of Plasmodium vivax. BMC Genomics 2013;14:8. [PMID: 23324551 PMCID: PMC3566924 DOI: 10.1186/1471-2164-14-8] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2012] [Accepted: 01/02/2013] [Indexed: 01/20/2023] Open

Miele V, Penel S, Duret L. Ultra-fast sequence clustering from similarity networks with SiLiX. BMC Bioinformatics 2011;12:116. [PMID: 21513511 PMCID: PMC3095554 DOI: 10.1186/1471-2105-12-116] [Citation(s) in RCA: 212] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2010] [Accepted: 04/22/2011] [Indexed: 01/04/2023] Open

Baumbach J. On the power and limits of evolutionary conservation--unraveling bacterial gene regulatory networks. Nucleic Acids Res 2010;38:7877-84. [PMID: 20699275 PMCID: PMC3001071 DOI: 10.1093/nar/gkq699] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open