Daoud M. A note on the distance distribution paradigm for Mosaab-metric to process segmented genomes of influenza virus.
Genomics Inform 2020;
18:e7. [PMID:
32224840 PMCID:
PMC7120345 DOI:
10.5808/gi.2020.18.1.e7]
[Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Revised: 02/27/2020] [Accepted: 02/27/2020] [Indexed: 11/20/2022] Open
Abstract
In this paper, we present few technical notes about the distance distribution paradigm for Mosaab-metric using 1, 2, and 3 grams feature extraction techniques to analyze composite data points in high dimensional feature spaces. This technical analysis will help the specialist in bioinformatics and biotechnology to deeply explore the biodiversity of influenza virus genome as a composite data point. Various technical examples are presented in this paper, in addition, the integrated statistical learning pipeline to process segmented genomes of influenza virus is illustrated as sequential-parallel computational pipeline.
Collapse