1
|
Ieger-Raittz R, De Pierri CR, Perico CP, Costa FDF, Bana EG, Vicenzi L, Machado DDJS, Marchaukoski JN, Raittz RT. What are we learning with Yoga? Mapping the scientific literature on Yoga using a vector-text-mining approach. PLoS One 2025; 20:e0322791. [PMID: 40440353 PMCID: PMC12121831 DOI: 10.1371/journal.pone.0322791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2024] [Accepted: 03/27/2025] [Indexed: 06/02/2025] Open
Abstract
The techniques used in yoga have roots in traditions that precede modern science. Research shows that yoga enhances quality of life and well-being, positively impacting physical and mental health. As yoga gains acceptance in Western countries, scientific studies on the subject increase exponentially. However, many of these studies are considered inconsistent due to the diverse methodologies and focuses in the field, which creates challenges for researchers and hampers progress. This study aims to develop a comprehensive framework for existing literature on yoga, facilitating multidisciplinary collaboration and bringing new light to relevant aspects. Given the complexity of the subject, advanced modeling techniques are necessary. Contemporary artificial intelligence methods have advanced Bioinformatics, including text mining (TM), allowing us to employ vector representations of texts to derive semantic insights and organize literature effectively. Based on TM resources, we provided a better general understanding of yoga and highlighted the relationships between yoga practice and various domains, including biochemical parameters and neuroscience. It also reveals that practitioners can learn to engage with their bodies and environments actively, enhancing their quality of life. However, there is a lack of research exploring the mechanisms behind this learning and its potential for further enhancement. Vector TM has made it possible to bolster and improve human analysis. The set of resources developed allowed us to determine the mapping of the literature, the analysis of which revealed 4 dimensions (exercise, physiology, theory and therapeutic) divided into 9 cohesive groups, representing the trends in the literature. The resulting platforms are available to Yoga researchers to evaluate our findings and make their forays into the existing literature.
Collapse
Affiliation(s)
- Rosangela Ieger-Raittz
- Graduate Program in Physical Exercise Medicine in Health Promotion, Health Sciences Sector, Federal University of Paraná, Curitiba, Paraná, Brazil
| | - Camilla Reginatto De Pierri
- Laboratory of Artificial Intelligence Applied to Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
- Department of Biochemistry and Molecular Biology, Federal University of Paraná, Curitiba, Paraná, Brazil
| | - Camila Pereira Perico
- Laboratory of Artificial Intelligence Applied to Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
- Graduate Program in Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
- Associate Graduate Program in Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
| | - Flavia de Fatima Costa
- Laboratory of Artificial Intelligence Applied to Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
- Graduate Program in Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
- Associate Graduate Program in Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
| | - Elisa Garbin Bana
- Laboratory of Artificial Intelligence Applied to Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
- Graduate Program in Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
- Associate Graduate Program in Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
| | - Leonardo Vicenzi
- Laboratory of Artificial Intelligence Applied to Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
- Graduate Program in Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
- Associate Graduate Program in Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
| | - Diogo de Jesus Soares Machado
- Laboratory of Artificial Intelligence Applied to Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
- Graduate Program in Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
- Associate Graduate Program in Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
| | - Jeroniza Nunes Marchaukoski
- Laboratory of Artificial Intelligence Applied to Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
- Graduate Program in Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
- Associate Graduate Program in Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
| | - Roberto Tadeu Raittz
- Laboratory of Artificial Intelligence Applied to Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
- Graduate Program in Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
- Associate Graduate Program in Bioinformatics, SEPT, Federal University of Paraná, Curitiba, Paraná, Brazil
| |
Collapse
|
2
|
Pimenta-Zanon MH, Kashiwabara AY, Vanzela ALL, Lopes FM. GRAMEP: an alignment-free method based on the maximum entropy principle for identifying SNPs. BMC Bioinformatics 2025; 26:66. [PMID: 40000933 PMCID: PMC11863517 DOI: 10.1186/s12859-025-06037-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2024] [Accepted: 01/06/2025] [Indexed: 02/27/2025] Open
Abstract
BACKGROUND Advances in high throughput sequencing technologies provide a huge number of genomes to be analyzed. Thus, computational methods play a crucial role in analyzing and extracting knowledge from the data generated. Investigating genomic mutations is critical because of their impact on chromosomal evolution, genetic disorders, and diseases. It is common to adopt aligning sequences for analyzing genomic variations. However, this approach can be computationally expensive and restrictive in scenarios with large datasets. RESULTS We present a novel method for identifying single nucleotide polymorphisms (SNPs) in DNA sequences from assembled genomes. This study proposes GRAMEP, an alignment-free approach that adopts the principle of maximum entropy to discover the most informative k-mers specific to a genome or set of sequences under investigation. The informative k-mers enable the detection of variant-specific mutations in comparison to a reference genome or other set of sequences. In addition, our method offers the possibility of classifying novel sequences with no need for organism-specific information. GRAMEP demonstrated high accuracy in both in silico simulations and analyses of viral genomes, including Dengue, HIV, and SARS-CoV-2. Our approach maintained accurate SARS-CoV-2 variant identification while demonstrating a lower computational cost compared to methods with the same purpose. CONCLUSIONS GRAMEP is an open and user-friendly software based on maximum entropy that provides an efficient alignment-free approach to identifying and classifying unique genomic subsequences and SNPs with high accuracy, offering advantages over comparative methods. The instructions for use, applicability, and usability of GRAMEP are open access at https://github.com/omatheuspimenta/GRAMEP .
Collapse
Affiliation(s)
- Matheus Henrique Pimenta-Zanon
- Computer Science Department, Universidade Tecnológica Federal do Paraná (UTFPR), Alberto Carazzai, 1640, Cornélio Procópio, Paraná, 86300-000, Brazil
| | - André Yoshiaki Kashiwabara
- Computer Science Department, Universidade Tecnológica Federal do Paraná (UTFPR), Alberto Carazzai, 1640, Cornélio Procópio, Paraná, 86300-000, Brazil
| | - André Luís Laforga Vanzela
- Laboratory of Cytogenetics and Plant Diversity, Department of General Biology, Universidade Estadual de Londrina (UEL), Rodovia Celso Garcia Cid, PR-445, Km 380, Londrina, Paraná, 86057-970, Brazil
| | - Fabricio Martins Lopes
- Computer Science Department, Universidade Tecnológica Federal do Paraná (UTFPR), Alberto Carazzai, 1640, Cornélio Procópio, Paraná, 86300-000, Brazil.
| |
Collapse
|
3
|
Nichio BTDL, Chaves RBR, Pedrosa FDO, Raittz RT. Exploring diazotrophic diversity: unveiling Nif core distribution and evolutionary patterns in nitrogen-fixing organisms. BMC Genomics 2025; 26:81. [PMID: 39871141 PMCID: PMC11773926 DOI: 10.1186/s12864-024-10994-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2024] [Accepted: 11/05/2024] [Indexed: 01/29/2025] Open
Abstract
BACKGROUND Diazotrophs carry out biological nitrogen fixation (BNF) using the nitrogenase enzyme complex (NEC), which relies on nitrogenase encoded by nif genes. Horizontal gene transfer (HGT) and gene duplications have created significant diversity among these genes, making it challenging to identify potential diazotrophs. Previous studies have established a minimal set of Nif proteins, known as the Nif core, which includes NifH, NifD, NifK, NifE, NifN, and NifB. This study aimed to identify potential diazotroph groups based on the Nif core and to analyze the inheritance patterns of accessory Nif proteins related to Mo-nitrogenase, along with their impact on N2 fixation maintenance. RESULTS In a systematic study, 118 diazotrophs were identified, resulting in a database of 2,156 Nif protein sequences obtained with RAFTS³G. Using this Nif database and a data mining strategy, we extended our analysis to 711 species and found that 544 contain the Nif core. A partial Nif core set was observed in eight species in this study. Finally, we cataloged 662 species with Nif core, of which 52 were novel. Our analysis generated 10,076 Nif proteins from these species and revealed some Nif core duplications. Additionally, we determined the optimal cluster value (k = 10) for analyzing diazotrophic diversity. Combining synteny and phylogenetic analyses revealed distinct syntenies in the nif gene composition across ten groups. CONCLUSIONS This study advances our understanding of the distribution of nif genes, aiding in the prediction and classification of N₂-fixing organisms. Furthermore, we present a comprehensive overview of the diversity, distribution, and evolutionary relationships among diazotrophic organisms associated with the Nif core. The analysis revealed the phylogenetic and functional organization of different groups, identifying synteny patterns and new nif gene arrangements across various bacterial and archaeal species.The identified groups serve as a valuable framework for further exploration of the molecular mechanisms underlying biological nitrogen fixation and its evolutionary significance across different bacterial lineages.
Collapse
Affiliation(s)
- Bruno Thiago de Lima Nichio
- Laboratory of Artificial Intelligence Applied to Bioinformatics, Professional and Technical Education Sector - SEPT, UFPR, Curitiba, Paraná, Brazil
- Department of Biochemistry, Biological Sciences Sector, Federal University of Paraná (UFPR), Curitiba, Paraná, Brazil
| | - Roxana Beatriz Ribeiro Chaves
- Department of Biochemistry, Biological Sciences Sector, Federal University of Paraná (UFPR), Curitiba, Paraná, Brazil
| | - Fábio de Oliveira Pedrosa
- Laboratory of Artificial Intelligence Applied to Bioinformatics, Professional and Technical Education Sector - SEPT, UFPR, Curitiba, Paraná, Brazil
- Department of Biochemistry, Biological Sciences Sector, Federal University of Paraná (UFPR), Curitiba, Paraná, Brazil
| | - Roberto Tadeu Raittz
- Laboratory of Artificial Intelligence Applied to Bioinformatics, Professional and Technical Education Sector - SEPT, UFPR, Curitiba, Paraná, Brazil.
- Department of Biochemistry, Biological Sciences Sector, Federal University of Paraná (UFPR), Curitiba, Paraná, Brazil.
| |
Collapse
|