1
|
Si Y, Li H, Li X. Difference Analysis Among Six Kinds of Acceptor Splicing Sequences by the Dispersion Features of 6-mer Subsets in Human Genes. BIOLOGY 2025; 14:206. [PMID: 40001974 PMCID: PMC11853274 DOI: 10.3390/biology14020206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2025] [Revised: 02/07/2025] [Accepted: 02/13/2025] [Indexed: 02/27/2025]
Abstract
Identifying the sequence composition of different splicing modes is a challenge in current research. This study explored the dispersion distributions of 6-mer subsets in human acceptor splicing regions. Without differentiating acceptor splicing modes, obvious differences were observed across the upstream, core, and downstream regions of splicing sites for 16 dispersion distributions. These findings indicate that the dispersion value of each subset can effectively characterize the compositional properties of splicing sequences. When acceptor splicing sequences were classified into common, constitutive, and alternative modes, the differences in dispersion distributions for most of the XY1 6-mer subsets were significant among the three splicing modes. Furthermore, the alternative splicing mode was classified into normal, exonic, and intronic sub-modes, the differences in dispersion distributions for most of the XY1 6-mer subsets were also significant among the three splicing sub-modes. Our results indicate that dispersion values of XY1 6-mer subsets not only revealed the sequence composition patterns of acceptor splicing regions but also effectively identified the differences in base correlation among various acceptor splicing modes. Our research provides new insights into revealing and predicting different splicing modes.
Collapse
Affiliation(s)
| | - Hong Li
- Inner Mongolia Autonomous Region Key Laboratory of Biophysics and Bioinformatics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China; (Y.S.); (X.L.)
| | | |
Collapse
|
2
|
Li X, Li H, Yang Z, Wang L. Distribution rules of 8-mer spectra and characterization of evolution state in animal genome sequences. BMC Genomics 2024; 25:855. [PMID: 39266973 PMCID: PMC11391722 DOI: 10.1186/s12864-024-10786-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Accepted: 09/09/2024] [Indexed: 09/14/2024] Open
Abstract
BACKGROUND Studying the composition rules and evolution mechanisms of genome sequences are core issues in the post-genomic era, and k-mer spectrum analysis of genome sequences is an effective means to solve this problem. RESULT We divided total 8-mers of genome sequences into 16 kinds of XY-type due to XY dinucleotides number in 8-mers. Previous works explored that the independent unimodal distributions observed only in three CG-type 8-mer spectra, while non-CG type 8-mer spectra have not the universal phenomenon from prokaryotes to eukaryotes. On this basis, we analyzed the distribution variation of non-CG type 8-mer spectra across 889 animal genome sequences. Following the evolutionary order of animals from primitive to more complex, we found that the spectrum distributions gradually transition from unimodal to tri-modal. The relative distance from the average frequency of each non-CG type 8-mers to the center frequency is different within a species and among different species. For the 8-mers contain CG dinucleotides, we further divided these into 16 subsets, where each 8-mer contains both CG and XY dinucleotides, called XY1_CG1 subsets. We found that the separability values of XY1_CG1 spectra are closely related to the evolution and specificity of animals. Considering the constraint of Chargaff's second parity rule, we finally obtained 10 separability values as the feature set to characterize the evolution state of genome sequences. In order to verify the rationality of the feature set, we used 14 common classification algorithms to perform binary classification tests. The results showed that the accuracy (Acc) ranged between 98.70% and 83.88% among birds, other vertebrates and mammals. CONCLUSION We proposed a credible feature set to characterizes the evolution state of genomes and obtained satisfied results by the feature set on large scale classification of animals.
Collapse
Affiliation(s)
- Xiaolong Li
- Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot, 010021, China
| | - Hong Li
- Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot, 010021, China.
| | - Zhenhua Yang
- School of Economics and Management, Inner Mongolia University of Science and Technology, Baotou, 014010, China
| | - Lu Wang
- Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot, 010021, China
| |
Collapse
|
3
|
Ma B, Khan R, Raza SHA, Gao Z, Hou S, Ullah F, Hassan MM, Hassan MM, AlGabbani Q, Alotaibi MA, Shah MA, Gui L. Determination of the relationship between class IV sirtuin genes and growth traits in Chinese black Tibetan sheep. Anim Biotechnol 2021:1-7. [PMID: 34918617 DOI: 10.1080/10495398.2021.2016434] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Class IV sirtuin (SIRT6 and SIRT7) played essential roles in biometabolism processes via deacetylating specific transcription factors. The present study was conducted to search for mutations in SIRT6/7 and determine their associations with growth traits in black Tibetan sheep. Via DNA sequencing methods, three single-nucleotide polymorphisms (SNPs) were identified in 427 ewes, including a mutation (g.3724C > T) in the intron 1 of SIRT6 and two mutations (g.3668G > T and g.4223C > G) in SIRT7 intron 6 and 8, respectively. Based on the χ2 test, both g.3724C > T and g.4223C > G loci fitted with Hardy-Weinberg equilibrium (p > 0.05). Compared with animals with genotype TT, the CC genotype at g.3724C > T locus (SIRT6) exhibited the highest mean for body weight (p < 0.05) and heart girth (p < 0.05). At g.3668G > T locus (SIRT7), individuals carrying the GG genotype tended to have heavier body weight than those of TT genotype (p < 0.05). With the exception of body weight, body measurement traits not affected by combinative genotype (p > 0.05). Our results could be used as genetic markers for marker-assisted selection and maybe guide sheep breeding in economic traits.
Collapse
Affiliation(s)
- Boyan Ma
- College of Agriculture and Animal Husbandry, Qinghai University, Xining, China
| | - Rajwali Khan
- Department of Livestock Management, Breeding and Genetics The University of Agriculture Peshawar, Peshawar, Pakistan
| | | | - Zhanhong Gao
- College of Agriculture and Animal Husbandry, Qinghai University, Xining, China
| | - Shengzhen Hou
- College of Agriculture and Animal Husbandry, Qinghai University, Xining, China
| | - Farman Ullah
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Education, College of Animal Science and Technology, Huazhong Agricultural University, Wuhan, China
| | - Montaser M Hassan
- Department of Biology, College of Science, Taif University, Taif, Saudi Arabia
| | - Mohamed M Hassan
- Department of Biology, College of Science, Taif University, Taif, Saudi Arabia
| | - Qwait AlGabbani
- Department of Biology, College of Sciences and Humanities, Prince Sattam Bin Abdulaziz University, Al-Kharj, Saudi Arabia
| | | | - Mujahid Ali Shah
- Faculty of Fisheries and Protection of Water, University of South Bohemia in Ceske Budejovice, Czech Republic
| | - Linsheng Gui
- College of Agriculture and Animal Husbandry, Qinghai University, Xining, China
| |
Collapse
|
4
|
Yang Z, Li H, Jia Y, Zheng Y, Meng H, Bao T, Li X, Luo L. Intrinsic laws of k-mer spectra of genome sequences and evolution mechanism of genomes. BMC Evol Biol 2020; 20:157. [PMID: 33228538 PMCID: PMC7684957 DOI: 10.1186/s12862-020-01723-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Accepted: 11/10/2020] [Indexed: 11/17/2022] Open
Abstract
Background K-mer spectra of DNA sequences contain important information about sequence composition and sequence evolution. We want to reveal the evolution rules of genome sequences by studying the k-mer spectra of genome sequences. Results The intrinsic laws of k-mer spectra of 920 genome sequences from primate to prokaryote were analyzed. We found that there are two types of evolution selection modes in genome sequences, named as CG Independent Selection and TA Independent Selection. There is a mutual inhibition relationship between CG and TA independent selections. We found that the intensity of CG and TA independent selections correlates closely with genome evolution and G + C content of genome sequences. The living habits of species are related closely to the independent selection modes adopted by species genomes. Consequently, we proposed an evolution mechanism of genomes in which the genome evolution is determined by the intensities of the CG and TA independent selections and the mutual inhibition relationship. Besides, by the evolution mechanism of genomes, we speculated the evolution modes of prokaryotes in mild and extreme environments in the anaerobic age and the evolving process of prokaryotes from anaerobic to aerobic environment on earth as well as the originations of different eukaryotes. Conclusion We found that there are two independent selection modes in genome sequences. The evolution of genome sequence is determined by the two independent selection modes and the mutual inhibition relationship between them.
Collapse
Affiliation(s)
- Zhenhua Yang
- Laboratory of Theoretical Biophysics, School of Physical Science & Technology, Inner Mongolia University, Hohhot, 010021, China.,School of Economics and Management, Inner Mongolia University of Science & Technology, Baotou, 014010, China
| | - Hong Li
- Laboratory of Theoretical Biophysics, School of Physical Science & Technology, Inner Mongolia University, Hohhot, 010021, China.
| | - Yun Jia
- College of Science, Inner Mongolia University of Technology, Hohhot, 010051, China
| | - Yan Zheng
- Baotou Medical College, Inner Mongolia University of Science & Technology, Baotou, 014040, China
| | - Hu Meng
- School of Life Science & Technology, Inner Mongolia University of Science & Technology, Baotou, 014010, China
| | - Tonglaga Bao
- Laboratory of Theoretical Biophysics, School of Physical Science & Technology, Inner Mongolia University, Hohhot, 010021, China
| | - Xiaolong Li
- Laboratory of Theoretical Biophysics, School of Physical Science & Technology, Inner Mongolia University, Hohhot, 010021, China
| | - Liaofu Luo
- Laboratory of Theoretical Biophysics, School of Physical Science & Technology, Inner Mongolia University, Hohhot, 010021, China
| |
Collapse
|