1
|
Lima RAT, Garay AV, Frederico TD, de Oliveira GM, Quirino BF, Barbosa JARG, Freitas SMD, Krüger RH. Biochemical and structural characterization of a family-9 glycoside hydrolase bioprospected from the termite Syntermes wheeleri gut bacteria metagenome. Enzyme Microb Technol 2025; 189:110654. [PMID: 40262434 DOI: 10.1016/j.enzmictec.2025.110654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2024] [Revised: 03/24/2025] [Accepted: 04/12/2025] [Indexed: 04/24/2025]
Abstract
Glycosyl hydrolases (GH) are enzymes involved in the degradation of plant biomass. They are important for biorefineries that aim at the sustainable utilization of lignocellulosic residues to generate value-added products. The termite Syntermes wheeleri gut microbiota showed an abundance of bacteria from the phylum Firmicutes, a phylum with enzymes capable of breaking down cellulose and degrading lignin, facilitating the use of plant materials as a food source for termites. Using bioinformatics techniques, cellobiohydrolases were searched for in the gut metagenome of the termite Syntermes wheeleri, endemic to the Cerrado. After selecting sequences of the target enzymes, termite gut microbiome metatranscriptome data were used as the criteria to choose the GH9 enzyme sequence Exo8574. Here we present the biochemical and structural characterization of Exo8574, a GH9 enzyme that showed activity with the substrate p-nitrophenyl-D-cellobioside (pNPC), consistent with cellobiohydrolase activity. Bioinformatics tools were used to perform phylogeny studies of Exo8574 and to identify conserved families and domains. Exo8574 showed 48.8 % homology to a protein from a bacterium belonging to the phylum Firmicutes. The high-quality three-dimensional (3D) model of Exo8574 was obtained by protein structure prediction AlphaFold 2, a neural network-based method. After the heterologous expression of Exo8574 and its purification, biochemical experiments showed that the optimal activity of the enzyme was at a temperature of 55 ºC and pH 6.0, which was enhanced in the presence of metal ions, especially Fe2 +. The estimated kinetic parameters of Exo8574 using the synthetic substrate p-nithrophenyl-beta-D-cellobioside (pNPC) were: Vmax = 9.14 ± 0.2 x10-5 μmol/min and Km = 248.27 ± 26.35 μmol/L. The thermostability test showed a 50 % loss of activity after 1 h incubation at 55 °C. The secondary structure contents of Exo8574 evaluated by Circular Dichroism were pH dependent, with greater structuring of protein in β-antiparallel and α-helices at pH 6.0. The similarity between the CD results and the Ramachandran plot of the 3D model suggests that a reliable model has been obtained. Altogether, the results of the biochemical and structural characterization showed that Exo8574 is capable of acting on p-nithrophenyl-beta-D-cellobioside (pNPC), a substrate that mimics bonds cleaved by cellobiohydrolases. These findings have significant implications for advancing in the field of biomass conversion while also contributing to efforts aimed at overcoming challenges in developing more efficient cellulase cocktails.
Collapse
Affiliation(s)
| | - Aisel Valle Garay
- Department of Cell Biology, Darcy Ribeiro Campus, Universidade de Brasília, Brasília, DF 70910-900, Brazil
| | - Tayná Diniz Frederico
- Department of Cell Biology, Darcy Ribeiro Campus, Universidade de Brasília, Brasília, DF 70910-900, Brazil
| | - Gideane Mendes de Oliveira
- Department of Cell Biology, Darcy Ribeiro Campus, Universidade de Brasília, Brasília, DF 70910-900, Brazil
| | - Betania Ferraz Quirino
- Embrapa-Agroenegy, Genetics and Biotechnology Laboratory, Brasília, DF 70770-901, Brazil
| | | | - Sonia Maria de Freitas
- Department of Cell Biology, Darcy Ribeiro Campus, Universidade de Brasília, Brasília, DF 70910-900, Brazil
| | - Ricardo Henrique Krüger
- Department of Cell Biology, Darcy Ribeiro Campus, Universidade de Brasília, Brasília, DF 70910-900, Brazil.
| |
Collapse
|
2
|
Cham AK, Adams AK, Wadl PA, Ojeda-Zacarías MDC, Rutter WB, Jackson DM, Shoemaker DD, Yencho GC, Olukolu BA. Metagenome-enabled models improve genomic predictive ability and identification of herbivory-limiting genes in sweetpotato. HORTICULTURE RESEARCH 2024; 11:uhae135. [PMID: 38974189 PMCID: PMC11226878 DOI: 10.1093/hr/uhae135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Accepted: 04/27/2024] [Indexed: 07/09/2024]
Abstract
Plant-insect interactions are often influenced by host- or insect-associated metagenomic community members. The relative abundance of insects and the microbes that modulate their interactions were obtained from sweetpotato (Ipomoea batatas) leaf-associated metagenomes using quantitative reduced representation sequencing and strain/species-level profiling with the Qmatey software. Positive correlations were found between whitefly (Bemisia tabaci) and its endosymbionts (Candidatus Hamiltonella defensa, Candidatus Portiera aleyrodidarum, and Rickettsia spp.) and negative correlations with nitrogen-fixing bacteria that implicate nitric oxide in sweetpotato-whitefly interaction. Genome-wide associations using 252 975 dosage-based markers, and metagenomes as a covariate to reduce false positive rates, implicated ethylene and cell wall modification in sweetpotato-whitefly interaction. The predictive abilities (PA) for whitefly and Ocypus olens abundance were high in both populations (68%-69% and 33.3%-35.8%, respectively) and 69.9% for Frankliniella occidentalis. The metagBLUP (gBLUP) prediction model, which fits the background metagenome-based Cao dissimilarity matrix instead of the marker-based relationship matrix (G-matrix), revealed moderate PA (35.3%-49.1%) except for O. olens (3%-10.1%). A significant gain in PA after modeling the metagenome as a covariate (gGBLUP, ≤11%) confirms quantification accuracy and that the metagenome modulates phenotypic expression and might account for the missing heritability problem. Significant gains in PA were also revealed after fitting allele dosage (≤17.4%) and dominance effects (≤4.6%). Pseudo-diploidized genotype data underperformed for dominance models. Including segregation-distorted loci (SDL) increased PA by 6%-17.1%, suggesting that traits associated with fitness cost might benefit from the inclusion of SDL. Our findings confirm the holobiont theory of host-metagenome co-evolution and underscore its potential for breeding within the context of G × G × E interactions.
Collapse
Affiliation(s)
- Alhagie K Cham
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, TN 37996, USA
| | - Alison K Adams
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, TN 37996, USA
- Genome Science and Technology, University of Tennessee, Knoxville, TN 37916, USA
- Department of Plant Pathology, University of Georgia, Griffin, GA 30223, USA
| | - Phillip A Wadl
- US Vegetable Laboratory, United States Department of Agriculture, Agriculture Research Service, Charleston, SC 29414, USA
| | - Ma del Carmen Ojeda-Zacarías
- Faculty of Agronomy, Autonomous University of Nuevo León, Francisco Villa s/n, Col. Ex Hacienda El Canadá, 66050, General Escobedo, Nuevo León, México
| | - William B Rutter
- US Vegetable Laboratory, United States Department of Agriculture, Agriculture Research Service, Charleston, SC 29414, USA
| | - D Michael Jackson
- US Vegetable Laboratory, United States Department of Agriculture, Agriculture Research Service, Charleston, SC 29414, USA
| | - D Dewayne Shoemaker
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, TN 37996, USA
| | - G Craig Yencho
- Department of Horticultural Science, North Carolina State University, Raleigh, NC 27695, USA
| | - Bode A Olukolu
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, TN 37996, USA
- Genome Science and Technology, University of Tennessee, Knoxville, TN 37916, USA
| |
Collapse
|
3
|
Nguyen NH. Genetics and Genomics of Infectious Diseases in Key Aquaculture Species. BIOLOGY 2024; 13:29. [PMID: 38248460 PMCID: PMC10813283 DOI: 10.3390/biology13010029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 01/02/2024] [Accepted: 01/02/2024] [Indexed: 01/23/2024]
Abstract
Diseases pose a significant and pressing concern for the sustainable development of the aquaculture sector, particularly as their impact continues to grow due to climatic shifts such as rising water temperatures. While various approaches, ranging from biosecurity measures to vaccines, have been devised to combat infectious diseases, their efficacy is disease and species specific and contingent upon a multitude of factors. The fields of genetics and genomics offer effective tools to control and prevent disease outbreaks in aquatic animal species. In this study, we present the key findings from our recent research, focusing on the genetic resistance to three specific diseases: White Spot Syndrome Virus (WSSV) in white shrimp, Bacterial Necrotic Pancreatitis (BNP) in striped catfish, and skin fluke (a parasitic ailment) in yellowtail kingfish. Our investigations reveal that all three species possess substantial heritable genetic components for disease-resistant traits, indicating their potential responsiveness to artificial selection in genetic improvement programs tailored to combat these diseases. Also, we observed a high genetic association between disease traits and survival rates. Through selective breeding aimed at enhancing resistance to these pathogens, we achieved substantial genetic gains, averaging 10% per generation. These selection programs also contributed positively to the overall production performance and productivity of these species. Although the effects of selection on immunological traits or immune responses were not significant in white shrimp, they yielded favorable results in striped catfish. Furthermore, our genomic analyses, including shallow genome sequencing of pedigreed populations, enriched our understanding of the genomic architecture underlying disease resistance traits. These traits are primarily governed by a polygenic nature, with numerous genes or genetic variants, each with small effects. Leveraging a range of advanced statistical methods, from mixed models to machine and deep learning, we developed prediction models that demonstrated moderate-to-high levels of accuracy in forecasting these disease-related traits. In addition to genomics, our RNA-seq experiments identified several genes that undergo upregulation in response to infection or viral loads within the populations. Preliminary microbiome data, while offering limited predictive accuracy for disease traits in one of our studied species, underscore the potential for combining such data with genome sequence information to enhance predictive power for disease traits in our populations. Lastly, this paper briefly discusses the roles of precision agriculture systems and AI algorithms and outlines the path for future research to expedite the development of disease-resistant genetic lines tailored to our target species. In conclusion, our study underscores the critical role of genetics and genomics in fortifying the aquaculture sector against the threats posed by diseases, paving the way for more sustainable and resilient aquaculture development.
Collapse
Affiliation(s)
- Nguyen Hong Nguyen
- School of Science, Technology and Engineering, University of the Sunshine Coast, Maroochydore, QLD 4558, Australia
| |
Collapse
|
4
|
Berry DP, Spangler ML. Animal board invited review: Practical applications of genomic information in livestock. Animal 2023; 17:100996. [PMID: 37820404 DOI: 10.1016/j.animal.2023.100996] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 09/08/2023] [Accepted: 09/11/2023] [Indexed: 10/13/2023] Open
Abstract
Access to high-dimensional genomic information in many livestock species is accelerating. This has been greatly aided not only by continual reductions in genotyping costs but also an expansion in the services available that leverage genomic information to create a greater return-on-investment. Genomic information on individual animals has many uses including (1) parentage verification and discovery, (2) traceability, (3) karyotyping, (4) sex determination, (5) reporting and monitoring of mutations conferring major effects or congenital defects, (6) better estimating inbreeding of individuals and coancestry among individuals, (7) mating advice, (8) determining breed composition, (9) enabling precision management, and (10) genomic evaluations; genomic evaluations exploit genome-wide genotype information to improve the accuracy of predicting an animal's (and by extension its progeny's) genetic merit. Genomic data also provide a huge resource for research, albeit the outcome from this research, if successful, should eventually be realised through one of the ten applications already mentioned. The process for generating a genotype all the way from sample procurement to identifying erroneous genotypes is described, as are the steps that should be considered when developing a bespoke genotyping panel for practical application.
Collapse
Affiliation(s)
- D P Berry
- Animal & Grassland Research and Innovation Centre, Teagasc, Moorepark, Fermoy, Cork, Ireland.
| | - M L Spangler
- Department of Animal Science, University of Nebraska-Lincoln, Lincoln, NE, United States
| |
Collapse
|
5
|
Hess MK, Hodgkinson HE, Hess AS, Zetouni L, Budel JCC, Henry H, Donaldson A, Bilton TP, van Stijn TC, Kirk MR, Dodds KG, Brauning R, McCulloch AF, Hickey SM, Johnson PL, Jonker A, Morton N, Hendy S, Oddy VH, Janssen PH, McEwan JC, Rowe SJ. Large-scale analysis of sheep rumen metagenome profiles captured by reduced representation sequencing reveals individual profiles are influenced by the environment and genetics of the host. BMC Genomics 2023; 24:551. [PMID: 37723422 PMCID: PMC10506323 DOI: 10.1186/s12864-023-09660-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Accepted: 09/07/2023] [Indexed: 09/20/2023] Open
Abstract
BACKGROUND Producing animal protein while reducing the animal's impact on the environment, e.g., through improved feed efficiency and lowered methane emissions, has gained interest in recent years. Genetic selection is one possible path to reduce the environmental impact of livestock production, but these traits are difficult and expensive to measure on many animals. The rumen microbiome may serve as a proxy for these traits due to its role in feed digestion. Restriction enzyme-reduced representation sequencing (RE-RRS) is a high-throughput and cost-effective approach to rumen metagenome profiling, but the systematic (e.g., sequencing) and biological factors influencing the resulting reference based (RB) and reference free (RF) profiles need to be explored before widespread industry adoption is possible. RESULTS Metagenome profiles were generated by RE-RRS of 4,479 rumen samples collected from 1,708 sheep, and assigned to eight groups based on diet, age, time off feed, and country (New Zealand or Australia) at the time of sample collection. Systematic effects were found to have minimal influence on metagenome profiles. Diet was a major driver of differences between samples, followed by time off feed, then age of the sheep. The RF approach resulted in more reads being assigned per sample and afforded greater resolution when distinguishing between groups than the RB approach. Normalizing relative abundances within the sampling Cohort abolished structures related to age, diet, and time off feed, allowing a clear signal based on methane emissions to be elucidated. Genus-level abundances of rumen microbes showed low-to-moderate heritability and repeatability and were consistent between diets. CONCLUSIONS Variation in rumen metagenomic profiles was influenced by diet, age, time off feed and genetics. Not accounting for environmental factors may limit the ability to associate the profile with traits of interest. However, these differences can be accounted for by adjusting for Cohort effects, revealing robust biological signals. The abundances of some genera were consistently heritable and repeatable across different environments, suggesting that metagenomic profiles could be used to predict an individual's future performance, or performance of its offspring, in a range of environments. These results highlight the potential of using rumen metagenomic profiles for selection purposes in a practical, agricultural setting.
Collapse
Affiliation(s)
- Melanie K Hess
- AgResearch Ltd., Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand.
| | - Hannah E Hodgkinson
- AgResearch Ltd., Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Andrew S Hess
- AgResearch Ltd., Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
- Agriculture, Veterinary & Rangeland Sciences, University of Nevada-Reno, 1664 N. Virginia St. Mail stop 202, Reno, NV, 89557, USA
| | - Larissa Zetouni
- AgResearch Ltd., Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
- Wageningen University & Research, P.O. Box 338, 6700, AH, Wageningen, The Netherlands
| | - Juliana C C Budel
- AgResearch Ltd., Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
- Graduate Program in Animal Science, Universidade Federal do Pará (UFPa), Castanhal, Brazil
| | - Hannah Henry
- AgResearch Ltd., Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Alistair Donaldson
- NSW Department of Primary Industries, University of New England, Armidale, 2351, Australia
| | - Timothy P Bilton
- AgResearch Ltd., Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Tracey C van Stijn
- AgResearch Ltd., Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Michelle R Kirk
- AgResearch Ltd., Grasslands Research Centre, Private Bag 11,008, Palmerston North, 4410, New Zealand
| | - Ken G Dodds
- AgResearch Ltd., Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Rudiger Brauning
- AgResearch Ltd., Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Alan F McCulloch
- AgResearch Ltd., Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Sharon M Hickey
- AgResearch Ltd., Ruakura Research Centre, Private Bag 3115, Hamilton, 3214, New Zealand
| | - Patricia L Johnson
- AgResearch Ltd., Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Arjan Jonker
- AgResearch Ltd., Grasslands Research Centre, Private Bag 11,008, Palmerston North, 4410, New Zealand
| | - Nickolas Morton
- Te Pūnaha Matatini, University of Auckland, Auckland, 1010, New Zealand
| | - Shaun Hendy
- Te Pūnaha Matatini, University of Auckland, Auckland, 1010, New Zealand
| | - V Hutton Oddy
- NSW Department of Primary Industries, University of New England, Armidale, 2351, Australia
| | - Peter H Janssen
- AgResearch Ltd., Grasslands Research Centre, Private Bag 11,008, Palmerston North, 4410, New Zealand
| | - John C McEwan
- AgResearch Ltd., Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Suzanne J Rowe
- AgResearch Ltd., Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
| |
Collapse
|
6
|
Hess MK, Zetouni L, Hess AS, Budel J, Dodds KG, Henry HM, Brauning R, McCulloch AF, Hickey SM, Johnson PL, Elmes S, Wing J, Bryson B, Knowler K, Hyndman D, Baird H, McRae KM, Jonker A, Janssen PH, McEwan JC, Rowe SJ. Combining host and rumen metagenome profiling for selection in sheep: prediction of methane, feed efficiency, production, and health traits. Genet Sel Evol 2023; 55:53. [PMID: 37491204 PMCID: PMC10367317 DOI: 10.1186/s12711-023-00822-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 07/03/2023] [Indexed: 07/27/2023] Open
Abstract
BACKGROUND Rumen microbes break down complex dietary carbohydrates into energy sources for the host and are increasingly shown to be a key aspect of animal performance. Host genotypes can be combined with microbial DNA sequencing to predict performance traits or traits related to environmental impact, such as enteric methane emissions. Metagenome profiles were generated from 3139 rumen samples, collected from 1200 dual purpose ewes, using restriction enzyme-reduced representation sequencing (RE-RRS). Phenotypes were available for methane (CH4) and carbon dioxide (CO2) emissions, the ratio of CH4 to CH4 plus CO2 (CH4Ratio), feed efficiency (residual feed intake: RFI), liveweight at the time of methane collection (LW), liveweight at 8 months (LW8), fleece weight at 12 months (FW12) and parasite resistance measured by faecal egg count (FEC1). We estimated the proportion of phenotypic variance explained by host genetics and the rumen microbiome, as well as prediction accuracies for each of these traits. RESULTS Incorporating metagenome profiles increased the variance explained and prediction accuracy compared to fitting only genomics for all traits except for CO2 emissions when animals were on a grass diet. Combining the metagenome profile with host genotype from lambs explained more than 70% of the variation in methane emissions and residual feed intake. Predictions were generally more accurate when incorporating metagenome profiles compared to genetics alone, even when considering profiles collected at different ages (lamb vs adult), or on different feeds (grass vs lucerne pellet). A reference-free approach to metagenome profiling performed better than metagenome profiles that were restricted to capturing genera from a reference database. We hypothesise that our reference-free approach is likely to outperform other reference-based approaches such as 16S rRNA gene sequencing for use in prediction of individual animal performance. CONCLUSIONS This paper shows the potential of using RE-RRS as a low-cost, high-throughput approach for generating metagenome profiles on thousands of animals for improved prediction of economically and environmentally important traits. A reference-free approach using a microbial relationship matrix from log10 proportions of each tag normalized within cohort (i.e., the group of animals sampled at the same time) is recommended for future predictions using RE-RRS metagenome profiles.
Collapse
Affiliation(s)
- Melanie K Hess
- University Invermay Agricultural Centre, AgResearch Ltd., Private Bag 50034, Mosgiel, 9053, New Zealand.
- University of Nebraska-Lincoln, Institute of Agriculture and Natural Resources, 300 Agricultural Hall, University of Nebraska-Lincoln, Lincoln, NE, 68583, USA.
| | - Larissa Zetouni
- University Invermay Agricultural Centre, AgResearch Ltd., Private Bag 50034, Mosgiel, 9053, New Zealand
- Wageningen University & Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| | - Andrew S Hess
- University Invermay Agricultural Centre, AgResearch Ltd., Private Bag 50034, Mosgiel, 9053, New Zealand
- University of Nevada, Reno, Agriculture, Veterinary & Rangeland Sciences, 1664 N. Virginia St., Mail Stop 202, Reno, NV, 89557, USA
| | - Juliana Budel
- University Invermay Agricultural Centre, AgResearch Ltd., Private Bag 50034, Mosgiel, 9053, New Zealand
- Graduate Program in Animal Science, Universidade Federal do Pará (UFPa), Castanhal, Brazil
| | - Ken G Dodds
- University Invermay Agricultural Centre, AgResearch Ltd., Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Hannah M Henry
- University Invermay Agricultural Centre, AgResearch Ltd., Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Rudiger Brauning
- University Invermay Agricultural Centre, AgResearch Ltd., Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Alan F McCulloch
- University Invermay Agricultural Centre, AgResearch Ltd., Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Sharon M Hickey
- Ruakura Research Centre, AgResearch Ltd., Private Bag 3115, Hamilton, 3240, New Zealand
| | - Patricia L Johnson
- University Invermay Agricultural Centre, AgResearch Ltd., Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Sara Elmes
- Deer Industry New Zealand, PO Box 10702, Wellington, 6140, New Zealand
| | - Janine Wing
- Pāmu, Landcorp Farming Ltd, PO Box 5349, Wellington, 6011, New Zealand
| | - Brooke Bryson
- Woodlands Research Farm, AgResearch Ltd., 204 Woodlands-Morton Mains Road, Woodlands, 9871, New Zealand
| | - Kevin Knowler
- University Invermay Agricultural Centre, AgResearch Ltd., Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Dianne Hyndman
- University Invermay Agricultural Centre, AgResearch Ltd., Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Hayley Baird
- University Invermay Agricultural Centre, AgResearch Ltd., Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Kathryn M McRae
- University Invermay Agricultural Centre, AgResearch Ltd., Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Arjan Jonker
- Grasslands Research Centre, AgResearch Ltd., Private Bag 11008, Palmerston North, 4410, New Zealand
| | - Peter H Janssen
- Grasslands Research Centre, AgResearch Ltd., Private Bag 11008, Palmerston North, 4410, New Zealand
| | - John C McEwan
- University Invermay Agricultural Centre, AgResearch Ltd., Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Suzanne J Rowe
- University Invermay Agricultural Centre, AgResearch Ltd., Private Bag 50034, Mosgiel, 9053, New Zealand
| |
Collapse
|
7
|
Zhao W, Qadri QR, Zhang Z, Wang Z, Pan Y, Wang Q, Zhang Z. PyAGH: a python package to fast construct kinship matrices based on different levels of omic data. BMC Bioinformatics 2023; 24:153. [PMID: 37072709 PMCID: PMC10111838 DOI: 10.1186/s12859-023-05280-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2022] [Accepted: 04/10/2023] [Indexed: 04/20/2023] Open
Abstract
BACKGROUND Construction of kinship matrices among individuals is an important step for both association studies and prediction studies based on different levels of omic data. Methods for constructing kinship matrices are becoming diverse and different methods have their specific appropriate scenes. However, software that can comprehensively calculate kinship matrices for a variety of scenarios is still in an urgent demand. RESULTS In this study, we developed an efficient and user-friendly python module, PyAGH, that can accomplish (1) conventional additive kinship matrces construction based on pedigree, genotypes, abundance data from transcriptome or microbiome; (2) genomic kinship matrices construction in combined population; (3) dominant and epistatic effects kinship matrices construction; (4) pedigree selection, tracing, detection and visualization; (5) visualization of cluster, heatmap and PCA analysis based on kinship matrices. The output from PyAGH can be easily integrated in other mainstream software based on users' purposes. Compared with other softwares, PyAGH integrates multiple methods for calculating the kinship matrix and has advantages in terms of speed and data size compared to other software. PyAGH is developed in python and C + + and can be easily installed by pip tool. Installation instructions and a manual document can be freely available from https://github.com/zhaow-01/PyAGH . CONCLUSION PyAGH is a fast and user-friendly Python package for calculating kinship matrices using pedigree, genotype, microbiome and transcriptome data as well as processing, analyzing and visualizing data and results. This package makes it easier to perform predictions and association studies processes based on different levels of omic data.
Collapse
Affiliation(s)
- Wei Zhao
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, 800# Dongchuan Road, Shanghai, China
| | - Qamar Raza Qadri
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, 800# Dongchuan Road, Shanghai, China
| | - Zhenyang Zhang
- Department of Animal Science, College of Animal Sciences, Zhejiang University, 866# Yuhangtang Road, Hangzhou, 310058, China
| | - Zhen Wang
- Department of Animal Science, College of Animal Sciences, Zhejiang University, 866# Yuhangtang Road, Hangzhou, 310058, China
| | - Yuchun Pan
- Department of Animal Science, College of Animal Sciences, Zhejiang University, 866# Yuhangtang Road, Hangzhou, 310058, China
- Hainan Research Institute, Zhejiang University, 11# Yonyou Industrial Park, Yazhou Bay Science and Technology City, Sanya, 572025, China
| | - Qishan Wang
- Department of Animal Science, College of Animal Sciences, Zhejiang University, 866# Yuhangtang Road, Hangzhou, 310058, China.
| | - Zhe Zhang
- Department of Animal Science, College of Animal Sciences, Zhejiang University, 866# Yuhangtang Road, Hangzhou, 310058, China.
| |
Collapse
|