1
|
Yan Q, Li S, Yan Q, Huo X, Wang C, Wang X, Sun Y, Zhao W, Yu Z, Zhang Y, Guo R, Lv Q, He X, Yao C, Li Z, Chen F, Ji Q, Zhang A, Jin H, Wang G, Feng X, Feng L, Wu F, Ning J, Deng S, An Y, Guo DA, Martin FM, Ma X. A genomic compendium of cultivated human gut fungi characterizes the gut mycobiome and its relevance to common diseases. Cell 2024; 187:2969-2989.e24. [PMID: 38776919 DOI: 10.1016/j.cell.2024.04.043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Revised: 02/17/2024] [Accepted: 04/29/2024] [Indexed: 05/25/2024]
Abstract
The gut fungal community represents an essential element of human health, yet its functional and metabolic potential remains insufficiently elucidated, largely due to the limited availability of reference genomes. To address this gap, we presented the cultivated gut fungi (CGF) catalog, encompassing 760 fungal genomes derived from the feces of healthy individuals. This catalog comprises 206 species spanning 48 families, including 69 species previously unidentified. We explored the functional and metabolic attributes of the CGF species and utilized this catalog to construct a phylogenetic representation of the gut mycobiome by analyzing over 11,000 fecal metagenomes from Chinese and non-Chinese populations. Moreover, we identified significant common disease-related variations in gut mycobiome composition and corroborated the associations between fungal signatures and inflammatory bowel disease (IBD) through animal experimentation. These resources and findings substantially enrich our understanding of the biological diversity and disease relevance of the human gut mycobiome.
Collapse
Affiliation(s)
- Qiulong Yan
- Second Affiliated Hospital, Dalian Medical University, Dalian 116044, China; Dalian Key Laboratory of Metabolic Target Characterization and Traditional Chinese Medicine Intervention, School of Pharmacy, Dalian Medical University, Dalian 116044, China; College of Basic Medical Sciences, Dalian Medical University, Dalian 116044, China
| | - Shenghui Li
- Puensum Genetech Institute, Wuhan 430076, China; Key Laboratory of Precision Nutrition and Food Quality, Department of Nutrition and Health, China Agricultural University, Beijing 100091, China
| | - Qingsong Yan
- Second Affiliated Hospital, Dalian Medical University, Dalian 116044, China
| | - Xiaokui Huo
- Second Affiliated Hospital, Dalian Medical University, Dalian 116044, China
| | - Chao Wang
- Second Affiliated Hospital, Dalian Medical University, Dalian 116044, China; Dalian Key Laboratory of Metabolic Target Characterization and Traditional Chinese Medicine Intervention, School of Pharmacy, Dalian Medical University, Dalian 116044, China; First Affiliated Hospital, Dalian Medical University, Dalian 116044, China.
| | - Xifan Wang
- Key Laboratory of Precision Nutrition and Food Quality, Department of Nutrition and Health, China Agricultural University, Beijing 100091, China; Department of Obstetrics and Gynecology, Columbia University, New York, NY 10027, USA
| | - Yan Sun
- Second Affiliated Hospital, Dalian Medical University, Dalian 116044, China
| | - Wenyu Zhao
- Dalian Key Laboratory of Metabolic Target Characterization and Traditional Chinese Medicine Intervention, School of Pharmacy, Dalian Medical University, Dalian 116044, China
| | - Zhenlong Yu
- Dalian Key Laboratory of Metabolic Target Characterization and Traditional Chinese Medicine Intervention, School of Pharmacy, Dalian Medical University, Dalian 116044, China
| | - Yue Zhang
- Puensum Genetech Institute, Wuhan 430076, China
| | - Ruochun Guo
- Puensum Genetech Institute, Wuhan 430076, China
| | - Qingbo Lv
- Puensum Genetech Institute, Wuhan 430076, China
| | - Xin He
- Dalian Key Laboratory of Metabolic Target Characterization and Traditional Chinese Medicine Intervention, School of Pharmacy, Dalian Medical University, Dalian 116044, China; Shanghai Research Center for Modernization of Traditional Chinese Medicine, National Engineering Laboratory for TCM Standardization Technology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201210, China
| | - Changliang Yao
- Shanghai Research Center for Modernization of Traditional Chinese Medicine, National Engineering Laboratory for TCM Standardization Technology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201210, China
| | | | - Fang Chen
- College of Basic Medical Sciences, Dalian Medical University, Dalian 116044, China
| | - Qianru Ji
- Puensum Genetech Institute, Wuhan 430076, China
| | - Aiqin Zhang
- Puensum Genetech Institute, Wuhan 430076, China
| | - Hao Jin
- Puensum Genetech Institute, Wuhan 430076, China
| | - Guangyang Wang
- College of Basic Medical Sciences, Dalian Medical University, Dalian 116044, China
| | - Xiaoying Feng
- Second Affiliated Hospital, Dalian Medical University, Dalian 116044, China
| | - Lei Feng
- Second Affiliated Hospital, Dalian Medical University, Dalian 116044, China
| | - Fan Wu
- Second Affiliated Hospital, Dalian Medical University, Dalian 116044, China
| | - Jing Ning
- Dalian Key Laboratory of Metabolic Target Characterization and Traditional Chinese Medicine Intervention, School of Pharmacy, Dalian Medical University, Dalian 116044, China
| | - Sa Deng
- Dalian Key Laboratory of Metabolic Target Characterization and Traditional Chinese Medicine Intervention, School of Pharmacy, Dalian Medical University, Dalian 116044, China
| | - Yue An
- Second Affiliated Hospital, Dalian Medical University, Dalian 116044, China
| | - De-An Guo
- Shanghai Research Center for Modernization of Traditional Chinese Medicine, National Engineering Laboratory for TCM Standardization Technology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201210, China.
| | - Francis M Martin
- Université de Lorraine, Institut national de recherche pour l'agriculture, l'alimentation et l'environnement, UMR Interactions Arbres/Microorganismes, Centre INRAE Grand Est-Nancy, Champenoux 54280, France; Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Beijing Forestry University, Beijing 100091, China.
| | - Xiaochi Ma
- Second Affiliated Hospital, Dalian Medical University, Dalian 116044, China; Dalian Key Laboratory of Metabolic Target Characterization and Traditional Chinese Medicine Intervention, School of Pharmacy, Dalian Medical University, Dalian 116044, China.
| |
Collapse
|
2
|
Hosseini SY, Mallick R, Mäkinen P, Ylä-Herttuala S. Insights into Prime Editing Technology: A Deep Dive into Fundamentals, Potentials and Challenges. Hum Gene Ther 2024. [PMID: 38832869 DOI: 10.1089/hum.2024.043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/06/2024] Open
Abstract
As the most versatile and precise gene editing technology, prime editing (PE) can establish a durable cure for most human genetic disorders. Several generations of PE have been developed based on an editor machine or pegRNA to achieve any kind of genetic correction. However, due to the early stage of development, PE complex elements need to be optimized for more efficient editing. Smart optimization of editor proteins as well as pegRNA has been contemplated by many researchers but the universal PE machine's current shortcomings remain to be solved. The modification of PE elements, fine-tuning of the host genes, manipulation of epigenetics, and blockage of immune responses could be employed to reach more efficient prime editing. Moreover, the host factors involved in the PE process, such as repair and innate immune system genes have not been determined, and PE cell context-dependency is still poorly understood. Regarding the large size of the PE elements, delivery is a significant challenge and the development of a universal viral or non-viral platform is still far from complete. PE versions with shortened variants of RT are still too large to fit in common viral vectors. Immune reactions against PE elements and delivery vectors should be considered in the new versions. The prediction of the prime editing process remains to be improved for screening and validation purposes. In this review, the fundamentals of PE, including generations, potential, optimization, delivery, in vivo barriers, and the future landscape of the technology will be discussed.
Collapse
Affiliation(s)
- Seyed Younes Hosseini
- University of Eastern Finland A I Virtanen Institute for Molecular Sciences, Kuopio, Pohjois-Savo, Finland
- Shiraz University of Medical Sciences, 2. Bacteriology and Virology Department, Shiraz, Fars, Iran (the Islamic Republic of);
| | - Rahul Mallick
- University of Eastern Finland A I Virtanen Institute for Molecular Sciences, P.O Box 1627 (Street address: Neulaniementie 2), Kuopio, Finland, 70211;
| | - Petri Mäkinen
- University of Eastern Finland A I Virtanen Institute for Molecular Sciences, Yliopistonranta 1E, Kuopio, Finland, 70210;
| | - Seppo Ylä-Herttuala
- University of Eastern Finland, A I Virtanen Institute for Molecular Sciences, Kuopio, Finland;
| |
Collapse
|
3
|
He X, Qi Z, Liu Z, Chang X, Zhang X, Li J, Wang M. Pangenome analysis reveals transposon-driven genome evolution in cotton. BMC Biol 2024; 22:92. [PMID: 38654264 DOI: 10.1186/s12915-024-01893-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2023] [Accepted: 04/18/2024] [Indexed: 04/25/2024] Open
Abstract
BACKGROUND Transposable elements (TEs) have a profound influence on the trajectory of plant evolution, driving genome expansion and catalyzing phenotypic diversification. The pangenome, a comprehensive genetic pool encompassing all variations within a species, serves as an invaluable tool, unaffected by the confounding factors of intraspecific diversity. This allows for a more nuanced exploration of plant TE evolution. RESULTS Here, we constructed a pangenome for diploid A-genome cotton using 344 accessions from representative geographical regions, including 223 from China as the main component. We found 511 Mb of non-reference sequences (NRSs) and revealed the presence of 5479 previously undiscovered protein-coding genes. Our comprehensive approach enabled us to decipher the genetic underpinnings of the distinct geographic distributions of cotton. Notably, we identified 3301 presence-absence variations (PAVs) that are closely tied to gene expression patterns within the pangenome, among which 2342 novel expression quantitative trait loci (eQTLs) were found residing in NRSs. Our investigation also unveiled contrasting patterns of transposon proliferation between diploid and tetraploid cotton, with long terminal repeat (LTR) retrotransposons exhibiting a synchronized surge in polyploids. Furthermore, the invasion of LTR retrotransposons from the A subgenome to the D subgenome triggered a substantial expansion of the latter following polyploidization. In addition, we found that TE insertions were responsible for the loss of 36.2% of species-specific genes, as well as the generation of entirely new species-specific genes. CONCLUSIONS Our pangenome analyses provide new insights into cotton genomics and subgenome dynamics after polyploidization and demonstrate the power of pangenome approaches for elucidating transposon impacts and genome evolution.
Collapse
Affiliation(s)
- Xin He
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, China
| | - Zhengyang Qi
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, China
| | - Zhenping Liu
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, China
| | - Xing Chang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, China
| | - Xianlong Zhang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, China
| | - Jianying Li
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, China.
| | - Maojun Wang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, China.
| |
Collapse
|
4
|
Zhang Y, Nair S, Zhang Z, Zhao J, Zhao H, Lu L, Chang L, Jiao N. Adverse Environmental Perturbations May Threaten Kelp Farming Sustainability by Exacerbating Enterobacterales Diseases. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2024; 58:5796-5810. [PMID: 38507562 DOI: 10.1021/acs.est.3c09921] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/22/2024]
Abstract
Globally kelp farming is gaining attention to mitigate land-use pressures and achieve carbon neutrality. However, the influence of environmental perturbations on kelp farming remains largely unknown. Recently, a severe disease outbreak caused extensive kelp mortality in Sanggou Bay, China, one of the world's largest high-density kelp farming areas. Here, through in situ investigations and simulation experiments, we find indications that an anomalously dramatic increase in elevated coastal seawater light penetration may have contributed to dysbiosis in the kelp Saccharina japonica's microbiome. This dysbiosis promoted the proliferation of opportunistic pathogenic Enterobacterales, mainly including the genera Colwellia and Pseudoalteromonas. Using transcriptomic analyses, we revealed that high-light conditions likely induced oxidative stress in kelp, potentially facilitating opportunistic bacterial Enterobacterales attack that activates a terrestrial plant-like pattern recognition receptor system in kelp. Furthermore, we uncover crucial genotypic determinants of Enterobacterales dominance and pathogenicity within kelp tissue, including pathogen-associated molecular patterns, potential membrane-damaging toxins, and alginate and mannitol lysis capability. Finally, through analysis of kelp-associated microbiome data sets under the influence of ocean warming and acidification, we conclude that such Enterobacterales favoring microbiome shifts are likely to become more prevalent in future environmental conditions. Our study highlights the need for understanding complex environmental influences on kelp health and associated microbiomes for the sustainable development of seaweed farming.
Collapse
Affiliation(s)
- Yongyu Zhang
- Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, No. 189 Songling Road, Qingdao 266101, Shandong, China
- Shandong Energy Institute, No. 189 Songling Road, Qingdao 266101, Shandong, China
- Qingdao New Energy Shandong Laboratory, Qingdao 266101, Shandong, China
| | - Shailesh Nair
- Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, No. 189 Songling Road, Qingdao 266101, Shandong, China
- Shandong Energy Institute, No. 189 Songling Road, Qingdao 266101, Shandong, China
- Qingdao New Energy Shandong Laboratory, Qingdao 266101, Shandong, China
| | - Zenghu Zhang
- Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, No. 189 Songling Road, Qingdao 266101, Shandong, China
- Shandong Energy Institute, No. 189 Songling Road, Qingdao 266101, Shandong, China
- Qingdao New Energy Shandong Laboratory, Qingdao 266101, Shandong, China
| | - Jiulong Zhao
- Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, No. 189 Songling Road, Qingdao 266101, Shandong, China
- Shandong Energy Institute, No. 189 Songling Road, Qingdao 266101, Shandong, China
- Qingdao New Energy Shandong Laboratory, Qingdao 266101, Shandong, China
| | - Hanshuang Zhao
- Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, No. 189 Songling Road, Qingdao 266101, Shandong, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Longfei Lu
- Weihai Changqing Ocean Science Technology Co., Ltd., Rongcheng 264300, China
| | - Lirong Chang
- Weihai Changqing Ocean Science Technology Co., Ltd., Rongcheng 264300, China
| | - Nianzhi Jiao
- Institute of Marine Microbes and Ecospheres, State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen 361100, China
| |
Collapse
|
5
|
Macey MC. Genome-resolved metagenomics identifies novel active microbes in biogeochemical cycling within methanol-enriched soil. ENVIRONMENTAL MICROBIOLOGY REPORTS 2024; 16:e13246. [PMID: 38575138 PMCID: PMC10994693 DOI: 10.1111/1758-2229.13246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Accepted: 03/15/2024] [Indexed: 04/06/2024]
Abstract
Metagenome assembled genomes (MAGs), generated from sequenced 13C-labelled DNA from 13C-methanol enriched soils, were binned using an ensemble approach. This method produced a significantly larger number of higher-quality MAGs compared to direct binning approaches. These MAGs represent both the primary methanol utilizers and the secondary utilizers labelled via cross-feeding and predation on the labelled methylotrophs, including numerous uncultivated taxa. Analysis of these MAGs enabled the identification of multiple metabolic pathways within these active taxa that have climatic relevance relating to nitrogen, sulfur and trace gas metabolism. This includes denitrification, dissimilatory nitrate reduction to ammonium, ammonia oxidation and metabolism of organic sulfur species. The binning of viral sequence data also yielded extensive viral MAGs, identifying active viral replication by both lytic and lysogenic phages within the methanol-enriched soils. These MAGs represent a valuable resource for characterizing biogeochemical cycling within terrestrial environments.
Collapse
Affiliation(s)
- Michael C. Macey
- AstrobiologyOU, Earth, Environment and Ecosystem SciencesThe Open UniversityMilton KeynesUK
| |
Collapse
|
6
|
Xu S, Huang H, Chen S, Muhammad ZUA, Wei W, Xie W, Jiang H, Hou S. Recovery of 1887 metagenome-assembled genomes from the South China Sea. Sci Data 2024; 11:197. [PMID: 38351104 PMCID: PMC10864278 DOI: 10.1038/s41597-024-03050-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 02/05/2024] [Indexed: 02/16/2024] Open
Abstract
The South China Sea (SCS) is a marginal sea characterized by strong land-sea biogeochemical interactions. SCS has a distinctive landscape with a multitude of seamounts in its basin. Seamounts create "seamount effects" that influence the diversity and distribution of planktonic microorganisms in the surrounding oligotrophic waters. Although the vertical distribution and community structure of marine microorganisms have been explored in certain regions of the global ocean, there is a lack of comprehensive microbial genomic surveys for uncultured microorganisms in SCS, particularly in the seamount regions. Here, we employed a metagenomic approach to study the uncultured microbial communities sampled from the Xianbei seamount region to the North Coast waters of SCS. A total of 1887 non-redundant prokaryotic metagenome-assembled genomes (MAGs) were reconstructed, of which, 153 MAGs were classified as high-quality MAGs based on the MIMAG standards. The community structure and genomic information provided by this dataset could be used to analyze microbial distribution and metabolism in the SCS.
Collapse
Affiliation(s)
- Shuaishuai Xu
- Department of Ocean Science & Engineering, Southern University of Science and Technology, Shenzhen, 518055, China
- College of Life Science and Technology, Jinan University, Guangzhou, 510632, China
| | - Hailong Huang
- School of Marine Sciences, Ningbo University, Ningbo, 315211, China
| | - Songze Chen
- Department of Ocean Science & Engineering, Southern University of Science and Technology, Shenzhen, 518055, China
- Shenzhen Ecological and Environmental Monitoring Center of Guangdong Province, Shenzhen, 518049, China
| | - Zain Ul Arifeen Muhammad
- Department of Ocean Science & Engineering, Southern University of Science and Technology, Shenzhen, 518055, China
| | - Wenya Wei
- School of Marine Sciences, Sun Yat-sen University, Guangzhou, 510632, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, 519000, China
| | - Wei Xie
- School of Marine Sciences, Sun Yat-sen University, Guangzhou, 510632, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, 519000, China
| | - Haibo Jiang
- School of Marine Sciences, Ningbo University, Ningbo, 315211, China.
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, 519000, China.
| | - Shengwei Hou
- Department of Ocean Science & Engineering, Southern University of Science and Technology, Shenzhen, 518055, China.
| |
Collapse
|
7
|
Wang K, Hua G, Li J, Yang Y, Zhang C, Yang L, Hu X, Scheben A, Wu Y, Gong P, Zhang S, Fan Y, Zeng T, Lu L, Gong Y, Jiang R, Sun G, Tian Y, Kang X, Hu H, Li W. Duck pan-genome reveals two transposon insertions caused bodyweight enlarging and white plumage phenotype formation during evolution. IMETA 2024; 3:e154. [PMID: 38868520 PMCID: PMC10989122 DOI: 10.1002/imt2.154] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 11/07/2023] [Indexed: 06/14/2024]
Abstract
Structural variations (SVs) are a major source of domestication and improvement traits. We present the first duck pan-genome constructed using five genome assemblies capturing ∼40.98 Mb new sequences. This pan-genome together with high-depth sequencing data (∼46.5×) identified 101,041 SVs, of which substantial proportions were derived from transposable element (TE) activity. Many TE-derived SVs anchoring in a gene body or regulatory region are linked to duck's domestication and improvement. By combining quantitative genetics with molecular experiments, we, for the first time, unraveled a 6945 bp Gypsy insertion as a functional mutation of the major gene IGF2BP1 associated with duck bodyweight. This Gypsy insertion, to our knowledge, explains the largest effect on bodyweight among avian species (27.61% of phenotypic variation). In addition, we also examined another 6634 bp Gypsy insertion in MITF intron, which triggers a novel transcript of MITF, thereby contributing to the development of white plumage. Our findings highlight the importance of using a pan-genome as a reference in genomics studies and illuminate the impact of transposons in trait formation and livestock breeding.
Collapse
Affiliation(s)
- Kejun Wang
- Henan Key Laboratory for Innovation and Utilization of Chicken Germplasm Resources, Department of Animal Genetic and Breeding, College of Animal Science and TechnologyHenan Agricultural UniversityZhengzhouChina
- The Shennong LaboratoryZhengzhouChina
| | - Guoying Hua
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Agricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhenChina
| | - Jingyi Li
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Intelligent Husbandry Department, College of Animal Science and TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Yu Yang
- Wuhan Academy of Agricultural ScienceWuhanChina
| | - Chenxi Zhang
- Henan Key Laboratory for Innovation and Utilization of Chicken Germplasm Resources, Department of Animal Genetic and Breeding, College of Animal Science and TechnologyHenan Agricultural UniversityZhengzhouChina
- The Shennong LaboratoryZhengzhouChina
| | - Lan Yang
- Henan Key Laboratory for Innovation and Utilization of Chicken Germplasm Resources, Department of Animal Genetic and Breeding, College of Animal Science and TechnologyHenan Agricultural UniversityZhengzhouChina
- The Shennong LaboratoryZhengzhouChina
| | - Xiaoyu Hu
- Henan Key Laboratory for Innovation and Utilization of Chicken Germplasm Resources, Department of Animal Genetic and Breeding, College of Animal Science and TechnologyHenan Agricultural UniversityZhengzhouChina
- The Shennong LaboratoryZhengzhouChina
| | - Armin Scheben
- Simons Center for Quantitative BiologyCold Spring Harbor LaboratoryCold Spring HarborNew YorkUSA
| | - Yanan Wu
- Department of preventive veterinary medicine, College of Veterinary MedicineHenan Agricultural UniversityZhengzhouChina
- International Joint Research Center for National Animal ImmunologyZhengzhouHenanChina
| | - Ping Gong
- Wuhan Academy of Agricultural ScienceWuhanChina
| | - Shuangjie Zhang
- Quality Safety and Processing LaboratoryJiangsu Institute of Poultry SciencesYangzhouChina
| | - Yanfeng Fan
- Quality Safety and Processing LaboratoryJiangsu Institute of Poultry SciencesYangzhouChina
| | - Tao Zeng
- State Key Laboratory for Managing Biotic and Chemical Threats to the Quality and Safety of Agro‐Products, Institute of Animal Husbandry and Veterinary ScienceZhejiang Academy of Agricultural SciencesHangzhouChina
| | - Lizhi Lu
- State Key Laboratory for Managing Biotic and Chemical Threats to the Quality and Safety of Agro‐Products, Institute of Animal Husbandry and Veterinary ScienceZhejiang Academy of Agricultural SciencesHangzhouChina
| | - Yanzhang Gong
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Intelligent Husbandry Department, College of Animal Science and TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Ruirui Jiang
- Henan Key Laboratory for Innovation and Utilization of Chicken Germplasm Resources, Department of Animal Genetic and Breeding, College of Animal Science and TechnologyHenan Agricultural UniversityZhengzhouChina
- The Shennong LaboratoryZhengzhouChina
| | - Guirong Sun
- Henan Key Laboratory for Innovation and Utilization of Chicken Germplasm Resources, Department of Animal Genetic and Breeding, College of Animal Science and TechnologyHenan Agricultural UniversityZhengzhouChina
- The Shennong LaboratoryZhengzhouChina
| | - Yadong Tian
- Henan Key Laboratory for Innovation and Utilization of Chicken Germplasm Resources, Department of Animal Genetic and Breeding, College of Animal Science and TechnologyHenan Agricultural UniversityZhengzhouChina
- The Shennong LaboratoryZhengzhouChina
| | - Xiangtao Kang
- Henan Key Laboratory for Innovation and Utilization of Chicken Germplasm Resources, Department of Animal Genetic and Breeding, College of Animal Science and TechnologyHenan Agricultural UniversityZhengzhouChina
- The Shennong LaboratoryZhengzhouChina
| | - Haifei Hu
- Rice Research Institute, Guangdong Key Laboratory of New Technology in Rice Breeding and Guangdong Rice Engineering LaboratoryGuangdong Academy of Agricultural SciencesGuangzhouChina
| | - Wenting Li
- Henan Key Laboratory for Innovation and Utilization of Chicken Germplasm Resources, Department of Animal Genetic and Breeding, College of Animal Science and TechnologyHenan Agricultural UniversityZhengzhouChina
- The Shennong LaboratoryZhengzhouChina
| |
Collapse
|
8
|
Wang Z, You R, Han H, Liu W, Sun F, Zhu S. Effective binning of metagenomic contigs using contrastive multi-view representation learning. Nat Commun 2024; 15:585. [PMID: 38233391 PMCID: PMC10794208 DOI: 10.1038/s41467-023-44290-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Accepted: 12/07/2023] [Indexed: 01/19/2024] Open
Abstract
Contig binning plays a crucial role in metagenomic data analysis by grouping contigs from the same or closely related genomes. However, existing binning methods face challenges in practical applications due to the diversity of data types and the difficulties in efficiently integrating heterogeneous information. Here, we introduce COMEBin, a binning method based on contrastive multi-view representation learning. COMEBin utilizes data augmentation to generate multiple fragments (views) of each contig and obtains high-quality embeddings of heterogeneous features (sequence coverage and k-mer distribution) through contrastive learning. Experimental results on multiple simulated and real datasets demonstrate that COMEBin outperforms state-of-the-art binning methods, particularly in recovering near-complete genomes from real environmental samples. COMEBin outperforms other binning methods remarkably when integrated into metagenomic analysis pipelines, including the recovery of potentially pathogenic antibiotic-resistant bacteria (PARB) and moderate or higher quality bins containing potential biosynthetic gene clusters (BGCs).
Collapse
Affiliation(s)
- Ziye Wang
- Institute of Science and Technology for Brain-Inspired Intelligence and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
| | - Ronghui You
- Institute of Science and Technology for Brain-Inspired Intelligence and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
| | - Haitao Han
- Institute of Science and Technology for Brain-Inspired Intelligence and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
| | - Wei Liu
- Institute of Science and Technology for Brain-Inspired Intelligence and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
| | - Fengzhu Sun
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Shanfeng Zhu
- Institute of Science and Technology for Brain-Inspired Intelligence and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China.
- Shanghai Qi Zhi Institute, Shanghai, China.
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence (Fudan University), Ministry of Education, Shanghai, China.
- Shanghai Key Lab of Intelligent Information Processing and Shanghai Institute of Artificial Intelligence Algorithm, Fudan University, Shanghai, China.
- Zhangjiang Fudan International Innovation Center, Shanghai, China.
| |
Collapse
|
9
|
Abakarova M, Marquet C, Rera M, Rost B, Laine E. Alignment-based Protein Mutational Landscape Prediction: Doing More with Less. Genome Biol Evol 2023; 15:evad201. [PMID: 37936309 PMCID: PMC10653582 DOI: 10.1093/gbe/evad201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 10/27/2023] [Accepted: 11/01/2023] [Indexed: 11/09/2023] Open
Abstract
The wealth of genomic data has boosted the development of computational methods predicting the phenotypic outcomes of missense variants. The most accurate ones exploit multiple sequence alignments, which can be costly to generate. Recent efforts for democratizing protein structure prediction have overcome this bottleneck by leveraging the fast homology search of MMseqs2. Here, we show the usefulness of this strategy for mutational outcome prediction through a large-scale assessment of 1.5M missense variants across 72 protein families. Our study demonstrates the feasibility of producing alignment-based mutational landscape predictions that are both high-quality and compute-efficient for entire proteomes. We provide the community with the whole human proteome mutational landscape and simplified access to our predictive pipeline.
Collapse
Affiliation(s)
- Marina Abakarova
- CNRS, IBPS, Laboratory of Computational and Quantitative Biology (LCQB), Sorbonne Université, UMR 7238, Paris 75005, France
- Université Paris Cité, INSERM UMR U1284, 75004 Paris, France
| | - Céline Marquet
- Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748 Munich, Germany
- TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748 Garching, Germany
| | - Michael Rera
- Université Paris Cité, INSERM UMR U1284, 75004 Paris, France
| | - Burkhard Rost
- Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748 Munich, Germany
- Institute for Advanced Study (TUM-IAS), Lichtenbergstr. 2a, Garching, 85748 Munich, Germany
- TUM School of Life Sciences Weihenstephan (TUM-WZW), Alte Akademie 8, Freising, Germany
| | - Elodie Laine
- CNRS, IBPS, Laboratory of Computational and Quantitative Biology (LCQB), Sorbonne Université, UMR 7238, Paris 75005, France
- Institut universitaire de France (IUF)
| |
Collapse
|
10
|
Gu L, Li X, Zhu W, Shen Y, Wang Q, Liu W, Zhang J, Zhang H, Li J, Li Z, Liu Z, Li C, Wang H. Ultrasensitive proteomics depicted an in-depth landscape for the very early stage of mouse maternal-to-zygotic transition. J Pharm Anal 2023; 13:942-954. [PMID: 37719194 PMCID: PMC10499587 DOI: 10.1016/j.jpha.2023.05.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Revised: 05/05/2023] [Accepted: 05/08/2023] [Indexed: 09/19/2023] Open
Abstract
Single-cell or low-input multi-omics techniques have revolutionized the study of pre-implantation embryo development. However, the single-cell or low-input proteomic research in this field is relatively underdeveloped because of the higher threshold of the starting material for mammalian embryo samples and the lack of hypersensitive proteome technology. In this study, a comprehensive solution of ultrasensitive proteome technology (CS-UPT) was developed for single-cell or low-input mouse oocyte/embryo samples. The deep coverage and high-throughput routes significantly reduced the starting material and were selected by investigators based on their demands. Using the deep coverage route, we provided the first large-scale snapshot of the very early stage of mouse maternal-to-zygotic transition, including almost 5,500 protein groups from 20 mouse oocytes or zygotes for each sample. Moreover, significant protein regulatory networks centered on transcription factors and kinases between the MII oocyte and 1-cell embryo provided rich insights into minor zygotic genome activation.
Collapse
Affiliation(s)
- Lei Gu
- Center for Single-Cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Xumiao Li
- Center for Single-Cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Wencheng Zhu
- Institute of Neuroscience, CAS Center for Excellence in Brain Science and Intelligence Technology, CAS Key Laboratory of Primate Neurobiology, State Key Laboratory of Neuroscience, Chinese Academy of Sciences, Shanghai, 200031, China
- Shanghai Center for Brain Science and Brain-Inspired Intelligence Technology, Shanghai, 200031, China
| | - Yi Shen
- Shanghai Applied Protein Technology Co., Ltd., Shanghai, 201100, China
| | - Qinqin Wang
- Center for Single-Cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Wenjun Liu
- Institute of Neuroscience, CAS Center for Excellence in Brain Science and Intelligence Technology, CAS Key Laboratory of Primate Neurobiology, State Key Laboratory of Neuroscience, Chinese Academy of Sciences, Shanghai, 200031, China
- University of Chinese Academy of Sciences, Beijing, 101408, China
| | - Junfeng Zhang
- Shanghai Applied Protein Technology Co., Ltd., Shanghai, 201100, China
| | - Huiping Zhang
- Shanghai Applied Protein Technology Co., Ltd., Shanghai, 201100, China
| | - Jingquan Li
- Center for Single-Cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Ziyi Li
- Shanghai Applied Protein Technology Co., Ltd., Shanghai, 201100, China
| | - Zhen Liu
- Institute of Neuroscience, CAS Center for Excellence in Brain Science and Intelligence Technology, CAS Key Laboratory of Primate Neurobiology, State Key Laboratory of Neuroscience, Chinese Academy of Sciences, Shanghai, 200031, China
- Shanghai Center for Brain Science and Brain-Inspired Intelligence Technology, Shanghai, 200031, China
| | - Chen Li
- Center for Single-Cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Hui Wang
- Center for Single-Cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| |
Collapse
|
11
|
Pan S, Zhao XM, Coelho LP. SemiBin2: self-supervised contrastive learning leads to better MAGs for short- and long-read sequencing. Bioinformatics 2023; 39:i21-i29. [PMID: 37387171 PMCID: PMC10311329 DOI: 10.1093/bioinformatics/btad209] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/01/2023] Open
Abstract
MOTIVATION Metagenomic binning methods to reconstruct metagenome-assembled genomes (MAGs) from environmental samples have been widely used in large-scale metagenomic studies. The recently proposed semi-supervised binning method, SemiBin, achieved state-of-the-art binning results in several environments. However, this required annotating contigs, a computationally costly and potentially biased process. RESULTS We propose SemiBin2, which uses self-supervised learning to learn feature embeddings from the contigs. In simulated and real datasets, we show that self-supervised learning achieves better results than the semi-supervised learning used in SemiBin1 and that SemiBin2 outperforms other state-of-the-art binners. Compared to SemiBin1, SemiBin2 can reconstruct 8.3-21.5% more high-quality bins and requires only 25% of the running time and 11% of peak memory usage in real short-read sequencing samples. To extend SemiBin2 to long-read data, we also propose ensemble-based DBSCAN clustering algorithm, resulting in 13.1-26.3% more high-quality genomes than the second best binner for long-read data. AVAILABILITY AND IMPLEMENTATION SemiBin2 is available as open source software at https://github.com/BigDataBiology/SemiBin/ and the analysis scripts used in the study can be found at https://github.com/BigDataBiology/SemiBin2_benchmark.
Collapse
Affiliation(s)
- Shaojun Pan
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Ministry of Education, Shanghai 200433, China
| | - Xing-Ming Zhao
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Ministry of Education, Shanghai 200433, China
- MOE Frontiers Center for Brain Science, Fudan University, Shanghai 200433, China
- Zhangjiang Fudan International Innovation Center, Shanghai 201203, China
| | - Luis Pedro Coelho
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Ministry of Education, Shanghai 200433, China
| |
Collapse
|
12
|
Jia L, Wu Y, Dong Y, Chen J, Chen WH, Zhao XM. A survey on computational strategies for genome-resolved gut metagenomics. Brief Bioinform 2023; 24:7145904. [PMID: 37114640 DOI: 10.1093/bib/bbad162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 03/20/2023] [Accepted: 04/04/2023] [Indexed: 04/29/2023] Open
Abstract
Recovering high-quality metagenome-assembled genomes (HQ-MAGs) is critical for exploring microbial compositions and microbe-phenotype associations. However, multiple sequencing platforms and computational tools for this purpose may confuse researchers and thus call for extensive evaluation. Here, we systematically evaluated a total of 40 combinations of popular computational tools and sequencing platforms (i.e. strategies), involving eight assemblers, eight metagenomic binners and four sequencing technologies, including short-, long-read and metaHiC sequencing. We identified the best tools for the individual tasks (e.g. the assembly and binning) and combinations (e.g. generating more HQ-MAGs) depending on the availability of the sequencing data. We found that the combination of the hybrid assemblies and metaHiC-based binning performed best, followed by the hybrid and long-read assemblies. More importantly, both long-read and metaHiC sequencings link more mobile elements and antibiotic resistance genes to bacterial hosts and improve the quality of public human gut reference genomes with 32% (34/105) HQ-MAGs that were either of better quality than those in the Unified Human Gastrointestinal Genome catalog version 2 or novel.
Collapse
Affiliation(s)
- Longhao Jia
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
| | - Yingjian Wu
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| | - Yanqi Dong
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
| | - Jingchao Chen
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| | - Wei-Hua Chen
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
- Institution of Medical Artificial Intelligence, Binzhou Medical University, Yantai 264003, China
| | - Xing-Ming Zhao
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Ministry of Education, Ministry of Education, Shanghai 200433, China
- MOE Frontiers Center for Brain Science, Fudan University, Shanghai 200433, China
- State Key Laboratory of Medical Neurobiology, Institutes of Brain Science, Fudan University, Shanghai, China
| |
Collapse
|
13
|
Sereika M, Petriglieri F, Jensen TBN, Sannikov A, Hoppe M, Nielsen PH, Marshall IPG, Schramm A, Albertsen M. Closed genomes uncover a saltwater species of Candidatus Electronema and shed new light on the boundary between marine and freshwater cable bacteria. THE ISME JOURNAL 2023; 17:561-569. [PMID: 36697964 PMCID: PMC10030654 DOI: 10.1038/s41396-023-01372-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Revised: 01/11/2023] [Accepted: 01/13/2023] [Indexed: 01/26/2023]
Abstract
Cable bacteria of the Desulfobulbaceae family are centimeter-long filamentous bacteria, which are capable of conducting long-distance electron transfer. Currently, all cable bacteria are classified into two candidate genera: Candidatus Electronema, typically found in freshwater environments, and Candidatus Electrothrix, typically found in saltwater environments. This taxonomic framework is based on both 16S rRNA gene sequences and metagenome-assembled genome (MAG) phylogenies. However, most of the currently available MAGs are highly fragmented, incomplete, and thus likely miss key genes essential for deciphering the physiology of cable bacteria. Also, a closed, circular genome of cable bacteria has not been published yet. To address this, we performed Nanopore long-read and Illumina short-read shotgun sequencing of selected environmental samples and a single-strain enrichment of Ca. Electronema aureum. We recovered multiple cable bacteria MAGs, including two circular and one single-contig. Phylogenomic analysis, also confirmed by 16S rRNA gene-based phylogeny, classified one circular MAG and the single-contig MAG as novel species of cable bacteria, which we propose to name Ca. Electronema halotolerans and Ca. Electrothrix laxa, respectively. The Ca. Electronema halotolerans, despite belonging to the previously recognized freshwater genus of cable bacteria, was retrieved from brackish-water sediment. Metabolic predictions showed several adaptations to a high salinity environment, similar to the "saltwater" Ca. Electrothrix species, indicating how Ca. Electronema halotolerans may be the evolutionary link between marine and freshwater cable bacteria lineages.
Collapse
Affiliation(s)
- Mantas Sereika
- Center for Microbial Communities, Aalborg University, Aalborg, Denmark
| | | | | | - Artur Sannikov
- Center for Electromicrobiology, Aarhus University, Aarhus, Denmark
| | - Morten Hoppe
- Center for Electromicrobiology, Aarhus University, Aarhus, Denmark
| | | | - Ian P G Marshall
- Center for Electromicrobiology, Aarhus University, Aarhus, Denmark
| | - Andreas Schramm
- Center for Electromicrobiology, Aarhus University, Aarhus, Denmark
| | - Mads Albertsen
- Center for Microbial Communities, Aalborg University, Aalborg, Denmark.
| |
Collapse
|