1
|
Liu Z, Xie Z, Li M. Comprehensive and deep evaluation of structural variation detection pipelines with third-generation sequencing data. Genome Biol 2024; 25:188. [PMID: 39010145 PMCID: PMC11247875 DOI: 10.1186/s13059-024-03324-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 06/26/2024] [Indexed: 07/17/2024] Open
Abstract
BACKGROUND Structural variation (SV) detection methods using third-generation sequencing data are widely employed, yet accurately detecting SVs remains challenging. Different methods often yield inconsistent results for certain SV types, complicating tool selection and revealing biases in detection. RESULTS This study comprehensively evaluates 53 SV detection pipelines using simulated and real data from PacBio (CLR: Continuous Long Read, CCS: Circular Consensus Sequencing) and Nanopore (ONT) platforms. We assess their performance in detecting various sizes and types of SVs, breakpoint biases, and genotyping accuracy with various sequencing depths. Notably, pipelines such as Minimap2-cuteSV2, NGMLR-SVIM, PBMM2-pbsv, Winnowmap-Sniffles2, and Winnowmap-SVision exhibit comparatively higher recall and precision. Our findings also show that combining multiple pipelines with the same aligner, like pbmm2 or winnowmap, can significantly enhance performance. The individual pipelines' detailed ranking and performance metrics can be viewed in a dynamic table: http://pmglab.top/SVPipelinesRanking . CONCLUSIONS This study comprehensively characterizes the strengths and weaknesses of numerous pipelines, providing valuable insights that can improve SV detection in third-generation sequencing data and inform SV annotation and function prediction.
Collapse
Affiliation(s)
- Zhi Liu
- Program in Bioinformatics, Zhongshan School of Medicine, The Fifth Affiliated Hospital, Sun Yat-Sen University, Guangzhou, China
- Key Laboratory of Tropical Disease Control (Sun Yat-Sen University), Ministry of Education, Guangzhou, China
| | - Zhi Xie
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou, China
| | - Miaoxin Li
- Program in Bioinformatics, Zhongshan School of Medicine, The Fifth Affiliated Hospital, Sun Yat-Sen University, Guangzhou, China.
- Key Laboratory of Tropical Disease Control (Sun Yat-Sen University), Ministry of Education, Guangzhou, China.
- Center for Precision Medicine, Sun Yat-Sen University, Guangzhou, China.
- Department of Psychiatry, The University of Hong Kong, Hong Kong, SAR, China.
- Guangdong Provincial Key Laboratory of Biomedical Imaging and Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital, Sun Yat-Sen University, Zhuhai, China.
| |
Collapse
|
2
|
Aoki YI, Taguchi K, Anzawa H, Kawashima J, Ishida N, Otsuki A, Hasegawa A, Baird L, Suzuki T, Motoike IN, Ohneda K, Kumada K, Katsuoka F, Kinoshita K, Yamamoto M. Whole blood transcriptome analysis for age- and gender-specific gene expression profiling in Japanese individuals. J Biochem 2024; 175:611-627. [PMID: 38268329 PMCID: PMC11756699 DOI: 10.1093/jb/mvae008] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 12/29/2023] [Accepted: 01/12/2024] [Indexed: 01/26/2024] Open
Abstract
Whole blood transcriptome analysis is a valuable approachin medical research, primarily due to the ease of sample collection and the richness of the information obtained. Since the expression profile of individual genes in the analysis is influenced by medical traits and demographic attributes such as age and gender, there has been a growing demand for a comprehensive database for blood transcriptome analysis. Here, we performed whole blood RNA sequencing (RNA-seq) analysis on 576 participants stratified by age (20-30s and 60-70s) and gender from cohorts of the Tohoku Medical Megabank (TMM). A part of female segment included pregnant women. We did not exclude the globin gene family in our RNA-seq study, which enabled us to identify instances of hereditary persistence of fetal hemoglobin based on the HBG1 and HBG2 expression information. Comparing stratified populations allowed us to identify groups of genes associated with age-related changes and gender differences. We also found that the immune response status, particularly measured by neutrophil-to-lymphocyte ratio (NLR), strongly influences the diversity of individual gene expression profiles in whole blood transcriptome analysis. This stratification has resulted in a data set that will be highly beneficial for future whole blood transcriptome analysis in the Japanese population.
Collapse
Affiliation(s)
- Yu-ichi Aoki
- Tohoku Medical Megabank Organization, Tohoku University,
Sendai, Miyagi, Japan
- Systems Bioinformatics, Graduate School of Information Sciences, Tohoku
University, Aramaki aza Aoba 6-3-09, Aoba-ku, Sendai, Miyagi, 980-8579,
Japan
| | - Keiko Taguchi
- Tohoku Medical Megabank Organization, Tohoku University,
Sendai, Miyagi, Japan
- Advanced Research Center for Innovations in Next-Generation Medicine,
Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai,
Miyagi, 980-8573, Japan
| | - Hayato Anzawa
- Tohoku Medical Megabank Organization, Tohoku University,
Sendai, Miyagi, Japan
- Systems Bioinformatics, Graduate School of Information Sciences, Tohoku
University, Aramaki aza Aoba 6-3-09, Aoba-ku, Sendai, Miyagi, 980-8579,
Japan
| | - Junko Kawashima
- Tohoku Medical Megabank Organization, Tohoku University,
Sendai, Miyagi, Japan
| | - Noriko Ishida
- Tohoku Medical Megabank Organization, Tohoku University,
Sendai, Miyagi, Japan
| | - Akihito Otsuki
- Tohoku Medical Megabank Organization, Tohoku University,
Sendai, Miyagi, Japan
| | - Atsushi Hasegawa
- Tohoku Medical Megabank Organization, Tohoku University,
Sendai, Miyagi, Japan
| | - Liam Baird
- Tohoku Medical Megabank Organization, Tohoku University,
Sendai, Miyagi, Japan
- Advanced Research Center for Innovations in Next-Generation Medicine,
Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai,
Miyagi, 980-8573, Japan
| | - Takafumi Suzuki
- Tohoku Medical Megabank Organization, Tohoku University,
Sendai, Miyagi, Japan
| | - Ikuko N Motoike
- Tohoku Medical Megabank Organization, Tohoku University,
Sendai, Miyagi, Japan
- Systems Bioinformatics, Graduate School of Information Sciences, Tohoku
University, Aramaki aza Aoba 6-3-09, Aoba-ku, Sendai, Miyagi, 980-8579,
Japan
| | - Kinuko Ohneda
- Tohoku Medical Megabank Organization, Tohoku University,
Sendai, Miyagi, Japan
| | - Kazuki Kumada
- Tohoku Medical Megabank Organization, Tohoku University,
Sendai, Miyagi, Japan
- Advanced Research Center for Innovations in Next-Generation Medicine,
Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai,
Miyagi, 980-8573, Japan
| | - Fumiki Katsuoka
- Tohoku Medical Megabank Organization, Tohoku University,
Sendai, Miyagi, Japan
- Advanced Research Center for Innovations in Next-Generation Medicine,
Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai,
Miyagi, 980-8573, Japan
| | - Kengo Kinoshita
- Tohoku Medical Megabank Organization, Tohoku University,
Sendai, Miyagi, Japan
- Systems Bioinformatics, Graduate School of Information Sciences, Tohoku
University, Aramaki aza Aoba 6-3-09, Aoba-ku, Sendai, Miyagi, 980-8579,
Japan
- Advanced Research Center for Innovations in Next-Generation Medicine,
Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai,
Miyagi, 980-8573, Japan
| | - Masayuki Yamamoto
- Tohoku Medical Megabank Organization, Tohoku University,
Sendai, Miyagi, Japan
| |
Collapse
|
3
|
Suga A, Mizobuchi K, Inooka T, Yoshitake K, Minematsu N, Tsunoda K, Kuniyoshi K, Kawai Y, Omae Y, Tokunaga K, Hayashi T, Ueno S, Iwata T. A homozygous structural variant of RPGRIP1 is frequently associated with achromatopsia in Japanese patients with IRD. GENETICS IN MEDICINE OPEN 2024; 2:101843. [PMID: 39669618 PMCID: PMC11613597 DOI: 10.1016/j.gimo.2024.101843] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 03/21/2024] [Accepted: 03/21/2024] [Indexed: 12/14/2024]
Abstract
Purpose Achromatopsia (ACHM) is an early-onset cone dysfunction caused by 5 genes with cone-specific functions (CNGA3, CNGB3, GNAT2, PDE6C, and PDE6H) and by ATF6, a transcription factor with ubiquitous expression. To improve the relatively low variant detection ratio in these genes in a cohort of exome-sequenced Japanese patients with inherited retinal diseases (IRD), we performed genome sequencing to detect structural variants and intronic variants in patients with ACHM. Methods Genome sequencing of 10 ACHM pedigrees was performed after exome sequencing. Structural, non-coding, and coding variants were filtered based on segregation between the affected and unaffected in each pedigree. Variant frequency and predicted damage scores were considered in identifying pathogenic variants. Results A homozygous deletion involving exon 18 of RPGRIP1 was detected in 5 of 10 ACHM probands, and variant inheritance from each parent was confirmed. This deletion was relatively frequent (minor allele frequency = 0.0023) in the Japanese population but was only homozygous in patients with ACHM among the 199 Japanese IRD probands analyzed by the same genome sequencing pipeline. Conclusion The deletion involving exon 18 of RPGRIP1 is a prevalent cause of ACHM in Japanese patients and contributes to the wide spectrum of RPGRIP1-associated IRD phenotypes, from Leber congenital amaurosis to ACHM.
Collapse
Affiliation(s)
- Akiko Suga
- Division of Molecular and Cellular Biology, National Institute of Sensory Organs, NHO Tokyo Medical Center, Tokyo, Japan
| | - Kei Mizobuchi
- Department of Ophthalmology, The Jikei University School of Medicine, Tokyo, Japan
| | - Taiga Inooka
- Department of Ophthalmology, Nagoya University Graduate School of Medicine, Aichi, Japan
| | - Kazutoshi Yoshitake
- Laboratory of Aquatic Molecular Biology and Biotechnology, Aquatic Bioscience, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | - Naoko Minematsu
- Division of Molecular and Cellular Biology, National Institute of Sensory Organs, NHO Tokyo Medical Center, Tokyo, Japan
| | - Kazushige Tsunoda
- Division of Vision Research, National Institute of Sensory Organs, NHO Tokyo Medical Center, Tokyo, Japan
| | - Kazuki Kuniyoshi
- Department of Ophthalmology, Kindai University Faculty of Medicine, Osaka, Japan
| | - Yosuke Kawai
- Genome Medical Science Project, National Center for Global Health and Medicine, Tokyo, Japan
| | - Yosuke Omae
- Genome Medical Science Project, National Center for Global Health and Medicine, Tokyo, Japan
| | - Katsushi Tokunaga
- Genome Medical Science Project, National Center for Global Health and Medicine, Tokyo, Japan
| | - Takaaki Hayashi
- Department of Ophthalmology, The Jikei University School of Medicine, Tokyo, Japan
| | - Shinji Ueno
- Department of Ophthalmology, Hirosaki University Graduate School of Medicine, Aomori, Japan
| | - Takeshi Iwata
- Division of Molecular and Cellular Biology, National Institute of Sensory Organs, NHO Tokyo Medical Center, Tokyo, Japan
| |
Collapse
|
4
|
Reis ALM, Rapadas M, Hammond JM, Gamaarachchi H, Stevanovski I, Ayuputeri Kumaheri M, Chintalaphani SR, Dissanayake DSB, Siggs OM, Hewitt AW, Llamas B, Brown A, Baynam G, Mann GJ, McMorran BJ, Easteal S, Hermes A, Jenkins MR, Patel HR, Deveson IW. The landscape of genomic structural variation in Indigenous Australians. Nature 2023; 624:602-610. [PMID: 38093003 PMCID: PMC10733147 DOI: 10.1038/s41586-023-06842-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Accepted: 11/07/2023] [Indexed: 12/20/2023]
Abstract
Indigenous Australians harbour rich and unique genomic diversity. However, Aboriginal and Torres Strait Islander ancestries are historically under-represented in genomics research and almost completely missing from reference datasets1-3. Addressing this representation gap is critical, both to advance our understanding of global human genomic diversity and as a prerequisite for ensuring equitable outcomes in genomic medicine. Here we apply population-scale whole-genome long-read sequencing4 to profile genomic structural variation across four remote Indigenous communities. We uncover an abundance of large insertion-deletion variants (20-49 bp; n = 136,797), structural variants (50 b-50 kb; n = 159,912) and regions of variable copy number (>50 kb; n = 156). The majority of variants are composed of tandem repeat or interspersed mobile element sequences (up to 90%) and have not been previously annotated (up to 62%). A large fraction of structural variants appear to be exclusive to Indigenous Australians (12% lower-bound estimate) and most of these are found in only a single community, underscoring the need for broad and deep sampling to achieve a comprehensive catalogue of genomic structural variation across the Australian continent. Finally, we explore short tandem repeats throughout the genome to characterize allelic diversity at 50 known disease loci5, uncover hundreds of novel repeat expansion sites within protein-coding genes, and identify unique patterns of diversity and constraint among short tandem repeat sequences. Our study sheds new light on the dimensions and dynamics of genomic structural variation within and beyond Australia.
Collapse
Affiliation(s)
- Andre L M Reis
- Genomics and Inherited Disease Program, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute, Darlinghurst, New South Wales, Australia
- Faculty of Medicine, University of New South Wales, Sydney, New South Wales, Australia
| | - Melissa Rapadas
- Genomics and Inherited Disease Program, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute, Darlinghurst, New South Wales, Australia
| | - Jillian M Hammond
- Genomics and Inherited Disease Program, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute, Darlinghurst, New South Wales, Australia
| | - Hasindu Gamaarachchi
- Genomics and Inherited Disease Program, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute, Darlinghurst, New South Wales, Australia
- School of Computer Science and Engineering, University of New South Wales, Sydney, New South Wales, Australia
| | - Igor Stevanovski
- Genomics and Inherited Disease Program, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute, Darlinghurst, New South Wales, Australia
| | - Meutia Ayuputeri Kumaheri
- Genomics and Inherited Disease Program, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute, Darlinghurst, New South Wales, Australia
| | - Sanjog R Chintalaphani
- Genomics and Inherited Disease Program, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute, Darlinghurst, New South Wales, Australia
- Faculty of Medicine, University of New South Wales, Sydney, New South Wales, Australia
| | - Duminda S B Dissanayake
- National Centre for Indigenous Genomics, John Curtin School of Medical Research, Australian National University, Canberra, Australian Capital Territory, Australia
- Institute for Applied Ecology, University of Canberra, Canberra, Australian Capital Territory, Australia
| | - Owen M Siggs
- Genomics and Inherited Disease Program, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute, Darlinghurst, New South Wales, Australia
- Department of Ophthalmology, Flinders University, Bedford Park, South Australia, Australia
| | - Alex W Hewitt
- Menzies Institute for Medical Research, University of Tasmania, Hobart, Tasmania, Australia
| | - Bastien Llamas
- National Centre for Indigenous Genomics, John Curtin School of Medical Research, Australian National University, Canberra, Australian Capital Territory, Australia
- Australian Centre for Ancient DNA, School of Biological Sciences and Environment Institute, University of Adelaide, Adelaide, South Australia, Australia
- ARC Centre of Excellence for Australian Biodiversity and Heritage, University of Adelaide, Adelaide, South Australia, Australia
- Indigenous Genomics, Telethon Kids Institute, Adelaide, South Australia, Australia
| | - Alex Brown
- National Centre for Indigenous Genomics, John Curtin School of Medical Research, Australian National University, Canberra, Australian Capital Territory, Australia
- Indigenous Genomics, Telethon Kids Institute, Adelaide, South Australia, Australia
| | - Gareth Baynam
- Telethon Kids Institute and Division of Paediatrics, Faculty of Health and Medical Sciences, University of Western Australia, Perth, Western Australia, Australia
- Genetic Services of Western Australia, Western Australian Department of Health, Perth, Western Australia, Australia
- Western Australian Register of Developmental Anomalies, Western Australian Department of Health, Perth, Western Australia, Australia
| | - Graham J Mann
- National Centre for Indigenous Genomics, John Curtin School of Medical Research, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Brendan J McMorran
- National Centre for Indigenous Genomics, John Curtin School of Medical Research, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Simon Easteal
- National Centre for Indigenous Genomics, John Curtin School of Medical Research, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Azure Hermes
- National Centre for Indigenous Genomics, John Curtin School of Medical Research, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Misty R Jenkins
- Immunology Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia
| | - Hardip R Patel
- National Centre for Indigenous Genomics, John Curtin School of Medical Research, Australian National University, Canberra, Australian Capital Territory, Australia.
| | - Ira W Deveson
- Genomics and Inherited Disease Program, Garvan Institute of Medical Research, Sydney, New South Wales, Australia.
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute, Darlinghurst, New South Wales, Australia.
- Faculty of Medicine, University of New South Wales, Sydney, New South Wales, Australia.
| |
Collapse
|
5
|
Taira M, Mugikura S, Mori N, Hozawa A, Saito T, Nakamura T, Kiyomoto H, Kobayashi T, Ogishima S, Nagami F, Uruno A, Shimizu R, Kobayashi T, Yasuda J, Kure S, Sakurai M, Motoike IN, Kumada K, Nakaya N, Obara T, Oba K, Sekiguchi A, Thyreau B, Mutoh T, Takano Y, Abe M, Maikusa N, Tatewaki Y, Taki Y, Yaegashi N, Tomita H, Kinoshita K, Kuriyama S, Fuse N, Yamamoto M. Tohoku Medical Megabank Brain Magnetic Resonance Imaging Study: Rationale, Design, and Background. JMA J 2023; 6:246-264. [PMID: 37560377 PMCID: PMC10407421 DOI: 10.31662/jmaj.2022-0220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Accepted: 04/24/2023] [Indexed: 08/11/2023] Open
Abstract
The Tohoku Medical Megabank Brain Magnetic Resonance Imaging Study (TMM Brain MRI Study) was established to collect multimodal information through neuroimaging and neuropsychological assessments to evaluate the cognitive function and mental health of residents who experienced the Great East Japan Earthquake (GEJE) and associated tsunami. The study also aimed to promote advances in personalized healthcare and medicine related to mental health and cognitive function among the general population. We recruited participants for the first (baseline) survey starting in July 2014, enrolling individuals who were participating in either the TMM Community-Based Cohort Study (TMM CommCohort Study) or the TMM Birth and Three-Generation Cohort Study (TMM BirThree Cohort Study). We collected multiple magnetic resonance imaging (MRI) sequences, including 3D T1-weighted sequences, magnetic resonance angiography (MRA), diffusion tensor imaging (DTI), pseudo-continuous arterial spin labeling (pCASL), and three-dimensional fluid-attenuated inversion recovery (FLAIR) sequences. To assess neuropsychological status, we used both questionnaire- and interview-based rating scales. The former assessments included the Tri-axial Coping Scale, Impact of Event Scale in Japanese, Profile of Mood States, and 15-item Depression, Anxiety, and Stress Scale, whereas the latter assessments included the Mini-Mental State Examination, Japanese version. A total of 12,164 individuals were recruited for the first (baseline) survey, including those unable to complete all assessments. In parallel, we returned the MRI results to the participants and subsequently shared the MRI data through the TMM Biobank. At present, the second (first follow-up) survey of the study started in October 2019 is underway. In this study, we established a large and comprehensive database that included robust neuroimaging data as well as psychological and cognitive assessment data. In combination with genomic and omics data already contained in the TMM Biobank database, these data could provide new insights into the relationships of pathological processes with neuropsychological disorders, including age-related cognitive impairment.
Collapse
Affiliation(s)
- Makiko Taira
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Graduate School of Medicine, Tohoku University, Sendai, Japan
- Tohoku University Hospital, Tohoku University, Sendai, Japan
| | - Shunji Mugikura
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Graduate School of Medicine, Tohoku University, Sendai, Japan
- Tohoku University Hospital, Tohoku University, Sendai, Japan
| | - Naoko Mori
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Graduate School of Medicine, Tohoku University, Sendai, Japan
- Tohoku University Hospital, Tohoku University, Sendai, Japan
| | - Atsushi Hozawa
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
| | - Tomo Saito
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, Sendai, Japan
| | - Tomohiro Nakamura
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
| | - Hideyasu Kiyomoto
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
| | - Tadao Kobayashi
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
| | - Soichi Ogishima
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, Sendai, Japan
| | - Fuji Nagami
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Graduate School of Medicine, Tohoku University, Sendai, Japan
| | - Akira Uruno
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Graduate School of Medicine, Tohoku University, Sendai, Japan
| | - Ritsuko Shimizu
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Graduate School of Medicine, Tohoku University, Sendai, Japan
| | - Tomoko Kobayashi
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Graduate School of Medicine, Tohoku University, Sendai, Japan
- Tohoku University Hospital, Tohoku University, Sendai, Japan
| | - Jun Yasuda
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Miyagi Cancer Center, Natori, Japan
| | - Shigeo Kure
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Graduate School of Medicine, Tohoku University, Sendai, Japan
- Tohoku University Hospital, Tohoku University, Sendai, Japan
- Miyagi Children's Hospital, Sendai, Japan
| | - Miyuki Sakurai
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
| | - Ikuko N Motoike
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Graduate School of Information Sciences, Tohoku University, Sendai, Japan
| | - Kazuki Kumada
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
| | - Naoki Nakaya
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
| | - Taku Obara
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
| | - Kentaro Oba
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Graduate School of Medicine, Tohoku University, Sendai, Japan
- Institute of Development, Aging and Cancer, Tohoku University, Sendai, Japan
| | - Atsushi Sekiguchi
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Integrative Brain Imaging Center, National Center of Neurology and Psychiatry, Tokyo, Japan
| | - Benjamin Thyreau
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Graduate School of Medicine, Tohoku University, Sendai, Japan
- Institute of Development, Aging and Cancer, Tohoku University, Sendai, Japan
| | - Tatsushi Mutoh
- Graduate School of Medicine, Tohoku University, Sendai, Japan
- Institute of Development, Aging and Cancer, Tohoku University, Sendai, Japan
| | - Yuji Takano
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Institute of Development, Aging and Cancer, Tohoku University, Sendai, Japan
- University of Human Environments, Matsuyama, Japan
| | - Mitsunari Abe
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Integrative Brain Imaging Center, National Center of Neurology and Psychiatry, Tokyo, Japan
- Graduate School of Medicine, Fukushima Medical University, Fukushima, Japan
| | - Norihide Maikusa
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Integrative Brain Imaging Center, National Center of Neurology and Psychiatry, Tokyo, Japan
- Graduate School of Art and Science, University of Tokyo, Tokyo, Japan
| | - Yasuko Tatewaki
- Graduate School of Medicine, Tohoku University, Sendai, Japan
- Institute of Development, Aging and Cancer, Tohoku University, Sendai, Japan
| | - Yasuyuki Taki
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Graduate School of Medicine, Tohoku University, Sendai, Japan
- Institute of Development, Aging and Cancer, Tohoku University, Sendai, Japan
| | - Nobuo Yaegashi
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Graduate School of Medicine, Tohoku University, Sendai, Japan
- Tohoku University Hospital, Tohoku University, Sendai, Japan
- Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, Sendai, Japan
| | - Hiroaki Tomita
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Graduate School of Medicine, Tohoku University, Sendai, Japan
- Tohoku University Hospital, Tohoku University, Sendai, Japan
| | - Kengo Kinoshita
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, Sendai, Japan
- Graduate School of Information Sciences, Tohoku University, Sendai, Japan
| | - Shinichi Kuriyama
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Graduate School of Medicine, Tohoku University, Sendai, Japan
- International Research Institute of Disaster Science, Tohoku University, Sendai, Japan
- The United Centers for Advanced Research and Translational Medicine, Tohoku University, Sendai, Japan
| | - Nobuo Fuse
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Graduate School of Medicine, Tohoku University, Sendai, Japan
- Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, Sendai, Japan
| | - Masayuki Yamamoto
- Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
- Graduate School of Medicine, Tohoku University, Sendai, Japan
- Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, Sendai, Japan
| |
Collapse
|
6
|
Samarakoon H, Ferguson JM, Gamaarachchi H, Deveson IW. Accelerated nanopore basecalling with SLOW5 data format. Bioinformatics 2023; 39:btad352. [PMID: 37252813 PMCID: PMC10261880 DOI: 10.1093/bioinformatics/btad352] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 04/12/2023] [Accepted: 05/29/2023] [Indexed: 06/01/2023] Open
Abstract
MOTIVATION Nanopore sequencing is emerging as a key pillar in the genomic technology landscape but computational constraints limiting its scalability remain to be overcome. The translation of raw current signal data into DNA or RNA sequence reads, known as 'basecalling', is a major friction in any nanopore sequencing workflow. Here, we exploit the advantages of the recently developed signal data format 'SLOW5' to streamline and accelerate nanopore basecalling on high-performance computing (HPC) and cloud environments. RESULTS SLOW5 permits highly efficient sequential data access, eliminating a potential analysis bottleneck. To take advantage of this, we introduce Buttery-eel, an open-source wrapper for Oxford Nanopore's Guppy basecaller that enables SLOW5 data access, resulting in performance improvements that are essential for scalable, affordable basecalling. AVAILABILITY AND IMPLEMENTATION Buttery-eel is available at https://github.com/Psy-Fer/buttery-eel.
Collapse
Affiliation(s)
- Hiruna Samarakoon
- Genomics Pillar, Garvan Institute of Medical Research, Sydney, NSW 2010, Australia
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children’s Research Institute, Australia
- School of Computer Science and Engineering, University of New South Wales, Sydney, NSW 2052, Australia
| | - James M Ferguson
- Genomics Pillar, Garvan Institute of Medical Research, Sydney, NSW 2010, Australia
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children’s Research Institute, Australia
| | - Hasindu Gamaarachchi
- Genomics Pillar, Garvan Institute of Medical Research, Sydney, NSW 2010, Australia
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children’s Research Institute, Australia
- School of Computer Science and Engineering, University of New South Wales, Sydney, NSW 2052, Australia
| | - Ira W Deveson
- Genomics Pillar, Garvan Institute of Medical Research, Sydney, NSW 2010, Australia
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children’s Research Institute, Australia
- Faculty of Medicine, University of New South Wales, Sydney, NSW 2052, Australia
| |
Collapse
|