1
|
Samra H, Li A, Soh B. GENE2D: A NoSQL Integrated Data Repository of Genetic Disorders Data. Healthcare (Basel) 2020; 8:healthcare8030257. [PMID: 32781728 PMCID: PMC7551627 DOI: 10.3390/healthcare8030257] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Revised: 07/27/2020] [Accepted: 08/04/2020] [Indexed: 11/16/2022] Open
Abstract
There are few sources from which to obtain clinical and genetic data for use in research in Saudi Arabia. Numerous obstacles led to the difficulty of integrating these data from silos and scattered sources to provide standardized access to large data sets for patients with common health conditions. To this end, we sought to contribute to this area and offer a practical and easy-to-implement solution. In this paper, we aim to design and implement a "not only SQL" (NoSQL) based integration framework to generate an Integrated Data Repository of Genetic Disorders Data (GENE2D) to integrate data from various genetic clinics and research centers in Saudi Arabia and provide an easy-to-use query interface for researchers to conduct their studies on large datasets. The major components involved in the GENE2D architecture consists of the data sources, the integrated data repository (IDR) as a central database, and the application interface. The IDR uses a NoSQL document store via MongoDB (an open source document-oriented database program) as a backend database. The application interface called Query Builder provides multiple services for data retrieval from the database using a custom query to answer simple or complex research questions. The GENE2D system demonstrates its potential to help grow and develop a national genetic disorders database in Saudi Arabia.
Collapse
Affiliation(s)
- Halima Samra
- Department of Computer Science and Information Technology, La Trobe University, Melbourne, VIC 3086, Australia;
- Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah 21589, Saudi Arabia
- Correspondence:
| | - Alice Li
- La Trobe Business School, La Trobe University, Melbourne, VIC 3086, Australia;
| | - Ben Soh
- Department of Computer Science and Information Technology, La Trobe University, Melbourne, VIC 3086, Australia;
| |
Collapse
|
2
|
Yohe SL, Carter AB, Pfeifer JD, Crawford JM, Cushman-Vokoun A, Caughron S, Leonard DGB. Standards for Clinical Grade Genomic Databases. Arch Pathol Lab Med 2016; 139:1400-12. [PMID: 26516938 DOI: 10.5858/arpa.2014-0568-cp] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
CONTEXT Next-generation sequencing performed in a clinical environment must meet clinical standards, which requires reproducibility of all aspects of the testing. Clinical-grade genomic databases (CGGDs) are required to classify a variant and to assist in the professional interpretation of clinical next-generation sequencing. Applying quality laboratory standards to the reference databases used for sequence-variant interpretation presents a new challenge for validation and curation. OBJECTIVES To define CGGD and the categories of information contained in CGGDs and to frame recommendations for the structure and use of these databases in clinical patient care. DESIGN Members of the College of American Pathologists Personalized Health Care Committee reviewed the literature and existing state of genomic databases and developed a framework for guiding CGGD development in the future. RESULTS Clinical-grade genomic databases may provide different types of information. This work group defined 3 layers of information in CGGDs: clinical genomic variant repositories, genomic medical data repositories, and genomic medicine evidence databases. The layers are differentiated by the types of genomic and medical information contained and the utility in assisting with clinical interpretation of genomic variants. Clinical-grade genomic databases must meet specific standards regarding submission, curation, and retrieval of data, as well as the maintenance of privacy and security. CONCLUSION These organizing principles for CGGDs should serve as a foundation for future development of specific standards that support the use of such databases for patient care.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Debra G B Leonard
- From the Department of Laboratory Medicine and Pathology, University of Minnesota Medical Center, Minneapolis (Dr Yohe); the Department of Pathology and Laboratory Medicine and the Department of Biomedical Informatics, Emory University, Atlanta, Georgia (Dr Carter); the Department of Pathology, Washington University School of Medicine, St. Louis, Missouri (Dr Pfeifer); the Department of Pathology and Laboratory Medicine, Hofstra North Shore-Long Island Jewish School of Medicine, Hempstead, New York (Dr Crawford); the Department of Pathology and Microbiology, University of Nebraska Medical Center, Omaha (Dr Cushman-Vokoun); the MAWD Pathology Group, North Kansas City, Missouri (Dr Caughron); and the Department of Pathology and Laboratory Medicine, University of Vermont College of Medicine, Burlington (Dr Leonard)
| |
Collapse
|
3
|
Hassan NNN, Plazzer JP, Smith TD, Halim-Fikri H, Macrae F, Zubaidi AAL, Zilfalil BA. Harmonizing the interpretation of genetic variants across the world: the Malaysian experience. BMC Res Notes 2016; 9:125. [PMID: 26915360 PMCID: PMC4768322 DOI: 10.1186/s13104-015-1798-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2015] [Accepted: 12/14/2015] [Indexed: 12/18/2022] Open
Abstract
Background Databases for gene variants are very useful for sharing genetic data and to facilitate the understanding of the genetic basis of diseases. This report summarises the issues surrounding the
development of the Malaysian Human Variome Project Country Node. The focus is on human germline variants. Somatic variants, mitochondrial variants and other types of genetic variation have corresponding databases which are not covered here, as they have specific issues that do not necessarily apply to germline variations. Results The ethical, legal, social issues,
intellectual property, ownership of the data, information technology implementation, and efforts to improve the standards and systems used in data sharing are discussed. Conclusion An overarching framework such as provided by the Human Variome Project to co-ordinate activities is invaluable. Country Nodes, such as MyHVP, enable human gene variation associated with human diseases to be collected, stored and shared by all disciplines (clinicians, molecular biologists, pathologists, bioinformaticians) for a consistent interpretation of genetic variants locally and across the world.
Collapse
Affiliation(s)
- Nik Norliza Nik Hassan
- Department of Colorectal Medicine and Genetics, The Royal Melbourne Hospital, Victoria, Australia.
| | - John-Paul Plazzer
- Department of Colorectal Medicine and Genetics, The Royal Melbourne Hospital, Victoria, Australia.
| | - Timothy D Smith
- Human Variome Project, The University of Melbourne, Victoria, Australia.
| | - Hashim Halim-Fikri
- Department of Colorectal Medicine and Genetics, The Royal Melbourne Hospital, Victoria, Australia. .,Faculty of Medicine, Medical Campus, Universiti Sultan Zainal Abidin (UniSZA), Jln. Sultan Mahmud, 20400, Kuala Terengganu, Terengganu, Malaysia.
| | - Finlay Macrae
- Department of Colorectal Medicine and Genetics, The Royal Melbourne Hospital, Victoria, Australia. .,Department of Medicine, The University of Melbourne, Victoria, Australia.
| | - A A L Zubaidi
- Faculty of Medicine, Medical Campus, Universiti Sultan Zainal Abidin (UniSZA), Jln. Sultan Mahmud, 20400, Kuala Terengganu, Terengganu, Malaysia.
| | - Bin Alwi Zilfalil
- Department of Pediatric, School of Medical Sciences, Health Campus, Universiti Sains Malaysia, Kubang Kerian, 16150, Kota Bharu, Kelantan, Malaysia.
| | | |
Collapse
|
5
|
Halim-Fikri H, Etemad A, Abdul Latif AZ, Merican AF, Baig AA, Annuar AA, Ismail E, Salahshourifar I, Liza-Sharmini AT, Ramli M, Shah MI, Johan MF, Hassan NNN, Abdul-Aziz NM, Mohd Noor NH, Nur-Shafawati AR, Hassan R, Bahar R, Zain RB, Yusoff SM, Yusoff S, Tan SG, Thong MK, Wan-Isa H, Abdullah WZ, Mohamed Z, Abdul Latiff Z, Zilfalil BA. The first Malay database toward the ethnic-specific target molecular variation. BMC Res Notes 2015; 8:176. [PMID: 25925844 PMCID: PMC4440489 DOI: 10.1186/s13104-015-1123-y] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2014] [Accepted: 04/15/2015] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND The Malaysian Node of the Human Variome Project (MyHVP) is one of the eighteen official Human Variome Project (HVP) country-specific nodes. Since its inception in 9(th) October 2010, MyHVP has attracted the significant number of Malaysian clinicians and researchers to participate and contribute their data to this project. MyHVP also act as the center of coordination for genotypic and phenotypic variation studies of the Malaysian population. A specialized database was developed to store and manage the data based on genetic variations which also associated with health and disease of Malaysian ethnic groups. This ethnic-specific database is called the Malaysian Node of the Human Variome Project database (MyHVPDb). FINDINGS Currently, MyHVPDb provides only information about the genetic variations and mutations found in the Malays. In the near future, it will expand for the other Malaysian ethnics as well. The data sets are specified based on diseases or genetic mutation types which have three main subcategories: Single Nucleotide Polymorphism (SNP), Copy Number Variation (CNV) followed by the mutations which code for the common diseases among Malaysians. MyHVPDb has been open to the local researchers, academicians and students through the registration at the portal of MyHVP ( http://hvpmalaysia.kk.usm.my/mhgvc/index.php?id=register ). CONCLUSIONS This database would be useful for clinicians and researchers who are interested in doing a study on genomics population and genetic diseases in order to obtain up-to-date and accurate information regarding the population-specific variations and also useful for those in countries with similar ethnic background.
Collapse
Affiliation(s)
- Hashim Halim-Fikri
- Department of Pediatric, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150, Kelantan, Malaysia. .,Faculty of Medicine, Universiti Sultan Zainal Abidin (UniSZA), 20400, Kuala Terengganu, Terengganu, Malaysia.
| | - Ali Etemad
- Department of Pediatric, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150, Kelantan, Malaysia.
| | - Ahmad Zubaidi Abdul Latif
- Faculty of Medicine, Universiti Sultan Zainal Abidin (UniSZA), 20400, Kuala Terengganu, Terengganu, Malaysia.
| | - Amir Feisal Merican
- Centre of Research for Computational Sciences and Informatics in Biology, Bioindustry, Environment, Agriculture & Healthcare, University of Malaya, 50603, Kuala Lumpur, Malaysia. .,Institute of Biological Sciences, University of Malaya, 50603, Kuala Lumpur, Malaysia.
| | - Atif Amin Baig
- Faculty of Medicine, Universiti Sultan Zainal Abidin (UniSZA), 20400, Kuala Terengganu, Terengganu, Malaysia. .,Molecular Medicine Cluster, Biomedical Center, Faculty of Medicine, Universiti Sultan Zainal Abidin (UniSZA), 20400, Kuala Terengganu, Terengganu, Malaysia.
| | - Azlina Ahmad Annuar
- Department of Biomedical Science, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia.
| | - Endom Ismail
- School of Biosciences and Biotechnology, National University of Malaysia, 43600, Bangi, Selangor, Malaysia.
| | - Iman Salahshourifar
- Human Genome Center, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150, Kelantan, Malaysia.
| | - Ahmad Tajudin Liza-Sharmini
- Department of Ophthalmology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150, Kelantan, Malaysia.
| | - Marini Ramli
- Department of Hematology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150, Kelantan, Malaysia.
| | - Mohamed Irwan Shah
- Molecular Medicine Cluster, Biomedical Center, Faculty of Medicine, Universiti Sultan Zainal Abidin (UniSZA), 20400, Kuala Terengganu, Terengganu, Malaysia.
| | - Muhammad Farid Johan
- Department of Hematology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150, Kelantan, Malaysia.
| | | | | | - Noor Haslina Mohd Noor
- Department of Hematology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150, Kelantan, Malaysia.
| | - Ab Rajab Nur-Shafawati
- Human Genome Center, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150, Kelantan, Malaysia.
| | - Rosline Hassan
- Department of Hematology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150, Kelantan, Malaysia.
| | - Rosnah Bahar
- Department of Hematology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150, Kelantan, Malaysia.
| | - Rosnah Binti Zain
- Oral Cancer Research Coordinating Centre, Faculty of Dentistry, University of Malaya, Kuala Lumpur, Malaysia.
| | - Shafini Mohamed Yusoff
- Department of Hematology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150, Kelantan, Malaysia.
| | - Surini Yusoff
- Department of Pediatric, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150, Kelantan, Malaysia.
| | - Soon Guan Tan
- Faculty of Biotechnology & Biomolecular Sciences, Universiti Putra Malaysia, Serdang, Selangor, Malaysia.
| | - Meow-Keong Thong
- Department of Pediatrics, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia.
| | - Hatin Wan-Isa
- Human Genome Center, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150, Kelantan, Malaysia.
| | - Wan Zaidah Abdullah
- Department of Hematology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150, Kelantan, Malaysia.
| | - Zahurin Mohamed
- Department of Pharmacology, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia.
| | - Zarina Abdul Latiff
- Department of Pediatrics, Faculty of Medicine, Universiti Kebangsaan Malaysia Medical Centre (UKMMC), Kuala Lumpur, Malaysia.
| | - Bin Alwi Zilfalil
- Department of Pediatric, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, 16150, Kelantan, Malaysia.
| | | |
Collapse
|
6
|
Kaput J, van Ommen B, Kremer B, Priami C, Monteiro JP, Morine M, Pepping F, Diaz Z, Fenech M, He Y, Albers R, Drevon CA, Evelo CT, Hancock REW, Ijsselmuiden C, Lumey LH, Minihane AM, Muller M, Murgia C, Radonjic M, Sobral B, West KP. Consensus statement understanding health and malnutrition through a systems approach: the ENOUGH program for early life. GENES & NUTRITION 2014; 9:378. [PMID: 24363221 PMCID: PMC3896628 DOI: 10.1007/s12263-013-0378-y] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/19/2013] [Accepted: 12/02/2013] [Indexed: 12/20/2022]
Abstract
Nutrition research, like most biomedical disciplines, adopted and often uses experimental approaches based on Beadle and Tatum's one gene-one polypeptide hypothesis, thereby reducing biological processes to single reactions or pathways. Systems thinking is needed to understand the complexity of health and disease processes requiring measurements of physiological processes, as well as environmental and social factors, which may alter the expression of genetic information. Analysis of physiological processes with omics technologies to assess systems' responses has only become available over the past decade and remains costly. Studies of environmental and social conditions known to alter health are often not connected to biomedical research. While these facts are widely accepted, developing and conducting comprehensive research programs for health are often beyond financial and human resources of single research groups. We propose a new research program on essential nutrients for optimal underpinning of growth and health (ENOUGH) that will use systems approaches with more comprehensive measurements and biostatistical analysis of the many biological and environmental factors that influence undernutrition. Creating a knowledge base for nutrition and health is a necessary first step toward developing solutions targeted to different populations in diverse social and physical environments for the two billion undernourished people in developed and developing economies.
Collapse
Affiliation(s)
- Jim Kaput
- Clinical Translation Unit, Nestle Institute of Health Sciences, Lausanne, Switzerland,
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Patrinos GP, Smith TD, Howard H, Al-Mulla F, Chouchane L, Hadjisavvas A, Hamed SA, Li XT, Marafie M, Ramesar RS, Ramos FJ, de Ravel T, El-Ruby MO, Shrestha TR, Sobrido MJ, Tadmouri G, Witsch-Baumgartner M, Zilfalil BA, Auerbach AD, Carpenter K, Cutting GR, Dung VC, Grody W, Hasler J, Jorde L, Kaput J, Macek M, Matsubara Y, Padilla C, Robinson H, Rojas-Martinez A, Taylor GR, Vihinen M, Weber T, Burn J, Qi M, Cotton RGH, Rimoin D. Human Variome Project country nodes: documenting genetic information within a country. Hum Mutat 2012; 33:1513-9. [PMID: 22753370 DOI: 10.1002/humu.22147] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2012] [Accepted: 06/04/2012] [Indexed: 11/09/2022]
Abstract
The Human Variome Project (http://www.humanvariomeproject.org) is an international effort aiming to systematically collect and share information on all human genetic variation. The two main pillars of this effort are gene/disease-specific databases and a network of Human Variome Project Country Nodes. The latter are nationwide efforts to document the genomic variation reported within a specific population. The development and successful operation of the Human Variome Project Country Nodes are of utmost importance to the success of Human Variome Project's aims and goals because they not only allow the genetic burden of disease to be quantified in different countries, but also provide diagnosticians and researchers access to an up-to-date resource that will assist them in their daily clinical practice and biomedical research, respectively. Here, we report the discussions and recommendations that resulted from the inaugural meeting of the International Confederation of Countries Advisory Council, held on 12th December 2011, during the 2011 Human Variome Project Beijing Meeting. We discuss the steps necessary to maximize the impact of the Country Node effort for developing regional and country-specific clinical genetics resources and summarize a few well-coordinated genetic data collection initiatives that would serve as paradigms for similar projects.
Collapse
Affiliation(s)
- George P Patrinos
- Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|