Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

66
(from Reference Citation Analysis)

Article PDFs (16)

Cited by > 0 (54)

Searched Name

Ka Yee Yeung

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

McCarthy MS, Colburn ZT, Yeung KY, Gillette LH, Hung LH, Elshaw E. A Randomized Controlled Trial of Precision Nutrition Counseling for Service Members at Risk for Metabolic Syndrome. Mil Med 2023;188:606-613. [PMID: 37948286 DOI: 10.1093/milmed/usad276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 04/13/2023] [Accepted: 07/29/2023] [Indexed: 11/12/2023] Open

Abstract

INTRODUCTION

Metabolic syndrome (MetS) is a threat to the active component military as it impacts health, readiness, retention, and cost to the Military Health System. The most prevalent risk factors documented in service members' health records are high blood pressure (BP), low high-density lipoprotein cholesterol, and elevated triglycerides. Other risk factors include abdominal obesity and elevated fasting blood glucose. Precision nutrition counseling and wellness software applications have demonstrated positive results for weight management when coupled with high levels of participant engagement and motivation.

MATERIALS AND METHODS

In this prospective randomized controlled trial, trained registered dietitians conducted nutrition counseling using results of targeted sequencing, biomarkers, and expert recommendations to reduce the risk for MetS. Upon randomization, the treatment arm initiated six weekly sessions and the control arm received educational pamphlets. An eHealth application captured diet and physical activity. Anthropometrics and BP were measured at baseline, 6 weeks, and 12 weeks, and biomarkers were measured at baseline and 12 weeks. The primary outcome was a change in weight at 12 weeks. Statistical analysis included descriptive statistics and t-tests or analysis of variance with significance set at P < .05.

RESULTS

Overall, 138 subjects enrolled from November 2019 to February 2021 between two military bases; 107 completed the study. Demographics were as follows: 66% male, mean age 31 years, 66% married, and 49% Caucasian and non-Hispanic. Weight loss was not significant between groups or sites at 12 weeks. Overall, 27% of subjects met the diagnostic criteria for MetS on enrollment and 17.8% upon study completion. High deleterious variant prevalence was identified for genes with single-nucleotide polymorphisms linked to obesity (40%), cholesterol (38%), and BP (58%). Overall, 65% of subjects had low 25(OH)D upon enrollment; 45% remained insufficient at study completion. eHealth app had low adherence yet sufficient correlation with a valid reference.

CONCLUSIONS

Early signs of progress with weight loss at 6 weeks were not sustained at 12 weeks. DNA-based nutrition counseling was not efficacious for weight loss.

Collapse

Sala-Torra O, Reddy S, Hung LH, Beppu L, Wu D, Radich J, Yeung KY, Yeung CCS. Rapid detection of myeloid neoplasm fusions using single-molecule long-read sequencing. PLOS Glob Public Health 2023;3:e0002267. [PMID: 37699001 PMCID: PMC10497132 DOI: 10.1371/journal.pgph.0002267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Accepted: 07/17/2023] [Indexed: 09/14/2023]

Hoang V, Hung LH, Perez D, Deng H, Schooley R, Arumilli N, Yeung KY, Lloyd W. Container Profiler: Profiling resource utilization of containerized big data pipelines. Gigascience 2022;12:giad069. [PMID: 37624874 PMCID: PMC10452954 DOI: 10.1093/gigascience/giad069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Revised: 08/02/2023] [Accepted: 08/15/2023] [Indexed: 08/27/2023] Open

Hung LH, Straw E, Reddy S, Schmitz R, Colburn Z, Yeung KY. Cloud-enabled Biodepot workflow builder integrates image processing using Fiji with reproducible data analysis using Jupyter notebooks. Sci Rep 2022;12:14920. [PMID: 36056115 PMCID: PMC9440253 DOI: 10.1038/s41598-022-19173-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Accepted: 08/25/2022] [Indexed: 11/16/2022] Open

Abstract

Modern biomedical image analyses workflows contain multiple computational processing tasks giving rise to problems in reproducibility. In addition, image datasets can span both spatial and temporal dimensions, with additional channels for fluorescence and other data, resulting in datasets that are too large to be processed locally on a laptop. For omics analyses, software containers have been shown to enhance reproducibility, facilitate installation and provide access to scalable computational resources on the cloud. However, most image analyses contain steps that are graphical and interactive, features that are not supported by most omics execution engines. We present the containerized and cloud-enabled Biodepot-workflow-builder platform that supports graphics from software containers and has been extended for image analyses. We demonstrate the potential of our modular approach with multi-step workflows that incorporate the popular and open-source Fiji suite for image processing. One of our examples integrates fully interactive ImageJ macros with Jupyter notebooks. Our second example illustrates how the complicated cloud setup of an computationally intensive process such as stitching 3D digital pathology datasets using BigStitcher can be automated and simplified. In both examples, users can leverage a form-based graphical interface to execute multi-step workflows with a single click, using the provided sample data and preset input parameters. Alternatively, users can interactively modify the image processing steps in the workflow, apply the workflows to their own data, change the input parameters and macros. By providing interactive graphics support to software containers, our modular platform supports reproducible image analysis workflows, simplified access to cloud resources for analysis of large datasets, and integration across different applications such as Jupyter.

Collapse

Chan CY, Tang MHY, Wong KC, Chong YK, Yeung KY, Mak TWL. Acute poisoning by dexmedetomidine-containing chewing gum in a child. Pathology 2021;54:666-667. [PMID: 34801281 DOI: 10.1016/j.pathol.2021.08.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Accepted: 08/20/2021] [Indexed: 10/19/2022]

Reddy S, Hung LH, Sala-Torra O, Radich JP, Yeung CC, Yeung KY. A graphical, interactive and GPU-enabled workflow to process long-read sequencing data. BMC Genomics 2021;22:626. [PMID: 34425749 PMCID: PMC8381503 DOI: 10.1186/s12864-021-07927-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 08/10/2021] [Indexed: 12/18/2022] Open

Abstract

Background

Long-read sequencing has great promise in enabling portable, rapid molecular-assisted cancer diagnoses. A key challenge in democratizing long-read sequencing technology in the biomedical and clinical community is the lack of graphical bioinformatics software tools which can efficiently process the raw nanopore reads, support graphical output and interactive visualizations for interpretations of results. Another obstacle is that high performance software tools for long-read sequencing data analyses often leverage graphics processing units (GPU), which is challenging and time-consuming to configure, especially on the cloud.

Results

We present a graphical cloud-enabled workflow for fast, interactive analysis of nanopore sequencing data using GPUs. Users customize parameters, monitor execution and visualize results through an accessible graphical interface. The workflow and its components are completely containerized to ensure reproducibility and facilitate installation of the GPU-enabled software. We also provide an Amazon Machine Image (AMI) with all software and drivers pre-installed for GPU computing on the cloud. Most importantly, we demonstrate the potential of applying our software tools to reduce the turnaround time of cancer diagnostics by generating blood cancer (NB4, K562, ME1, 238 MV4;11) cell line Nanopore data using the Flongle adapter. We observe a 29x speedup and a 93x reduction in costs for the rate-limiting basecalling step in the analysis of blood cancer cell line data.

Conclusions

Our interactive and efficient software tools will make analyses of Nanopore data using GPU and cloud computing accessible to biomedical and clinical scientists, thus facilitating the adoption of cost effective, fast, portable and real-time long-read sequencing.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12864-021-07927-1.

Collapse

Hung LH, Lloyd W, Agumbe Sridhar R, Athmalingam Ravishankar SD, Xiong Y, Sobie E, Yeung KY. Holistic optimization of an RNA-seq workflow for multi-threaded environments. Bioinformatics 2019;35:4173-4175. [PMID: 30859176 DOI: 10.1093/bioinformatics/btz169] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2018] [Revised: 02/01/2019] [Accepted: 03/09/2019] [Indexed: 11/12/2022] Open

Liang X, Young WC, Hung LH, Raftery AE, Yeung KY. Integration of Multiple Data Sources for Gene Network Inference Using Genetic Perturbation Data. J Comput Biol 2019;26:1113-1129. [PMID: 31009236 PMCID: PMC6786343 DOI: 10.1089/cmb.2019.0036] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open

Young WC, Yeung KY, Raftery AE. Identifying Dynamical Time Series Model Parameters from Equilibrium Samples, with Application to Gene Regulatory Networks. STAT MODEL 2019;19:444-465. [PMID: 33824624 DOI: 10.1177/1471082x18776577] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Fourati S, Talla A, Mahmoudian M, Burkhart JG, Klén R, Henao R, Yu T, Aydın Z, Yeung KY, Ahsen ME, Almugbel R, Jahandideh S, Liang X, Nordling TEM, Shiga M, Stanescu A, Vogel R, Pandey G, Chiu C, McClain MT, Woods CW, Ginsburg GS, Elo LL, Tsalik EL, Mangravite LM, Sieberts SK. A crowdsourced analysis to identify ab initio molecular signatures predictive of susceptibility to viral infection. Nat Commun 2018;9:4418. [PMID: 30356117 PMCID: PMC6200745 DOI: 10.1038/s41467-018-06735-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2018] [Accepted: 09/12/2018] [Indexed: 01/17/2023] Open

Affiliation(s)

Slim Fourati Department of Pathology, School of Medicine, Case Western Reserve University, Cleveland, OH, 44106, USA
Aarthi Talla Department of Pathology, School of Medicine, Case Western Reserve University, Cleveland, OH, 44106, USA
Mehrad Mahmoudian Turku Centre for Biotechnology, University of Turku and Åbo Akademi University, FI-20520, Turku, Finland Department of Future Technologies, University of Turku, FI-20014 Turku, Finland
Joshua G Burkhart Department of Medical Informatics and Clinical Epidemiology, School of Medicine, Oregon Health & Science University, Portland, OR, 97239, USA Laboratory of Evolutionary Genetics, Institute of Ecology and Evolution, University of Oregon, Eugene, OR, 97403, USA
Riku Klén Turku Centre for Biotechnology, University of Turku and Åbo Akademi University, FI-20520, Turku, Finland
Ricardo Henao Duke Center for Applied Genomics and Precision Medicine, Duke University School of Medicine, Durham, NC, 27710, USA Department of Electrical and Computer Engineering, Duke University, Durham, NC, 27708, USA
Thomas Yu Sage Bionetworks, Seattle, WA, 98121, USA
Zafer Aydın Department of Computer Engineering, Abdullah Gul University, Kayseri, 38080, Turkey
Ka Yee Yeung School of Engineering and Technology, University of Washington Tacoma, Tacoma, WA, 98402, USA
Mehmet Eren Ahsen Department of Genetics and Genomic Sciences and Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
Reem Almugbel School of Engineering and Technology, University of Washington Tacoma, Tacoma, WA, 98402, USA
Samad Jahandideh Origent Data Sciences, Inc., Vienna, VA, 22182, USA
Xiao Liang School of Engineering and Technology, University of Washington Tacoma, Tacoma, WA, 98402, USA
Torbjörn E M Nordling Department of Mechanical Engineering, National Cheng Kung University, Tainan, 70101, Taiwan
Motoki Shiga Department of Electrical, Electronic and Computer Engineering, Faculty of Engineering, Gifu University, Gifu, 501-1193, Japan
Ana Stanescu Department of Genetics and Genomic Sciences and Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA Department of Computer Science, University of West Georgia, Carrolton, GA, 30116, USA
Robert Vogel Department of Genetics and Genomic Sciences and Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA IBM T.J. Watson Research Center, Yorktown Heights, NY, 10598, USA
Gaurav Pandey Department of Genetics and Genomic Sciences and Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
Christopher Chiu Section of Infectious Diseases and Immunity, Imperial College London, London, W12 0NN, UK
Micah T McClain Duke Center for Applied Genomics and Precision Medicine, Duke University School of Medicine, Durham, NC, 27710, USA Medical Service, Durham VA Health Care System, Durham, NC, 27705, USA Department of Medicine, Duke University School of Medicine, Durham, NC, 27710, USA
Christopher W Woods Duke Center for Applied Genomics and Precision Medicine, Duke University School of Medicine, Durham, NC, 27710, USA Medical Service, Durham VA Health Care System, Durham, NC, 27705, USA Department of Medicine, Duke University School of Medicine, Durham, NC, 27710, USA
Geoffrey S Ginsburg Duke Center for Applied Genomics and Precision Medicine, Duke University School of Medicine, Durham, NC, 27710, USA Department of Medicine, Duke University School of Medicine, Durham, NC, 27710, USA
Laura L Elo Turku Centre for Biotechnology, University of Turku and Åbo Akademi University, FI-20520, Turku, Finland
Ephraim L Tsalik Duke Center for Applied Genomics and Precision Medicine, Duke University School of Medicine, Durham, NC, 27710, USA Department of Medicine, Duke University School of Medicine, Durham, NC, 27710, USA Emergency Medicine Service, Durham VA Health Care System, Durham, NC, 27705, USA
Lara M Mangravite Sage Bionetworks, Seattle, WA, 98121, USA.
Solveig K Sieberts Sage Bionetworks, Seattle, WA, 98121, USA.

Collapse

Zhang P, Hung LH, Lloyd W, Yeung KY. Hot-starting software containers for STAR aligner. Gigascience 2018;7:5062793. [PMID: 30085034 PMCID: PMC6131214 DOI: 10.1093/gigascience/giy092] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2017] [Revised: 04/07/2018] [Accepted: 07/17/2018] [Indexed: 01/22/2023] Open

Hung LH, Shi K, Wu M, Young WC, Raftery AE, Yeung KY. fastBMA: scalable network inference and transitive reduction. Gigascience 2018;6:1-10. [PMID: 29020744 PMCID: PMC5632288 DOI: 10.1093/gigascience/gix078] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2017] [Accepted: 08/10/2017] [Indexed: 11/15/2022] Open

Mittal V, Hung LH, Keswani J, Kristiyanto D, Lee SB, Yeung KY. GUIdock-VNC: using a graphical desktop sharing system to provide a browser-based interface for containerized software. Gigascience 2018;6:1-6. [PMID: 28327936 PMCID: PMC5530313 DOI: 10.1093/gigascience/giw013] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2016] [Accepted: 12/16/2016] [Indexed: 11/30/2022] Open

Almugbel R, Hung LH, Hu J, Almutairy A, Ortogero N, Tamta Y, Yeung KY. Reproducible Bioconductor workflows using browser-based interactive notebooks and containers. J Am Med Inform Assoc 2018;25:4-12. [PMID: 29092073 PMCID: PMC6381817 DOI: 10.1093/jamia/ocx120] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Revised: 08/31/2017] [Accepted: 09/28/2017] [Indexed: 11/14/2022] Open

Abstract

Objective

Bioinformatics publications typically include complex software workflows that are difficult to describe in a manuscript. We describe and demonstrate the use of interactive software notebooks to document and distribute bioinformatics research. We provide a user-friendly tool, BiocImageBuilder, that allows users to easily distribute their bioinformatics protocols through interactive notebooks uploaded to either a GitHub repository or a private server.

Materials and methods

We present four different interactive Jupyter notebooks using R and Bioconductor workflows to infer differential gene expression, analyze cross-platform datasets, process RNA-seq data and KinomeScan data. These interactive notebooks are available on GitHub. The analytical results can be viewed in a browser. Most importantly, the software contents can be executed and modified. This is accomplished using Binder, which runs the notebook inside software containers, thus avoiding the need to install any software and ensuring reproducibility. All the notebooks were produced using custom files generated by BiocImageBuilder.

Results

BiocImageBuilder facilitates the publication of workflows with a point-and-click user interface. We demonstrate that interactive notebooks can be used to disseminate a wide range of bioinformatics analyses. The use of software containers to mirror the original software environment ensures reproducibility of results. Parameters and code can be dynamically modified, allowing for robust verification of published results and encouraging rapid adoption of new methods.

Conclusion

Given the increasing complexity of bioinformatics workflows, we anticipate that these interactive software notebooks will become as necessary for documenting software methods as traditional laboratory notebooks have been for documenting bench protocols, and as ubiquitous.

Collapse

Young WC, Raftery AE, Yeung KY. Model-Based Clustering With Data Correction For Removing Artifacts In Gene Expression Data. Ann Appl Stat 2017;11:1998-2026. [PMID: 30740193 DOI: 10.1214/17-aoas1051] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Chung PH, Wong CW, Lai CK, Siu HK, Tsang DN, Yeung KY, Ip DK, Tam PK. A prospective interventional study to examine the effect of a silver alloy and hydrogel-coated catheter on the incidence of catheter-associated urinary tract infection. Hong Kong Med J 2017;23:239-45. [PMID: 28211358 DOI: 10.12809/hkmj164906] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open

Abstract

INTRODUCTION

Catheter-associated urinary tract infection is a major hospital-acquired infection. This study aimed to analyse the effect of a silver alloy and hydrogel-coated catheter on the occurrence of catheter-associated urinary tract infection.

METHODS

This was a 1-year prospective study conducted at a single centre in Hong Kong. Adult patients with an indwelling urinary catheter for longer than 24 hours were recruited. The incidence of catheter-associated urinary tract infection in patients with a conventional latex Foley catheter without hydrogel was compared with that in patients with a silver alloy and hydrogel-coated catheter. The most recent definition of urinary tract infection was based on the latest surveillance definition of the National Healthcare Safety Network managed by Centers for Disease Control and Prevention.

RESULTS

A total of 306 patients were recruited with a similar ratio between males and females. The mean (standard deviation) age was 81.1 (10.5) years. The total numbers of catheter-days were 4352 and 7474 in the silver-coated and conventional groups, respectively. The incidences of catheter-associated urinary tract infection per 1000 catheter-days were 6.4 and 9.4, respectively (P=0.095). There was a 31% reduction in the incidence of catheter-associated urinary tract infection per 1000 catheter-days in the silver-coated group. Escherichia coli was the most commonly involved pathogen (36.7%) of all cases. Subgroup analysis revealed that the protective effect of silver-coated catheter was more pronounced in long-term users as well as female patients with a respective 48% (P=0.027) and 42% (P=0.108) reduction in incidence of catheter-associated urinary tract infection. The mean catheterisation time per person was the longest in patients using a silver-coated catheter (17.0 days) compared with those using a conventional (10.8 days) or both types of catheter (13.6 days) [P=0.01].

CONCLUSIONS

Silver alloy and hydrogel-coated catheters appear to be effective in preventing catheter-associated urinary tract infection based on the latest surveillance definition. The effect is perhaps more prominent in long-term users and female patients.

Collapse

Young WC, Raftery AE, Yeung KY. A posterior probability approach for gene regulatory network inference in genetic perturbation data. Math Biosci Eng 2016;13:1241-1251. [PMID: 27775378 DOI: 10.3934/mbe.2016041] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Hung LH, Kristiyanto D, Lee SB, Yeung KY. GUIdock: Using Docker Containers with a Common Graphics User Interface to Address the Reproducibility of Research. PLoS One 2016;11:e0152686. [PMID: 27045593 PMCID: PMC4821530 DOI: 10.1371/journal.pone.0152686] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2015] [Accepted: 03/17/2016] [Indexed: 12/03/2022] Open

Fronczuk M, Raftery AE, Yeung KY. CyNetworkBMA: a Cytoscape app for inferring gene regulatory networks. Source Code Biol Med 2015;10:11. [PMID: 26566394 PMCID: PMC4642660 DOI: 10.1186/s13029-015-0043-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/22/2014] [Accepted: 10/31/2015] [Indexed: 12/31/2022]

Becker PS, Schmitt MW, Loeb LA, Gu W, Wei Q, Xie Z, Carson AR, Martins T, Blau CA, Oehler V, Yeung KY. Correlation of genomic analysis by MyAML with chemotherapy drug sensitivity. J Clin Oncol 2015. [DOI: 10.1200/jco.2015.33.15_suppl.7080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Young WC, Raftery AE, Yeung KY. Fast Bayesian inference for gene regulatory networks using ScanBMA. BMC Syst Biol 2014;8:47. [PMID: 24742092 PMCID: PMC4006459 DOI: 10.1186/1752-0509-8-47] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/27/2013] [Accepted: 04/04/2014] [Indexed: 11/22/2022]

Dickinson A, Yeung KY, Donoghue J, Baker MJ, Kelly RD, McKenzie M, Johns TG, St John JC. The regulation of mitochondrial DNA copy number in glioblastoma cells. Cell Death Differ 2013;20:1644-53. [PMID: 23995230 PMCID: PMC3824586 DOI: 10.1038/cdd.2013.115] [Citation(s) in RCA: 95] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2013] [Revised: 07/10/2013] [Accepted: 07/22/2013] [Indexed: 01/07/2023] Open

Lo K, Raftery AE, Dombek KM, Zhu J, Schadt EE, Bumgarner RE, Yeung KY. Integrating external biological knowledge in the construction of regulatory networks from time-series expression data. BMC Syst Biol 2012;6:101. [PMID: 22898396 PMCID: PMC3465231 DOI: 10.1186/1752-0509-6-101] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/25/2012] [Accepted: 07/24/2012] [Indexed: 01/27/2023]

Raftery AE, Niu X, Hoff PD, Yeung KY. Fast Inference for the Latent Space Network Model Using a Case-Control Approximate Likelihood. J Comput Graph Stat 2012;21:901-919. [PMID: 27570438 DOI: 10.1080/10618600.2012.679240] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

Yeung KY, Gooley TA, Zhang A, Raftery AE, Radich JP, Oehler VG. Predicting relapse prior to transplantation in chronic myeloid leukemia by integrating expert knowledge and expression data. ACTA ACUST UNITED AC 2012;28:823-30. [PMID: 22296787 DOI: 10.1093/bioinformatics/bts059] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Zarbl H, Gallo MA, Glick J, Yeung KY, Vouros P. The vanishing zero revisited: thresholds in the age of genomics. Chem Biol Interact 2010;184:273-8. [PMID: 20109442 DOI: 10.1016/j.cbi.2010.01.031] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2009] [Revised: 01/11/2010] [Accepted: 01/18/2010] [Indexed: 10/19/2022]

Bumgarner RE, Yeung KY. Methods for the inference of biological pathways and networks. Methods Mol Biol 2009;541:225-45. [PMID: 19381545 DOI: 10.1007/978-1-59745-243-4_11] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]

Annest A, Bumgarner RE, Raftery AE, Yeung KY. Iterative Bayesian Model Averaging: a method for the application of survival analysis to high-dimensional microarray data. BMC Bioinformatics 2009;10:72. [PMID: 19245714 PMCID: PMC2657791 DOI: 10.1186/1471-2105-10-72] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2008] [Accepted: 02/26/2009] [Indexed: 11/17/2022] Open

Abstract

BACKGROUND

Microarray technology is increasingly used to identify potential biomarkers for cancer prognostics and diagnostics. Previously, we have developed the iterative Bayesian Model Averaging (BMA) algorithm for use in classification. Here, we extend the iterative BMA algorithm for application to survival analysis on high-dimensional microarray data. The main goal in applying survival analysis to microarray data is to determine a highly predictive model of patients' time to event (such as death, relapse, or metastasis) using a small number of selected genes. Our multivariate procedure combines the effectiveness of multiple contending models by calculating the weighted average of their posterior probability distributions. Our results demonstrate that our iterative BMA algorithm for survival analysis achieves high prediction accuracy while consistently selecting a small and cost-effective number of predictor genes.

RESULTS

We applied the iterative BMA algorithm to two cancer datasets: breast cancer and diffuse large B-cell lymphoma (DLBCL) data. On the breast cancer data, the algorithm selected a total of 15 predictor genes across 84 contending models from the training data. The maximum likelihood estimates of the selected genes and the posterior probabilities of the selected models from the training data were used to divide patients in the test (or validation) dataset into high- and low-risk categories. Using the genes and models determined from the training data, we assigned patients from the test data into highly distinct risk groups (as indicated by a p-value of 7.26e-05 from the log-rank test). Moreover, we achieved comparable results using only the 5 top selected genes with 100% posterior probabilities. On the DLBCL data, our iterative BMA procedure selected a total of 25 genes across 3 contending models from the training data. Once again, we assigned the patients in the validation set to significantly distinct risk groups (p-value = 0.00139).

CONCLUSION

The strength of the iterative BMA algorithm for survival analysis lies in its ability to account for model uncertainty. The results from this study demonstrate that our procedure selects a small number of genes while eclipsing other methods in predictive performance, making it a highly accurate and cost-effective prognostic tool in the clinical setting.

Collapse

Chu VT, Gottardo R, Raftery AE, Bumgarner RE, Yeung KY. MeV+R: using MeV as a graphical user interface for Bioconductor applications in microarray analysis. Genome Biol 2008;9:R118. [PMID: 18652698 PMCID: PMC2530872 DOI: 10.1186/gb-2008-9-7-r118] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2008] [Revised: 06/01/2008] [Accepted: 07/24/2008] [Indexed: 11/10/2022] Open

Gottardo R, Raftery AE, Yeung KY, Bumgarner RE. Bayesian robust inference for differential gene expression in microarrays with multiple samples. Biometrics 2006;62:10-8. [PMID: 16542223 DOI: 10.1111/j.1541-0420.2005.00397.x] [Citation(s) in RCA: 68] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Liu X, Sivaganesan S, Yeung KY, Guo J, Bumgarner RE, Medvedovic M. Context-specific infinite mixtures for clustering gene expression profiles across diverse microarray dataset. Bioinformatics 2006;22:1737-44. [PMID: 16709591 PMCID: PMC1617036 DOI: 10.1093/bioinformatics/btl184] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Gottardo R, Raftery AE, Yeung KY, Bumgarner RE. Quality Control and Robust Estimation for cDNA Microarrays With Replicates. J Am Stat Assoc 2006. [DOI: 10.1198/016214505000001096] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Li Q, Fraley C, Bumgarner RE, Yeung KY, Raftery AE. Donuts, scratches and blanks: robust model-based segmentation of microarray images. Bioinformatics 2005;21:2875-82. [PMID: 15845656 DOI: 10.1093/bioinformatics/bti447] [Citation(s) in RCA: 95] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open

Yeung KY, Bumgarner RE, Raftery AE. Bayesian model averaging: development of an improved multi-class, gene selection and classification tool for microarray data. Bioinformatics 2005;21:2394-402. [PMID: 15713736 DOI: 10.1093/bioinformatics/bti319] [Citation(s) in RCA: 200] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Vanasse GJ, Winn RK, Rodov S, Zieske AW, Li JT, Tupper JC, Tang J, Raines EW, Peters MA, Yeung KY, Harlan JM. Bcl-2 Overexpression Leads to Increases in Suppressor of Cytokine Signaling-3 Expression in B Cells and De novo Follicular Lymphoma. Mol Cancer Res 2004. [DOI: 10.1158/1541-7786.620.2.11] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Abstract Abstract The t(14;18)(q32;q21), resulting in deregulated expression of B-cell-leukemia/lymphoma-2 (Bcl-2), represents the genetic hallmark in human follicular lymphomas. Substantial evidence supports the hypothesis that the t(14;18) and Bcl-2 overexpression are necessary but not solely responsible for neoplastic transformation and require cooperating genetic derangements for neoplastic transformation to occur. To investigate genes that cooperate with Bcl-2 to influence cellular signaling pathways important for neoplastic transformation, we used oligonucleotide microarrays to determine differential gene expression patterns in CD19+ B cells isolated from Eμ-Bcl-2 transgenic mice and wild-type littermate control mice. Fifty-seven genes were induced and 94 genes were repressed by ≥2-fold in Eμ-Bcl-2 transgenic mice (P < 0.05). The suppressor of cytokine signaling-3 (SOCS3) gene was found to be overexpressed 5-fold in B cells from Eμ-Bcl-2 transgenic mice. Overexpression of Bcl-2 in both mouse embryo fibroblast-1 and hematopoietic cell lines resulted in induction of SOCS3 protein, suggesting a Bcl-2-associated mechanism underlying SOCS3 induction. Immunohistochemistry with SOCS3 antisera on tissue from a cohort of patients with de novo follicular lymphoma revealed marked overexpression of SOCS3 protein that, within the follicular center cell region, was limited to neoplastic follicular lymphoma cells and colocalized with Bcl-2 expression in 9 of 12 de novo follicular lymphoma cases examined. In contrast, SOCS3 protein expression was not detected in the follicular center cell region of benign hyperplastic tonsil tissue. These data suggest that Bcl-2 overexpression leads to the induction of activated signal transducer and activator of transcription 3 (STAT3) and to the induction of SOCS3, which may contribute to the pathogenesis of follicular lymphoma. Collapse

Vanasse GJ, Winn RK, Rodov S, Zieske AW, Li JT, Tupper JC, Tang J, Raines EW, Peters MA, Yeung KY, Harlan JM. Bcl-2 overexpression leads to increases in suppressor of cytokine signaling-3 expression in B cells and de novo follicular lymphoma. Mol Cancer Res 2004;2:620-31. [PMID: 15561778] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/01/2023]

Abstract

The t(14;18)(q32;q21), resulting in deregulated expression of B-cell-leukemia/lymphoma-2 (Bcl-2), represents the genetic hallmark in human follicular lymphomas. Substantial evidence supports the hypothesis that the t(14;18) and Bcl-2 overexpression are necessary but not solely responsible for neoplastic transformation and require cooperating genetic derangements for neoplastic transformation to occur. To investigate genes that cooperate with Bcl-2 to influence cellular signaling pathways important for neoplastic transformation, we used oligonucleotide microarrays to determine differential gene expression patterns in CD19+ B cells isolated from Emu-Bcl-2 transgenic mice and wild-type littermate control mice. Fifty-seven genes were induced and 94 genes were repressed by > or =2-fold in Emu-Bcl-2 transgenic mice (P < 0.05). The suppressor of cytokine signaling-3 (SOCS3) gene was found to be overexpressed 5-fold in B cells from Emu-Bcl-2 transgenic mice. Overexpression of Bcl-2 in both mouse embryo fibroblast-1 and hematopoietic cell lines resulted in induction of SOCS3 protein, suggesting a Bcl-2-associated mechanism underlying SOCS3 induction. Immunohistochemistry with SOCS3 antisera on tissue from a cohort of patients with de novo follicular lymphoma revealed marked overexpression of SOCS3 protein that, within the follicular center cell region, was limited to neoplastic follicular lymphoma cells and colocalized with Bcl-2 expression in 9 of 12 de novo follicular lymphoma cases examined. In contrast, SOCS3 protein expression was not detected in the follicular center cell region of benign hyperplastic tonsil tissue. These data suggest that Bcl-2 overexpression leads to the induction of activated signal transducer and activator of transcription 3 (STAT3) and to the induction of SOCS3, which may contribute to the pathogenesis of follicular lymphoma.

Collapse

Yeung KY, Medvedovic M, Bumgarner RE. From co-expression to co-regulation: how many microarray experiments do we need? Genome Biol 2004;5:R48. [PMID: 15239833 PMCID: PMC463312 DOI: 10.1186/gb-2004-5-7-r48] [Citation(s) in RCA: 76] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2004] [Revised: 04/19/2004] [Accepted: 05/28/2004] [Indexed: 11/10/2022] Open

Medvedovic M, Yeung KY, Bumgarner RE. Bayesian mixture model based clustering of replicated microarray data. Bioinformatics 2004;20:1222-32. [PMID: 14871871 DOI: 10.1093/bioinformatics/bth068] [Citation(s) in RCA: 145] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Abstract

MOTIVATION

Identifying patterns of co-expression in microarray data by cluster analysis has been a productive approach to uncovering molecular mechanisms underlying biological processes under investigation. Using experimental replicates can generally improve the precision of the cluster analysis by reducing the experimental variability of measurements. In such situations, Bayesian mixtures allow for an efficient use of information by precisely modeling between-replicates variability.

RESULTS

We developed different variants of Bayesian mixture based clustering procedures for clustering gene expression data with experimental replicates. In this approach, the statistical distribution of microarray data is described by a Bayesian mixture model. Clusters of co-expressed genes are created from the posterior distribution of clusterings, which is estimated by a Gibbs sampler. We define infinite and finite Bayesian mixture models with different between-replicates variance structures and investigate their utility by analyzing synthetic and the real-world datasets. Results of our analyses demonstrate that (1) improvements in precision achieved by performing only two experimental replicates can be dramatic when the between-replicates variability is high, (2) precise modeling of intra-gene variability is important for accurate identification of co-expressed genes and (3) the infinite mixture model with the 'elliptical' between-replicates variance structure performed overall better than any other method tested. We also introduce a heuristic modification to the Gibbs sampler based on the 'reverse annealing' principle. This modification effectively overcomes the tendency of the Gibbs sampler to converge to different modes of the posterior distribution when started from different initial positions. Finally, we demonstrate that the Bayesian infinite mixture model with 'elliptical' variance structure is capable of identifying the underlying structure of the data without knowing the 'correct' number of clusters.

AVAILABILITY

The MS Windows based program named Gaussian Infinite Mixture Modeling (GIMM) implementing the Gibbs sampler and corresponding C++ code are available at http://homepages.uc.edu/~medvedm/GIMM.htm SUPPLEMENTAL INFORMATION: http://expression.microslu.washington.edu/expression/kayee/medvedovic2003/medvedovic_bioinf2003.html

Collapse

Yeung KY, Bumgarner RE. Multiclass classification of microarray data with repeated measurements: application to cancer. Genome Biol 2003;4:R83. [PMID: 14659020 PMCID: PMC329422 DOI: 10.1186/gb-2003-4-12-r83] [Citation(s) in RCA: 91] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2003] [Revised: 08/14/2003] [Accepted: 10/17/2003] [Indexed: 11/21/2022] Open

Yeung KY, Medvedovic M, Bumgarner RE. Clustering gene-expression data with repeated measurements. Genome Biol 2003;4:R34. [PMID: 12734014 PMCID: PMC156590 DOI: 10.1186/gb-2003-4-5-r34] [Citation(s) in RCA: 136] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2002] [Revised: 02/11/2003] [Accepted: 03/07/2003] [Indexed: 11/26/2022] Open

Barrett MT, Yeung KY, Ruzzo WL, Hsu L, Blount PL, Sullivan R, Zarbl H, Delrow J, Rabinovitch PS, Reid BJ. Transcriptional analyses of Barrett's metaplasia and normal upper GI mucosae. Neoplasia 2002;4:121-8. [PMID: 11896567 PMCID: PMC1550324 DOI: 10.1038/sj.neo.7900221] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2001] [Accepted: 09/14/2001] [Indexed: 12/29/2022]

Yeung KY, Baum L, Chan WM, Lam DS, Kwok AK, Pang CP. Molecular diagnostics for retinitis pigmentosa. Clin Chim Acta 2001;313:209-15. [PMID: 11694261 DOI: 10.1016/s0009-8981(01)00674-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Yeung KY, Fraley C, Murua A, Raftery AE, Ruzzo WL. Model-based clustering and data transformations for gene expression data. Bioinformatics 2001;17:977-87. [PMID: 11673243 DOI: 10.1093/bioinformatics/17.10.977] [Citation(s) in RCA: 594] [Impact Index Per Article: 25.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Abstract

MOTIVATION

Clustering is a useful exploratory technique for the analysis of gene expression data. Many different heuristic clustering algorithms have been proposed in this context. Clustering algorithms based on probability models offer a principled alternative to heuristic algorithms. In particular, model-based clustering assumes that the data is generated by a finite mixture of underlying probability distributions such as multivariate normal distributions. The issues of selecting a 'good' clustering method and determining the 'correct' number of clusters are reduced to model selection problems in the probability framework. Gaussian mixture models have been shown to be a powerful tool for clustering in many applications.

RESULTS

We benchmarked the performance of model-based clustering on several synthetic and real gene expression data sets for which external evaluation criteria were available. The model-based approach has superior performance on our synthetic data sets, consistently selecting the correct model and the number of clusters. On real expression data, the model-based approach produced clusters of quality comparable to a leading heuristic clustering algorithm, but with the key advantage of suggesting the number of clusters and an appropriate model. We also explored the validity of the Gaussian mixture assumption on different transformations of real data. We also assessed the degree to which these real gene expression data sets fit multivariate Gaussian distributions both before and after subjecting them to commonly used data transformations. Suitably chosen transformations seem to result in reasonable fits.

AVAILABILITY

MCLUST is available at http://www.stat.washington.edu/fraley/mclust. The software for the diagonal model is under development.

CONTACT

kayee@cs.washington.edu.

SUPPLEMENTARY INFORMATION

http://www.cs.washington.edu/homes/kayee/model.

Collapse

Chan WM, Yeung KY, Pang CP, Baum L, Lau TC, Kwok AK, Lam DS. Rhodopsin mutations in Chinese patients with retinitis pigmentosa. Br J Ophthalmol 2001;85:1046-8. [PMID: 11520753 PMCID: PMC1724134 DOI: 10.1136/bjo.85.9.1046] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]

Yeung KY, Ruzzo WL. Principal component analysis for clustering gene expression data. Bioinformatics 2001;17:763-74. [PMID: 11590094 DOI: 10.1093/bioinformatics/17.9.763] [Citation(s) in RCA: 456] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Baum L, Chan WM, Yeung KY, Lam DS, Kwok AK, Pang CP. RP1 in Chinese: Eight novel variants and evidence that truncation of the extreme C-terminal does not cause retinitis pigmentosa. Hum Mutat 2001;17:436. [PMID: 11317367 DOI: 10.1002/humu.1127] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Yeung KY, Barrett M, Delrow J, Blount P, Reid B, Rabinovitch P. Transcriptional analysis of Barrett's epithelium and normal gastrointestinal tissues. Nat Genet 2001. [DOI: 10.1038/87376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Yeung KY, Haynor DR, Ruzzo WL. Validating clustering for gene expression data. Bioinformatics 2001;17:309-18. [PMID: 11301299 DOI: 10.1093/bioinformatics/17.4.309] [Citation(s) in RCA: 463] [Impact Index Per Article: 20.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Ahlgren JD, Ellison NM, Gottlieb RJ, Laluna F, Lokich JJ, Sinclair PR, Ueno W, Wampler GL, Yeung KY, Alt D. Hormonal palliation of chemoresistant ovarian cancer: three consecutive phase II trials of the Mid-Atlantic Oncology Program. J Clin Oncol 1993;11:1957-68. [PMID: 7691999 DOI: 10.1200/jco.1993.11.10.1957] [Citation(s) in RCA: 82] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open

Cheung WH, Ha DK, Yeung KY, Hung RP. Methods for enumerating Escherichia coli in subtropical waters. Epidemiol Infect 1991;106:345-54. [PMID: 2019302 PMCID: PMC2272005 DOI: 10.1017/s0950268800048494] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open