Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zhang H, Tsai CP, Yu CY, Bonney G. Tree-based linkage and association analyses of asthma. Genet Epidemiol 2002;21 Suppl 1:S317-22. [PMID: 11793691 DOI: 10.1002/gepi.2001.21.s1.s317] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

For:	Zhang H, Tsai CP, Yu CY, Bonney G. Tree-based linkage and association analyses of asthma. Genet Epidemiol 2002;21 Suppl 1:S317-22. [PMID: 11793691 DOI: 10.1002/gepi.2001.21.s1.s317] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Number

Cited by Other Article(s)

Song C, Zhang H. TARV: tree-based analysis of rare variants identifying risk modifying variants in CTNNA2 and CNTNAP2 for alcohol addiction. Genet Epidemiol 2014;38:552-9. [PMID: 25041903 DOI: 10.1002/gepi.21843] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2014] [Revised: 06/02/2014] [Accepted: 06/16/2014] [Indexed: 12/18/2022]

Amos C, George V, Bailey-Wilson J, Demenais F. George Bonney (1947-2013) Remembered. Genet Epidemiol 2013. [DOI: 10.1002/gepi.21780] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Lunetta KL, Hayward LB, Segal J, Van Eerdewegh P. Screening large-scale association study data: exploiting interactions using random forests. BMC Genet 2004. [PMID: 15588316 DOI: 10.1186/1471‐2156‐5‐32] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Genome-wide association studies for complex diseases will produce genotypes on hundreds of thousands of single nucleotide polymorphisms (SNPs). A logical first approach to dealing with massive numbers of SNPs is to use some test to screen the SNPs, retaining only those that meet some criterion for further study. For example, SNPs can be ranked by p-value, and those with the lowest p-values retained. When SNPs have large interaction effects but small marginal effects in a population, they are unlikely to be retained when univariate tests are used for screening. However, model-based screens that pre-specify interactions are impractical for data sets with thousands of SNPs. Random forest analysis is an alternative method that produces a single measure of importance for each predictor variable that takes into account interactions among variables without requiring model specification. Interactions increase the importance for the individual interacting variables, making them more likely to be given high importance relative to other variables. We test the performance of random forests as a screening procedure to identify small numbers of risk-associated SNPs from among large numbers of unassociated SNPs using complex disease models with up to 32 loci, incorporating both genetic heterogeneity and multi-locus interaction.

RESULTS

Keeping other factors constant, if risk SNPs interact, the random forest importance measure significantly outperforms the Fisher Exact test as a screening tool. As the number of interacting SNPs increases, the improvement in performance of random forest analysis relative to Fisher Exact test for screening also increases. Random forests perform similarly to the univariate Fisher Exact test as a screening tool when SNPs in the analysis do not interact.

CONCLUSIONS

In the context of large-scale genetic association studies where unknown interactions exist among true risk-associated SNPs or SNPs and environmental covariates, screening SNPs using random forest analyses can significantly reduce the number of SNPs that need to be retained for further study compared to standard univariate screening methods.

Collapse

Lunetta KL, Hayward LB, Segal J, Van Eerdewegh P. Screening large-scale association study data: exploiting interactions using random forests. BMC Genet 2004;5:32. [PMID: 15588316 PMCID: PMC545646 DOI: 10.1186/1471-2156-5-32] [Citation(s) in RCA: 264] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2004] [Accepted: 12/10/2004] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

RESULTS

CONCLUSIONS

Collapse

Costello TJ, Falk CT, Ye KQ. Data mining and computationally intensive methods: summary of Group 7 contributions to Genetic Analysis Workshop 13. Genet Epidemiol 2004;25 Suppl 1:S57-63. [PMID: 14635170 DOI: 10.1002/gepi.10285] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Ghazalpour A, Doss S, Yang X, Aten J, Toomey EM, Van Nas A, Wang S, Drake TA, Lusis AJ. Thematic review series: The pathogenesis of atherosclerosis. Toward a biological network for atherosclerosis. J Lipid Res 2004;45:1793-805. [PMID: 15292376 DOI: 10.1194/jlr.r400006-jlr200] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open

Bureau A, Dupuis J, Hayward B, Falls K, Van Eerdewegh P. Mapping complex traits using Random Forests. BMC Genet 2003. [PMID: 14975132 DOI: 10.1186/1471‐2156‐4‐s1‐s64] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/03/2023] Open

Bureau A, Dupuis J, Hayward B, Falls K, Van Eerdewegh P. Mapping complex traits using Random Forests. BMC Genet 2003;4 Suppl 1:S64. [PMID: 14975132 PMCID: PMC1866502 DOI: 10.1186/1471-2156-4-s1-s64] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Hoh J, Ott J. Mathematical multi-locus approaches to localizing complex human trait genes. Nat Rev Genet 2003;4:701-9. [PMID: 12951571 DOI: 10.1038/nrg1155] [Citation(s) in RCA: 210] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Yeh CB, Leckman JF, Wan FJ, Shiah IS, Lu RB. Characteristics of acute stress symptoms and nitric oxide concentration in young rescue workers in Taiwan. Psychiatry Res 2002;112:59-68. [PMID: 12379451 DOI: 10.1016/s0165-1781(02)00179-8] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Zhang H, Leckman JF, Pauls DL, Tsai CP, Kidd KK, Campos MR. Genomewide scan of hoarding in sib pairs in which both sibs have Gilles de la Tourette syndrome. Am J Hum Genet 2002;70:896-904. [PMID: 11840360 PMCID: PMC379118 DOI: 10.1086/339520] [Citation(s) in RCA: 136] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2001] [Accepted: 01/11/2002] [Indexed: 11/04/2022] Open