Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Moretti S, van Leeuwen D, Gmuender H, Bonassi S, van Delft J, Kleinjans J, Patrone F, Merlo DF. Combining Shapley value and statistics to the analysis of gene expression data in children exposed to air pollution. BMC Bioinformatics 2008;9:361. [PMID: 18764936 PMCID: PMC2556684 DOI: 10.1186/1471-2105-9-361] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2008] [Accepted: 09/02/2008] [Indexed: 02/04/2023] Open

For:	Moretti S, van Leeuwen D, Gmuender H, Bonassi S, van Delft J, Kleinjans J, Patrone F, Merlo DF. Combining Shapley value and statistics to the analysis of gene expression data in children exposed to air pollution. BMC Bioinformatics 2008;9:361. [PMID: 18764936 PMCID: PMC2556684 DOI: 10.1186/1471-2105-9-361] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2008] [Accepted: 09/02/2008] [Indexed: 02/04/2023] Open

Number

Cited by Other Article(s)

Paylar B, Längkvist M, Jass J, Olsson PE. Utilization of Computer Classification Methods for Exposure Prediction and Gene Selection in Daphnia magna Toxicogenomics. BIOLOGY 2023;12:biology12050692. [PMID: 37237504 DOI: 10.3390/biology12050692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Revised: 05/02/2023] [Accepted: 05/06/2023] [Indexed: 05/28/2023]

Balestra C, Maj C, Müller E, Mayr A. Redundancy-aware unsupervised ranking based on game theory: Ranking pathways in collections of gene sets. PLoS One 2023;18:e0282699. [PMID: 36893181 PMCID: PMC9997904 DOI: 10.1371/journal.pone.0282699] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 02/13/2023] [Indexed: 03/10/2023] Open

Abstract

In Genetics, gene sets are grouped in collections concerning their biological function. This often leads to high-dimensional, overlapping, and redundant families of sets, thus precluding a straightforward interpretation of their biological meaning. In Data Mining, it is often argued that techniques to reduce the dimensionality of data could increase the maneuverability and consequently the interpretability of large data. In the past years, moreover, we witnessed an increasing consciousness of the importance of understanding data and interpretable models in the machine learning and bioinformatics communities. On the one hand, there exist techniques aiming to aggregate overlapping gene sets to create larger pathways. While these methods could partly solve the large size of the collections' problem, modifying biological pathways is hardly justifiable in this biological context. On the other hand, the representation methods to increase interpretability of collections of gene sets that have been proposed so far have proved to be insufficient. Inspired by this Bioinformatics context, we propose a method to rank sets within a family of sets based on the distribution of the singletons and their size. We obtain sets' importance scores by computing Shapley values; Making use of microarray games, we do not incur the typical exponential computational complexity. Moreover, we address the challenge of constructing redundancy-aware rankings where, in our case, redundancy is a quantity proportional to the size of intersections among the sets in the collections. We use the obtained rankings to reduce the dimension of the families, therefore showing lower redundancy among sets while still preserving a high coverage of their elements. We finally evaluate our approach for collections of gene sets and apply Gene Sets Enrichment Analysis techniques to the now smaller collections: As expected, the unsupervised nature of the proposed rankings allows for unremarkable differences in the number of significant gene sets for specific phenotypic traits. In contrast, the number of performed statistical tests can be drastically reduced. The proposed rankings show a practical utility in bioinformatics to increase interpretability of the collections of gene sets and a step forward to include redundancy-awareness into Shapley values computations.

Collapse

Serra F, Bottini S, Pratella D, Stathopoulou MG, Sebille W, El-Hami L, Repetto E, Mauduit C, Benahmed M, Grandjean V, Trabucchi M. Systemic CLIP-seq analysis and game theory approach to model microRNA mode of binding. Nucleic Acids Res 2021;49:e66. [PMID: 33823551 PMCID: PMC8216473 DOI: 10.1093/nar/gkab198] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Revised: 02/19/2021] [Accepted: 03/10/2021] [Indexed: 12/18/2022] Open

Sun MW, Moretti S, Paskov KM, Stockham NT, Varma M, Chrisman BS, Washington PY, Jung JY, Wall DP. Game theoretic centrality: a novel approach to prioritize disease candidate genes by combining biological networks with the Shapley value. BMC Bioinformatics 2020;21:356. [PMID: 32787845 PMCID: PMC7430867 DOI: 10.1186/s12859-020-03693-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Accepted: 07/21/2020] [Indexed: 11/13/2022] Open

Sun MW, Gupta* A, Varma M, Paskov KM, Jung JY, Stockham NT, Wall DP. Coalitional Game Theory Facilitates Identification of Non-Coding Variants Associated With Autism. BIOMEDICAL INFORMATICS INSIGHTS 2019;11:1178222619832859. [PMID: 30886520 PMCID: PMC6410388 DOI: 10.1177/1178222619832859] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/04/2018] [Accepted: 12/17/2018] [Indexed: 12/18/2022]

Abstract

Studies on autism spectrum disorder (ASD) have amassed substantial evidence for the role of genetics in the disease's phenotypic manifestation. A large number of coding and non-coding variants with low penetrance likely act in a combinatorial manner to explain the variable forms of ASD. However, many of these combined interactions, both additive and epistatic, remain undefined. Coalitional game theory (CGT) is an approach that seeks to identify players (individual genetic variants or genes) who tend to improve the performance-association to a disease phenotype of interest-of any coalition (subset of co-occurring genetic variants) they join. This method has been previously applied to boost biologically informative signal from gene expression data and exome sequencing data but remains to be explored in the context of cooperativity among non-coding genomic regions. We describe our extension of previous work, highlighting non-coding chromosomal regions relevant to ASD using CGT on alteration data of 4595 fully sequenced genomes from 756 multiplex families. Genomes were encoded into binary matrices for three types of non-coding regions previously implicated in ASD and separated into ASD (case) and unaffected (control) samples. A player metric, the Shapley value, enabled determination of individual variant contributions in both sets of cohorts. A total of 30 non-coding positions were found to have significantly elevated player scores and likely represent significant contributors to the genetic coordination underlying ASD. Cross-study analyses revealed that a subset of mutated non-coding regions (all of which are in human accelerated regions (HARs)) and related genes are involved in biological pathways or behavioral outcomes known to be affected in autism, suggesting the importance of single nucleotide polymorphisms (SNPs) within HARs in ASD. These findings support the use of CGT in identifying hidden yet influential non-coding players from large-scale genomic data, to better understand the precise underpinnings of complex neurodevelopmental disorders such as autism.

Collapse

Coalitional game theory as a promising approach to identify candidate autism genes. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2018;23:436-447. [PMID: 29218903 PMCID: PMC6055932] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Abstract

Despite mounting evidence for the strong role of genetics in the phenotypic manifestation of Autism Spectrum Disorder (ASD), the specific genes responsible for the variable forms of ASD remain undefined. ASD may be best explained by a combinatorial genetic model with varying epistatic interactions across many small effect mutations. Coalitional or cooperative game theory is a technique that studies the combined effects of groups of players, known as coalitions, seeking to identify players who tend to improve the performance--the relationship to a specific disease phenotype--of any coalition they join. This method has been previously shown to boost biologically informative signal in gene expression data but to-date has not been applied to the search for cooperative mutations among putative ASD genes. We describe our approach to highlight genes relevant to ASD using coalitional game theory on alteration data of 1,965 fully sequenced genomes from 756 multiplex families. Alterations were encoded into binary matrices for ASD (case) and unaffected (control) samples, indicating likely gene-disrupting, inherited mutations in altered genes. To determine individual gene contributions given an ASD phenotype, a "player" metric, referred to as the Shapley value, was calculated for each gene in the case and control cohorts. Sixty seven genes were found to have significantly elevated player scores and likely represent significant contributors to the genetic coordination underlying ASD. Using network and cross-study analysis, we found that these genes are involved in biological pathways known to be affected in the autism cases and that a subset directly interact with several genes known to have strong associations to autism. These findings suggest that coalitional game theory can be applied to large-scale genomic data to identify hidden yet influential players in complex polygenic disorders such as autism.

Collapse

Optimal and Novel Hybrid Feature Selection Framework for Effective Data Classification. ACTA ACUST UNITED AC 2017. [DOI: 10.1007/978-981-10-4762-6_48] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/12/2023]

Sasikala S, Appavu alias Balamurugan S, Geetha S. A novel adaptive feature selector for supervised classification. INFORM PROCESS LETT 2017. [DOI: 10.1016/j.ipl.2016.08.003] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Sochat V, David M, Wall DP. Translational Meta-analytical Methods to Localize the Regulatory Patterns of Neurological Disorders in the Human Brain. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2015;2015:2073-2082. [PMID: 26958307 PMCID: PMC4765688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Fagnocchi L, Bottini S, Golfieri G, Fantappiè L, Ferlicca F, Antunes A, Guadagnuolo S, Del Tordello E, Siena E, Serruto D, Scarlato V, Muzzi A, Delany I. Global transcriptome analysis reveals small RNAs affecting Neisseria meningitidis bacteremia. PLoS One 2015;10:e0126325. [PMID: 25951061 PMCID: PMC4423775 DOI: 10.1371/journal.pone.0126325] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2014] [Accepted: 03/31/2015] [Indexed: 12/11/2022] Open

A Novel Feature Selection Technique for Improved Survivability Diagnosis of Breast Cancer. ACTA ACUST UNITED AC 2015. [DOI: 10.1016/j.procs.2015.04.005] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Camargo-Rodriguez AV, Kim JT. DoGeNetS: using optimisation to discriminate regulatory network topologies based on gene expression data. IET Syst Biol 2012;6:1-8. [PMID: 22360266 DOI: 10.1049/iet-syb.2011.0004] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Sajitz-Hermstein M, Nikoloski Z. Restricted cooperative games on metabolic networks reveal functionally important reactions. J Theor Biol 2012;314:192-203. [PMID: 22940237 DOI: 10.1016/j.jtbi.2012.08.018] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2011] [Revised: 08/02/2012] [Accepted: 08/16/2012] [Indexed: 11/26/2022]

Moretti S, Vasilakos AV. An overview of recent applications of Game Theory to bioinformatics. Inf Sci (N Y) 2010. [DOI: 10.1016/j.ins.2010.07.019] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Moretti S, Fragnelli V, Patrone F, Bonassi S. Using coalitional games on biological networks to measure centrality and power of genes. Bioinformatics 2010;26:2721-30. [PMID: 20817743 DOI: 10.1093/bioinformatics/btq508] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Craig PM, Hogstrand C, Wood CM, McClelland GB. Gene expression endpoints following chronic waterborne copper exposure in a genomic model organism, the zebrafish, Danio rerio. Physiol Genomics 2009;40:23-33. [PMID: 19789285 DOI: 10.1152/physiolgenomics.00089.2009] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open