1
|
Russell M, Aqi A, Saitou M, Gokcumen O, Masuda N. Gene communities in co-expression networks across different tissues. ARXIV 2023:arXiv:2305.12963v2. [PMID: 37292479 PMCID: PMC10246089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
With the recent availability of tissue-specific gene expression data, e.g., provided by the GTEx Consortium, there is interest in comparing gene co-expression patterns across tissues. One promising approach to this problem is to use a multilayer network analysis framework and perform multilayer community detection. Communities in gene co-expression networks reveal groups of genes similarly expressed across individuals, potentially involved in related biological processes responding to specific environmental stimuli or sharing common regulatory variations. We construct a multilayer network in which each of the four layers is an exocrine gland tissue-specific gene co-expression network. We develop methods for multilayer community detection with correlation matrix input and an appropriate null model. Our correlation matrix input method identifies five groups of genes that are similarly co-expressed in multiple tissues (a community that spans multiple layers, which we call a generalist community) and two groups of genes that are co-expressed in just one tissue (a community that lies primarily within just one layer, which we call a specialist community). We further found gene co-expression communities where the genes physically cluster across the genome significantly more than expected by chance (on chromosomes 1 and 11). This clustering hints at underlying regulatory elements determining similar expression patterns across individuals and cell types. We suggest that KRTAP3-1, KRTAP3-3, and KRTAP3-5 share regulatory elements in skin and pancreas. Furthermore, we find that CELA3A and CELA3B share associated expression quantitative trait loci in the pancreas. The results indicate that our multilayer community detection method for correlation matrix input extracts biologically interesting communities of genes.
Collapse
Affiliation(s)
| | - Alber Aqi
- Department of Biological Sciences, University at Buffalo
| | - Marie Saitou
- Faculty of Biosciences, Norwegian University of Life Sciences
| | - Omer Gokcumen
- Department of Biological Sciences, University at Buffalo
| | - Naoki Masuda
- Department of Mathematics, University at Buffalo
- Institute for Artificial Intelligence and Data Science, University at Buffalo
| |
Collapse
|
2
|
Vasquez PA, Walker B, Bloom K, Kolbin D, Caughman N, Freeman R, Lysy M, Hult C, Newhall KA, Papanikolas M, Edelmaier C, Forest MG. The power of weak, transient interactions across biology: A paradigm of emergent behavior. PHYSICA D. NONLINEAR PHENOMENA 2023; 454:133866. [PMID: 38274029 PMCID: PMC10806540 DOI: 10.1016/j.physd.2023.133866] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2024]
Abstract
A growing list of diverse biological systems and their equally diverse functionalities provides realizations of a paradigm of emergent behavior. In each of these biological systems, pervasive ensembles of weak, short-lived, spatially local interactions act autonomously to convey functionalities at larger spatial and temporal scales. In this article, a range of diverse systems and functionalities are presented in a cursory manner with literature citations for further details. Then two systems and their properties are discussed in more detail: yeast chromosome biology and human respiratory mucus.
Collapse
Affiliation(s)
- Paula A. Vasquez
- Department of Mathematics, University of South Carolina, United States of America
| | - Ben Walker
- Department of Mathematics, University of California at Irvine, United States of America
| | - Kerry Bloom
- Department of Biology, University of North Carolina at Chapel Hill, United States of America
| | - Daniel Kolbin
- Department of Biology, University of North Carolina at Chapel Hill, United States of America
| | - Neall Caughman
- Department of Mathematics, University of North Carolina at Chapel Hill, United States of America
| | - Ronit Freeman
- Department of Applied Physical Sciences, University of North Carolina at Chapel Hill, United States of America
| | - Martin Lysy
- Department of Statistics and Actuarial Science, University of Waterloo, Canada
| | - Caitlin Hult
- Department of Mathematics, Gettysburg College, United States of America
| | - Katherine A. Newhall
- Department of Mathematics, University of North Carolina at Chapel Hill, United States of America
| | - Micah Papanikolas
- Department of Applied Physical Sciences, University of North Carolina at Chapel Hill, United States of America
| | - Christopher Edelmaier
- Department of Applied Physical Sciences, University of North Carolina at Chapel Hill, United States of America
- Center for Computational Biology, Flatiron Institute, United States of America
| | - M. Gregory Forest
- Department of Mathematics, University of North Carolina at Chapel Hill, United States of America
- Department of Applied Physical Sciences, University of North Carolina at Chapel Hill, United States of America
| |
Collapse
|
3
|
Russell M, Aqil A, Saitou M, Gokcumen O, Masuda N. Gene communities in co-expression networks across different tissues. PLoS Comput Biol 2023; 19:e1011616. [PMID: 37976327 PMCID: PMC10691702 DOI: 10.1371/journal.pcbi.1011616] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Revised: 12/01/2023] [Accepted: 10/19/2023] [Indexed: 11/19/2023] Open
Abstract
With the recent availability of tissue-specific gene expression data, e.g., provided by the GTEx Consortium, there is interest in comparing gene co-expression patterns across tissues. One promising approach to this problem is to use a multilayer network analysis framework and perform multilayer community detection. Communities in gene co-expression networks reveal groups of genes similarly expressed across individuals, potentially involved in related biological processes responding to specific environmental stimuli or sharing common regulatory variations. We construct a multilayer network in which each of the four layers is an exocrine gland tissue-specific gene co-expression network. We develop methods for multilayer community detection with correlation matrix input and an appropriate null model. Our correlation matrix input method identifies five groups of genes that are similarly co-expressed in multiple tissues (a community that spans multiple layers, which we call a generalist community) and two groups of genes that are co-expressed in just one tissue (a community that lies primarily within just one layer, which we call a specialist community). We further found gene co-expression communities where the genes physically cluster across the genome significantly more than expected by chance (on chromosomes 1 and 11). This clustering hints at underlying regulatory elements determining similar expression patterns across individuals and cell types. We suggest that KRTAP3-1, KRTAP3-3, and KRTAP3-5 share regulatory elements in skin and pancreas. Furthermore, we find that CELA3A and CELA3B share associated expression quantitative trait loci in the pancreas. The results indicate that our multilayer community detection method for correlation matrix input extracts biologically interesting communities of genes.
Collapse
Affiliation(s)
- Madison Russell
- Department of Mathematics, State University of New York at Buffalo, Buffalo, New York, United States of America
| | - Alber Aqil
- Department of Biological Sciences, State University of New York at Buffalo, Buffalo, New York, United States of America
| | - Marie Saitou
- Faculty of Biosciences, Norwegian University of Life Sciences, Ås, Norway
| | - Omer Gokcumen
- Department of Biological Sciences, State University of New York at Buffalo, Buffalo, New York, United States of America
| | - Naoki Masuda
- Department of Mathematics, State University of New York at Buffalo, Buffalo, New York, United States of America
- Institute for Artificial Intelligence and Data Science, State University of New York at Buffalo, Buffalo, New York, United States of America
| |
Collapse
|
4
|
Lokmanoglu AD, Nisbet EC, Osborne MT, Tien J, Malloy S, Cueva Chacón L, Villa Turek E, Abhari R. Social Media Sentiment about COVID-19 Vaccination Predicts Vaccine Acceptance among Peruvian Social Media Users the Next Day. Vaccines (Basel) 2023; 11:817. [PMID: 37112729 PMCID: PMC10146388 DOI: 10.3390/vaccines11040817] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 03/30/2023] [Accepted: 04/04/2023] [Indexed: 04/29/2023] Open
Abstract
Drawing upon theories of risk and decision making, we present a theoretical framework for how the emotional attributes of social media content influence risk behaviors. We apply our framework to understanding how COVID-19 vaccination Twitter posts influence acceptance of the vaccine in Peru, the country with the highest relative number of COVID-19 excess deaths. By employing computational methods, topic modeling, and vector autoregressive time series analysis, we show that the prominence of expressed emotions about COVID-19 vaccination in social media content is associated with the daily percentage of Peruvian social media survey respondents who are vaccine-accepting over 231 days. Our findings show that net (positive) sentiment and trust emotions expressed in tweets about COVID-19 are positively associated with vaccine acceptance among survey respondents one day after the post occurs. This study demonstrates that the emotional attributes of social media content, besides veracity or informational attributes, may influence vaccine acceptance for better or worse based on its valence.
Collapse
Affiliation(s)
- Ayse D. Lokmanoglu
- Department of Communication Studies, Northwestern University, Evanston, IL 60208, USA
| | - Erik C. Nisbet
- Department of Communication Studies, Northwestern University, Evanston, IL 60208, USA
| | - Matthew T. Osborne
- Department of Mathematics, The Ohio State University, Columbus, OH 43210, USA
| | - Joseph Tien
- Department of Mathematics, The Ohio State University, Columbus, OH 43210, USA
| | | | - Lourdes Cueva Chacón
- School of Journalism and Media Studies, San Diego State University, San Diego, CA 92182, USA
| | - Esteban Villa Turek
- Department of Communication Studies, Northwestern University, Evanston, IL 60208, USA
| | - Rod Abhari
- Department of Communication Studies, Northwestern University, Evanston, IL 60208, USA
| |
Collapse
|
5
|
Cho W, Lee D, Kim BJ. A multiresolution framework for the analysis of community structure in international trade networks. Sci Rep 2023; 13:5721. [PMID: 37029219 PMCID: PMC10082076 DOI: 10.1038/s41598-023-32686-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Accepted: 03/31/2023] [Indexed: 04/09/2023] Open
Abstract
International trade networks are complex systems that consist of overlapping multiple trade blocs of varying sizes. However, the resulting structures of community detection in trade networks often fail to accurately represent the complexity of international trade. To address this issue, we propose a multiresolution framework that integrates information from a range of resolutions to consider trade communities of different sizes and reveal the hierarchical structure of trade networks and their constituent blocks. In addition, we introduce a measure called multiresolution membership inconsistency for each country, which demonstrates the positive correlation between a country's structural inconsistency in terms of network topology and its vulnerability to external intervention in terms of economic and security functioning. Our findings show that network science-based approaches can effectively capture the complex interdependencies between countries and provide new metrics for evaluating the characteristics and behaviors of countries in both economic and political contexts.
Collapse
Affiliation(s)
- Wonguk Cho
- Graduate School of Data Science, Seoul National University, Seoul, 08826, Republic of Korea
- Department of Physics, Sungkyunkwan University, Suwon, 16419, Republic of Korea
| | - Daekyung Lee
- Department of Energy Technology, Korea Institute of Energy Technology, Naju, 58322, Republic of Korea
| | - Beom Jun Kim
- Department of Physics, Sungkyunkwan University, Suwon, 16419, Republic of Korea.
| |
Collapse
|
6
|
Finite-state parameter space maps for pruning partitions in modularity-based community detection. Sci Rep 2022; 12:15928. [PMID: 36151268 PMCID: PMC9508178 DOI: 10.1038/s41598-022-20142-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Accepted: 09/09/2022] [Indexed: 11/08/2022] Open
Abstract
Partitioning networks into communities of densely connected nodes is an important tool used widely across different applications, with numerous methods and software packages available for community detection. Modularity-based methods require parameters to be selected (or assume defaults) to control the resolution and, in multilayer networks, interlayer coupling. Meanwhile, most useful algorithms are heuristics yielding different near-optimal results upon repeated runs (even at the same parameters). To address these difficulties, we combine recent developments into a simple-to-use framework for pruning a set of partitions to a subset that are self-consistent by an equivalence with the objective function for inference of a degree-corrected planted partition stochastic block model (SBM). Importantly, this combined framework reduces some of the problems associated with the stochasticity that is inherent in the use of heuristics for optimizing modularity. In our examples, the pruning typically highlights only a small number of partitions that are fixed points of the corresponding map on the set of somewhere-optimal partitions in the parameter space. We also derive resolution parameter upper bounds for fitting a constrained SBM of K blocks and demonstrate that these bounds hold in practice, further guiding parameter space regions to consider. With publicly available code ( http://github.com/ragibson/ModularityPruning ), our pruning procedure provides a new baseline for using modularity-based community detection in practice.
Collapse
|
7
|
Chen AA, Srinivasan D, Pomponio R, Fan Y, Nasrallah IM, Resnick SM, Beason-Held LL, Davatzikos C, Satterthwaite TD, Bassett DS, Shinohara RT, Shou H. Harmonizing functional connectivity reduces scanner effects in community detection. Neuroimage 2022; 256:119198. [PMID: 35421567 PMCID: PMC9202339 DOI: 10.1016/j.neuroimage.2022.119198] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Revised: 04/06/2022] [Accepted: 04/07/2022] [Indexed: 12/12/2022] Open
Abstract
Community detection on graphs constructed from functional magnetic resonance imaging (fMRI) data has led to important insights into brain functional organization. Large studies of brain community structure often include images acquired on multiple scanners across different studies. Differences in scanner can introduce variability into the downstream results, and these differences are often referred to as scanner effects. Such effects have been previously shown to significantly impact common network metrics. In this study, we identify scanner effects in data-driven community detection results and related network metrics. We assess a commonly employed harmonization method and propose new methodology for harmonizing functional connectivity that leverage existing knowledge about network structure as well as patterns of covariance in the data. Finally, we demonstrate that our new methods reduce scanner effects in community structure and network metrics. Our results highlight scanner effects in studies of brain functional organization and provide additional tools to address these unwanted effects. These findings and methods can be incorporated into future functional connectivity studies, potentially preventing spurious findings and improving reliability of results.
Collapse
Affiliation(s)
- Andrew A Chen
- Penn Statistics in Imaging and Visualization Center, Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA 19104, USA; Center for Biomedical Image Computing and Analytics, University of Pennsylvania, Philadelphia, PA 19104, USA.
| | - Dhivya Srinivasan
- Center for Biomedical Image Computing and Analytics, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Radiology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Raymond Pomponio
- Department of Biostatistics, Colorado School of Public Health, Aurora, CO 80045, USA
| | - Yong Fan
- Center for Biomedical Image Computing and Analytics, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Radiology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Ilya M Nasrallah
- Center for Biomedical Image Computing and Analytics, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Radiology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Susan M Resnick
- Laboratory of Behavioral Neuroscience, National Institute on Aging, Baltimore, MD 21224, USA
| | - Lori L Beason-Held
- Laboratory of Behavioral Neuroscience, National Institute on Aging, Baltimore, MD 21224, USA
| | - Christos Davatzikos
- Center for Biomedical Image Computing and Analytics, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Radiology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Theodore D Satterthwaite
- Penn Statistics in Imaging and Visualization Center, Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA 19104, USA; Center for Biomedical Image Computing and Analytics, University of Pennsylvania, Philadelphia, PA 19104, USA; Penn Lifespan Informatics & Neuroimaging Center, Department of Psychiatry, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Psychiatry, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Dani S Bassett
- Department of Bioengineering, School of Engineering & Applied Science, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Electrical & Systems Engineering, School of Engineering & Applied Science, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Physics & Astronomy, College of Arts & Sciences, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Nuerology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Santa Fe Institute, 1399 Hyde Park Rd, Santa Fe, NM 87501, USA
| | - Russell T Shinohara
- Penn Statistics in Imaging and Visualization Center, Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA 19104, USA; Center for Biomedical Image Computing and Analytics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Haochang Shou
- Penn Statistics in Imaging and Visualization Center, Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA 19104, USA; Center for Biomedical Image Computing and Analytics, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
8
|
Chowdhury A, Srinivasan S, Bhowmick S, Mukherjee A, Ghosh K. Constant community identification in million-scale networks. SOCIAL NETWORK ANALYSIS AND MINING 2022; 12:70. [PMID: 35789889 PMCID: PMC9243870 DOI: 10.1007/s13278-022-00895-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2022] [Revised: 05/27/2022] [Accepted: 06/01/2022] [Indexed: 10/29/2022]
Abstract
The inherently stochastic nature of community detection in real-world complex networks poses an important challenge in assessing the accuracy of the results. In order to eliminate the algorithmic and implementation artifacts, it is necessary to identify the groups of vertices that are always clustered together, independent of the community detection algorithm used. Such groups of vertices are called constant communities. Current approaches for finding constant communities are very expensive and do not scale to large networks. In this paper, we use binary edge classification to find constant communities. The key idea is to classify edges based on whether they form a constant community or not. We present two methods for edge classification. The first is a GCN-based semi-supervised approach that we term Line-GCN. The second is an unsupervised approach based on image thresholding methods. Neither of these methods requires explicit detection of communities and can thus scale to very large networks of the order of millions of vertices. Both of our semi-supervised and unsupervised results on real-world graphs demonstrate that the constant communities obtained by our method have higher F1-scores and comparable or higher NMI scores than other state-of-the-art baseline methods for constant community detection. While the training step of Line-GCN can be expensive, the unsupervised algorithm is 10 times faster than the baseline methods. For larger networks, the baseline methods cannot complete, whereas all of our algorithms can find constant communities in a reasonable amount of time. Finally, we also demonstrate that our methods are robust under noisy conditions. We use three different, well-studied noise models to add noise to the networks and show that our results are mostly stable.
Collapse
Affiliation(s)
- Anjan Chowdhury
- Center for Soft Computing Research, Indian Statistical Institute, Kolkata, India
| | - Sriram Srinivasan
- Department of Radiation Oncology, Virginia Commonwealth University, Richmond, USA
| | - Sanjukta Bhowmick
- Department of Computer Science, University of North Texas, Denton, USA
| | - Animesh Mukherjee
- Department of Computer Science and Engineering, IIT Kharagpur, Kharagpur, India
| | - Kuntal Ghosh
- Machine Intelligence Unit, Indian Statistical Institute, Kolkata, India
| |
Collapse
|
9
|
Yeung HW, Shen X, Stolicyn A, de Nooij L, Harris MA, Romaniuk L, Buchanan CR, Waiter GD, Sandu AL, McNeil CJ, Murray A, Steele JD, Campbell A, Porteous D, Lawrie SM, McIntosh AM, Cox SR, Smith KM, Whalley HC. Spectral clustering based on structural magnetic resonance imaging and its relationship with major depressive disorder and cognitive ability. Eur J Neurosci 2021; 54:6281-6303. [PMID: 34390586 DOI: 10.1111/ejn.15423] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 08/09/2021] [Indexed: 11/29/2022]
Abstract
There is increasing interest in using data-driven unsupervised methods to identify structural underpinnings of common mental illnesses, including major depressive disorder (MDD) and associated traits such as cognition. However, studies are often limited to severe clinical cases with small sample sizes and most do not include replication. Here, we examine two relatively large samples with structural magnetic resonance imaging (MRI), measures of lifetime MDD and cognitive variables: Generation Scotland (GS subsample, N = 980) and UK Biobank (UKB, N = 8,900), for discovery and replication, using an exploratory approach. Regional measures of FreeSurfer derived cortical thickness (CT), cortical surface area (CSA), cortical volume (CV) and subcortical volume (subCV) were input into a clustering process, controlling for common covariates. The main analysis steps involved constructing participant K-nearest neighbour graphs and graph partitioning with Markov stability to determine optimal clustering of participants. Resultant clusters were (1) checked whether they were replicated in an independent cohort and (2) tested for associations with depression status and cognitive measures. Participants separated into two clusters based on structural brain measurements in GS subsample, with large Cohen's d effect sizes between clusters in higher order cortical regions, commonly associated with executive function and decision making. Clustering was replicated in the UKB sample, with high correlations of cluster effect sizes for CT, CSA, CV and subCV between cohorts across regions. The identified clusters were not significantly different with respect to MDD case-control status in either cohort (GS subsample: pFDR = .2239-.6585; UKB: pFDR = .2003-.7690). Significant differences in general cognitive ability were, however, found between the clusters for both datasets, for CSA, CV and subCV (GS subsample: d = 0.2529-.3490, pFDR < .005; UKB: d = 0.0868-0.1070, pFDR < .005). Our results suggest that there are replicable natural groupings of participants based on cortical and subcortical brain measures, which may be related to differences in cognitive performance, but not to the MDD case-control status.
Collapse
Affiliation(s)
- Hon Wah Yeung
- Division of Psychiatry, University of Edinburgh, Edinburgh, UK
| | - Xueyi Shen
- Division of Psychiatry, University of Edinburgh, Edinburgh, UK
| | - Aleks Stolicyn
- Division of Psychiatry, University of Edinburgh, Edinburgh, UK
| | - Laura de Nooij
- Division of Psychiatry, University of Edinburgh, Edinburgh, UK
| | - Mathew A Harris
- Division of Psychiatry, University of Edinburgh, Edinburgh, UK
| | - Liana Romaniuk
- Division of Psychiatry, University of Edinburgh, Edinburgh, UK
| | - Colin R Buchanan
- Lothian Birth Cohorts group, Department of Psychology, University of Edinburgh, Edinburgh, UK
| | - Gordon D Waiter
- Aberdeen Biomedical Imaging Centre, Institute of Medical Sciences, University of Aberdeen, Aberdeen, UK
| | - Anca-Larisa Sandu
- Aberdeen Biomedical Imaging Centre, Institute of Medical Sciences, University of Aberdeen, Aberdeen, UK
| | - Christopher J McNeil
- Aberdeen Biomedical Imaging Centre, Institute of Medical Sciences, University of Aberdeen, Aberdeen, UK
| | - Alison Murray
- Aberdeen Biomedical Imaging Centre, Institute of Medical Sciences, University of Aberdeen, Aberdeen, UK
| | - J Douglas Steele
- School of Medicine, University of Dundee, Dundee, UK.,Department of Neurology, NHS Tayside, Ninewells Hospital and Medical School, Dundee, UK
| | - Archie Campbell
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, UK
| | - David Porteous
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, UK.,Centre for Cognitive Ageing and Cognitive Epidemiology, University of Edinburgh, Edinburgh, UK
| | | | - Andrew M McIntosh
- Division of Psychiatry, University of Edinburgh, Edinburgh, UK.,Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, UK
| | - Simon R Cox
- Lothian Birth Cohorts group, Department of Psychology, University of Edinburgh, Edinburgh, UK
| | - Keith M Smith
- Usher Institute, University of Edinburgh, Edinburgh, UK.,Health Data Research UK, London, UK
| | | |
Collapse
|
10
|
Tardiff N, Medaglia JD, Bassett DS, Thompson-Schill SL. The modulation of brain network integration and arousal during exploration. Neuroimage 2021; 240:118369. [PMID: 34242784 PMCID: PMC8507424 DOI: 10.1016/j.neuroimage.2021.118369] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Revised: 07/01/2021] [Accepted: 07/05/2021] [Indexed: 11/08/2022] Open
Abstract
There is growing interest in how neuromodulators shape brain networks. Recent neuroimaging studies provide evidence that brainstem arousal systems, such as the locus coeruleus-norepinephrine system (LC-NE), influence functional connectivity and brain network topology, suggesting they have a role in flexibly reconfiguring brain networks in order to adapt behavior and cognition to environmental demands. To date, however, the relationship between brainstem arousal systems and functional connectivity has not been assessed within the context of a task with an established relationship between arousal and behavior, with most prior studies relying on incidental variations in arousal or pharmacological manipulation and static brain networks constructed over long periods of time. These factors have likely contributed to a heterogeneity of effects across studies. To address these issues, we took advantage of the association between LC-NE-linked arousal and exploration to probe the relationships between exploratory choice, arousal—as measured indirectly via pupil diameter—and brain network dynamics. Exploration in a bandit task was associated with a shift toward fewer, more weakly connected modules that were more segregated in terms of connectivity and topology but more integrated with respect to the diversity of cognitive systems represented in each module. Functional connectivity strength decreased, and changes in connectivity were correlated with changes in pupil diameter, in line with the hypothesis that brainstem arousal systems influence the dynamic reorganization of brain networks. More broadly, we argue that carefully aligning dynamic network analyses with task designs can increase the temporal resolution at which behaviorally- and cognitively-relevant modulations can be identified, and offer these results as a proof of concept of this approach.
Collapse
Affiliation(s)
- Nathan Tardiff
- Department of Psychology, University of Pennsylvania, Philadelphia, PA, United States.
| | - John D Medaglia
- Department of Psychology, Drexel University, Philadelphia, PA, United States; Department of Neurology, Drexel University, Philadelphia, PA, United States; Department of Neurology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Danielle S Bassett
- Department of Neurology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States; Department of Bioengineering, University of Pennsylvania, Philadelphia, PA, United States; Department of Electrical & Systems Engineering, University of Pennsylvania, Philadelphia, PA, United States; Department of Psychiatry, University of Pennsylvania, Philadelphia, PA, United States; Department of Physics & Astronomy, University of Pennsylvania, Philadelphia, PA, United States; Santa Fe Institute, Santa Fe, NM, United States
| | | |
Collapse
|
11
|
Lee D, Lee SH, Kim BJ, Kim H. Consistency landscape of network communities. Phys Rev E 2021; 103:052306. [PMID: 34134219 DOI: 10.1103/physreve.103.052306] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 04/20/2021] [Indexed: 11/07/2022]
Abstract
The concept of community detection has long been used as a key device for handling the mesoscale structures in networks. Suitably conducted community detection reveals various embedded informative substructures of network topology. However, regarding the practical usage of community detection, it has always been a tricky problem to assign a reasonable community resolution for networks of interest. Because of the absence of the unanimously accepted criterion, most of the previous studies utilized rather ad hoc heuristics to decide the community resolution. In this work, we harness the concept of consistency in community structures of networks to provide the overall community resolution landscape of networks, which we eventually take to quantify the reliability of detected communities for a given resolution parameter. More precisely, we exploit the ambiguity in the results of stochastic detection algorithms and suggest a method that denotes the relative validity of community structures in regard to their stability of global and local inconsistency measures using multiple detection processes. Applying our framework to synthetic and real networks, we confirm that it effectively displays insightful fundamental aspects of community structures.
Collapse
Affiliation(s)
- Daekyung Lee
- Department of Physics, Sungkyunkwan University, Suwon 16419, Korea
| | - Sang Hoon Lee
- Department of Liberal Arts, Gyeongsang National University, Jinju 52725, Korea.,Future Convergence Technology Research Institute, Gyeongsang National University, Jinju 52849, Korea
| | - Beom Jun Kim
- Department of Physics, Sungkyunkwan University, Suwon 16419, Korea
| | - Heetae Kim
- Department of Energy Technology, Korea Institute of Energy Technology, Naju 58322, Korea.,Data Science Institute, Faculty of Engineering, Universidad del Desarrollo, Santiago 7610658, Chile
| |
Collapse
|
12
|
Yang Z, Telesford QK, Franco AR, Lim R, Gu S, Xu T, Ai L, Castellanos FX, Yan CG, Colcombe S, Milham MP. Measurement reliability for individual differences in multilayer network dynamics: Cautions and considerations. Neuroimage 2021; 225:117489. [PMID: 33130272 PMCID: PMC7829665 DOI: 10.1016/j.neuroimage.2020.117489] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Accepted: 10/21/2020] [Indexed: 01/16/2023] Open
Abstract
Multilayer network models have been proposed as an effective means of capturing the dynamic configuration of distributed neural circuits and quantitatively describing how communities vary over time. Beyond general insights into brain function, a growing number of studies have begun to employ these methods for the study of individual differences. However, test-retest reliabilities for multilayer network measures have yet to be fully quantified or optimized, potentially limiting their utility for individual difference studies. Here, we systematically evaluated the impact of multilayer community detection algorithms, selection of network parameters, scan duration, and task condition on test-retest reliabilities of multilayer network measures (i.e., flexibility, integration, and recruitment). A key finding was that the default method used for community detection by the popular generalized Louvain algorithm can generate erroneous results. Although available, an updated algorithm addressing this issue is yet to be broadly adopted in the neuroimaging literature. Beyond the algorithm, the present work identified parameter selection as a key determinant of test-retest reliability; however, optimization of these parameters and expected reliabilities appeared to be dataset-specific. Once parameters were optimized, consistent with findings from the static functional connectivity literature, scan duration was a much stronger determinant of reliability than scan condition. When the parameters were optimized and scan duration was sufficient, both passive (i.e., resting state, Inscapes, and movie) and active (i.e., flanker) tasks were reliable, although reliability in the movie watching condition was significantly higher than in the other three tasks. The minimal data requirement for achieving reliable measures for the movie watching condition was 20 min, and 30 min for the other three tasks. Our results caution the field against the use of default parameters without optimization based on the specific datasets to be employed - a process likely to be limited for most due to the lack of test-retest samples to enable parameter optimization.
Collapse
Affiliation(s)
- Zhen Yang
- Center for Biomedical Imaging and Neuromodulation, The Nathan S. Kline Institute for Psychiatric Research, 140 Old Orangeburg Rd, Orangeburg, NY 10962, United States; Department of Psychiatry, NYU Grossman School of Medicine, 550 1st Avenue, New York, NY 10016, United States.
| | - Qawi K Telesford
- Center for Biomedical Imaging and Neuromodulation, The Nathan S. Kline Institute for Psychiatric Research, 140 Old Orangeburg Rd, Orangeburg, NY 10962, United States
| | - Alexandre R Franco
- Center for Biomedical Imaging and Neuromodulation, The Nathan S. Kline Institute for Psychiatric Research, 140 Old Orangeburg Rd, Orangeburg, NY 10962, United States; Department of Psychiatry, NYU Grossman School of Medicine, 550 1st Avenue, New York, NY 10016, United States; Center for the Developing Brain, The Child Mind Institute, 101 East 56th Street, New York, NY 10022, United States
| | - Ryan Lim
- Center for Biomedical Imaging and Neuromodulation, The Nathan S. Kline Institute for Psychiatric Research, 140 Old Orangeburg Rd, Orangeburg, NY 10962, United States
| | - Shi Gu
- University of Electronic Science and Technology of China, Chengdu, Sichuan, China
| | - Ting Xu
- Center for the Developing Brain, The Child Mind Institute, 101 East 56th Street, New York, NY 10022, United States
| | - Lei Ai
- Center for the Developing Brain, The Child Mind Institute, 101 East 56th Street, New York, NY 10022, United States
| | - Francisco X Castellanos
- Center for Biomedical Imaging and Neuromodulation, The Nathan S. Kline Institute for Psychiatric Research, 140 Old Orangeburg Rd, Orangeburg, NY 10962, United States; Department of Child and Adolescent Psychiatry, NYU Grossman School of Medicine, New York, NY 10016, United States
| | - Chao-Gan Yan
- CAS Key Laboratory of Behavioral Science, Institute of Psychology, Beijing, China
| | - Stan Colcombe
- Center for Biomedical Imaging and Neuromodulation, The Nathan S. Kline Institute for Psychiatric Research, 140 Old Orangeburg Rd, Orangeburg, NY 10962, United States; Department of Psychiatry, NYU Grossman School of Medicine, 550 1st Avenue, New York, NY 10016, United States
| | - Michael P Milham
- Center for Biomedical Imaging and Neuromodulation, The Nathan S. Kline Institute for Psychiatric Research, 140 Old Orangeburg Rd, Orangeburg, NY 10962, United States; Center for the Developing Brain, The Child Mind Institute, 101 East 56th Street, New York, NY 10022, United States.
| |
Collapse
|
13
|
Abstract
K-mer based comparisons have emerged as powerful complements to BLAST-like alignment algorithms, particularly when the sequences being compared lack direct evolutionary relationships. In this chapter, we describe methods to compare k-mer content between groups of long noncoding RNAs (lncRNAs), to identify communities of lncRNAs with related k-mer contents, to identify the enrichment of protein-binding motifs in lncRNAs, and to scan for domains of related k-mer contents in lncRNAs. Our step-by-step instructions are complemented by Python code deposited in Github. Though our chapter focuses on lncRNAs, the methods we describe could be applied to any set of nucleic acid sequences.
Collapse
Affiliation(s)
- Jessime M Kirk
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Invitae Corporation, San Francisco, CA, USA
| | - Daniel Sprague
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Curriculum in Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Flagship Pioneering, Boston, MA, USA
| | - J Mauro Calabrese
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
- Curriculum in Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
| |
Collapse
|
14
|
Sanchez-Rodriguez LM, Iturria-Medina Y, Mouches P, Sotero RC. Detecting brain network communities: Considering the role of information flow and its different temporal scales. Neuroimage 2020; 225:117431. [PMID: 33045336 DOI: 10.1016/j.neuroimage.2020.117431] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2020] [Revised: 09/22/2020] [Accepted: 09/29/2020] [Indexed: 12/16/2022] Open
Abstract
The identification of community structure in graphs continues to attract great interest in several fields. Network neuroscience is particularly concerned with this problem considering the key roles communities play in brain processes and functionality. Most methods used for community detection in brain graphs are based on the maximization of a parameter-dependent modularity function that often obscures the physical meaning and hierarchical organization of the partitions of network nodes. In this work, we present a new method able to detect communities at different scales in a natural, unrestricted way. First, to obtain an estimation of the information flow in the network we release random walkers to freely move over it. The activity of the walkers is separated into oscillatory modes by using empirical mode decomposition. After grouping nodes by their co-occurrence at each time scale, k-modes clustering returns the desired partitions. Our algorithm was first tested on benchmark graphs with favorable performance. Next, it was applied to real and simulated anatomical and/or functional connectomes in the macaque and human brains. We found a clear hierarchical repertoire of community structures in both the anatomical and the functional networks. The observed partitions range from the evident division in two hemispheres -in which all processes are managed globally- to specialized communities seemingly shaped by physical proximity and shared function. Additionally, the spatial scales of a network's community structure (characterized by a measure we term within-communities path length) appear inversely proportional to the oscillatory modes' average frequencies. The proportionality constant may constitute a network-specific propagation velocity for the information flow. Our results stimulate the research of hierarchical community organization in terms of temporal scales of information flow in the brain network.
Collapse
Affiliation(s)
- Lazaro M Sanchez-Rodriguez
- Department of Radiology and Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada; Department of Neurology and Neurosurgery, Montreal Neurological Institute, McGill Univ., Montreal, Canada; McConnel Brain Imaging Center, Montreal Neurological Institute, McGill Univ., Montreal, Canada; Ludmer Centre for Neuroinformatics and Mental Health, McGill Univ., Montreal, Canada.
| | - Yasser Iturria-Medina
- Department of Neurology and Neurosurgery, Montreal Neurological Institute, McGill Univ., Montreal, Canada; McConnel Brain Imaging Center, Montreal Neurological Institute, McGill Univ., Montreal, Canada; Ludmer Centre for Neuroinformatics and Mental Health, McGill Univ., Montreal, Canada
| | - Pauline Mouches
- Department of Radiology and Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada; Biomedical Engineering Graduate Program, University of Calgary, Calgary, Alberta, Canada
| | - Roberto C Sotero
- Department of Radiology and Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada; Biomedical Engineering Graduate Program, University of Calgary, Calgary, Alberta, Canada.
| |
Collapse
|
15
|
Betzel RF, Bertolero MA, Gordon EM, Gratton C, Dosenbach NUF, Bassett DS. The community structure of functional brain networks exhibits scale-specific patterns of inter- and intra-subject variability. Neuroimage 2019; 202:115990. [PMID: 31291606 PMCID: PMC7734597 DOI: 10.1016/j.neuroimage.2019.07.003] [Citation(s) in RCA: 56] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2018] [Revised: 06/28/2019] [Accepted: 07/01/2019] [Indexed: 02/01/2023] Open
Abstract
The network organization of the human brain varies across individuals, changes with development and aging, and differs in disease. Discovering the major dimensions along which this variability is displayed remains a central goal of both neuroscience and clinical medicine. Such efforts can be usefully framed within the context of the brain's modular network organization, which can be assessed quantitatively using computational techniques and extended for the purposes of multi-scale analysis, dimensionality reduction, and biomarker generation. Although the concept of modularity and its utility in describing brain network organization is clear, principled methods for comparing multi-scale communities across individuals and time are surprisingly lacking. Here, we present a method that uses multi-layer networks to simultaneously discover the modular structure of many subjects at once. This method builds upon the well-known multi-layer modularity maximization technique, and provides a viable and principled tool for studying differences in network communities across individuals and within individuals across time. We test this method on two datasets and identify consistent patterns of inter-subject community variability, demonstrating that this variability - which would be undetectable using past approaches - is associated with measures of cognitive performance. In general, the multi-layer, multi-subject framework proposed here represents an advance over current approaches by straighforwardly mapping community assignments across subjects and holds promise for future investigations of inter-subject community variation in clinical populations or as a result of task constraints.
Collapse
Affiliation(s)
- Richard F Betzel
- Department of Bioengineering, School of Engineering and Applied Sciences, University of Pennsylvania, Philadelphia, PA, 19104, USA; Department of Psychological and Brain Sciences, Indiana University, Bloomington, IN, 47401, USA; Cognitive Science Program, Indiana University, Bloomington, IN, 47401, USA
| | - Maxwell A Bertolero
- Department of Bioengineering, School of Engineering and Applied Sciences, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Evan M Gordon
- VISN 17 Center of Excellence for Research on Returning War Veterans, Waco, TX, 76711, USA; Center for Vital Longevity, School of Behavioral and Brain Sciences, University of Texas at Dallas, Dallas, TX, 75235, USA
| | - Caterina Gratton
- Department of Neurology, Washington University School of Medicine, St. Louis, MO, 63110, USA; Department of Psychology, Northwestern University, Evanston, IL, 60208, USA
| | - Nico U F Dosenbach
- Department of Neurology, Washington University School of Medicine, St. Louis, MO, 63110, USA; Program in Occupational Therapy, Washington University School of Medicine, St. Louis, MO, 63110, USA; Department of Pediatrics, Washington University School of Medicine, St. Louis, MO, 63110, USA
| | - Danielle S Bassett
- Department of Bioengineering, School of Engineering and Applied Sciences, University of Pennsylvania, Philadelphia, PA, 19104, USA; Department of Electrical & Systems Engineering, School of Engineering and Applied Sciences, University of Pennsylvania, Philadelphia, PA, 19104, USA; Department of Neurology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA; Department of Physics & Astronomy, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA, 19104, USA; Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA; Santa Fe Institute, 1399 Hyde Park Rd, Santa Fe, NM, 87501, USA.
| |
Collapse
|
16
|
Calatayud J, Bernardo-Madrid R, Neuman M, Rojas A, Rosvall M. Exploring the solution landscape enables more reliable network community detection. Phys Rev E 2019; 100:052308. [PMID: 31869919 DOI: 10.1103/physreve.100.052308] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2019] [Indexed: 06/10/2023]
Abstract
To understand how a complex system is organized and functions, researchers often identify communities in the system's network of interactions. Because it is practically impossible to explore all solutions to guarantee the best one, many community-detection algorithms rely on multiple stochastic searches. But for a given combination of network and stochastic algorithms, how many searches are sufficient to find a solution that is good enough? The standard approach is to pick a reasonably large number of searches and select the network partition with the highest quality or derive a consensus solution based on all network partitions. However, if different partitions have similar qualities such that the solution landscape is degenerate, the single best partition may miss relevant information, and a consensus solution may blur complementary communities. Here we address this degeneracy problem with coarse-grained descriptions of the solution landscape. We cluster network partitions based on their similarity and suggest an approach to determine the minimum number of searches required to describe the solution landscape adequately. To make good use of all partitions, we also propose different ways to explore the solution landscape, including a significance clustering procedure. We test these approaches on synthetic networks and a real-world network using two contrasting community-detection algorithms: The algorithm that can identify more general structures requires more searches, and networks with clearer community structures require fewer searches. We also find that exploring the coarse-grained solution landscape can reveal complementary solutions and enable more reliable community detection.
Collapse
Affiliation(s)
- Joaquín Calatayud
- Integrated Science Lab, Department of Physics, Umeå University, Sweden
| | | | - Magnus Neuman
- Integrated Science Lab, Department of Physics, Umeå University, Sweden
| | - Alexis Rojas
- Integrated Science Lab, Department of Physics, Umeå University, Sweden
| | - Martin Rosvall
- Integrated Science Lab, Department of Physics, Umeå University, Sweden
| |
Collapse
|
17
|
Trogdon JG, Weir WH, Shai S, Mucha PJ, Kuo TM, Meyer AM, Stitzenberg KB. Comparing Shared Patient Networks Across Payers. J Gen Intern Med 2019; 34:2014-2020. [PMID: 30945065 PMCID: PMC6816773 DOI: 10.1007/s11606-019-04978-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/11/2018] [Revised: 11/21/2018] [Accepted: 02/19/2019] [Indexed: 11/24/2022]
Abstract
BACKGROUND Measuring care coordination in administrative data facilitates important research to improve care quality. OBJECTIVE To compare shared patient networks constructed from administrative claims data across multiple payers. DESIGN Social network analysis of pooled cross sections of physicians treating prevalent colorectal cancer patients between 2003 and 2013. PARTICIPANTS Surgeons, medical oncologists, and radiation oncologists identified from North Carolina Central Cancer Registry data linked to Medicare claims (N = 1735) and private insurance claims (N = 1321). MAIN MEASURES Provider-level measures included the number of patients treated, the number of providers with whom they share patients (by specialty), the extent of patient sharing with each specialty, and network centrality. Network-level measures included the number of providers and shared patients, the density of shared-patient relationships among providers, and the size and composition of clusters of providers with a high level of patient sharing. RESULTS For 24.5% of providers, total patient volume rank differed by at least one quintile group between payers. Medicare claims missed 14.6% of all shared patient relationships between providers, but captured a greater number of patient-sharing relationships per provider compared with the private insurance database, even after controlling for the total number of patients (27.242 vs 26.044, p < 0.001). Providers in the private network shared a higher fraction of patients with other providers (0.226 vs 0.127, p < 0.001) compared to the Medicare network. Clustering coefficients for providers, weighted betweenness, and eigenvector centrality varied greatly across payers. Network differences led to some clusters of providers that existed in the combined network not being detected in Medicare alone. CONCLUSION Many features of shared patient networks constructed from a single-payer database differed from similar networks constructed from other payers' data. Depending on a study's goals, shortcomings of single-payer networks should be considered when using claims data to draw conclusions about provider behavior.
Collapse
Affiliation(s)
- Justin G Trogdon
- Department of Health Policy and Management, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA. .,Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
| | - W H Weir
- Carolina Center for Interdisciplinary Applied Mathematics, Department of Mathematics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - S Shai
- Department of Mathematics and Computer Science, Wesleyan University, Marion, IN, USA
| | - P J Mucha
- Carolina Center for Interdisciplinary Applied Mathematics, Department of Mathematics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - T M Kuo
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | | | - K B Stitzenberg
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Division of Surgical Oncology and Endocrinology Surgery, Department of Surgery, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| |
Collapse
|
18
|
Walker B, Taylor D, Lawrimore J, Hult C, Adalsteinsson D, Bloom K, Forest MG. Transient crosslinking kinetics optimize gene cluster interactions. PLoS Comput Biol 2019; 15:e1007124. [PMID: 31433796 PMCID: PMC6730938 DOI: 10.1371/journal.pcbi.1007124] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2019] [Revised: 09/06/2019] [Accepted: 07/18/2019] [Indexed: 02/05/2023] Open
Abstract
Our understanding of how chromosomes structurally organize and dynamically interact has been revolutionized through the lens of long-chain polymer physics. Major protein contributors to chromosome structure and dynamics are condensin and cohesin that stochastically generate loops within and between chains, and entrap proximal strands of sister chromatids. In this paper, we explore the ability of transient, protein-mediated, gene-gene crosslinks to induce clusters of genes, thereby dynamic architecture, within the highly repeated ribosomal DNA that comprises the nucleolus of budding yeast. We implement three approaches: live cell microscopy; computational modeling of the full genome during G1 in budding yeast, exploring four decades of timescales for transient crosslinks between 5kbp domains (genes) in the nucleolus on Chromosome XII; and, temporal network models with automated community (cluster) detection algorithms applied to the full range of 4D modeling datasets. The data analysis tools detect and track gene clusters, their size, number, persistence time, and their plasticity (deformation). Of biological significance, our analysis reveals an optimal mean crosslink lifetime that promotes pairwise and cluster gene interactions through "flexible" clustering. In this state, large gene clusters self-assemble yet frequently interact (merge and separate), marked by gene exchanges between clusters, which in turn maximizes global gene interactions in the nucleolus. This regime stands between two limiting cases each with far less global gene interactions: with shorter crosslink lifetimes, "rigid" clustering emerges with clusters that interact infrequently; with longer crosslink lifetimes, there is a dissolution of clusters. These observations are compared with imaging experiments on a normal yeast strain and two condensin-modified mutant cell strains. We apply the same image analysis pipeline to the experimental and simulated datasets, providing support for the modeling predictions.
Collapse
Affiliation(s)
- Benjamin Walker
- Department of Mathematics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Dane Taylor
- Department of Mathematics, University at Buffalo, State University of New York, Buffalo, New York, United States of America
- Computational and Data Enabled Science and Engineering, University at Buffalo, State University of New York, Buffalo, New York, United States of America
| | - Josh Lawrimore
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Caitlin Hult
- Department of Microbiology and Immunology, University of Michigan Medical School, Ann Arbor, Michigan, United States of America
- Department of Chemical Engineering, University of Michigan, Ann Arbor, Michigan, United States of America
| | - David Adalsteinsson
- Department of Mathematics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Kerry Bloom
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - M. Gregory Forest
- Department of Mathematics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Departments of Applied Physical Sciences and Biomedical Engineering, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- * E-mail:
| |
Collapse
|
19
|
Robinson JI, Weir WH, Crowley JR, Hink T, Reske KA, Kwon JH, Burnham CAD, Dubberke ER, Mucha PJ, Henderson JP. Metabolomic networks connect host-microbiome processes to human Clostridioides difficile infections. J Clin Invest 2019; 129:3792-3806. [PMID: 31403473 DOI: 10.1172/jci126905] [Citation(s) in RCA: 58] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2019] [Accepted: 06/11/2019] [Indexed: 12/15/2022] Open
Abstract
Clostridioides difficile infection (CDI) accounts for a substantial proportion of deaths attributable to antibiotic-resistant bacteria in the United States. Although C. difficile can be an asymptomatic colonizer, its pathogenic potential is most commonly manifested in patients with antibiotic-modified intestinal microbiomes. In a cohort of 186 hospitalized patients, we showed that host and microbe-associated shifts in fecal metabolomes had the potential to distinguish patients with CDI from those with non-C. difficile diarrhea and C. difficile colonization. Patients with CDI exhibited a chemical signature of Stickland amino acid fermentation that was distinct from those of uncolonized controls. This signature suggested that C. difficile preferentially catabolizes branched chain amino acids during CDI. Unexpectedly, we also identified a series of noncanonical, unsaturated bile acids that were depleted in patients with CDI. These bile acids may derive from an extended host-microbiome dehydroxylation network in uninfected patients. Bile acid composition and leucine fermentation defined a prototype metabolomic model with potential to distinguish clinical CDI from asymptomatic C. difficile colonization.
Collapse
Affiliation(s)
- John I Robinson
- Center for Women's Infectious Disease Research, Division of Infectious Diseases, Department of Internal Medicine, Washington University School of Medicine, St. Louis, Missouri, USA
| | - William H Weir
- Carolina Center for Interdisciplinary Applied Mathematics, Department of Mathematics, and Curriculum in Bioinformatics & Computational Biology, University of North Carolina, Chapel Hill, North Carolina, USA
| | - Jan R Crowley
- Center for Women's Infectious Disease Research, Division of Infectious Diseases, Department of Internal Medicine, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Tiffany Hink
- Center for Women's Infectious Disease Research, Division of Infectious Diseases, Department of Internal Medicine, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Kimberly A Reske
- Center for Women's Infectious Disease Research, Division of Infectious Diseases, Department of Internal Medicine, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Jennie H Kwon
- Center for Women's Infectious Disease Research, Division of Infectious Diseases, Department of Internal Medicine, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Carey-Ann D Burnham
- Department of Pathology and Immunology, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Erik R Dubberke
- Center for Women's Infectious Disease Research, Division of Infectious Diseases, Department of Internal Medicine, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Peter J Mucha
- Carolina Center for Interdisciplinary Applied Mathematics, Department of Mathematics, and Curriculum in Bioinformatics & Computational Biology, University of North Carolina, Chapel Hill, North Carolina, USA
| | - Jeffrey P Henderson
- Center for Women's Infectious Disease Research, Division of Infectious Diseases, Department of Internal Medicine, Washington University School of Medicine, St. Louis, Missouri, USA
| |
Collapse
|
20
|
Gates KM, Fisher ZF, Arizmendi C, Henry TR, Duffy KA, Mucha PJ. Assessing the robustness of cluster solutions obtained from sparse count matrices. Psychol Methods 2019; 24:675-689. [PMID: 30742473 DOI: 10.1037/met0000204] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Psychological researchers often seek to obtain cluster solutions from sparse count matrices (e.g., social networks; counts of symptoms that are in common for 2 given individuals; structural brain imaging). Increasingly, community detection methods are being used to subset the data in a data-driven manner. While many of these approaches perform well in simulation studies and thus offer some improvement upon traditional clustering approaches, there is no readily available approach for evaluating the robustness of these solutions in empirical data. Researchers have no way of knowing if their results are due to noise. We describe here 2 approaches novel to the field of psychology that enable evaluation of cluster solution robustness. This tutorial also explains the use of an associated R package, perturbR, which provides researchers with the ability to use the methods described herein. In the first approach, the cluster assignment from the original matrix is compared against cluster assignments obtained by randomly perturbing the edges in the matrix. Stable cluster solutions should not demonstrate large changes in the presence of small perturbations. For the second approach, Monte Carlo simulations of random matrices that have the same properties as the original matrix are generated. The distribution of quality scores ("modularity") obtained from the cluster solutions from these matrices are then compared with the score obtained from the original matrix results. From this, one can assess if the results are better than what would be expected by chance. perturbR automates these 2 methods, providing an easy-to-use resource for psychological researchers. We demonstrate the utility of this package using benchmark simulated data generated from a previous study and then apply the methods to publicly available empirical data obtained from social networks and structural neuroimaging. (PsycINFO Database Record (c) 2019 APA, all rights reserved).
Collapse
Affiliation(s)
- Kathleen M Gates
- Department of Psychology and Neuroscience, University of North Carolina at Chapel Hill
| | - Zachary F Fisher
- Department of Psychology and Neuroscience, University of North Carolina at Chapel Hill
| | - Cara Arizmendi
- Department of Psychology and Neuroscience, University of North Carolina at Chapel Hill
| | - Teague R Henry
- Department of Psychology and Neuroscience, University of North Carolina at Chapel Hill
| | - Kelly A Duffy
- Department of Psychology and Neuroscience, University of North Carolina at Chapel Hill
| | - Peter J Mucha
- Department of Mathematics, University of North Carolina at Chapel Hill
| |
Collapse
|
21
|
Kirk JM, Kim SO, Inoue K, Smola MJ, Lee DM, Schertzer MD, Wooten JS, Baker AR, Sprague D, Collins DW, Horning CR, Wang S, Chen Q, Weeks KM, Mucha PJ, Calabrese JM. Functional classification of long non-coding RNAs by k-mer content. Nat Genet 2018; 50:1474-1482. [PMID: 30224646 PMCID: PMC6262761 DOI: 10.1038/s41588-018-0207-8] [Citation(s) in RCA: 154] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Accepted: 07/24/2018] [Indexed: 12/30/2022]
Abstract
The functions of most long non-coding RNAs (lncRNAs) are unknown. In contrast to proteins, lncRNAs with similar functions often lack linear sequence homology; thus, the identification of function in one lncRNA rarely informs the identification of function in others. We developed a sequence comparison method to deconstruct linear sequence relationships in lncRNAs and evaluate similarity based on the abundance of short motifs called kmers. We found that lncRNAs of related function often had similar kmer profiles despite lacking linear homology, and that kmer profiles correlated with protein binding to lncRNAs and with their subcellular localization. Using a novel assay to quantify Xist-like regulatory potential, we directly demonstrated that evolutionarily unrelated lncRNAs can encode similar function through different spatial arrangements of related sequence motifs. Kmer-based classification is a powerful approach to detect recurrent relationships between sequence and function in lncRNAs.
Collapse
Affiliation(s)
- Jessime M Kirk
- Department of Pharmacology and Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Susan O Kim
- Department of Pharmacology and Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,National Institute of Environmental Health Sciences, Research Triangle Park, NC, USA
| | - Kaoru Inoue
- Department of Pharmacology and Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,National Institute of Environmental Health Sciences, Research Triangle Park, NC, USA
| | - Matthew J Smola
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Ribometrix, Durham, NC, USA
| | - David M Lee
- Department of Pharmacology and Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Curriculum in Genetics and Molecular Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Megan D Schertzer
- Department of Pharmacology and Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Curriculum in Genetics and Molecular Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Joshua S Wooten
- Department of Pharmacology and Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Curriculum in Genetics and Molecular Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Allison R Baker
- Department of Pharmacology and Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Harvard Medical School, Ph.D. Program in Biological and Biomedical Sciences, Boston, MA, USA
| | - Daniel Sprague
- Department of Pharmacology and Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Curriculum in Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - David W Collins
- Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Christopher R Horning
- Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Shuo Wang
- Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Qidi Chen
- Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Kevin M Weeks
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Peter J Mucha
- Carolina Center for Interdisciplinary Applied Mathematics, Department of Mathematics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - J Mauro Calabrese
- Department of Pharmacology and Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
| |
Collapse
|
22
|
Muldoon SF. Multilayer network modeling creates opportunities for novel network statistics. Phys Life Rev 2018; 24:143-145. [DOI: 10.1016/j.plrev.2017.12.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2017] [Accepted: 12/21/2017] [Indexed: 11/26/2022]
|
23
|
Jeub LGS, Sporns O, Fortunato S. Multiresolution Consensus Clustering in Networks. Sci Rep 2018; 8:3259. [PMID: 29459635 PMCID: PMC5818657 DOI: 10.1038/s41598-018-21352-7] [Citation(s) in RCA: 79] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2017] [Accepted: 02/02/2018] [Indexed: 11/16/2022] Open
Abstract
Networks often exhibit structure at disparate scales. We propose a method for identifying community structure at different scales based on multiresolution modularity and consensus clustering. Our contribution consists of two parts. First, we propose a strategy for sampling the entire range of possible resolutions for the multiresolution modularity quality function. Our approach is directly based on the properties of modularity and, in particular, provides a natural way of avoiding the need to increase the resolution parameter by several orders of magnitude to break a few remaining small communities, necessitating the introduction of ad-hoc limits to the resolution range with standard sampling approaches. Second, we propose a hierarchical consensus clustering procedure, based on a modified modularity, that allows one to construct a hierarchical consensus structure given a set of input partitions. While here we are interested in its application to partitions sampled using multiresolution modularity, this consensus clustering procedure can be applied to the output of any clustering algorithm. As such, we see many potential applications of the individual parts of our multiresolution consensus clustering procedure in addition to using the procedure itself to identify hierarchical structure in networks.
Collapse
Affiliation(s)
- Lucas G S Jeub
- School of Informatics, Computing and Engineering, Indiana University, Indiana, United States.
| | - Olaf Sporns
- Department of Psychological and Brain Sciences, Indiana University, Indiana, United States
- Network Science Institute (IUNI), Indiana University, Indiana, United States
| | - Santo Fortunato
- School of Informatics, Computing and Engineering, Indiana University, Indiana, United States
- Network Science Institute (IUNI), Indiana University, Indiana, United States
| |
Collapse
|