1
|
Li N, Jin D, Wei J, Huang Y, Xu J. Functional brain abnormalities in major depressive disorder using a multiscale community detection approach. Neuroscience 2022; 501:1-10. [PMID: 35964834 DOI: 10.1016/j.neuroscience.2022.08.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Revised: 08/04/2022] [Accepted: 08/05/2022] [Indexed: 11/28/2022]
Abstract
Major depressive disorder (MDD) is a serious disease associated with abnormal brain regions, however, the interconnection between specific brain regions related to depression has not been fully explored. To solve this problem, the paper proposes a novel multiscale community detection method to compare the differences in brain regions between normal controls (NC) and MDD patients. This study adopted the Brainnetome Atlas to divide the brain into 246 regions and extract the time series of each region. The Pearson correlation was used to measure the similarity among different brain regions to conduct the brain functional network and to perform multiscale community detection. The optimal brain community structure of each group was further explored based on the modularized Qcut algorithm, normalized mutual information (NMI), and variation of information (VI). The Jaccard index was then applied to compare the abnormalities of each brain region from different community environments between the brain function networks of NC and MDD patients. The experiments revealed several abnormal brain regions between NC and MDD, including the superior frontal gyrus, middle frontal gyrus, inferior frontal gyrus, orbital gyrus, superior temporal gyrus, middle temporal gyrus, inferior temporal gyrus, posterior superior temporal sulcus, inferior parietal gyrus, precuneus, postcentral gyrus, insular gyrus, cingulate gyrus, hippocampus and basal ganglia. Finally, a new subnetwork related to cognitive function was discovered, which was composed of the island gyrus and inferior frontal gyrus. All experiments indicated that the proposed method is useful in detecting functional brain abnormalities in MDD, and it can provide valuable insights into the diagnosis and treatment of MDD.
Collapse
Affiliation(s)
- Na Li
- Tianjin Key Lab of Cognitive Computing and Application, College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Di Jin
- Tianjin Key Lab of Cognitive Computing and Application, College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Jianguo Wei
- Tianjin Key Lab of Cognitive Computing and Application, College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Yuxiao Huang
- Columbian College of Arts & Sciences, George Washington University, Washington D.C., USA
| | - Junhai Xu
- Tianjin Key Lab of Cognitive Computing and Application, College of Intelligence and Computing, Tianjin University, Tianjin, China.
| |
Collapse
|
2
|
Jin D, Li B, Jiao P, He D, Shan H, Zhang W. Modeling with Node Popularities for Autonomous Overlapping Community Detection. ACM T INTEL SYST TEC 2020. [DOI: 10.1145/3373760] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
Overlapping community detection has triggered recent research in network analysis. One of the promising techniques for finding overlapping communities is the popular stochastic models, which, unfortunately, have some common drawbacks. They do not support an important observation that highly connected nodes are more likely to reside in the overlapping regions of communities in the network. These methods are in essence not truly unsupervised, since they require a threshold on probabilistic memberships to derive overlapping structures and need the number of communities to be specified
a priori
. We develop a new method to address these issues for overlapping community detection. We first present a stochastic model to accommodate the relative importance and the expected degree of every node in each community. We then infer every overlapping community by ranking the nodes according to their importance. Second, we determine the number of communities under the Bayesian framework. We evaluate our method and compare it with five state-of-the-art methods. The results demonstrate the superior performance of our method. We also apply this new method to two applications, showing its superb performance on practical problems.
Collapse
Affiliation(s)
- Di Jin
- College of Intelligence and Computing, Tianjin University, China
| | - Bingyi Li
- College of Intelligence and Computing, Tianjin University, China
| | - Pengfei Jiao
- College of Intelligence and Computing, Center of Biosafety Research and Strategy, Tianjin University, China
| | - Dongxiao He
- College of Intelligence and Computing, Tianjin University, China
| | - Hongyu Shan
- College of Intelligence and Computing, Tianjin University, China
| | - Weixiong Zhang
- Department of Computer Science and Engineering, Washington University in St. Louis, St. Louis, Missouri
| |
Collapse
|
3
|
|
4
|
A no self-edge stochastic block model and a heuristic algorithm for balanced anti-community detection in networks. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2020.01.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
5
|
|
6
|
Chen Y, Wang X, Tang B. Structural regularity exploration in multidimensional networks via Bayesian inference. Neural Comput Appl 2018. [DOI: 10.1007/s00521-017-3041-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
7
|
A degree-based block model and a local expansion optimization algorithm for anti-community detection in networks. PLoS One 2018; 13:e0195226. [PMID: 29668688 PMCID: PMC5906029 DOI: 10.1371/journal.pone.0195226] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2017] [Accepted: 02/24/2018] [Indexed: 11/19/2022] Open
Abstract
Anti-community detection in networks can discover negative relations among objects. However, a few researches pay attention to detecting anti-community structure and they do not consider the node degree and most of them require high computational cost. Block models are promising methods for exploring modular regularities, but their results are highly dependent on the observed structure. In this paper, we first propose a Degree-based Block Model (DBM) for anti-community structure. DBM takes the node degree into consideration and evolves a new objective function Q(C) for evaluation. And then, a Local Expansion Optimization Algorithm (LEOA), which preferentially considers the nodes with high degree, is proposed for anti-community detection. LEOA consists of three stages: structural center detection, local anti-community expansion and group membership adjustment. Based on the formulation of DBM, we develop a synthetic benchmark DBM-Net for evaluating comparison algorithms in detecting known anti-community structures. Experiments on DBM-Net with up to 100000 nodes and 17 real-world networks demonstrate the effectiveness and efficiency of LEOA for anti-community detection in networks.
Collapse
|
8
|
A parameter selection method of the deterministic anti-annealing algorithm for network exploring. Neurocomputing 2017. [DOI: 10.1016/j.neucom.2016.11.050] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
9
|
Jiang JQ. Stochastic block model and exploratory analysis in signed networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2015; 91:062805. [PMID: 26172752 DOI: 10.1103/physreve.91.062805] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/02/2015] [Indexed: 06/04/2023]
Abstract
We propose a generalized stochastic block model to explore the mesoscopic structures in signed networks by grouping vertices that exhibit similar positive and negative connection profiles into the same cluster. In this model, the group memberships are viewed as hidden or unobserved quantities, and the connection patterns between groups are explicitly characterized by two block matrices, one for positive links and the other for negative links. By fitting the model to the observed network, we can not only extract various structural patterns existing in the network without prior knowledge, but also recognize what specific structures we obtained. Furthermore, the model parameters provide vital clues about the probabilities that each vertex belongs to different groups and the centrality of each vertex in its corresponding group. This information sheds light on the discovery of the networks' overlapping structures and the identification of two types of important vertices, which serve as the cores of each group and the bridges between different groups, respectively. Experiments on a series of synthetic and real-life networks show the effectiveness as well as the superiority of our model.
Collapse
Affiliation(s)
- Jonathan Q Jiang
- Department of Computer Science, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon, Hong Kong, China
| |
Collapse
|
10
|
Cao X, Wang X, Jin D, Guo X, Tang X. A stochastic model for detecting overlapping and hierarchical community structure. PLoS One 2015; 10:e0119171. [PMID: 25822148 PMCID: PMC4379187 DOI: 10.1371/journal.pone.0119171] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2014] [Accepted: 01/19/2015] [Indexed: 12/01/2022] Open
Abstract
Community detection is a fundamental problem in the analysis of complex networks. Recently, many researchers have concentrated on the detection of overlapping communities, where a vertex may belong to more than one community. However, most current methods require the number (or the size) of the communities as a priori information, which is usually unavailable in real-world networks. Thus, a practical algorithm should not only find the overlapping community structure, but also automatically determine the number of communities. Furthermore, it is preferable if this method is able to reveal the hierarchical structure of networks as well. In this work, we firstly propose a generative model that employs a nonnegative matrix factorization (NMF) formulization with a l2,1 norm regularization term, balanced by a resolution parameter. The NMF has the nature that provides overlapping community structure by assigning soft membership variables to each vertex; the l2,1 regularization term is a technique of group sparsity which can automatically determine the number of communities by penalizing too many nonempty communities; and hence the resolution parameter enables us to explore the hierarchical structure of networks. Thereafter, we derive the multiplicative update rule to learn the model parameters, and offer the proof of its correctness. Finally, we test our approach on a variety of synthetic and real-world networks, and compare it with some state-of-the-art algorithms. The results validate the superior performance of our new method.
Collapse
Affiliation(s)
- Xiaochun Cao
- School of Computer Science and Technology, Tianjin University, Tianjin 300072, China
- State Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, China
| | - Xiao Wang
- School of Computer Science and Technology, Tianjin University, Tianjin 300072, China
- * E-mail:
| | - Di Jin
- School of Computer Science and Technology, Tianjin University, Tianjin 300072, China
| | - Xiaojie Guo
- State Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, China
| | - Xianchao Tang
- School of Computer Science and Technology, Tianjin University, Tianjin 300072, China
| |
Collapse
|
11
|
He D, Jin D, Chen Z, Zhang W. Identification of hybrid node and link communities in complex networks. Sci Rep 2015; 5:8638. [PMID: 25728010 PMCID: PMC4345336 DOI: 10.1038/srep08638] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2013] [Accepted: 01/27/2015] [Indexed: 12/03/2022] Open
Abstract
Identifying communities in complex networks is an effective means for analyzing complex systems, with applications in diverse areas such as social science, engineering, biology and medicine. Finding communities of nodes and finding communities of links are two popular schemes for network analysis. These schemes, however, have inherent drawbacks and are inadequate to capture complex organizational structures in real networks. We introduce a new scheme and an effective approach for identifying complex mixture structures of node and link communities, called hybrid node-link communities. A central piece of our approach is a probabilistic model that accommodates node, link and hybrid node-link communities. Our extensive experiments on various real-world networks, including a large protein-protein interaction network and a large network of semantically associated words, illustrated that the scheme for hybrid communities is superior in revealing network characteristics. Moreover, the new approach outperformed the existing methods for finding node or link communities separately.
Collapse
Affiliation(s)
- Dongxiao He
- School of Computer Science and Technology, Tianjin University, Tianjin. 300072, P. R. China
| | - Di Jin
- School of Computer Science and Technology, Tianjin University, Tianjin. 300072, P. R. China
| | - Zheng Chen
- Department of Computer Science and Engineering, Washington University, St. Louis. MO 63130, USA
- Institute for Systems Biology, Jianghan University, Wuhan. Hubei 430056, P. R. China
| | - Weixiong Zhang
- Department of Computer Science and Engineering, Washington University, St. Louis. MO 63130, USA
- Institute for Systems Biology, Jianghan University, Wuhan. Hubei 430056, P. R. China
- Department of Genetics, Washington University, St. Louis. MO 63130, USA
| |
Collapse
|
12
|
Jin D, Gabrys B, Dang J. Combined node and link partitions method for finding overlapping communities in complex networks. Sci Rep 2015; 5:8600. [PMID: 25715829 PMCID: PMC4341207 DOI: 10.1038/srep08600] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2014] [Accepted: 01/28/2015] [Indexed: 11/09/2022] Open
Abstract
Community detection in complex networks is a fundamental data analysis task in various domains, and how to effectively find overlapping communities in real applications is still a challenge. In this work, we propose a new unified model and method for finding the best overlapping communities on the basis of the associated node and link partitions derived from the same framework. Specifically, we first describe a unified model that accommodates node and link communities (partitions) together, and then present a nonnegative matrix factorization method to learn the parameters of the model. Thereafter, we infer the overlapping communities based on the derived node and link communities, i.e., determine each overlapped community between the corresponding node and link community with a greedy optimization of a local community function conductance. Finally, we introduce a model selection method based on consensus clustering to determine the number of communities. We have evaluated our method on both synthetic and real-world networks with ground-truths, and compared it with seven state-of-the-art methods. The experimental results demonstrate the superior performance of our method over the competing ones in detecting overlapping communities for all analysed data sets. Improved performance is particularly pronounced in cases of more complicated networked community structures.
Collapse
Affiliation(s)
- Di Jin
- School of Computer Science and Technology, Tianjin University, Tianjin 300073, P. R. China
| | - Bogdan Gabrys
- Data Science Institute, Faculty of Science and Technology, Bournemouth University, Poole, Dorset BH12 5BB, UK
| | - Jianwu Dang
- 1] School of Computer Science and Technology, Tianjin University, Tianjin 300073, P. R. China [2] School of Information Science, Japan Advanced Institute of Science and Technology, Japan
| |
Collapse
|
13
|
Larremore DB, Clauset A, Jacobs AZ. Efficiently inferring community structure in bipartite networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2014; 90:012805. [PMID: 25122340 PMCID: PMC4137326 DOI: 10.1103/physreve.90.012805] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/12/2014] [Indexed: 05/23/2023]
Abstract
Bipartite networks are a common type of network data in which there are two types of vertices, and only vertices of different types can be connected. While bipartite networks exhibit community structure like their unipartite counterparts, existing approaches to bipartite community detection have drawbacks, including implicit parameter choices, loss of information through one-mode projections, and lack of interpretability. Here we solve the community detection problem for bipartite networks by formulating a bipartite stochastic block model, which explicitly includes vertex type information and may be trivially extended to k-partite networks. This bipartite stochastic block model yields a projection-free and statistically principled method for community detection that makes clear assumptions and parameter choices and yields interpretable results. We demonstrate this model's ability to efficiently and accurately find community structure in synthetic bipartite networks with known structure and in real-world bipartite networks with unknown structure, and we characterize its performance in practical contexts.
Collapse
Affiliation(s)
- Daniel B Larremore
- Center for Communicable Disease Dynamics, Harvard School of Public Health, Boston, Massachusetts 02115, USA and Department of Epidemiology, Harvard School of Public Health, Boston, Massachusetts 02115, USA
| | - Aaron Clauset
- Department of Computer Science, University of Colorado, Boulder, Colorado 80309, USA and Santa Fe Institute, Santa Fe, New Mexico 87501, USA and BioFrontiers Institute, University of Colorado, Boulder, Colorado 80303, USA
| | - Abigail Z Jacobs
- Department of Computer Science, University of Colorado, Boulder, Colorado 80309, USA
| |
Collapse
|
14
|
He D, Jin D, Baquero C, Liu D. Link community detection using generative model and nonnegative matrix factorization. PLoS One 2014; 9:e86899. [PMID: 24489803 PMCID: PMC3904957 DOI: 10.1371/journal.pone.0086899] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2013] [Accepted: 12/16/2013] [Indexed: 11/18/2022] Open
Abstract
Discovery of communities in complex networks is a fundamental data analysis problem with applications in various domains. While most of the existing approaches have focused on discovering communities of nodes, recent studies have shown the advantages and uses of link community discovery in networks. Generative models provide a promising class of techniques for the identification of modular structures in networks, but most generative models mainly focus on the detection of node communities rather than link communities. In this work, we propose a generative model, which is based on the importance of each node when forming links in each community, to describe the structure of link communities. We proceed to fit the model parameters by taking it as an optimization problem, and solve it using nonnegative matrix factorization. Thereafter, in order to automatically determine the number of communities, we extend the above method by introducing a strategy of iterative bipartition. This extended method not only finds the number of communities all by itself, but also obtains high efficiency, and thus it is more suitable to deal with large and unexplored real networks. We test this approach on both synthetic benchmarks and real-world networks including an application on a large biological network, and compare it with two highly related methods. Results demonstrate the superior performance of our approach over competing methods for the detection of link communities.
Collapse
Affiliation(s)
- Dongxiao He
- College of Computer Science and Technology, Jilin University, Changchun, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China
| | - Di Jin
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China
- School of Computer Science and Technology, Tianjin University, Tianjin, China
- School of Design, Engineering, and Computing, Bournemouth University, Poole, Dorset, United Kingdom
- * E-mail: (DJ); (DL)
| | - Carlos Baquero
- HASLab, INESC TEC and University of Minho, Braga, Portugal
| | - Dayou Liu
- College of Computer Science and Technology, Jilin University, Changchun, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China
- * E-mail: (DJ); (DL)
| |
Collapse
|
15
|
Cao X, Wang X, Jin D, Cao Y, He D. Identifying overlapping communities as well as hubs and outliers via nonnegative matrix factorization. Sci Rep 2013; 3:2993. [PMID: 24129402 PMCID: PMC3797436 DOI: 10.1038/srep02993] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2013] [Accepted: 09/27/2013] [Indexed: 12/02/2022] Open
Abstract
Community detection is important for understanding networks. Previous studies observed that communities are not necessarily disjoint and might overlap. It is also agreed that some outlier vertices participate in no community, and some hubs in a community might take more important roles than others. Each of these facts has been independently addressed in previous work. But there is no algorithm, to our knowledge, that can identify these three structures altogether. To overcome this limitation, we propose a novel model where vertices are measured by their centrality in communities, and define the identification of overlapping communities, hubs, and outliers as an optimization problem, calculated by nonnegative matrix factorization. We test this method on various real networks, and compare it with several competing algorithms. The experimental results not only demonstrate its ability of identifying overlapping communities, hubs, and outliers, but also validate its superior performance in terms of clustering quality.
Collapse
Affiliation(s)
- Xiaochun Cao
- 1] School of Computer Science and Technology, Tianjin University, Tianjin 300072, China [2] State Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, China
| | | | | | | | | |
Collapse
|
16
|
Bao P, Shen HW, Chen W, Cheng XQ. Cumulative effect in information diffusion: empirical study on a microblogging network. PLoS One 2013; 8:e76027. [PMID: 24098422 PMCID: PMC3788071 DOI: 10.1371/journal.pone.0076027] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2013] [Accepted: 08/22/2013] [Indexed: 11/18/2022] Open
Abstract
Cumulative effect in social contagion underlies many studies on the spread of innovation, behavior, and influence. However, few large-scale empirical studies are conducted to validate the existence of cumulative effect in information diffusion on social networks. In this paper, using the population-scale dataset from the largest Chinese microblogging website, we conduct a comprehensive study on the cumulative effect in information diffusion. We base our study on the diffusion network of message, where nodes are the involved users and links characterize forwarding relationship among them. We find that multiple exposures to the same message indeed increase the possibility of forwarding it. However, additional exposures cannot further improve the chance of forwarding when the number of exposures crosses its peak at two. This finding questions the cumulative effect hypothesis in information diffusion. Furthermore, to clarify the forwarding preference among users, we investigate both structural motif in the diffusion network and temporal pattern in information diffusion process. Findings provide some insights for understanding the variation of message popularity and explain the characteristics of diffusion network.
Collapse
Affiliation(s)
- Peng Bao
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Hua-Wei Shen
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
- * E-mail:
| | - Wei Chen
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Xue-Qi Cheng
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
17
|
Fronczak P, Fronczak A, Bujok M. Exponential random graph models for networks with community structure. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2013; 88:032810. [PMID: 24125315 DOI: 10.1103/physreve.88.032810] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2013] [Indexed: 06/02/2023]
Abstract
Although the community structure organization is an important characteristic of real-world networks, most of the traditional network models fail to reproduce the feature. Therefore, the models are useless as benchmark graphs for testing community detection algorithms. They are also inadequate to predict various properties of real networks. With this paper we intend to fill the gap. We develop an exponential random graph approach to networks with community structure. To this end we mainly built upon the idea of blockmodels. We consider both the classical blockmodel and its degree-corrected counterpart and study many of their properties analytically. We show that in the degree-corrected blockmodel, node degrees display an interesting scaling property, which is reminiscent of what is observed in real-world fractal networks. A short description of Monte Carlo simulations of the models is also given in the hope of being useful to others working in the field.
Collapse
Affiliation(s)
- Piotr Fronczak
- Faculty of Physics, Warsaw University of Technology, Koszykowa 75, PL-00-662 Warsaw, Poland
| | | | | |
Collapse
|
18
|
Chai BF, Yu J, Jia CY, Yang TB, Jiang YW. Combining a popularity-productivity stochastic block model with a discriminative-content model for general structure detection. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2013; 88:012807. [PMID: 23944518 DOI: 10.1103/physreve.88.012807] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/27/2012] [Revised: 04/17/2013] [Indexed: 06/02/2023]
Abstract
Latent community discovery that combines links and contents of a text-associated network has drawn more attention with the advance of social media. Most of the previous studies aim at detecting densely connected communities and are not able to identify general structures, e.g., bipartite structure. Several variants based on the stochastic block model are more flexible for exploring general structures by introducing link probabilities between communities. However, these variants cannot identify the degree distributions of real networks due to a lack of modeling of the differences among nodes, and they are not suitable for discovering communities in text-associated networks because they ignore the contents of nodes. In this paper, we propose a popularity-productivity stochastic block (PPSB) model by introducing two random variables, popularity and productivity, to model the differences among nodes in receiving links and producing links, respectively. This model has the flexibility of existing stochastic block models in discovering general community structures and inherits the richness of previous models that also exploit popularity and productivity in modeling the real scale-free networks with power law degree distributions. To incorporate the contents in text-associated networks, we propose a combined model which combines the PPSB model with a discriminative model that models the community memberships of nodes by their contents. We then develop expectation-maximization (EM) algorithms to infer the parameters in the two models. Experiments on synthetic and real networks have demonstrated that the proposed models can yield better performances than previous models, especially on networks with general structures.
Collapse
Affiliation(s)
- Bian-fang Chai
- Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing 100044, China
| | | | | | | | | |
Collapse
|
19
|
Sun XQ, Shen HW, Cheng XQ, Wang ZY. Degree-strength correlation reveals anomalous trading behavior. PLoS One 2012; 7:e45598. [PMID: 23082114 PMCID: PMC3474833 DOI: 10.1371/journal.pone.0045598] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2012] [Accepted: 08/23/2012] [Indexed: 11/19/2022] Open
Abstract
Manipulation is an important issue for both developed and emerging stock markets. Many efforts have been made to detect manipulation in stock markets. However, it is still an open problem to identify the fraudulent traders, especially when they collude with each other. In this paper, we focus on the problem of identifying the anomalous traders using the transaction data of eight manipulated stocks and forty-four non-manipulated stocks during a one-year period. By analyzing the trading networks of stocks, we find that the trading networks of manipulated stocks exhibit significantly higher degree-strength correlation than the trading networks of non-manipulated stocks and the randomized trading networks. We further propose a method to detect anomalous traders of manipulated stocks based on statistical significance analysis of degree-strength correlation. Experimental results demonstrate that our method is effective at distinguishing the manipulated stocks from non-manipulated ones. Our method outperforms the traditional weight-threshold method at identifying the anomalous traders in manipulated stocks. More importantly, our method is difficult to be fooled by colluded traders.
Collapse
Affiliation(s)
| | | | - Xue-Qi Cheng
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
- * E-mail:
| | | |
Collapse
|