1
|
Liu XL, Zhao C. A converging reputation ranking iteration method via the eigenvector. PLoS One 2022; 17:e0274567. [PMID: 36190970 PMCID: PMC9529115 DOI: 10.1371/journal.pone.0274567] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 08/30/2022] [Indexed: 11/07/2022] Open
Abstract
Ranking user reputation and object quality in online rating systems is of great significance for the construction of reputation systems. In this paper we put forward an iterative algorithm for ranking reputation and quality in terms of eigenvector, named EigenRank algorithm, where the user reputation and object quality interact and the user reputation converges to the eigenvector associated to the greatest eigenvalue of a certain matrix. In addition, we prove the convergence of EigenRank algorithm, and analyse the speed of convergence. Meanwhile, the experimental results for the synthetic networks show that the AUC values and Kendall’s τ of the EigenRank algorithm are greater than the ones from the IBeta method and Vote Aggregation method with different proportions of random/malicious ratings. The results for the empirical networks show that the EigenRank algorithm performs better in accuracy and robustness compared to the IBeta method and Vote Aggregation method in the random and malicious rating attack cases. This work provides an expectable ranking algorithm for the online user reputation identification.
Collapse
Affiliation(s)
- Xiao-Lu Liu
- School of Management Science and Engineering, Shandong University of Finance and Economics, Jinan, PR China
- * E-mail: (XLL); (CZ)
| | - Chong Zhao
- School of Mathematics, Shandong University, Jinan, PR China
- * E-mail: (XLL); (CZ)
| |
Collapse
|
2
|
Zhao M, Deng Q, Wang K, Wu R, Tao J, Fan C, Chen L, Cui P. Bilateral Filtering Graph Convolutional Network for Multi-relational Social Recommendation in the Power-law Networks. ACM T INFORM SYST 2022. [DOI: 10.1145/3469799] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]
Abstract
In recent years, advances in Graph Convolutional Networks (GCNs) have given new insights into the development of social recommendation. However, many existing GCN-based social recommendation methods often directly apply GCN to capture user-item and user-user interactions, which probably have two main limitations: (a) Due to the power-law property of the degree distribution, the vanilla GCN with static normalized adjacency matrix has limitations in learning node representations, especially for the long-tail nodes; (b) multi-typed social relationships between users that are ubiquitous in the real world are rarely considered. In this article, we propose a novel Bilateral Filtering Heterogeneous Attention Network (BFHAN), which improves long-tail node representations and leverages multi-typed social relationships between user nodes. First, we propose a novel graph convolutional filter for the user-item bipartite network and extend it to the user-user homogeneous network. Further, we theoretically analyze the correlation between the convergence values of different graph convolutional filters and node degrees after stacking multiple layers. Second, we model multi-relational social interactions between users as the multiplex network and further propose a multiplex attention network to capture distinctive inter-layer influences for user representations. Last but not least, the experimental results demonstrate that our proposed method outperforms several state-of-the-art GCN-based methods for social recommendation tasks.
Collapse
|
3
|
Motivating participation in crowdsourcing contests: The role of instruction-writing strategy. INFORMATION & MANAGEMENT 2022. [DOI: 10.1016/j.im.2022.103616] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
4
|
Yin X, Wang H, Yin P, Zhu H, Zhang Z. A co-occurrence based approach of automatic keyword expansion using mass diffusion. Scientometrics 2020. [DOI: 10.1007/s11192-020-03601-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
5
|
Zhu X, Wang X, Zhao H, Pei T, Kuang L, Wang L. BHCMDA: A New Biased Heat Conduction Based Method for Potential MiRNA-Disease Association Prediction. Front Genet 2020; 11:384. [PMID: 32425979 PMCID: PMC7212362 DOI: 10.3389/fgene.2020.00384] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2020] [Accepted: 03/27/2020] [Indexed: 01/04/2023] Open
Abstract
Recent studies have indicated that microRNAs (miRNAs) are closely related to sundry human sophisticated diseases. According to the surmise that functionally similar miRNAs are more likely associated with phenotypically similar diseases, researchers have proposed a variety of valid computational models through integrating known miRNA-disease associations, disease semantic similarity, miRNA functional similarity, and Gaussian interaction profile kernel similarity to discover the potential miRNA-disease relationships in biomedical researches. Taking account of the limitations of previous computational models, a new computational model based on biased heat conduction for MiRNA-Disease Association prediction (BHCMDA) was proposed in this paper, which can achieve the AUC of 0.8890 in LOOCV (Leave-One-Out Cross Validation) and the mean AUC of 0.9060, 0.8931 under the framework of twofold cross validation, fivefold cross validation, respectively. In addition, BHCMDA was further implemented to the case studies of three vital human cancers, and simulation results illustrated that there were 88% (Esophageal Neoplasms), 92% (Colonic Neoplasms) and 92% (Lymphoma) out of top 50 predicted miRNAs having been confirmed by experimental literatures, separately, which demonstrated the good performance of BHCMDA as well. Thence, BHCMDA would be a useful calculative resource for potential miRNA-disease association prediction.
Collapse
Affiliation(s)
- Xianyou Zhu
- College of Computer Science and Technology, Hengyang Normal University, Hengyang, China
| | - Xuzai Wang
- Key Laboratory of Hunan Province for Internet of Things and Information Security, Xiangtan University, Xiangtan, China
| | - Haochen Zhao
- Key Laboratory of Hunan Province for Internet of Things and Information Security, Xiangtan University, Xiangtan, China
| | - Tingrui Pei
- Key Laboratory of Hunan Province for Internet of Things and Information Security, Xiangtan University, Xiangtan, China
| | - Linai Kuang
- College of Computer Science and Technology, Hengyang Normal University, Hengyang, China.,Key Laboratory of Hunan Province for Internet of Things and Information Security, Xiangtan University, Xiangtan, China
| | - Lei Wang
- Key Laboratory of Hunan Province for Internet of Things and Information Security, Xiangtan University, Xiangtan, China.,College of Computer Engineering & Applied Mathematics, Changsha University, Changsha, China
| |
Collapse
|
6
|
|
7
|
Zhu X, Yang Y, Chen G, Medo M, Tian H, Cai SM. Information filtering based on corrected redundancy-eliminating mass diffusion. PLoS One 2017; 12:e0181402. [PMID: 28749976 PMCID: PMC5531469 DOI: 10.1371/journal.pone.0181402] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2017] [Accepted: 07/02/2017] [Indexed: 11/18/2022] Open
Abstract
Methods used in information filtering and recommendation often rely on quantifying the similarity between objects or users. The used similarity metrics often suffer from similarity redundancies arising from correlations between objects' attributes. Based on an unweighted undirected object-user bipartite network, we propose a Corrected Redundancy-Eliminating similarity index (CRE) which is based on a spreading process on the network. Extensive experiments on three benchmark data sets-Movilens, Netflix and Amazon-show that when used in recommendation, the CRE yields significant improvements in terms of recommendation accuracy and diversity. A detailed analysis is presented to unveil the origins of the observed differences between the CRE and mainstream similarity indices.
Collapse
Affiliation(s)
- Xuzhen Zhu
- State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, 100876, China
| | - Yujie Yang
- State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, 100876, China
| | - Guilin Chen
- State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, 100876, China
| | - Matus Medo
- Department of Physics, University of Fribourg, Chemin du Musée 3, CH-1700 Fribourg, Switzerland
| | - Hui Tian
- State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, 100876, China
| | - Shi-Min Cai
- Web Sciences Center, School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, P.R.China
- Big Data Research Center, University of Electronic Science and Technology of China, Chengdu, 611731, P.R.China
- * E-mail:
| |
Collapse
|
8
|
Liu JG, Liu XL, Guo Q, Han JT. Identifying the perceptive users for online social systems. PLoS One 2017; 12:e0178118. [PMID: 28704382 PMCID: PMC5509131 DOI: 10.1371/journal.pone.0178118] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2017] [Accepted: 05/07/2017] [Indexed: 11/18/2022] Open
Abstract
In this paper, the perceptive user, who could identify the high-quality objects in their initial lifespan, is presented. By tracking the ratings given to the rewarded objects, we present a method to identify the user perceptibility, which is defined as the capability that a user can identify these objects at their early lifespan. Moreover, we investigate the behavior patterns of the perceptive users from three dimensions: User activity, correlation characteristics of user rating series and user reputation. The experimental results for the empirical networks indicate that high perceptibility users show significantly different behavior patterns with the others: Having larger degree, stronger correlation of rating series and higher reputation. Furthermore, in view of the hysteresis in finding the rewarded objects, we present a general framework for identifying the high perceptibility users based on user behavior patterns. The experimental results show that this work is helpful for deeply understanding the collective behavior patterns for online users.
Collapse
Affiliation(s)
- Jian-Guo Liu
- Data Science and Cloud Service Research Centre, Shanghai University of Finance and Economics, Shanghai 200433, PR China
- Department of Physics, Fribourg University, CH-1700 Fribourg, Switzerland
- * E-mail:
| | - Xiao-Lu Liu
- Research Center of Complex Systems Science, University of Shanghai for Science and Technology, Shanghai 200093, PR China
| | - Qiang Guo
- Research Center of Complex Systems Science, University of Shanghai for Science and Technology, Shanghai 200093, PR China
| | - Jing-Ti Han
- Data Science and Cloud Service Research Centre, Shanghai University of Finance and Economics, Shanghai 200433, PR China
| |
Collapse
|
9
|
Liu JG, Hou L, Pan X, Guo Q, Zhou T. Stability of similarity measurements for bipartite networks. Sci Rep 2016; 6:18653. [PMID: 26725688 PMCID: PMC4698667 DOI: 10.1038/srep18653] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2015] [Accepted: 11/23/2015] [Indexed: 11/25/2022] Open
Abstract
Similarity is a fundamental measure in network analyses and machine learning algorithms, with wide applications ranging from personalized recommendation to socio-economic dynamics. We argue that an effective similarity measurement should guarantee the stability even under some information loss. With six bipartite networks, we investigate the stabilities of fifteen similarity measurements by comparing the similarity matrixes of two data samples which are randomly divided from original data sets. Results show that, the fifteen measurements can be well classified into three clusters according to their stabilities, and measurements in the same cluster have similar mathematical definitions. In addition, we develop a top-n-stability method for personalized recommendation, and find that the unstable similarities would recommend false information to users, and the performance of recommendation would be largely improved by using stable similarity measurements. This work provides a novel dimension to analyze and evaluate similarity measurements, which can further find applications in link prediction, personalized recommendation, clustering algorithms, community detection and so on.
Collapse
Affiliation(s)
- Jian-Guo Liu
- Data Science and Cloud Service Research Centre, Shanghai University of Finance and Economics, Shanghai 200433, PR China.,Research Center of Complex Systems Science, University of Shanghai for Science and Technology, Shanghai 200093, PR China
| | - Lei Hou
- Research Center of Complex Systems Science, University of Shanghai for Science and Technology, Shanghai 200093, PR China.,Informatics Research Center, Henley Business School, University of Reading, Whiteknights, RG6 6UD, United Kingdom
| | - Xue Pan
- Research Center of Complex Systems Science, University of Shanghai for Science and Technology, Shanghai 200093, PR China.,Informatics Research Center, Henley Business School, University of Reading, Whiteknights, RG6 6UD, United Kingdom
| | - Qiang Guo
- Research Center of Complex Systems Science, University of Shanghai for Science and Technology, Shanghai 200093, PR China
| | - Tao Zhou
- Web Sciences Center, University of Electronic Science and Technology of China, Chengdu 610054, PR China
| |
Collapse
|
10
|
Zhang FG, Zeng A. Information Filtering via Heterogeneous Diffusion in Online Bipartite Networks. PLoS One 2015; 10:e0129459. [PMID: 26125631 PMCID: PMC4488376 DOI: 10.1371/journal.pone.0129459] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2014] [Accepted: 05/10/2015] [Indexed: 11/18/2022] Open
Abstract
The rapid expansion of Internet brings us overwhelming online information, which is impossible for an individual to go through all of it. Therefore, recommender systems were created to help people dig through this abundance of information. In networks composed by users and objects, recommender algorithms based on diffusion have been proven to be one of the best performing methods. Previous works considered the diffusion process from user to object, and from object to user to be equivalent. We show in this work that it is not the case and we improve the quality of the recommendation by taking into account the asymmetrical nature of this process. We apply this idea to modify the state-of-the-art recommendation methods. The simulation results show that the new methods can outperform these existing methods in both recommendation accuracy and diversity. Finally, this modification is checked to be able to improve the recommendation in a realistic case.
Collapse
Affiliation(s)
- Fu-Guo Zhang
- School of Information Technology, Jiangxi University of Finance and Economics, Nanchang 330013, P.R. China
- Jiangxi Key Laboratory of Data and Knowledge Engineering, Jiangxi University of Finance and Economics, Nanchang, 330013, P. R. China
| | - An Zeng
- School of Systems Science, Beijing Normal University, Beijing, 100875, P. R. China
- * E-mail:
| |
Collapse
|
11
|
Abstract
As one of the major challenges, cold-start problem plagues nearly all recommender systems. In particular, new items will be overlooked, impeding the development of new products online. Given limited resources, how to utilize the knowledge of recommender systems and design efficient marketing strategy for new items is extremely important. In this paper, we convert this ticklish issue into a clear mathematical problem based on a bipartite network representation. Under the most widely used algorithm in real e-commerce recommender systems, the so-called item-based collaborative filtering, we show that to simply push new items to active users is not a good strategy. Interestingly, experiments on real recommender systems indicate that to connect new items with some less active users will statistically yield better performance, namely, these new items will have more chance to appear in other users' recommendation lists. Further analysis suggests that the disassortative nature of recommender systems contributes to such observation. In a word, getting in-depth understanding on recommender systems could pave the way for the owners to popularize their cold-start products with low costs.
Collapse
|
12
|
Similarity from multi-dimensional scaling: solving the accuracy and diversity dilemma in information filtering. PLoS One 2014; 9:e111005. [PMID: 25343243 PMCID: PMC4208813 DOI: 10.1371/journal.pone.0111005] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2014] [Accepted: 09/19/2014] [Indexed: 11/22/2022] Open
Abstract
Recommender systems are designed to assist individual users to navigate through the rapidly growing amount of information. One of the most successful recommendation techniques is the collaborative filtering, which has been extensively investigated and has already found wide applications in e-commerce. One of challenges in this algorithm is how to accurately quantify the similarities of user pairs and item pairs. In this paper, we employ the multidimensional scaling (MDS) method to measure the similarities between nodes in user-item bipartite networks. The MDS method can extract the essential similarity information from the networks by smoothing out noise, which provides a graphical display of the structure of the networks. With the similarity measured from MDS, we find that the item-based collaborative filtering algorithm can outperform the diffusion-based recommendation algorithms. Moreover, we show that this method tends to recommend unpopular items and increase the global diversification of the networks in long term.
Collapse
|
13
|
Zeng W, Zeng A, Liu H, Shang MS, Zhou T. Uncovering the information core in recommender systems. Sci Rep 2014; 4:6140. [PMID: 25142186 PMCID: PMC4139954 DOI: 10.1038/srep06140] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2014] [Accepted: 07/17/2014] [Indexed: 11/24/2022] Open
Abstract
With the rapid growth of the Internet and overwhelming amount of information that people are confronted with, recommender systems have been developed to effectively support users' decision-making process in online systems. So far, much attention has been paid to designing new recommendation algorithms and improving existent ones. However, few works considered the different contributions from different users to the performance of a recommender system. Such studies can help us improve the recommendation efficiency by excluding irrelevant users. In this paper, we argue that in each online system there exists a group of core users who carry most of the information for recommendation. With them, the recommender systems can already generate satisfactory recommendation. Our core user extraction method enables the recommender systems to achieve 90% of the accuracy of the top-L recommendation by taking only 20% of the users into account. A detailed investigation reveals that these core users are not necessarily the large-degree users. Moreover, they tend to select high quality objects and their selections are well diversified.
Collapse
Affiliation(s)
- Wei Zeng
- 1] Web Sciences Center, University of Electronic Science and Technology of China, Chengdu 611731, P.R. China [2] State Key Laboratory of Networking and Switching Technology, Beijing 100876, P.R. China
| | - An Zeng
- 1] Department of Physics, University of Fribourg, Fribourg CH1700, Switzerland [2] School of Systems Science, Beijing Normal University, Beijing 100875, PR China
| | - Hao Liu
- Department of Physics, University of Fribourg, Fribourg CH1700, Switzerland
| | - Ming-Sheng Shang
- 1] Web Sciences Center, University of Electronic Science and Technology of China, Chengdu 611731, P.R. China [2] Department of Physics, University of Fribourg, Fribourg CH1700, Switzerland
| | - Tao Zhou
- Web Sciences Center, University of Electronic Science and Technology of China, Chengdu 611731, P.R. China
| |
Collapse
|
14
|
Nie DC, Zhang ZK, Zhou JL, Fu Y, Zhang K. Information filtering on coupled social networks. PLoS One 2014; 9:e101675. [PMID: 25003525 PMCID: PMC4086959 DOI: 10.1371/journal.pone.0101675] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2014] [Accepted: 06/10/2014] [Indexed: 11/29/2022] Open
Abstract
In this paper, based on the coupled social networks (CSN), we propose a hybrid algorithm to nonlinearly integrate both social and behavior information of online users. Filtering algorithm, based on the coupled social networks, considers the effects of both social similarity and personalized preference. Experimental results based on two real datasets, Epinions and Friendfeed, show that the hybrid pattern can not only provide more accurate recommendations, but also enlarge the recommendation coverage while adopting global metric. Further empirical analyses demonstrate that the mutual reinforcement and rich-club phenomenon can also be found in coupled social networks where the identical individuals occupy the core position of the online system. This work may shed some light on the in-depth understanding of the structure and function of coupled social networks.
Collapse
Affiliation(s)
- Da-Cheng Nie
- Web Sciences Center, School of Computer Science & Engineering, University of Electronic Science and Technology of China, Chengdu, People's Republic of China
| | - Zi-Ke Zhang
- College of Communication Engineering, Chongqing University, Chongqing, People's Republic of China
- Alibaba Research Center for Complexity Sciences, Hangzhou Normal University, Hangzhou, People's Republic of China
- Alibaba Research Institute, Hangzhou, People's Republic of China
- * E-mail:
| | - Jun-Lin Zhou
- Web Sciences Center, School of Computer Science & Engineering, University of Electronic Science and Technology of China, Chengdu, People's Republic of China
| | - Yan Fu
- Web Sciences Center, School of Computer Science & Engineering, University of Electronic Science and Technology of China, Chengdu, People's Republic of China
| | - Kui Zhang
- College of Communication Engineering, Chongqing University, Chongqing, People's Republic of China
| |
Collapse
|
15
|
Liu JH, Zhang ZK, Chen L, Liu C, Yang C, Wang X. Gravity effects on information filtering and network evolving. PLoS One 2014; 9:e91070. [PMID: 24622162 PMCID: PMC3951341 DOI: 10.1371/journal.pone.0091070] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2013] [Accepted: 02/08/2014] [Indexed: 11/18/2022] Open
Abstract
In this paper, based on the gravity principle of classical physics, we propose a tunable gravity-based model, which considers tag usage pattern to weigh both the mass and distance of network nodes. We then apply this model in solving the problems of information filtering and network evolving. Experimental results on two real-world data sets, Del.icio.us and MovieLens, show that it can not only enhance the algorithmic performance, but can also better characterize the properties of real networks. This work may shed some light on the in-depth understanding of the effect of gravity model.
Collapse
Affiliation(s)
- Jin-Hu Liu
- Web Sciences Center, University of Electronic Science and Technology of China, Chengdu, People's Republic of China
- Institute of Information Economy, Hangzhou Normal University, Hangzhou, People's Republic of China
- Alibaba Research Center for Complexity Sciences, Hangzhou Normal University, Hangzhou, People's Republic of China
| | - Zi-Ke Zhang
- Institute of Information Economy, Hangzhou Normal University, Hangzhou, People's Republic of China
- Alibaba Research Center for Complexity Sciences, Hangzhou Normal University, Hangzhou, People's Republic of China
- * E-mail: (ZKZ); (XQW)
| | - Lingjiao Chen
- Web Sciences Center, University of Electronic Science and Technology of China, Chengdu, People's Republic of China
- Institute of Information Economy, Hangzhou Normal University, Hangzhou, People's Republic of China
- Alibaba Research Center for Complexity Sciences, Hangzhou Normal University, Hangzhou, People's Republic of China
| | - Chuang Liu
- Institute of Information Economy, Hangzhou Normal University, Hangzhou, People's Republic of China
- Alibaba Research Center for Complexity Sciences, Hangzhou Normal University, Hangzhou, People's Republic of China
| | - Chengcheng Yang
- Web Sciences Center, University of Electronic Science and Technology of China, Chengdu, People's Republic of China
| | - Xueqi Wang
- Division of Translational Medicine, Shanghai Changzheng Hospital, Second Military Medical University, Shanghai, People's Republic of China
- * E-mail: (ZKZ); (XQW)
| |
Collapse
|
16
|
Zeng W, Zeng A, Shang MS, Zhang YC. Information filtering in sparse online systems: recommendation via semi-local diffusion. PLoS One 2013; 8:e79354. [PMID: 24260206 PMCID: PMC3832491 DOI: 10.1371/journal.pone.0079354] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2013] [Accepted: 09/28/2013] [Indexed: 11/18/2022] Open
Abstract
With the rapid growth of the Internet and overwhelming amount of information and choices that people are confronted with, recommender systems have been developed to effectively support users' decision-making process in the online systems. However, many recommendation algorithms suffer from the data sparsity problem, i.e. the user-object bipartite networks are so sparse that algorithms cannot accurately recommend objects for users. This data sparsity problem makes many well-known recommendation algorithms perform poorly. To solve the problem, we propose a recommendation algorithm based on the semi-local diffusion process on the user-object bipartite network. The simulation results on two sparse datasets, Amazon and Bookcross, show that our method significantly outperforms the state-of-the-art methods especially for those small-degree users. Two personalized semi-local diffusion methods are proposed which further improve the recommendation accuracy. Finally, our work indicates that sparse online systems are essentially different from the dense online systems, so it is necessary to reexamine former algorithms and conclusions based on dense data in sparse systems.
Collapse
Affiliation(s)
- Wei Zeng
- Web Sciences Center, University of Electronic Science and Technology of China, Chengdu, People’s Republic of China
- Department of Physics, University of Fribourg, Fribourg, Switzerland
| | - An Zeng
- Department of Physics, University of Fribourg, Fribourg, Switzerland
- * E-mail: (M-SS); (AZ)
| | - Ming-Sheng Shang
- Web Sciences Center, University of Electronic Science and Technology of China, Chengdu, People’s Republic of China
- Institute of Information Economy, Hangzhou Normal University, Hangzhou, People’s Republic of China
- * E-mail: (M-SS); (AZ)
| | - Yi-Cheng Zhang
- Web Sciences Center, University of Electronic Science and Technology of China, Chengdu, People’s Republic of China
- Department of Physics, University of Fribourg, Fribourg, Switzerland
- Institute of Information Economy, Hangzhou Normal University, Hangzhou, People’s Republic of China
| |
Collapse
|
17
|
Zhou Y, Lü L, Liu W, Zhang J. The power of ground user in recommender systems. PLoS One 2013; 8:e70094. [PMID: 23936380 PMCID: PMC3732260 DOI: 10.1371/journal.pone.0070094] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2013] [Accepted: 06/19/2013] [Indexed: 11/19/2022] Open
Abstract
Accuracy and diversity are two important aspects to evaluate the performance of recommender systems. Two diffusion-based methods were proposed respectively inspired by the mass diffusion (MD) and heat conduction (HC) processes on networks. It has been pointed out that MD has high recommendation accuracy yet low diversity, while HC succeeds in seeking out novel or niche items but with relatively low accuracy. The accuracy-diversity dilemma is a long-term challenge in recommender systems. To solve this problem, we introduced a background temperature by adding a ground user who connects to all the items in the user-item bipartite network. Performing the HC algorithm on the network with ground user (GHC), it showed that the accuracy can be largely improved while keeping the diversity. Furthermore, we proposed a weighted form of the ground user (WGHC) by assigning some weights to the newly added links between the ground user and the items. By turning the weight as a free parameter, an optimal value subject to the highest accuracy is obtained. Experimental results on three benchmark data sets showed that the WGHC outperforms the state-of-the-art method MD for both accuracy and diversity.
Collapse
Affiliation(s)
- Yanbo Zhou
- Institute of Information Economy, Alibaba Business College, Hangzhou Normal University, Hangzhou, People’s Republic of China
- Department of Physics, University of Fribourg, Fribourg, Switzerland
| | - Linyuan Lü
- Institute of Information Economy, Alibaba Business College, Hangzhou Normal University, Hangzhou, People’s Republic of China
- Department of Physics, University of Fribourg, Fribourg, Switzerland
- * E-mail:
| | - Weiping Liu
- Department of Physics, University of Fribourg, Fribourg, Switzerland
| | - Jianlin Zhang
- Institute of Information Economy, Alibaba Business College, Hangzhou Normal University, Hangzhou, People’s Republic of China
- Department of Physics, University of Fribourg, Fribourg, Switzerland
| |
Collapse
|
18
|
Qiu T, Zhang ZK, Chen G. Information filtering via a scaling-based function. PLoS One 2013; 8:e63531. [PMID: 23696829 PMCID: PMC3656959 DOI: 10.1371/journal.pone.0063531] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2012] [Accepted: 04/02/2013] [Indexed: 11/19/2022] Open
Abstract
Finding a universal description of the algorithm optimization is one of the key challenges in personalized recommendation. In this article, for the first time, we introduce a scaling-based algorithm (SCL) independent of recommendation list length based on a hybrid algorithm of heat conduction and mass diffusion, by finding out the scaling function for the tunable parameter and object average degree. The optimal value of the tunable parameter can be abstracted from the scaling function, which is heterogeneous for the individual object. Experimental results obtained from three real datasets, Netflix, MovieLens and RYM, show that the SCL is highly accurate in recommendation. More importantly, compared with a number of excellent algorithms, including the mass diffusion method, the original hybrid method, and even an improved version of the hybrid method, the SCL algorithm remarkably promotes the personalized recommendation in three other aspects: solving the accuracy-diversity dilemma, presenting a high novelty, and solving the key challenge of cold start problem.
Collapse
Affiliation(s)
- Tian Qiu
- School of Information Engineering, Nanchang Hangkong University, Nanchang, P. R. China.
| | | | | |
Collapse
|
19
|
Liu JG, Shi K, Guo Q. Solving the accuracy-diversity dilemma via directed random walks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2012; 85:016118. [PMID: 22400636 DOI: 10.1103/physreve.85.016118] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/18/2011] [Revised: 12/24/2011] [Indexed: 05/31/2023]
Abstract
Random walks have been successfully used to measure user or object similarities in collaborative filtering (CF) recommender systems, which is of high accuracy but low diversity. A key challenge of a CF system is that the reliably accurate results are obtained with the help of peers' recommendation, but the most useful individual recommendations are hard to be found among diverse niche objects. In this paper we investigate the direction effect of the random walk on user similarity measurements and find that the user similarity, calculated by directed random walks, is reverse to the initial node's degree. Since the ratio of small-degree users to large-degree users is very large in real data sets, the large-degree users' selections are recommended extensively by traditional CF algorithms. By tuning the user similarity direction from neighbors to the target user, we introduce a new algorithm specifically to address the challenge of diversity of CF and show how it can be used to solve the accuracy-diversity dilemma. Without relying on any context-specific information, we are able to obtain accurate and diverse recommendations, which outperforms the state-of-the-art CF methods. This work suggests that the random-walk direction is an important factor to improve the personalized recommendation performance.
Collapse
Affiliation(s)
- Jian-Guo Liu
- Research Center of Complex Systems Science, University of Shanghai for Science and Technology, Shanghai 200093, People's Republic of China.
| | | | | |
Collapse
|