1
|
Vaccario G, Xu S, Mariani MS, Medo M. The quest for an unbiased scientific impact indicator remains open. Proc Natl Acad Sci U S A 2024; 121:e2410021121. [PMID: 39348539 PMCID: PMC11474024 DOI: 10.1073/pnas.2410021121] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/02/2024] Open
Affiliation(s)
- Giacomo Vaccario
- Chair of Systems Design, Department of Management, Technology, and Economics, ETH Zürich, ZürichCH-8006, Switzerland
| | - Shuqi Xu
- Institute of Dataspace, Comprehensive National Science Center, Hefei230088, People’s Republic of China
| | - Manuel S. Mariani
- University Research Priority Program Social Networks, Department of Business Administration, University of Zurich, ZurichCH-8050, Switzerland
| | - Matúš Medo
- Department for BioMedical Research, Inselspital, Bern University Hospital, University of Bern, BernCH-3008, Switzerland
- Department of Radiation Oncology, Inselspital, Bern University Hospital, University of Bern, BernCH-3008, Switzerland
| |
Collapse
|
2
|
Liu X, Li X. The impact of patentees assessment based on the heterogeneous patent innovation network. Heliyon 2024; 10:e30317. [PMID: 38803966 PMCID: PMC11128836 DOI: 10.1016/j.heliyon.2024.e30317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 04/22/2024] [Accepted: 04/23/2024] [Indexed: 05/29/2024] Open
Abstract
As a vital factor in technological innovation, patentee plays a significant role in the process of scientific and technological innovation, researching patentee has attracted the attention of experts and scholars. Previously, scholars have mainly quantified patent indicators or constructed homogeneous information networks to analyze patentees, but these methods cannot objectively measure the impact of patentees. Therefore, this study proposes a novel approach to assessing patentee impact based on a heterogeneous information network. The proposed method distinguishes the weight of different types of nodes using a weighted mechanism and extracts three types of fine-grained characteristics of network nodes. This approach results in the construction of a heterogeneous patent innovation network and the development of a new patentee impact assessment algorithm called CWAPN. Using Chinese green patents in the field of energy conservation and environmental protection as an example, experimental results show that the CWAPN algorithm can effectively assess the impact of patentees. Thereby identifying patentees who have made outstanding contributions to sustainable development in China.
Collapse
Affiliation(s)
- Xipeng Liu
- School of Management, Shanghai University, Shanghai, 200444, China
| | - Xinmiao Li
- School of Information Management and Engineering, Shanghai University of Finance and Economics, Shanghai, 200433, China
| |
Collapse
|
3
|
Jiang L, Raza A, El Ariss AB, Chen D, Danaher-Garcia N, Lee J, He S. Impact of medical technologies may be predicted using constructed graph bibliometrics. Sci Rep 2024; 14:2419. [PMID: 38287044 PMCID: PMC10824713 DOI: 10.1038/s41598-024-52233-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2023] [Accepted: 01/16/2024] [Indexed: 01/31/2024] Open
Abstract
Scientific research is driven by allocation of funding to different research projects based in part on the predicted scientific impact of the work. Data-driven algorithms can inform decision-making of scarce funding resources by identifying likely high-impact studies using bibliometrics. Compared to standardized citation-based metrics alone, we utilize a machine learning pipeline that analyzes high-dimensional relationships among a range of bibliometric features to improve the accuracy of predicting high-impact research. Random forest classification models were trained using 28 bibliometric features calculated from a dataset of 1,485,958 publications in medicine to retrospectively predict whether a publication would become high-impact. For each random forest model, the balanced accuracy score was above 0.95 and the area under the receiver operating characteristic curve was above 0.99. The high performance of high impact research prediction using our proposed models show that machine learning technologies are promising algorithms that can support funding decision-making for medical research.
Collapse
Affiliation(s)
| | | | | | - David Chen
- Temerty Faculty of Medicine, University of Toronto, Toronto, Canada
| | | | - Jarone Lee
- Department of Emergency Medicine, Massachusetts General Hospital, Boston, USA
- Trauma, Emergency Surgery, Surgical Critical Care, Massachusetts General Hospital, Boston, USA
| | - Shuhan He
- Department of Emergency Medicine, Massachusetts General Hospital, Boston, USA
- School of Healthcare Leadership, Institute of Health Professions, Boston, USA
- Trauma, Emergency Surgery, Surgical Critical Care, Massachusetts General Hospital, Boston, USA
- Lab of Computer Science, Massachusetts General Hospital, Boston, USA
| |
Collapse
|
4
|
Wang M, Zhang X, Zhong H, Wang D. AIRank: An algorithm on evaluating the academic influence of papers based on heterogeneous academic network. J Inf Sci 2023. [DOI: 10.1177/01655515231151406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Evaluation of papers’ academic influence is a hot issue in the field of scientific research management. Academic big data provides a data treasure with the coexistence of different types of academic entities, which can be used to evaluate academic influence from a more macro and comprehensive perspective. Based on academic big data, a heterogeneous academic network composed of links within and between three types of academic entities (authors, papers and venues) is constructed. In addition, a new academic influence ranking algorithm, AIRank, is proposed to evaluate papers’ academic influence. Different from the existing academic influence ranking algorithms, AIRank has made innovations in the following two aspects. (1) AIRank distinguishes the influence transmission intensity between different node pairs. Different from the strategy of evenly distributing influence among different node pairs, AIRank quantifies the intensity of influence transmission between node pairs based on investigating the citation emotional attribute, semantic similarity and academic quality differences between node pairs. Based on the intensity characteristics, AIRank realises the distribution and transmission of influence among different node pairs. (2) AIRank incorporates the influence transmission from heterogeneous neighbours in evaluating papers’ influence. According to the academic influence of author nodes and venue nodes, AIRank fine-tunes the iteration formula of paper influence to obtain the ranking of papers under the joint influence of homogeneous and heterogeneous neighbours. Experimental results show that, compared with the ranking results based on citation frequency and PageRank algorithm, AIRank algorithm can produce more differentiated and reasonable academic influence ranking results.
Collapse
Affiliation(s)
- Mingyang Wang
- College of Information and Computer Engineering, Northeast Forestry University, People’s Republic of China
| | - Xinyue Zhang
- College of Information and Computer Engineering, Northeast Forestry University, People’s Republic of China
| | - Hongwei Zhong
- College of Information and Computer Engineering, Northeast Forestry University, People’s Republic of China
| | - Dailin Wang
- Library, Northeast Forestry University, People’s Republic of China
| |
Collapse
|
5
|
Evaluating scientists by citation and disruption of their representative works. Scientometrics 2023. [DOI: 10.1007/s11192-023-04631-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
|
6
|
The influence of disruption on evaluating the scientific significance of papers. Scientometrics 2022. [DOI: 10.1007/s11192-022-04505-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
|
7
|
Mining Algorithm of Relatively Important Nodes Based on Edge Importance Greedy Strategy. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12126099] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Relatively important node mining has always been an essential research topic in complex networks. Existing relatively important node mining algorithms suffer from high time complexity and poor accuracy. Therefore, this paper proposes an algorithm for mining relatively important nodes based on the edge importance greedy strategy (EG). This method considers the importance of the edge to represent the degree of association between two connected nodes. Therefore, the greater the value of the connection between a node and a known important node, the more likely it is to be an important node. If the importance of the edges in an undirected network is measured, a greedy strategy can find important nodes. Compared with other relatively important node mining methods on real network data sets, such as SARS and 9/11, the experimental results show that the EG algorithm excels in both accuracy and applicability, which makes it a competitive algorithm in the mining of important nodes in a network.
Collapse
|
8
|
Zhou Y, Wang R, Zhang YC, Zeng A, Medo M. Improving PageRank using sports results modeling. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.108168] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
9
|
Wang HC, Cheng JW, Yang CT. SentCite: a sentence-level citation recommender based on the salient similarity among multiple segments. Scientometrics 2022. [DOI: 10.1007/s11192-022-04339-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
10
|
|
11
|
Katchanov YL, Markova YV. Dynamics of senses of new physics discourse: Co-keywords analysis. J Informetr 2022. [DOI: 10.1016/j.joi.2021.101245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
12
|
Wang J, Xu S, Mariani MS, Lü L. The local structure of citation networks uncovers expert-selected milestone papers. J Informetr 2021. [DOI: 10.1016/j.joi.2021.101220] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
13
|
Weis JW, Jacobson JM. Learning on knowledge graph dynamics provides an early warning of impactful research. Nat Biotechnol 2021; 39:1300-1307. [PMID: 34002098 DOI: 10.1038/s41587-021-00907-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Revised: 12/29/2020] [Accepted: 03/22/2021] [Indexed: 11/09/2022]
Abstract
The scientific ecosystem relies on citation-based metrics that provide only imperfect, inconsistent and easily manipulated measures of research quality. Here we describe DELPHI (Dynamic Early-warning by Learning to Predict High Impact), a framework that provides an early-warning signal for 'impactful' research by autonomously learning high-dimensional relationships among features calculated across time from the scientific literature. We prototype this framework and deduce its performance and scaling properties on time-structured publication graphs from 1980 to 2019 drawn from 42 biotechnology-related journals, including over 7.8 million individual nodes, 201 million relationships and 3.8 billion calculated metrics. We demonstrate the framework's performance by correctly identifying 19/20 seminal biotechnologies from 1980 to 2014 via a blinded retrospective study and provide 50 research papers from 2018 that DELPHI predicts will be in the top 5% of time-rescaled node centrality in the future. We propose DELPHI as a tool to aid in the construction of diversified, impact-optimized funding portfolios.
Collapse
Affiliation(s)
- James W Weis
- MIT Media Lab, Massachusetts Institute of Technology, Cambridge, MA, USA. .,Department of Computational and Systems Biology, Massachusetts Institute of Technology, Cambridge, MA, USA.
| | - Joseph M Jacobson
- MIT Media Lab, Massachusetts Institute of Technology, Cambridge, MA, USA.,MIT Center for Bits and Atoms, Massachusetts Institute of Technology, Cambridge, MA, USA
| |
Collapse
|
14
|
Yu D, Pan T. Tracing the main path of interdisciplinary research considering citation preference: A case from blockchain domain. J Informetr 2021. [DOI: 10.1016/j.joi.2021.101136] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
15
|
He C, Wu J, Zhang Q. Characterizing research leadership on geographically weighted collaboration network. Scientometrics 2021; 126:4005-4037. [PMID: 33776165 PMCID: PMC7980806 DOI: 10.1007/s11192-021-03943-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2020] [Accepted: 03/04/2021] [Indexed: 10/29/2022]
Abstract
Research collaborations, especially long-distance and international collaborations, have become increasingly prevalent worldwide. Recent studies highlighted the significant role of research leadership in collaborations. However, existing measures of the research leadership do not take into account the intensity of leadership in the co-authorship network. More importantly, the spatial features, which influence the collaboration patterns and research outcomes, have not been incorporated in measuring the research leadership. To fill the gap, we construct an institution-level weighted co-authorship network that integrates two types of weight on the edges: the intensity of collaborations and the spatial score (the geographical distance adjusted by the cross-linguistic-border nature). Based on this network, we propose a novel metric, namely the spatial research leadership rank, to identify the leading institutions while considering both the collaboration intensity and the spatial features. The leadership of an institution is measured by the following three criteria: (a) the institution frequently plays the corresponding rule in papers with other institutions; (b) the institution frequently plays the corresponding rule in longer distance and even cross-linguistic-border collaborations; (c) the participating institutions led by the institution have high leadership status themselves. Harnessing a dataset of 323,146 journal publications in pharmaceutical sciences during 2010-2018, we perform a comprehensive analysis of the geographical distribution and dynamic patterns of research leadership flows at the institution level. The results demonstrate that the SpatialLeaderRank outperforms baseline metrics in predicting the scholarly impact of institutions. And the result remains robust in the field of Information Science and Library Science. SUPPLEMENTARY INFORMATION The online version contains supplementary material available at 10.1007/s11192-021-03943-w.
Collapse
Affiliation(s)
- Chaocheng He
- School of Information Management, Wuhan University, Wuhan, Hubei China
- School of Data Science, City University of Hong Kong, Kowloon, Hong Kong, China
| | - Jiang Wu
- School of Information Management, Wuhan University, Wuhan, Hubei China
| | - Qingpeng Zhang
- School of Data Science, City University of Hong Kong, Kowloon, Hong Kong, China
| |
Collapse
|