1
|
Early-career factors largely determine the future impact of prominent researchers: evidence across eight scientific fields. Sci Rep 2023; 13:18794. [PMID: 37914796 PMCID: PMC10620415 DOI: 10.1038/s41598-023-46050-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2023] [Accepted: 10/26/2023] [Indexed: 11/03/2023] Open
Abstract
Can we help predict the future impact of researchers using early-career factors? We analyze early-career factors of the world's 100 most prominent researchers across 8 scientific fields and identify four key drivers in researchers' initial career: working at a top 25 ranked university, publishing a paper in a top 5 ranked journal, publishing most papers in top quartile (high-impact) journals and co-authoring with other prominent researchers in their field. We find that over 95% of prominent researchers across multiple fields had at least one of these four features in the first 5 years of their career. We find that the most prominent scientists who had an early career advantage in terms of citations and h-index are more likely to have had all four features, and that this advantage persists throughout their career after 10, 15 and 20 years. Our findings show that these few early-career factors help predict researchers' impact later in their careers. Our research thus points to the need to enhance fairness and career mobility among scientists who have not had a jump start early on.
Collapse
|
2
|
SciSciNet: A large-scale open data lake for the science of science research. Sci Data 2023; 10:315. [PMID: 37264014 DOI: 10.1038/s41597-023-02198-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Accepted: 05/02/2023] [Indexed: 06/03/2023] Open
Abstract
The science of science has attracted growing research interests, partly due to the increasing availability of large-scale datasets capturing the innerworkings of science. These datasets, and the numerous linkages among them, enable researchers to ask a range of fascinating questions about how science works and where innovation occurs. Yet as datasets grow, it becomes increasingly difficult to track available sources and linkages across datasets. Here we present SciSciNet, a large-scale open data lake for the science of science research, covering over 134M scientific publications and millions of external linkages to funding and public uses. We offer detailed documentation of pre-processing steps and analytical choices in constructing the data lake. We further supplement the data lake by computing frequently used measures in the literature, illustrating how researchers may contribute collectively to enriching the data lake. Overall, this data lake serves as an initial but useful resource for the field, by lowering the barrier to entry, reducing duplication of efforts in data processing and measurements, improving the robustness and replicability of empirical claims, and broadening the diversity and representation of ideas in the field.
Collapse
|
3
|
Data, measurement and empirical methods in the science of science. Nat Hum Behav 2023:10.1038/s41562-023-01562-4. [PMID: 37264084 DOI: 10.1038/s41562-023-01562-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 02/17/2023] [Indexed: 06/03/2023]
Abstract
The advent of large-scale datasets that trace the workings of science has encouraged researchers from many different disciplinary backgrounds to turn scientific methods into science itself, cultivating a rapidly expanding 'science of science'. This Review considers this growing, multidisciplinary literature through the lens of data, measurement and empirical methods. We discuss the purposes, strengths and limitations of major empirical approaches, seeking to increase understanding of the field's diverse methodologies and expand researchers' toolkits. Overall, new empirical developments provide enormous capacity to test traditional beliefs and conceptual frameworks about science, discover factors associated with scientific productivity, predict scientific outcomes and design policies that facilitate scientific progress.
Collapse
|
4
|
Enhancing the robustness of the disruption metric against noise. Scientometrics 2023. [DOI: 10.1007/s11192-023-04644-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/13/2023]
|
5
|
The effect of structural holes on producing novel and disruptive research in physics. Scientometrics 2023. [DOI: 10.1007/s11192-023-04635-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
|
6
|
Collaborative Team Recognition: A Core Plus Extension Structure. J Informetr 2022. [DOI: 10.1016/j.joi.2022.101346] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
7
|
|
8
|
Quantifying the online long-term interest in research. J Informetr 2022. [DOI: 10.1016/j.joi.2022.101288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
9
|
|
10
|
Understanding the onset of hot streaks across artistic, cultural, and scientific careers. Nat Commun 2021; 12:5392. [PMID: 34518529 PMCID: PMC8438033 DOI: 10.1038/s41467-021-25477-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Accepted: 08/04/2021] [Indexed: 11/09/2022] Open
Abstract
Across a range of creative domains, individual careers are characterized by hot streaks, which are bursts of high-impact works clustered together in close succession. Yet it remains unclear if there are any regularities underlying the beginning of hot streaks. Here, we analyze career histories of artists, film directors, and scientists, and develop deep learning and network science methods to build high-dimensional representations of their creative outputs. We find that across all three domains, individuals tend to explore diverse styles or topics before their hot streak, but become notably more focused after the hot streak begins. Crucially, hot streaks appear to be associated with neither exploration nor exploitation behavior in isolation, but a particular sequence of exploration followed by exploitation, where the transition from exploration to exploitation closely traces the onset of a hot streak. Overall, these results may have implications for identifying and nurturing talents across a wide range of creative domains.
Collapse
|
11
|
Abstract
Abstract
Modern science is dominated by scientific productions from teams. A recent finding shows that teams of both large and small sizes are essential in research, prompting us to analyze the extent to which a country’s scientific work is carried out by big or small teams. Here, using over 26 million publications from Web of Science, we find that China’s research output is more dominated by big teams than the rest of the world, which is particularly the case in fields of natural science. Despite the global trend that more papers are written by big teams, China’s drop in small team output is much steeper. As teams in China shift from small to large size, the team diversity that is essential for innovative work does not increase as much as that in other countries. Using the national average as the baseline, we find that the National Natural Science Foundation of China (NSFC) supports fewer small teams than the National Science Foundation (NSF) of the United States does, implying that big teams are preferred by grant agencies in China. Our finding provides new insights into the concern of originality and innovation in China, which indicates a need to balance small and big teams.
Collapse
|
12
|
'If I had a hedge fund, I would cure diabetes': endogenous mechanisms for creating public goods. SN BUSINESS & ECONOMICS 2021; 1:120. [PMID: 34778817 PMCID: PMC8419655 DOI: 10.1007/s43546-021-00115-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Accepted: 07/26/2021] [Indexed: 02/08/2023]
Abstract
We consider the problem of organizing capital to produce public goods with broad societal value. We review why market corrections via government subsidies or philanthropic initiatives are inadequate, in addition to considering the paradox of patents. Our proposed mechanism (an Ever-growing Prize and a Patent Repository) directs capital towards two innovation problems routinely overlooked: (1) problems for which the reward is insufficient even with established mechanisms (e.g. patents or academic prestige), and (2) problems for which the reward is large, but the effort risk is incalculable. The proposed hedge fund mechanism facilitates crowdsourcing, addressing the challenge of determining problems with broad societal interest; the ever-growing prize allows for an emergent rather than predetermined reward; the patent repository turns private intellectual property into a public good for target problems while circumventing the inventors' threat of patent expiration. We guide this discussion by considering two problems: treating Cystic Fibrosis and curing Diabetes.
Collapse
|
13
|
EnrichrBot: Twitter bot tracking tweets about human genes. Bioinformatics 2020; 36:3932-3934. [PMID: 32277816 DOI: 10.1093/bioinformatics/btaa240] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Revised: 03/10/2020] [Accepted: 04/06/2020] [Indexed: 12/27/2022] Open
Abstract
MOTIVATION Micro-blogging with Twitter to communicate new results, discuss ideas and share techniques is becoming central. While most Twitter users are real people, the Twitter API provides the opportunity to develop Twitter bots and to analyze global trends in tweets. RESULTS EnrichrBot is a bot that tracks and tweets information about human genes implementing six principal functions: (i) tweeting information about under-studied genes including non-coding lncRNAs, (ii) replying to requests for information about genes, (iii) responding to GWASbot, another bot that tweets Manhattan plots from genome-wide association study analysis of the UK Biobank, (iv) tweeting randomly selected gene sets from the Enrichr database for analysis with Enrichr, (v) responding to mentions of human genes in tweets with additional information about these genes and (vi) tweeting a weekly report about the most trending genes on Twitter. AVAILABILITY AND IMPLEMENTATION https://twitter.com/botenrichr; source code: https://github.com/MaayanLab/EnrichrBot. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
14
|
Abstract
Abstract
Conflict is a common feature in conservation and resource management. Environmental conflicts are frequently attributed to differences in values; however, variability in the perception of facts, rooted in social and cultural differences also underlies conflicts. Such differences in perception have been termed the Rashomon effect after the Kurosawa film. In the present article, we explore a conservation Rashomon effect—a phenomenon that results from a combination of differences in perspective, plausible alternative perspectives of a conservation issue, and the absence of evidence to elevate one perspective above others. As a remedy to the Rashomon effect, policy-makers have turned to scientists as honest brokers who share a common environmental reality. We evaluate this supposition and suggest that scientists, themselves, display Rashomon effects. We suggest that Rashomon effects can be reduced by acknowledging the plurality of reality, embracing epistemic pluralism, and prioritizing an inclusive process of resource management.
Collapse
|
15
|
Abstract
The foundation of the scientific method rests on access to data, and yet such access is often restricted or costly. We investigate how improved data access shifts the quantity, quality, and diversity of scientific research. We examine the impact of reductions in cost and sharing restrictions for satellite imagery data from NASA's Landsat program (the longest record of remote-sensing observations of the Earth) on academic science using a sample of about 24,000 Landsat publications by over 34,000 authors matched to almost 3,000 unique study locations. Analyses show that improved access had a substantial and positive effect on the quantity and quality of Landsat-enabled science. Improved data access also democratizes science by disproportionately helping scientists from the developing world and lower-ranked institutions to publish using Landsat data. This democratization in turn increases the geographic and topical diversity of Landsat-enabled research. Scientists who start using Landsat data after access is improved tend to focus on previously understudied regions close to their home location and introduce novel research topics. These findings suggest that policies that improve access to valuable scientific data may promote scientific progress, reduce inequality among scientists, and increase the diversity of scientific research.
Collapse
|
16
|
Scientific elite revisited: patterns of productivity, collaboration, authorship and impact. J R Soc Interface 2020; 17:20200135. [PMID: 32316884 DOI: 10.1098/rsif.2020.0135] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Throughout history, a relatively small number of individuals have made a profound and lasting impact on science and society. Despite long-standing, multi-disciplinary interests in understanding careers of elite scientists, there have been limited attempts for a quantitative, career-level analysis. Here, we leverage a comprehensive dataset we assembled, allowing us to trace the entire career histories of nearly all Nobel laureates in physics, chemistry, and physiology or medicine over the past century. We find that, although Nobel laureates were energetic producers from the outset, producing works that garner unusually high impact, their careers before winning the prize follow relatively similar patterns to those of ordinary scientists, being characterized by hot streaks and increasing reliance on collaborations. We also uncovered notable variations along their careers, often associated with the Nobel Prize, including shifting coauthorship structure in the prize-winning work, and a significant but temporary dip in the impact of work they produce after winning the Nobel Prize. Together, these results document quantitative patterns governing the careers of scientific elites, offering an empirical basis for a deeper understanding of the hallmarks of exceptional careers in science.
Collapse
|
17
|
Abstract
Setbacks are an integral part of a scientific career, yet little is known about their long-term effects. Here we examine junior scientists applying for National Institutes of Health R01 grants. By focusing on proposals fell just below and just above the funding threshold, we compare near-miss with narrow-win applicants, and find that an early-career setback has powerful, opposing effects. On the one hand, it significantly increases attrition, predicting more than a 10% chance of disappearing permanently from the NIH system. Yet, despite an early setback, individuals with near misses systematically outperform those with narrow wins in the longer run. Moreover, this performance advantage seems to go beyond a screening mechanism, suggesting early-career setback appears to cause a performance improvement among those who persevere. Overall, these findings are consistent with the concept that “what doesn’t kill me makes me stronger,” which may have broad implications for identifying, training and nurturing junior scientists. Little is known about the long-term effects of early-career setback. Here, the authors compare junior scientists who were awarded a NIH grant to those with similar track records, who were not, and find that individuals with the early setback systematically performed better in the longer term.
Collapse
|
18
|
Abstract
A central question in the science of science concerns how to develop a quantitative understanding of the evolution and impact of individual careers. Over the course of history, a relatively small fraction of individuals have made disproportionate, profound, and lasting impacts on science and society. Despite a long-standing interest in the careers of scientific elites across diverse disciplines, it remains difficult to collect large-scale career histories that could serve as training sets for systematic empirical and theoretical studies. Here, by combining unstructured data collected from CVs, university websites, and Wikipedia, together with the publication and citation database from Microsoft Academic Graph (MAG), we reconstructed publication histories of nearly all Nobel prize winners from the past century, through both manual curation and algorithmic disambiguation procedures. Data validation shows that the collected dataset presents among the most comprehensive collection of publication records for Nobel laureates currently available. As our quantitative understanding of science deepens, this dataset is expected to have increasing value. It will not only allow us to quantitatively probe novel patterns of productivity, collaboration, and impact governing successful scientific careers, it may also help us unearth the fundamental principles underlying creativity and the genesis of scientific breakthroughs.
Collapse
|
19
|
Who Is (Likely) Peer-Reviewing Your Papers? A Partial Insight into the World’s Top Reviewers. PUBLICATIONS 2019. [DOI: 10.3390/publications7010015] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Scientific publishing is experiencing unprecedented growth in terms of outputs across all fields. Inevitably this creates pressure throughout the system on a number of entities. One key element is represented by peer-reviewers, whose demand increases at an even higher pace than that of publications, since more than one reviewer per paper is needed and not all papers that get reviewed get published. The relatively recent Publons platform allows for unprecedented insight into the usual ‘blindness’ of the peer-review system. At a time where the world’s top peer-reviewers are announced and celebrated, we have taken a step back in order to attempt a partial mapping of their profiles to identify trends and key dimensions of this community of ‘super-reviewers’. This commentary focuses necessarily on a limited sample due to manual processing of data, which needs to be done within a single day for the type of information we seek. In investigating the numbers of performed reviews vs. academic citations, our analysis suggests that most reviews are carried out by relatively inexperienced academics. For some of these early career academics, peer-reviewing seems to be the only activity they engage with, given the high number of reviews performed (e.g., three manuscripts per day) and the lack of outputs (zero academic papers and citations in some cases). Additionally, the world’s top researchers (i.e., highly-cited researchers) are understandably busy with research activities and therefore far less active in peer-reviewing. Lastly, there seems to be an uneven distribution at a national level between scientific outputs (e.g., publications) and reviews performed. Our analysis contributes to the ongoing global discourse on the health of scientific peer-review, and it raises some important questions for further discussion.
Collapse
|
20
|
|