1
|
Yale SE, Brown MEL, Byrne MHV. Using freedom of information requests to access novel data sources in health professions education research. Postgrad Med J 2025; 101:481-486. [PMID: 39591525 DOI: 10.1093/postmj/qgae166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2024] [Revised: 10/09/2024] [Accepted: 11/14/2024] [Indexed: 11/28/2024]
Abstract
Educators and researchers are reliant upon access to data to drive teaching methods, curricular improvements, and progress in medical education research. However, data are not always accessible, due to resource constraints, institutional policies, and privacy concerns. Researchers have attempted to access novel data sources through surveys, semistructured interviews, and databases; however, these methodologies are limited. To improve access to data, Freedom of Information (FOI) Acts grant researchers the ability to formally request data that any public institute holds. Researchers have been reluctant to use this tool due to negative perceptions, despite its unique benefits. To increase awareness of this underutilized methodology, we summarize the process of FOI Act requests, its strengths and weaknesses, and the ways in which health professions education can leverage FOI requests within research. We provide examples of the use of FOI requests as a research method within adjacent fields and nascent use within the field of health professions research. In doing so, we hope to highlight how FOI requests can be a useful tool in health professions education researchers and its potential to increase access to unique data sources.
Collapse
Affiliation(s)
- Sophie E Yale
- University of Sunderland, City Campus, Edinburgh Building, Chester Rd, Sunderland SR1 3SD, United Kingdom
| | - Megan E L Brown
- School of Medicine, The Medical School, Newcastle University, Framlington Place, Newcastle upon Tyne NE2 4HH, United Kingdom
| | - Matthew H V Byrne
- Nuffield Department of Surgical Sciences, University of Oxford, Churchill Hospital, Old Rd, Headington, Oxford OX3 7LE, United Kingdom
| |
Collapse
|
2
|
Enriquez D. Publishing publicly available interview data: an empirical example of the experience of publishing interview data. FRONTIERS IN SOCIOLOGY 2024; 9:1157514. [PMID: 38903395 PMCID: PMC11188393 DOI: 10.3389/fsoc.2024.1157514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 02/19/2024] [Indexed: 06/22/2024]
Abstract
In September 2021 I made a collection of interview transcripts available for public use under a CreativeCommons license through the Princeton DataSpace. The interviews include 39 conversations I had with gig workers at AmazonFlex, Uber, and Lyft in 2019 as part of a study on automation efforts within these organizations. I made this decision because (1) I was required to contribute to a publicly available data set as a requirement of my funding and (2) I saw it as an opportunity to engage in the collaborative qualitative science experiments emerging in Science and Technology studies. This article documents my thought process and step-by-step design decisions for designing a study, gathering data, masking it, and publishing it in a public archive. Importantly, once I decided to publish these data, I determined that each choice about how the study would be designed and implemented had to be assessed for risk to the interviewee in a very deliberate way. It is not meant to be comprehensive and cover every possible condition a researcher may face while producing qualitative data. I aimed to be transparent both in my interview data and the process it took to gather and publish these data. I use this article to illustrate my thought process as I made each design decision for this study in hopes that it could be useful to a future researcher considering their own data publishing process.
Collapse
Affiliation(s)
- Diana Enriquez
- Department of Sociology, Princeton University, Princeton, NJ, United States
| |
Collapse
|
3
|
Global environmental equities and investor sentiment: the role of social media and Covid-19 pandemic crisis. REVIEW OF MANAGERIAL SCIENCE 2023. [DOI: 10.1007/s11846-022-00614-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
AbstractAccording to researchers, information generated from social media provides useful data for understanding the behaviour of various types of financial assets, using the sentiment expressed by these network users as an explanatory variable of asset prices. In a context in which investment based on sustainability and environmental preservation values is vital, there is no known scientific work that analyses the relationship between social networks and environmental investment, which is closely related to the 2030 Agenda for Sustainable Development. In this study, we aim to identify how investor sentiment, generated from social networks, influences environmental investment and whether this influence depends on the time variable, as well the role of the pandemic crisis and the Russia-Ukraine war. Our results show different forms of behaviour for the different periods considered, with the proximity between the two types of variables being time-varying. For shorter periods, proximity occurred mainly during the pandemic crisis, repeatedly revealing that sentiment is a risk factor in environmental investment and in particular how important the information generated from social networks can be in pricing environmental assets. For longer periods, no common stochastic trends were identified. The mechanisms generating the series are thus characterised by a certain autonomy.
Collapse
|
4
|
Munnes S, Harsch C, Knobloch M, Vogel JS, Hipp L, Schilling E. Examining Sentiment in Complex Texts. A Comparison of Different Computational Approaches. Front Big Data 2022; 5:886362. [PMID: 35600329 PMCID: PMC9114298 DOI: 10.3389/fdata.2022.886362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 04/14/2022] [Indexed: 11/13/2022] Open
Abstract
Can we rely on computational methods to accurately analyze complex texts? To answer this question, we compared different dictionary and scaling methods used in predicting the sentiment of German literature reviews to the “gold standard” of human-coded sentiments. Literature reviews constitute a challenging text corpus for computational analysis as they not only contain different text levels—for example, a summary of the work and the reviewer's appraisal—but are also characterized by subtle and ambiguous language elements. To take the nuanced sentiments of literature reviews into account, we worked with a metric rather than a dichotomous scale for sentiment analysis. The results of our analyses show that the predicted sentiments of prefabricated dictionaries, which are computationally efficient and require minimal adaption, have a low to medium correlation with the human-coded sentiments (r between 0.32 and 0.39). The accuracy of self-created dictionaries using word embeddings (both pre-trained and self-trained) was considerably lower (r between 0.10 and 0.28). Given the high coding intensity and contingency on seed selection as well as the degree of data pre-processing of word embeddings that we found with our data, we would not recommend them for complex texts without further adaptation. While fully automated approaches appear not to work in accurately predicting text sentiments with complex texts such as ours, we found relatively high correlations with a semiautomated approach (r of around 0.6)—which, however, requires intensive human coding efforts for the training dataset. In addition to illustrating the benefits and limits of computational approaches in analyzing complex text corpora and the potential of metric rather than binary scales of text sentiment, we also provide a practical guide for researchers to select an appropriate method and degree of pre-processing when working with complex texts.
Collapse
Affiliation(s)
- Stefan Munnes
- WZB Berlin Social Science Center, Berlin, Germany
- *Correspondence: Stefan Munnes
| | | | | | - Johannes S. Vogel
- WZB Berlin Social Science Center, Berlin, Germany
- Faculty of Economics and Social Sciences Chair of Inequality Research and Social Stratification Analysis, University of Potsdam, Potsdam, Germany
| | - Lena Hipp
- WZB Berlin Social Science Center, Berlin, Germany
- Faculty of Economics and Social Sciences Chair of Inequality Research and Social Stratification Analysis, University of Potsdam, Potsdam, Germany
| | - Erik Schilling
- Institute for German Philology, Ludwig Maximilian University of Munich, Munich, Germany
| |
Collapse
|
5
|
Barczak G, Hopp C, Kaminski J, Piller F, Pruschak G. How open is innovation research? – An empirical analysis of data sharing among innovation scholars. INDUSTRY AND INNOVATION 2022; 29:186-218. [DOI: 10.1080/13662716.2021.1967727] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2025]
Affiliation(s)
- Gloria Barczak
- D’Amore-McKim School of Business, Northeastern University, Boston, MA, USA
| | - Christian Hopp
- School of Business & Economics, RWTH Aachen University, Aachen, Germany
- Business School, Bern University of Applied Sciences, Bern, Switzerland
| | - Jermain Kaminski
- School of Business and Economics, Maastricht University, Maastricht, The Netherlands
| | - Frank Piller
- School of Business & Economics, RWTH Aachen University, Aachen, Germany
| | - Gernot Pruschak
- School of Business & Economics, RWTH Aachen University, Aachen, Germany
- Business School, Bern University of Applied Sciences, Bern, Switzerland
- Department of Business Decisions and Analytics, University of Vienna, Vienna, Austria
| |
Collapse
|
6
|
|
7
|
Altman M, Cohen PN. The Scholarly Knowledge Ecosystem: Challenges and Opportunities for the Field of Information. Front Res Metr Anal 2022; 6:751553. [PMID: 35178498 PMCID: PMC8843814 DOI: 10.3389/frma.2021.751553] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2021] [Accepted: 12/15/2021] [Indexed: 01/09/2023] Open
Abstract
The scholarly knowledge ecosystem presents an outstanding exemplar of the challenges of understanding, improving, and governing information ecosystems at scale. This article draws upon significant reports on aspects of the ecosystem to characterize the most important research challenges and promising potential approaches. The focus of this review article is the fundamental scientific research challenges related to developing a better understanding of the scholarly knowledge ecosystem. Across a range of disciplines, we identify reports that are conceived broadly, published recently, and written collectively. We extract the critical research questions, summarize these using quantitative text analysis, and use this quantitative analysis to inform a qualitative synthesis. Three broad themes emerge from this analysis: the need for multi-sectoral cooperation and coordination, for mixed methods analysis at multiple levels, and interdisciplinary collaboration. Further, we draw attention to an emerging consensus that scientific research in this area should by a set of core human values.
Collapse
Affiliation(s)
- Micah Altman
- Center for Research in Equitable and Open Scholarship, MIT Libraries, Massachusetts Institute of Technology, Cambridge, MA, United States
- *Correspondence: Micah Altman
| | - Philip N. Cohen
- Department of Sociology, University of Maryland, College Park, MD, United States
| |
Collapse
|
8
|
Query-based-learning mortality-related decoders for the developed island economy. Sci Rep 2022; 12:956. [PMID: 35046447 PMCID: PMC8770507 DOI: 10.1038/s41598-022-04855-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 12/30/2021] [Indexed: 11/09/2022] Open
Abstract
Search volumes from Google Trends over clear-defined temporal and spatial scales were reported beneficial in predicting influenza or disease outbreak. Recent studies showed Wiener Model shares merits of interpretability, implementation, and adaptation to nonlinear fluctuation in terms of real-time decoding. Previous work reported Google Trends effectively predicts death-related trends for the continent economy, yet whether it applies to the island economy is unclear. To this end, a framework of the mortality-related model for a developed island economy Taiwan was built based on potential death causes from Google Trends, aiming to provide new insights into death-related online search behavior at a population level. Our results showed estimated trends based on the Wiener model significantly correlated to actual trends, outperformed those with multiple linear regression and seasonal autoregressive integrated moving average. Meanwhile, apart from that involved all possible features, two other sets of feature selecting strategies were proposed to optimize pre-trained models, either by weights or waveform periodicity of features, resulting in estimated death-related dynamics along with spectrums of risk factors. In general, high-weight features were beneficial to both "die" and "death", whereas features that possessed clear periodic patterns contributed more to "death". Of note, normalization before modeling improved decoding performances.
Collapse
|
9
|
Abstract
Abstract
This paper identifies the potential benefits of data sharing and open science, supported by artificial intelligence tools and services, and dives into the challenges to make data open and findable, accessible, interoperable, and reusable (FAIR).
Collapse
|
10
|
Mir SA, Bhat MS, Rather G, Mattoo D. Role of big geospatial data in the COVID-19 crisis. DATA SCIENCE FOR COVID-19 2022. [PMCID: PMC8988928 DOI: 10.1016/b978-0-323-90769-9.00031-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
The outbreak of the 2019 novel coronavirus disease (COVID-19) has infected 4 million people worldwide and has caused more than 300,000 deaths worldwide. With infection and death rates on rise, COVID-19 poses a serious threat to social functioning, human health, economies, and geopolitics. Geographic information systems and big geospatial technologies have come to the forefront in this fight against COVID-19 by playing an important role by integrating multisourced data, enhanced and rapid analytics of mapping services, location analytics, and spatial tracking of confirmed, forecasting transmission trajectories, spatial clustering of risk on epidemiologic levels, public awareness on the elimination of panic spread and decision-making support for the government and research institutions for effective prevention and control of COVID-19 cases. Big geospatial data has turned itself as the major support system for governments in dealing with this global healthcare crisis because of its advanced and innovative technological capabilities from preparation of data to modeling the results with quick and large accessibility to every spatial scale. This robust data-driven system using the accurate and prediction geoanalysis is being widely used by governments and public health institutions interfaced with both health and nonhealth digital data repositories for mining the individual and regional datasets for breaking the transmission chain. Profiling of confirmed cases on the basis of location and temporality and then visualizing them effectively coupled with behavioral and critical geographic variables such as mobility patterns, demographic data, and population density enhance the predictive analytics of big geospatial data. With the intersection of artificial intelligence, geospatial data enables real-time visualization and syndromic surveillance of epidemic data based on spatiotemporal dynamics and the data are then accurately geopositioned. This chapter aims to reflect on the relevance of big geospatial data and health geoinformatics in containing and preventing the further spread of COVID-19 and how countries and research organizations around the world have used it as accurate, fast, and comprehensive dataset in their containing strategy and management of this public health crisis. China and Taiwan are used as case studies as in how these countries have applied the computational architecture of big geospatial data and location analytics surveillance techniques for prediction and monitoring of COVID-19-positive cases.
Collapse
|
11
|
Khusanova R, Kang SW, Choi SB. Work Engagement Among Public Employees: Antecedents and Consequences. Front Psychol 2021; 12:684495. [PMID: 34744859 PMCID: PMC8569609 DOI: 10.3389/fpsyg.2021.684495] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Accepted: 09/20/2021] [Indexed: 11/13/2022] Open
Abstract
This study is an investigation of the relationships among job meaningfulness, work engagement, and performance, including testing for a possible mediation effect of work engagement on the relationship between job meaningfulness and performance. We examine task interdependence as a boundary condition that facilitates employee engagement using two-stage multiple-source respondent data drawn from a sample of 183 Uzbek employees from public organizations and their 47 supervisors to test the hypotheses. The research findings confirm a positive association between job meaningfulness and engagement and the relationship between work engagement and performance. Mediation analysis using bootstrapping indicated that work engagement explained the influence of meaningfulness on performance. Furthermore, task interdependence negatively moderated the relationship between meaningfulness and engagement. This study responds to calls for researchers to identify the key and situational drivers of work engagement as well as examine the importance of meaningfulness in the public sector. It also increases the external validity of the findings by examining the relationship between engagement and performance in a non-Western context, namely, Islamic Uzbekistan. Despite the limitations of this research, the empirical findings contribute to the growing body of research on work engagement and meaningfulness in public organizations.
Collapse
Affiliation(s)
| | - Seung-Wan Kang
- College of Business, Gachon University, Seongnam, South Korea
| | - Suk Bong Choi
- College of Global Business, Korea University, Sejong City, South Korea
| |
Collapse
|
12
|
Hays R, Gibbs T, Hunt J, Jennings B, Masters K. The changing face of MedEdPublish. MEDEDPUBLISH 2021; 11:1. [PMID: 37869175 PMCID: PMC10587817 DOI: 10.12688/mep.17500.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2023] Open
Abstract
MedEdPublish has come a long way since it was launched in 2016 by AMEE as an independent academic e-journal that supports scholarship in health professions education. Beginning as a relatively small, in-house publication on a web platform adapted for the purpose, we invited members of our community of practice to submit articles on any topic in health professions education, and encouraged a wide range of article types. All articles were published so long as they met editing criteria and where within scope. Reviews were welcomed from both members of our Review panel and the general readership, all published openly with contributors identified. Many articles attracted several reviews, responses and comments, creating interactive discussion threads that provided learning opportunities for all. The outcome surpassed our expectations, with over 500 articles submitted during 2020, beyond the capacity of our editing team and platform to achieve our promise of rapid publishing. We have now moved to a much larger and powerful web platform, developed by F1000 Research and within the Taylor and Francis stable, the home of AMEE’s other journal, Medical Teacher. Most of our innovations are supported by the new platform and there is scope for further developments. We look forward to an exciting new phase of innovation, powered by the F1000 platform.
Collapse
Affiliation(s)
- Richard Hays
- College of Medicine and Dentristy, James Cook University, Townsville, QLD, 4814, Australia
| | | | - Julie Hunt
- College of Veterinary Medicine, Lincoln Memorial University, Harrogate, Tennessee, TN 37752, USA
| | - Barbara Jennings
- Norwich Medical School, UNiversity of East Anglia, Norwich, Norfolk, NR4 7TJ, UK
| | - Ken Masters
- Medical Education and Informatics, Sultan Qaboos University, Al Khoudh, Oman
| |
Collapse
|
13
|
Kouper I, Tucker KL, Tharp K, van Booven ME, Clark A. Active Curation of Large Longitudinal Surveys: A Case Study. JOURNAL OF ESCIENCE LIBRARIANSHIP 2021. [DOI: 10.7191/jeslib.2021.1210] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
Abstract
In this paper we take an in-depth look at the curation of a large longitudinal survey and activities and procedures involved in moving the data from its generation to the state that is needed to conduct scientific analysis. Using a case study approach, we describe how large surveys generate a range of data assets that require many decisions well before the data is considered for analysis and publication. We use the notion of active curation to describe activities and decisions about the data objects that are “live,” i.e., when they are still being collected and processed for the later stages of the data lifecycle. Our efforts illustrate a gap in the existing discussions on curation. On one hand, there is an acknowledged need for active or upstream curation as an engagement of curators close to the point of data creation. On the other hand, the recommendations on how to do that are scattered across multiple domain-oriented data efforts.
In describing the complexities of active curation of survey data and providing general recommendations we aim to draw attention to the practices of active curation, stimulate the development of interoperable tools, standards, and techniques needed at the initial stages of research projects, and encourage collaborations between libraries and other academic units.
Collapse
|
14
|
Sawchuk SL, Khair S. Computational Reproducibility: A Practical Framework for Data Curators. JOURNAL OF ESCIENCE LIBRARIANSHIP 2021. [DOI: 10.7191/jeslib.2021.1206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
Abstract
Introduction: This paper presents concrete and actionable steps to guide researchers, data curators, and data managers in improving their understanding and practice of computational reproducibility.
Objectives: Focusing on incremental progress rather than prescriptive rules, researchers and curators can build their knowledge and skills as the need arises. This paper presents a framework of incremental curation for reproducibility to support open science objectives.
Methods: A computational reproducibility framework developed for the Canadian Data Curation Forum serves as the model for this approach. This framework combines learning about reproducibility with recommended steps to improving reproducibility.
Conclusion: Computational reproducibility leads to more transparent and accurate research. The authors warn that fear of a crisis and focus on perfection should not prevent curation that may be ‘good enough.’
Collapse
|
15
|
A Robust Approach for Identifying the Major Components of the Bribery Tolerance Index. MATHEMATICS 2021. [DOI: 10.3390/math9131570] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The paper aims to emphasize the advantages of several advanced statistical and data mining techniques when applied to the dense literature on corruption measurements and determinants. For this purpose, we used all seven waves of the World Values Survey and we employed the Naive Bayes technique in SQL Server Analysis Services 2016, the LASSO package together with logit and melogit regressions with raw coefficients in Stata 16. We further conducted different types of tests and cross-validations on the wave, country, gender, and age categories. For eliminating multicollinearity, we used predictor correlation matrices. Moreover, we assessed the maximum computed variance inflation factor (VIF) against a maximum acceptable threshold, depending on the model’s R squared in Ordinary Least Square (OLS) regressions. Our main contribution consists of a methodology for exploring and validating the most important predictors of the risk associated with bribery tolerance. We found the significant role of three influences corresponding to questions about attitudes towards the property, authority, and public services, and other people in terms of anti-cheating, anti-evasion, and anti-violence. We used scobit, probit, and logit regressions with average marginal effects to build and test the index based on these attitudes. We successfully tested the index using also risk prediction nomograms and accuracy measurements (AUCROC > 0.9).
Collapse
|
16
|
Botta F, Gutiérrez-Roig M. Modelling urban vibrancy with mobile phone and OpenStreetMap data. PLoS One 2021; 16:e0252015. [PMID: 34077441 PMCID: PMC8172046 DOI: 10.1371/journal.pone.0252015] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Accepted: 05/07/2021] [Indexed: 11/30/2022] Open
Abstract
The concept of urban vibrancy has become increasingly important in the study of cities. A vibrant urban environment is an area of a city with high levels of human activity and interactions. Traditionally, studying our cities and what makes them vibrant has been very difficult, due to challenges in data collection on urban environments and people’s location and interactions. Here, we rely on novel sources of data to investigate how different features of our cities may relate to urban vibrancy. In particular, we explore whether there are any differences in which urban features make an environment vibrant for different age groups. We perform this quantitative analysis by extracting urban features from OpenStreetMap and the Italian census, and using them in spatial models to describe urban vibrancy. Our analysis shows a strong relationship between urban features and urban vibrancy, and particularly highlights the importance of third places, which are urban places offering opportunities for social interactions. Our findings provide evidence that a combination of mobile phone data with crowdsourced urban features can be used to better understand urban vibrancy.
Collapse
Affiliation(s)
- Federico Botta
- Department of Computer Science, University of Exeter, Exeter, United Kingdom
- * E-mail:
| | - Mario Gutiérrez-Roig
- Department of Mathematical Sciences, University of Essex, Colchester, United Kingdom
| |
Collapse
|
17
|
Botta F. Quantifying the differences in call detail records. ROYAL SOCIETY OPEN SCIENCE 2021; 8:201443. [PMID: 34234948 PMCID: PMC8242929 DOI: 10.1098/rsos.201443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Accepted: 06/04/2021] [Indexed: 06/13/2023]
Abstract
The increasing availability of mobile phone data has attracted the attention of several researchers interested in studying our collective behaviour. Our interactions with the phone network can take several forms, from SMS messages to phone calls and data usage. Typically, mobile phone data are released to researchers in the form of call detail records, which contain records of different types of interactions, and can be used to analyse various aspects of our behaviour. However, the inherently behavioural nature of these interactions may result in differences between how we make phone calls and receive text messages. Studies which rely on data derived from these interactions, therefore, need to carefully consider these differences. Here, we aim to investigate differences and limitations of different types of mobile phone interactions data by analysing a large mobile phone dataset. We study the relationship between different types of interactions and show how it changes over time. We anticipate our findings to be of interest to all researchers working in the area of computational social science.
Collapse
Affiliation(s)
- Federico Botta
- Department of Computer Science, University of Exeter, Exeter, UK
| |
Collapse
|
18
|
Kim Y. An empirical study of research ethics and their role in psychologists’ data sharing intentions using consequentialism theory of ethics. JOURNAL OF LIBRARIANSHIP AND INFORMATION SCIENCE 2021. [DOI: 10.1177/09610006211008967] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The purpose of this study is to examine how different ethical dimensions of egoism, utilitarianism, and deontology all help in the formation of psychologists’ research ethics for data sharing, and how the research ethics eventually affect psychologists making decisions regarding whether to engage in data sharing. This research utilized consequentialism theory of ethics as its theoretical framework to develop its research model of psychologists’ data sharing as mediated by research ethics. It conducted an online survey with psychologists in US academic institutions and collected a total of 362 valid responses. Then, it employed the structural equation modeling technique to evaluate the research model and related hypotheses of psychologists’ data sharing intentions as mediated by the profession’s research ethics. This research found that perceived career benefit, perceived community benefit, and norm of data sharing all significantly contribute to the formation of psychologists’ research ethics for data sharing, and then these research ethics, along with perceived community benefit and norm of data sharing, significantly influence psychologists’ data sharing intentions. This study suggests that the consequentialism theory of ethics nicely explains psychologists’ formation of their research ethics for data sharing and their decision to engage in data sharing. The study also suggests that research communities can better promote researchers’ data sharing behaviors by stimulating their research ethics through different ethical dimensions, including egoism (career benefit), utilitarianism (community benefit), and deontology (norm of data sharing).
Collapse
|
19
|
Cybulski JL, Scheepers R. Data science in organizations: Conceptualizing its breakthroughs and blind spots. JOURNAL OF INFORMATION TECHNOLOGY 2021. [DOI: 10.1177/0268396220988539] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The field of data science emerged in recent years, building on advances in computational statistics, machine learning, artificial intelligence, and big data. Modern organizations are immersed in data and are turning toward data science to address a variety of business problems. While numerous complex problems in science have become solvable through data science, not all scientific solutions are equally applicable to business. Many data-intensive business problems are situated in complex socio-political and behavioral contexts that still elude commonly used scientific methods. To what extent can such problems be addressed through data science? Does data science have any inherent blind spots in this regard? What types of business problems are likely to be addressed by data science in the near future, which will not, and why? We develop a conceptual framework to inform the application of data science in business. The framework draws on an extensive review of data science literature across four domains: data, method, interfaces, and cognition. We draw on Ashby’s Law of Requisite Variety as theoretical principle. We conclude that data-scientific advances across the four domains, in aggregate, could constitute requisite variety for particular types of business problems. This explains why such problems can be fully or only partially addressed, solved, or automated through data science. We distinguish between situations that can be improved due to cross-domain compensatory effects, and problems where data science, at best, only contributes merely to better understanding of complex phenomena.
Collapse
|
20
|
Armann-Keown V, Patterson L. Content analysis in library and information research: An analysis of trends. LIBRARY & INFORMATION SCIENCE RESEARCH 2020. [DOI: 10.1016/j.lisr.2020.101048] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
21
|
Nagaraj A, Shears E, de Vaan M. Improving data access democratizes and diversifies science. Proc Natl Acad Sci U S A 2020; 117:23490-23498. [PMID: 32900947 PMCID: PMC7519325 DOI: 10.1073/pnas.2001682117] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The foundation of the scientific method rests on access to data, and yet such access is often restricted or costly. We investigate how improved data access shifts the quantity, quality, and diversity of scientific research. We examine the impact of reductions in cost and sharing restrictions for satellite imagery data from NASA's Landsat program (the longest record of remote-sensing observations of the Earth) on academic science using a sample of about 24,000 Landsat publications by over 34,000 authors matched to almost 3,000 unique study locations. Analyses show that improved access had a substantial and positive effect on the quantity and quality of Landsat-enabled science. Improved data access also democratizes science by disproportionately helping scientists from the developing world and lower-ranked institutions to publish using Landsat data. This democratization in turn increases the geographic and topical diversity of Landsat-enabled research. Scientists who start using Landsat data after access is improved tend to focus on previously understudied regions close to their home location and introduce novel research topics. These findings suggest that policies that improve access to valuable scientific data may promote scientific progress, reduce inequality among scientists, and increase the diversity of scientific research.
Collapse
Affiliation(s)
- Abhishek Nagaraj
- Haas School of Business, University of California, Berkeley, CA 94720;
| | - Esther Shears
- Energy & Resources Group, University of California, Berkeley, CA 94720
| | - Mathijs de Vaan
- Haas School of Business, University of California, Berkeley, CA 94720
| |
Collapse
|
22
|
Abstract
In recent years, many studies have used social media data to make estimates of electoral outcomes and public opinion. This paper reports the findings from a meta-analysis examining the predictive power of social media data by focusing on various sources of data and different methods of prediction; i.e., (1) sentiment analysis, and (2) analysis of structural features. Our results, based on the data from 74 published studies, show significant variance in the accuracy of predictions, which were on average behind the established benchmarks in traditional survey research. In terms of the approaches used, the study shows that machine learning-based estimates are generally superior to those derived from pre-existing lexica, and that a combination of structural features and sentiment analyses provides the most accurate predictions. Furthermore, our study shows some differences in the predictive power of social media data across different levels of political democracy and different electoral systems. We also note that since the accuracy of election and public opinion forecasts varies depending on which statistical estimates are used, the scientific community should aim to adopt a more standardized approach to analyzing and reporting social media data-derived predictions in the future.
Collapse
|
23
|
Burgelman JC, Pascu C, Szkuta K, Von Schomberg R, Karalopoulos A, Repanas K, Schouppe M. Open Science, Open Data, and Open Scholarship: European Policies to Make Science Fit for the Twenty-First Century. Front Big Data 2019; 2:43. [PMID: 33693366 PMCID: PMC7931888 DOI: 10.3389/fdata.2019.00043] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2019] [Accepted: 11/18/2019] [Indexed: 11/13/2022] Open
Abstract
Open science will make science more efficient, reliable, and responsive to societal challenges. The European Commission has sought to advance open science policy from its inception in a holistic and integrated way, covering all aspects of the research cycle from scientific discovery and review to sharing knowledge, publishing, and outreach. We present the steps taken with a forward-looking perspective on the challenges laying ahead, in particular the necessary change of the rewards and incentives system for researchers (for which various actors are co-responsible and which goes beyond the mandate of the European Commission). Finally, we discuss the role of artificial intelligence (AI) within an open science perspective.
Collapse
Affiliation(s)
| | - Corina Pascu
- Open Science, DG Research and Innovation, European Commission, Brussels, Belgium
| | | | | | | | | | | |
Collapse
|
24
|
Jha N, Potnuru RKG, Sareen P, Shaju S. Employee voice, engagement and organizational effectiveness: a mediated model. EUROPEAN JOURNAL OF TRAINING AND DEVELOPMENT 2019. [DOI: 10.1108/ejtd-10-2018-0097] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Purpose
This study is based on social exchange theory and aims at understanding the role of employee engagement as a mediator between employee voice and organizational effectiveness.
Design/methodology/approach
Data was collected to test the mediating role of employee engagement between employee voice and organizational effectiveness. The respondents were employees in different IT companies located in major cities in India. The model was tested for full and partial mediation of employee engagement using structural equation modeling.
Findings
Considering the self-reported survey from 232 employees from companies in the IT sector, the findings reveal that there exists a significant association between employee voice and organizational effectiveness. The results reflect a close association between employee engagement and organizational effectiveness too. However, no significant association was found between employee voice and organizational effectiveness. Employee engagement is found to mediate the relationship between employee voice and organizational effectiveness.
Research limitations/implications
The foremost limitation of the study is the sample group that is limited to employees working in IT companies in Bangalore city. The results cannot be generalized to the entire IT industry in India. Although attempts are made to eliminate common method bias, there are chances of an overstated relationship by common method variance that cannot be neglected completely.
Practical implications
The paper will provide a deep insight to the practitioners about the role of employee voice in the engagement of employees. It will also indicate to the managers how the effectiveness of an organization can be heightened by creating opportunities for employees to voice their opinion in the organization.
Originality/value
The present study indicated that though there is an association between the independent variable, employee voice, and the dependent variable, organizational effectiveness, the relationship becomes more significant in the presence of employee engagement between them.
Collapse
|
25
|
Dockray S, O'Neill S, Jump O. Measuring the Psychobiological Correlates of Daily Experience in Adolescents. JOURNAL OF RESEARCH ON ADOLESCENCE : THE OFFICIAL JOURNAL OF THE SOCIETY FOR RESEARCH ON ADOLESCENCE 2019; 29:595-612. [PMID: 31573767 DOI: 10.1111/jora.12473] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Mapping the psychobiological correlates of social contexts, experiences, and emotional responses of adolescents in their daily lives provides insight into how adolescent well-being shapes, and is shaped by, experience. Measures of these psychobiological correlates are enabled by devices and technologies that must be precise and suitable for adolescent participants. The present report reviews the most often used research measures, and suggests strategies for best practice, drawn from practical experience. The rapid advances in technological methods to collect attuned measures of psychological processes, social context, and biological function indicate the promise for multimodal measures in ecological settings. Attaining these methodological goals will support research to secure comprehensive, quality data, and advance the understanding of psychobiological function in ambulatory settings.
Collapse
|
26
|
Olteanu A, Castillo C, Diaz F, Kıcıman E. Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries. Front Big Data 2019; 2:13. [PMID: 33693336 PMCID: PMC7931947 DOI: 10.3389/fdata.2019.00013] [Citation(s) in RCA: 100] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Accepted: 05/27/2019] [Indexed: 11/24/2022] Open
Abstract
Social data in digital form-including user-generated content, expressed or implicit relations between people, and behavioral traces-are at the core of popular applications and platforms, driving the research agenda of many researchers. The promises of social data are many, including understanding "what the world thinks" about a social issue, brand, celebrity, or other entity, as well as enabling better decision-making in a variety of fields including public policy, healthcare, and economics. Many academics and practitioners have warned against the naïve usage of social data. There are biases and inaccuracies occurring at the source of the data, but also introduced during processing. There are methodological limitations and pitfalls, as well as ethical boundaries and unexpected consequences that are often overlooked. This paper recognizes the rigor with which these issues are addressed by different researchers varies across a wide range. We identify a variety of menaces in the practices around social data use, and organize them in a framework that helps to identify them. "For your own sanity, you have to remember that not all problems can be solved. Not all problems can be solved, but all problems can be illuminated." -Ursula Franklin.
Collapse
Affiliation(s)
- Alexandra Olteanu
- Microsoft Research, New York, NY, United States
- Microsoft Research, Montreal, QC, Canada
| | - Carlos Castillo
- Department of Information and Communication Technologies, Universitat Pompeu Fabra, Barcelona, Spain
| | | | | |
Collapse
|
27
|
Ottenbacher KJ, Graham JE, Fisher SR. Data Science in Physical Medicine and Rehabilitation. Phys Med Rehabil Clin N Am 2019; 30:459-471. [DOI: 10.1016/j.pmr.2018.12.003] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
28
|
Waylen KA, Blackstock KL, van Hulst FJ, Damian C, Horváth F, Johnson RK, Kanka R, Külvik M, Macleod CJA, Meissner K, Oprina-Pavelescu MM, Pino J, Primmer E, Rîșnoveanu G, Šatalová B, Silander J, Špulerová J, Suškevičs M, Van Uytvanck J. Policy-driven monitoring and evaluation: Does it support adaptive management of socio-ecological systems? THE SCIENCE OF THE TOTAL ENVIRONMENT 2019; 662:373-384. [PMID: 30690371 DOI: 10.1016/j.scitotenv.2018.12.462] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2018] [Revised: 12/29/2018] [Accepted: 12/30/2018] [Indexed: 05/17/2023]
Abstract
Inadequate Monitoring and Evaluation (M&E) is often thought to hinder adaptive management of socio-ecological systems. A key influence on environmental management practices are environmental policies: however, their consequences for M&E practices have not been well-examined. We examine three policy areas - the Water Framework Directive, the Natura 2000 Directives, and the Agri-Environment Schemes of the Common Agricultural Policy - whose statutory requirements influence how the environment is managed and monitored across Europe. We use a comparative approach to examine what is monitored, how monitoring is carried out, and how results are used to update management, based on publicly available documentation across nine regional and national cases. The requirements and guidelines of these policies have provided significant impetus for monitoring: however, we find this policy-driven M&E usually does not match the ideals of what is needed to inform adaptive management. There is a tendency to focus on understanding state and trends rather than tracking the effect of interventions; a focus on specific biotic and abiotic indicators at the expense of understanding system functions and processes, especially social components; and limited attention to how context affects systems, though this is sometimes considered via secondary data. The resulting data are sometimes publicly-accessible, but it is rarely clear if and how these influence decisions at any level, whether this be in the original policy itself or at the level of measures such as site management plans. Adjustments to policy-driven M&E could better enable learning for adaptive management, by reconsidering what supports a balanced understanding of socio-ecological systems and decision-making. Useful strategies include making more use of secondary data, and more transparency in data-sharing and decision-making. Several countries and policy areas already offer useful examples. Such changes are essential given the influence of policy, and the urgency of enabling adaptive management to safeguard socio-ecological systems.
Collapse
Affiliation(s)
- Kerry A Waylen
- Social, Economic & Geographical Sciences, The James Hutton Institute, Cragiebuckler, Scotland AB15 8QH, UK.
| | - Kirsty L Blackstock
- Social, Economic & Geographical Sciences, The James Hutton Institute, Cragiebuckler, Scotland AB15 8QH, UK
| | - Freddy J van Hulst
- Social, Economic & Geographical Sciences, The James Hutton Institute, Cragiebuckler, Scotland AB15 8QH, UK
| | - Carmen Damian
- Department of Systems Ecology and Sustainability, University of Bucharest, 91-95 Spl. Independentei, Bucharest 050095, Romania
| | - Ferenc Horváth
- Institute of Ecology and Botany, Centre for Ecological Research, Hungarian Academy of Sciences, Alkotmány u. 2-4, 2163 Vácrátót, Hungary
| | - Richard K Johnson
- Department of Aquatic Sciences and Assessment, Swedish University of Agricultural Sciences, Box 7050, 750 07 Uppsala, Sweden
| | - Robert Kanka
- Institute of Landscape Ecology of the Slovak Academy of Sciences, Stefanikova 3, 814 99 Bratislava, Slovakia
| | - Mart Külvik
- Institute of Agricultural and Environmental Sciences, Estonian University of Life Sciences, Kreutzwaldi 5, 51006 Tartu, Estonia
| | - Christopher J A Macleod
- Information and Computational Sciences, The James Hutton Institute, Cragiebuckler, Scotland AB15 8QH, UK
| | - Kristian Meissner
- Programme for Environmental Information, Finnish Environment Institute - SYKE, Survontie 9a, 40500 Jyväskylä, Finland
| | - Mihaela M Oprina-Pavelescu
- Department of Systems Ecology and Sustainability, University of Bucharest, 91-95 Spl. Independentei, Bucharest 050095, Romania
| | - Joan Pino
- Centre for Research on Ecology and Forestry Applications - CREAF, Universitat Autònoma de Barcelona, E08193 Bellaterra (Cerdanyola del Vallès), Catalonia, Spain
| | - Eeva Primmer
- Programme for Environmental Information, Finnish Environment Institute - SYKE, Survontie 9a, 40500 Jyväskylä, Finland
| | - Geta Rîșnoveanu
- Department of Systems Ecology and Sustainability, University of Bucharest, 91-95 Spl. Independentei, Bucharest 050095, Romania
| | - Barbora Šatalová
- Institute of Landscape Ecology of the Slovak Academy of Sciences, Stefanikova 3, 814 99 Bratislava, Slovakia
| | - Jari Silander
- Freshwater Centre, Finnish Environment Institute - SYKE, P.O. Box 140 00251, Helsinki, Finland
| | - Jana Špulerová
- Institute of Landscape Ecology of the Slovak Academy of Sciences, Stefanikova 3, 814 99 Bratislava, Slovakia
| | - Monika Suškevičs
- Institute of Agricultural and Environmental Sciences, Estonian University of Life Sciences, Kreutzwaldi 5, 51006 Tartu, Estonia
| | - Jan Van Uytvanck
- Research Institute for Nature and Forest (INBO), Havenlaan 88 bus 73, 1000 Brussels, Belgium
| |
Collapse
|
29
|
Happy parents’ tweets: An exploration of Italian Twitter data using sentiment analysis. DEMOGRAPHIC RESEARCH 2019. [DOI: 10.4054/demres.2019.40.25] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
|
30
|
Employee engagement and job performance in Lebanon: the mediating role of creativity. INTERNATIONAL JOURNAL OF PRODUCTIVITY AND PERFORMANCE MANAGEMENT 2019. [DOI: 10.1108/ijppm-02-2018-0052] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Purpose
The purpose of this paper is to explore the relationship between employee engagement and job performance in the country of Lebanon, and to test whether creativity mediates the relationship between engagement and performance.
Design/methodology/approach
The research sample consisted of 186 respondents working in Lebanese firms. The questionnaire included established measures relating to employee engagement, job performance and creativity – in addition to various demographic questions. Stepwise multiple regression and bootstrapping methods were employed in the analysis of the data.
Findings
The findings showed a significant positive effect of employee engagement on job performance. However, mediation analysis using bootstrapping methods has shown that creativity has fully mediated the relationship between engagement and performance.
Originality/value
The study extends previous research and increases the external validity of the findings by investigating the relationship between engagement and performance in new non-western contexts. Moreover, this is one of the first research studies that explores the role of creativity in the relationship between the two variables; this helps in improving our understanding of the model and aids in enhancing the effect of engagement on performance.
Collapse
|
31
|
Abstract
The local-national gap is a problem currently plaguing the adoption of emerging technologies targeted at resolving energy transition issues that are characterized by disparities in the adoption of innovations and policies on a local level in response to national policy implementation. These disparities reflect a complex system of technical, economic, social, political, and ecological factors linked to the perceptions held by communities and how they see energy development and national/global policy goals. This dataset is an attempt to bridge the local-national gap regarding solar PV adoption in the State of Georgia (U.S.) by aggregating variables from seven different publicly-available sources. The objective of this activity was to design a resource that would help researchers interested in the context underlying solar adoption on the local scale of governance (e.g., the county level). The SolarView database includes information necessary for informing policy-making activities such as solar installation information, a historical county zip code directory, county-level census data, housing value indexes, renewable energy incentive totals, PV rooftop suitability percentages, and utility rates. As this is a database from multiple sources, incomplete data entries are noted.
Collapse
|
32
|
Social media analytics – Challenges in topic discovery, data collection, and data preparation. INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT 2018. [DOI: 10.1016/j.ijinfomgt.2017.12.002] [Citation(s) in RCA: 358] [Impact Index Per Article: 51.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
33
|
Big data can cause big mistakes: using the Societas Europaea Herpetologica atlas by Sillero et al. (2014), the distribution of Emys orbicularis will be misunderstood. Biologia (Bratisl) 2018. [DOI: 10.1007/bf00341563] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
34
|
Big data can cause big mistakes: using the Societas Europaea Herpetologica atlas by Sillero et al. (2014), the distribution of Emys orbicularis will be misunderstood. Biologia (Bratisl) 2018. [DOI: 10.2478/s11756-018-0033-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
35
|
An entropic barriers diffusion theory of decision-making in multiple alternative tasks. PLoS Comput Biol 2018; 14:e1005961. [PMID: 29499036 PMCID: PMC5851639 DOI: 10.1371/journal.pcbi.1005961] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2017] [Revised: 03/14/2018] [Accepted: 01/05/2018] [Indexed: 12/03/2022] Open
Abstract
We present a theory of decision-making in the presence of multiple choices that departs from traditional approaches by explicitly incorporating entropic barriers in a stochastic search process. We analyze response time data from an on-line repository of 15 million blitz chess games, and show that our model fits not just the mean and variance, but the entire response time distribution (over several response-time orders of magnitude) at every stage of the game. We apply the model to show that (a) higher cognitive expertise corresponds to the exploration of more complex solution spaces, and (b) reaction times of users at an on-line buying website can be similarly explained. Our model can be seen as a synergy between diffusion models used to model simple two-choice decision-making and planning agents in complex problem solving. Decision-making has been studied in great detail relying on binary choices, modeled as the noisy accumulation of a decision variable to a threshold. We show that it breaks down when used to describe real-life human decision involving multiple options. We show instead that including obstacles in the diffusion model (a natural conceptual extension) can describe the data with great degree of accuracy. We evaluate this new model by capitalizing on the advent of big data, analyzing a vast corpus of decision making obtained from on-line chess servers. The present manuscript resolves a conflict between current theories of decision-making and concrete data, it solves this data with a concrete theoretical proposal and analyzes specific predictions of the model.
Collapse
|
36
|
Gutmann MP, Merchant EK, Roberts E. "Big data" in economic history. THE JOURNAL OF ECONOMIC HISTORY 2018; 78:268-299. [PMID: 29713093 PMCID: PMC5922781 DOI: 10.1017/s0022050718000177] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Big data is an exciting prospect for the field of economic history, which has long depended on the acquisition, keying, and cleaning of scarce numerical information about the past. This article examines two areas in which economic historians are already using big data - population and environment - discussing ways in which increased frequency of observation, denser samples, and smaller geographic units allow us to analyze the past with greater precision and often to track individuals, places, and phenomena across time. We also explore promising new sources of big data: organically created economic data, high resolution images, and textual corpora.
Collapse
Affiliation(s)
- Myron P Gutmann
- Department of History and Institute of Behavioral Science, University of Colorado
| | | | - Evan Roberts
- Department of Sociology and Minnesota Population Center, University of Minnesota
| |
Collapse
|
37
|
Abstract
Purpose
The purpose of this paper is twofold: first, to further develop Paul Edwards’ concept of “data friction” by examining the socio-material forces that are shaping data movements in the cases of research data and online communications data, second, to articulate a politics of data friction, identifying the interrelated infrastructural, socio-cultural and regulatory dynamics of data friction, and how these are contributing to the constitution of social relations.
Design/methodology/approach
The paper develops a hermeneutic review of the literature on socio-material factors influencing the movement of digital data between social actors in the cases of research data sharing and online communications data. Parallels between the two cases are identified and used to further develop understanding of the politics of “data friction” beyond the concept’s current usage within the Science Studies literature.
Findings
A number of overarching parallels are identified relating to the ways in which new data flows and the frictions that shape them bring social actors into new forms of relation with one another, the platformisation of infrastructures for data circulation, and state action to influence the dynamics of data movement. Moments and sites of “data friction” are identified as deeply political – resulting from the collective decisions of human actors who experience significantly different levels of empowerment with regard to shaping the overall outcome.
Research limitations/implications
The paper further develops Paul Edwards’ concept of “data friction” beyond its current application in Science Studies. Analysis of the broader dynamics of data friction across different cases identifies a number of parallels that require further empirical examination and theorisation.
Practical implications
The observation that sites of data friction are deeply political has significant implications for all engaged in the practice and management of digital data production, circulation and use.
Social implications
It is argued that the concept of “data friction” can help social actors identify, examine and act upon some of the complex socio-material dynamics shaping emergent data movements across a variety of domains, and inform deliberation at all levels – from everyday practice to international regulation – about how such frictions can be collectively shaped towards the creation of more equitable and just societies.
Originality/value
The paper makes an original contribution to the literature on friction in the dynamics of digital data movement, arguing that in many cases data friction may be something to enable and foster, rather than overcome. It also brings together literature from diverse disciplinary fields to examine these frictional dynamics within two cases that have not previously been examined in relation to one another.
Collapse
|
38
|
Tian R, Yang P, Wang KZ. Joint Registration System under the Background of Big Data. Chin Med J (Engl) 2017; 130:2524-2526. [PMID: 29067949 PMCID: PMC5678248 DOI: 10.4103/0366-6999.217079] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2017] [Indexed: 01/06/2023] Open
Affiliation(s)
- Run Tian
- Department of Orthopedics, The Second Affiliated Hospital of Xi’an Jiaotong University, Xi’an, Shaanxi 710004, China
| | - Pei Yang
- Department of Orthopedics, The Second Affiliated Hospital of Xi’an Jiaotong University, Xi’an, Shaanxi 710004, China
| | - Kun-Zheng Wang
- Department of Orthopedics, The Second Affiliated Hospital of Xi’an Jiaotong University, Xi’an, Shaanxi 710004, China
| |
Collapse
|
39
|
GUPTA VISHAL, SINGH SHAILENDRA, BHATTACHARYA ABHIJIT. THE RELATIONSHIPS BETWEEN LEADERSHIP, WORK ENGAGEMENT AND EMPLOYEE INNOVATIVE PERFORMANCE: EMPIRICAL EVIDENCE FROM THE INDIAN R&D CONTEXT. INTERNATIONAL JOURNAL OF INNOVATION MANAGEMENT 2017. [DOI: 10.1142/s1363919617500554] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Integrating the behavioural theory of leadership with job demands–resources theory of engagement, the present study examines the process through which leadership impact R&D professionals’ innovative work behaviours and innovative performance (measured through peer-reviewed journal papers, patents, PhDs guided and keynote addresses delivered). Data from 467 scientists working in India’s largest civilian R&D organisation were collected and analysed using structural equation modelling. The study found that work engagement was positively related to innovative work behaviours as well as innovative performance. Leader behaviours had significant indirect effects on innovative work behaviours as well as innovative performance via work engagement. While the total effect of leadership on innovative work behaviours, the total effect was non-significant for innovative performance. Implications for theory and practice are discussed.
Collapse
Affiliation(s)
- VISHAL GUPTA
- Organizational Behavior Area, Indian Institute of Management Ahmedabad, Gujarat, India
| | - SHAILENDRA SINGH
- Human Resource Management Group, Indian Institute of Management Lucknow, Uttar Pradesh, India
| | - ABHIJIT BHATTACHARYA
- Decision Science Group, Indian Institute of Management Lucknow, Uttar Pradesh, India
| |
Collapse
|
40
|
Affiliation(s)
- Gianmaria Silvello
- Department of Information Engineering; University of Padua; Via Gradenigo 6/b, Padua Italy
| |
Collapse
|
41
|
Zadelaar JN, Agelink van Rentergem JA, Huizenga HM. Univariate comparisons given aggregated normative data. Clin Neuropsychol 2017; 31:1155-1172. [PMID: 28679311 DOI: 10.1080/13854046.2017.1348542] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
OBJECTIVE Normative comparison is a method to compare an individual to a norm group. It is commonly used in neuropsychological assessment to determine if a patient's cognitive capacities deviate from those of a healthy population. Neuropsychological assessment often involves multiple testing, which might increase the familywise error rate (FWER). Recently, several correction methods have been proposed to reduce the FWER. However these methods require that multivariate normative data are available, which is often not the case. We propose to obtain these data by merging the control group data of existing studies into an aggregated database. In this paper, we study how the correction methods fare given such an aggregated normative database. METHODS In a simulation study mimicking the aggregated database situation, we compared applying no correction, the Bonferroni correction, a maximum distribution approach and a stepwise approach on their FWER and their power to detect genuine deviations. RESULTS If the aggregated database contained data on all neuropsychological tests, the stepwise approach outperformed the other methods with respect to the FWER and power. However, if data were missing, the Bonferroni correction produced the lowest FWER. DISCUSSION Overall, the stepwise approach appears to be the most suitable normative comparison method for use in neuropsychological assessment. When the norm data contained large amounts of missing data, the Bonferroni correction proved best. Advice of which method to use in different situations is provided.
Collapse
Affiliation(s)
| | | | - Hilde M Huizenga
- a Department of Psychology , University of Amsterdam , Amsterdam , The Netherlands.,b Amsterdam Brain and Cognition Center Amsterdam, University of Amsterdam , Amsterdam , The Netherlands.,c Research Priority Area Yield , University of Amsterdam , Amsterdam , The Netherlands
| |
Collapse
|
42
|
|
43
|
Fdez-Arroyabe P, Roye D. Co-creation and Participatory Design of Big Data Infrastructures on the Field of Human Health Related Climate Services. STUDIES IN BIG DATA 2017. [DOI: 10.1007/978-3-319-49736-5_9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
44
|
Lepri B, Staiano J, Sangokoya D, Letouzé E, Oliver N. The Tyranny of Data? The Bright and Dark Sides of Data-Driven Decision-Making for Social Good. STUDIES IN BIG DATA 2017. [DOI: 10.1007/978-3-319-54024-5_1] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
|
45
|
Elliott KC, Cheruvelil KS, Montgomery GM, Soranno PA. Conceptions of Good Science in Our Data-Rich World. Bioscience 2016; 66:880-889. [PMID: 29599533 PMCID: PMC5862324 DOI: 10.1093/biosci/biw115] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Scientists have been debating for centuries the nature of proper scientific methods. Currently, criticisms being thrown at data-intensive science are reinvigorating these debates. However, many of these criticisms represent long-standing conflicts over the role of hypothesis testing in science and not just a dispute about the amount of data used. Here, we show that an iterative account of scientific methods developed by historians and philosophers of science can help make sense of data-intensive scientific practices and suggest more effective ways to evaluate this research. We use case studies of Darwin's research on evolution by natural selection and modern-day research on macrosystems ecology to illustrate this account of scientific methods and the innovative approaches to scientific evaluation that it encourages. We point out recent changes in the spheres of science funding, publishing, and education that reflect this richer account of scientific practice, and we propose additional reforms.
Collapse
Affiliation(s)
- Kevin C Elliott
- Kevin C. Elliott is an associate professor in Lyman Briggs College, the Department of Fisheries and Wildlife, and the Department of Philosophy; Kendra S. Cheruvelil is an associate professor in Lyman Briggs College and the Department of Fisheries and Wildlife; Georgina M. Montgomery is an associate professor in Lyman Briggs College and the Department of History; and Patricia A. Soranno is a professor in the Department of Fisheries and Wildlife at Michigan State University, in East Lansing. All authors contributed equally to the conceptualization of the paper and the supporting research. KCE organized the collaboration and initiated the writing process. All authors contributed text, reviewed manuscript drafts, and approved the final version
| | - Kendra S Cheruvelil
- Kevin C. Elliott is an associate professor in Lyman Briggs College, the Department of Fisheries and Wildlife, and the Department of Philosophy; Kendra S. Cheruvelil is an associate professor in Lyman Briggs College and the Department of Fisheries and Wildlife; Georgina M. Montgomery is an associate professor in Lyman Briggs College and the Department of History; and Patricia A. Soranno is a professor in the Department of Fisheries and Wildlife at Michigan State University, in East Lansing. All authors contributed equally to the conceptualization of the paper and the supporting research. KCE organized the collaboration and initiated the writing process. All authors contributed text, reviewed manuscript drafts, and approved the final version
| | - Georgina M Montgomery
- Kevin C. Elliott is an associate professor in Lyman Briggs College, the Department of Fisheries and Wildlife, and the Department of Philosophy; Kendra S. Cheruvelil is an associate professor in Lyman Briggs College and the Department of Fisheries and Wildlife; Georgina M. Montgomery is an associate professor in Lyman Briggs College and the Department of History; and Patricia A. Soranno is a professor in the Department of Fisheries and Wildlife at Michigan State University, in East Lansing. All authors contributed equally to the conceptualization of the paper and the supporting research. KCE organized the collaboration and initiated the writing process. All authors contributed text, reviewed manuscript drafts, and approved the final version
| | - Patricia A Soranno
- Kevin C. Elliott is an associate professor in Lyman Briggs College, the Department of Fisheries and Wildlife, and the Department of Philosophy; Kendra S. Cheruvelil is an associate professor in Lyman Briggs College and the Department of Fisheries and Wildlife; Georgina M. Montgomery is an associate professor in Lyman Briggs College and the Department of History; and Patricia A. Soranno is a professor in the Department of Fisheries and Wildlife at Michigan State University, in East Lansing. All authors contributed equally to the conceptualization of the paper and the supporting research. KCE organized the collaboration and initiated the writing process. All authors contributed text, reviewed manuscript drafts, and approved the final version
| |
Collapse
|
46
|
Fitzhugh SM, Ben Gibson C, Spiro ES, Butts CT. Spatio-temporal filtering techniques for the detection of disaster-related communication. SOCIAL SCIENCE RESEARCH 2016; 59:137-154. [PMID: 27480377 DOI: 10.1016/j.ssresearch.2016.04.023] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2015] [Revised: 04/15/2016] [Accepted: 04/26/2016] [Indexed: 06/06/2023]
Abstract
Individuals predominantly exchange information with one another through informal, interpersonal channels. During disasters and other disrupted settings, information spread through informal channels regularly outpaces official information provided by public officials and the press. Social scientists have long examined this kind of informal communication in the rumoring literature, but studying rumoring in disrupted settings has posed numerous methodological challenges. Measuring features of informal communication-timing, content, location-with any degree of precision has historically been extremely challenging in small studies and infeasible at large scales. We address this challenge by using online, informal communication from a popular microblogging website and for which we have precise spatial and temporal metadata. While the online environment provides a new means for observing rumoring, the abundance of data poses challenges for parsing hazard-related rumoring from countless other topics in numerous streams of communication. Rumoring about disaster events is typically temporally and spatially constrained to places where that event is salient. Accordingly, we use spatio and temporal subsampling to increase the resolution of our detection techniques. By filtering out data from known sources of error (per rumor theories), we greatly enhance the signal of disaster-related rumoring activity. We use these spatio-temporal filtering techniques to detect rumoring during a variety of disaster events, from high-casualty events in major population centers to minimally destructive events in remote areas. We consistently find three phases of response: anticipatory excitation where warnings and alerts are issued ahead of an event, primary excitation in and around the impacted area, and secondary excitation which frequently brings a convergence of attention from distant locales onto locations impacted by the event. Our results demonstrate the promise of spatio-temporal filtering techniques for "tuning" measurement of hazard-related rumoring to enable observation of rumoring at scales that have long been infeasible.
Collapse
Affiliation(s)
- Sean M Fitzhugh
- Department of Sociology, University of California, 3151 Social Science Plaza A, Irvine, CA 92697, USA.
| | - C Ben Gibson
- Department of Sociology, University of California, 3151 Social Science Plaza A, Irvine, CA 92697, USA.
| | - Emma S Spiro
- Information School, Mary Gates Hall, 370, University of Washington, Seattle, WA 98195, USA.
| | - Carter T Butts
- Department of Sociology, University of California, 3151 Social Science Plaza A, Irvine, CA 92697, USA; Institute for Mathematical Behavioral Sciences, Department of Statistics, Bren Hall 2019, University of California, Irvine, CA 92697, USA; Department of Electrical Engineering and Computer Sciences, 2200 Engineering Hall, University of California, Irvine, CA 92697, USA.
| |
Collapse
|
47
|
Park HW, Yoon J, Leydesdorff L. The normalization of co-authorship networks in the bibliometric evaluation: the government stimulation programs of China and Korea. Scientometrics 2016. [DOI: 10.1007/s11192-016-1978-2] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
48
|
Gutiérrez-Roig M, Sagarra O, Oltra A, Palmer JRB, Bartumeus F, Díaz-Guilera A, Perelló J. Active and reactive behaviour in human mobility: the influence of attraction points on pedestrians. ROYAL SOCIETY OPEN SCIENCE 2016; 3:160177. [PMID: 27493774 PMCID: PMC4968466 DOI: 10.1098/rsos.160177] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/10/2016] [Accepted: 06/16/2016] [Indexed: 06/06/2023]
Abstract
Human mobility is becoming an accessible field of study, thanks to the progress and availability of tracking technologies as a common feature of smart phones. We describe an example of a scalable experiment exploiting these circumstances at a public, outdoor fair in Barcelona (Spain). Participants were tracked while wandering through an open space with activity stands attracting their attention. We develop a general modelling framework based on Langevin dynamics, which allows us to test the influence of two distinct types of ingredients on mobility: reactive or context-dependent factors, modelled by means of a force field generated by attraction points in a given spatial configuration and active or inherent factors, modelled from intrinsic movement patterns of the subjects. The additive and constructive framework model accounts for some observed features. Starting with the simplest model (purely random walkers) as a reference, we progressively introduce different ingredients such as persistence, memory and perceptual landscape, aiming to untangle active and reactive contributions and quantify their respective relevance. The proposed approach may help in anticipating the spatial distribution of citizens in alternative scenarios and in improving the design of public events based on a facts-based approach.
Collapse
Affiliation(s)
- M. Gutiérrez-Roig
- Departament de Física Fonamental, Universitat de Barcelona. Martí i Franqués 1, 08028 Barcelona, Spain
| | - O. Sagarra
- Departament de Física Fonamental, Universitat de Barcelona. Martí i Franqués 1, 08028 Barcelona, Spain
- Universitat de Barcelona Institute of Complex Systems UBICS, 08028 Barcelona, Spain
| | - A. Oltra
- Centre d’Estudis Avançats de Blanes (CEAB-CSIC), Accés a la Cala Sant Francesc, 17300 Blanes, Spain
- Centre de Recerca Ecològica i Aplicacions Forestals (CREAF), Campus de Bellaterra (UAB) Edifici C, 08193 Cerdanyola del Vallès, Spain
| | - J. R. B. Palmer
- Centre de Recerca Ecològica i Aplicacions Forestals (CREAF), Campus de Bellaterra (UAB) Edifici C, 08193 Cerdanyola del Vallès, Spain
| | - F. Bartumeus
- Centre d’Estudis Avançats de Blanes (CEAB-CSIC), Accés a la Cala Sant Francesc, 17300 Blanes, Spain
- Centre de Recerca Ecològica i Aplicacions Forestals (CREAF), Campus de Bellaterra (UAB) Edifici C, 08193 Cerdanyola del Vallès, Spain
- ICREA, Passeig Lluis Companys 23, 08010 Barcelona, Spain
| | - A. Díaz-Guilera
- Departament de Física Fonamental, Universitat de Barcelona. Martí i Franqués 1, 08028 Barcelona, Spain
- Universitat de Barcelona Institute of Complex Systems UBICS, 08028 Barcelona, Spain
| | - J. Perelló
- Departament de Física Fonamental, Universitat de Barcelona. Martí i Franqués 1, 08028 Barcelona, Spain
- Universitat de Barcelona Institute of Complex Systems UBICS, 08028 Barcelona, Spain
| |
Collapse
|
49
|
Realising the technological promise of smartphones in addiction research and treatment: An ethical review. THE INTERNATIONAL JOURNAL OF DRUG POLICY 2016; 36:47-57. [PMID: 27455467 DOI: 10.1016/j.drugpo.2016.05.013] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2015] [Revised: 05/25/2016] [Accepted: 05/25/2016] [Indexed: 01/20/2023]
Abstract
BACKGROUND Smartphone technologies and mHealth applications (or apps) promise unprecedented scope for data collection, treatment intervention, and relapse prevention when used in the field of substance abuse and addiction. This potential also raises new ethical challenges that researchers, clinicians, and software developers must address. AIMS This paper aims to identify ethical issues in the current uses of smartphones in addiction research and treatment. METHODS A search of three databases (PubMed, Web of Science and PsycInfo) identified 33 studies involving smartphones or mHealth applications for use in the research and treatment of substance abuse and addiction. A content analysis was conducted to identify how smartphones are being used in these fields and to highlight the ethical issues raised by these studies. RESULTS Smartphones are being used to collect large amounts of sensitive information, including personal information, geo-location, physiological activity, self-reports of mood and cravings, and the consumption of illicit drugs, alcohol and nicotine. Given that detailed information is being collected about potentially illegal behaviour, we identified the following ethical considerations: protecting user privacy, maximising equity in access, ensuring informed consent, providing participants with adequate clinical resources, communicating clinically relevant results to individuals, and the urgent need to demonstrate evidence of safety and efficacy of the technologies. CONCLUSIONS mHealth technology offers the possibility to collect large amounts of valuable personal information that may enhance research and treatment of substance abuse and addiction. To realise this potential researchers, clinicians and app-developers must address these ethical concerns to maximise the benefits and minimise risks of harm to users.
Collapse
|
50
|
Seresinhe CI, Preis T, Moat HS. Quantifying the link between art and property prices in urban neighbourhoods. ROYAL SOCIETY OPEN SCIENCE 2016; 3:160146. [PMID: 27152228 PMCID: PMC4852651 DOI: 10.1098/rsos.160146] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/29/2016] [Accepted: 03/29/2016] [Indexed: 06/05/2023]
Abstract
Is there an association between art and changes in the economic conditions of urban neighbourhoods? While the popular media and policymakers commonly believe this to be the case, quantitative evidence remains lacking. Here, we use metadata of geotagged photographs uploaded to the popular image-sharing platform Flickr to quantify the presence of art in London neighbourhoods. We estimate the presence of art in neighbourhoods by determining the proportion of Flickr photographs which have the word 'art' attached. We compare this with the relative gain in residential property prices for each Inner London neighbourhood. We find that neighbourhoods which have a higher proportion of 'art' photographs also have greater relative gains in property prices. Our findings demonstrate how online data can be used to quantify aspects of the visual environment at scale and reveal new connections between the visual environment and crucial socio-economic measurements.
Collapse
|