Machine-driven meta-research : the application of big data approaches to map openness and transparency across the biomedical research literature

Placeholder Show Content

Abstract/Contents

Abstract
Recent concerns about the transparency and reproducibility of science have led to several calls for more open and transparent research practice. However, with almost 25,000 new biomedical articles published per week, manually mapping and understanding changes in transparency is unrealistic. In this dissertation, we develop the computational tools to study biomedical literature at a large scale. We first extract and study all preprints ever published in the biomedical literature to understand the extent to which the community has embraced this medium of quick and direct communication of information. We then extract and study all retracted articles to understand the extent to which the scientific community and the lay public are made aware of such retractions. We finally extract the entire open biomedical literature, develop automated algorithms to map indicators of transparency within this literature and characterize indicator distribution across time, fields of science and journals. Our results indicate that researchers are rapidly embracing opportunities to share their research as preprints and that these preprints receive substantial attention. They also indicate that instead of acting as a warning against inaccurate results, retractions of popular articles do not receive much attention and often inadvertently promote the original article. Finally, our results indicate that transparency of information is increasing over time, but uptake of some indicators (e.g. data sharing) lags behind others (e.g. conflicts of interest) and that practice varies enormously between fields of science. This work has also established two open source packages and an integrated database of the entire open biomedical literature to facilitate future research in the field. Our automated approach enables large-scale analyses that would have otherwise been unrealistic and establishes the foundations for computational and big data approaches to studying scientific research itself

Description

Type of resource text
Form electronic resource; remote; computer; online resource
Extent 1 online resource
Place California
Place [Stanford, California]
Publisher [Stanford University]
Copyright date 2020; ©2020
Publication date 2020; 2020
Issuance monographic
Language English

Creators/Contributors

Author Serghiou, Stylianos
Degree supervisor Iōannidēs, Iōannēs P. A
Thesis advisor Iōannidēs, Iōannēs P. A
Thesis advisor Baiocchi, Michael
Thesis advisor Goodman, Steven N
Thesis advisor Sainani, Kristin
Degree committee member Baiocchi, Michael
Degree committee member Goodman, Steven N
Degree committee member Sainani, Kristin
Associated with Stanford University, Program in Epidemiology

Subjects

Genre Theses
Genre Text

Bibliographic information

Statement of responsibility Stylianos Serghiou
Note Submitted to the Program in Epidemiology
Thesis Thesis Ph.D. Stanford University 2020
Location electronic resource

Access conditions

Copyright
© 2020 by Stylianos Serghiou
License
This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

Loading usage metrics...