Author

Monya Baker

Bio: Monya Baker is an academic researcher. The author has contributed to research in topics: Agency (sociology) & Lift (data mining). The author has an hindex of 5, co-authored 7 publications receiving 1971 citations.

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

A manifesto for reproducible science

[...]

Marcus R. Munafò¹, Brian A. Nosek², Brian A. Nosek³, Dorothy V. M. Bishop⁴, Katherine S. Button, Christopher D. Chambers⁵, Nathalie Percie du Sert, Uri Simonsohn⁶, Eric-Jan Wagenmakers⁷, Jennifer J. Ware, John P. A. Ioannidis⁸ - Show less +7 more•Institutions (8)

University of Bristol¹, University of Virginia², Center for Open Science³, University of Oxford⁴, Cardiff University⁵, University of Pennsylvania⁶, Amsterdam University College⁷, Stanford University⁸

10 Jan 2017-Nature Human Behaviour

TL;DR: This work argues for the adoption of measures to optimize key elements of the scientific process: methods, reporting and dissemination, reproducibility, evaluation and incentives, in the hope that this will facilitate action toward improving the transparency, reproducible and efficiency of scientific research.

...read moreread less

Abstract: Improving the reliability and efficiency of scientific research will increase the credibility of the published scientific literature and accelerate discovery. Here we argue for the adoption of measures to optimize key elements of the scientific process: methods, reporting and dissemination, reproducibility, evaluation and incentives. There is some evidence from both simulations and empirical studies supporting the likely effectiveness of these measures, but their broad adoption by researchers, institutions, funders and journals will require iterative evaluation and improvement. We discuss the goals of these measures, and how they can be implemented, in the hope that this will facilitate action toward improving the transparency, reproducibility and efficiency of scientific research.

...read moreread less

1,951 citations

Journal Article•DOI•

Evaluating the replicability of social science experiments in Nature and Science between 2010 and 2015

[...]

Colin F. Camerer¹, Anna Dreber², Felix Holzmeister³, Teck-Hua Ho⁴, Jürgen Huber³, Magnus Johannesson², Michael Kirchler³, Gideon Nave⁵, Brian A. Nosek⁶, Brian A. Nosek⁷, Thomas Pfeiffer, Adam Altmejd², Nick Buttrick⁶, Nick Buttrick⁷, Taizan Chan⁴, Yiling Chen⁸, Eskil Forsell, Anup Gampa⁶, Anup Gampa⁷, Emma Heikensten², Lily Hummer⁶, Taisuke Imai⁹, Siri Isaksson², Dylan Manfredi⁵, Julia Rose³, Eric-Jan Wagenmakers¹⁰, Hang Wu¹¹ - Show less +23 more•Institutions (11)

California Institute of Technology¹, Stockholm School of Economics², University of Innsbruck³, National University of Singapore⁴, University of Pennsylvania⁵, Center for Open Science⁶, University of Virginia⁷, Harvard University⁸, Ludwig Maximilian University of Munich⁹, University of Amsterdam¹⁰, Harbin Institute of Technology¹¹

27 Aug 2018-Nature Human Behaviour

TL;DR: It is found that peer beliefs of replicability are strongly related to replicable, suggesting that the research community could predict which results would replicate and that failures to replicate were not the result of chance alone.

...read moreread less

Abstract: Being able to replicate scientific findings is crucial for scientific progress. We replicate 21 systematically selected experimental studies in the social sciences published in Nature and Science between 2010 and 2015. The replications follow analysis plans reviewed by the original authors and pre-registered prior to the replications. The replications are high powered, with sample sizes on average about five times higher than in the original studies. We find a significant effect in the same direction as the original study for 13 (62%) studies, and the effect size of the replications is on average about 50% of the original effect size. Replicability varies between 12 (57%) and 14 (67%) studies for complementary replicability indicators. Consistent with these results, the estimated true-positive rate is 67% in a Bayesian analysis. The relative effect size of true positives is estimated to be 71%, suggesting that both false positives and inflated effect sizes of true positives contribute to imperfect reproducibility. Furthermore, we find that peer beliefs of replicability are strongly related to replicability, suggesting that the research community could predict which results would replicate and that failures to replicate were not the result of chance alone.

...read moreread less

759 citations

Journal Article•DOI•

Bioconda: sustainable and comprehensive software distribution for the life sciences.

[...]

Björn Grüning¹, Ryan K. Dale², Andreas Sjödin³, Andreas Sjödin⁴, Brad Chapman⁵, Jillian Rowe⁶, Christopher Tomkins-Tinch⁷, Christopher Tomkins-Tinch⁵, Renan Valieris, Johannes Köster⁸, Johannes Köster⁵ - Show less +7 more•Institutions (8)

University of Freiburg¹, National Institutes of Health², Swedish Defence Research Agency³, Umeå University⁴, Harvard University⁵, New York University Abu Dhabi⁶, Broad Institute⁷, University of Duisburg-Essen⁸

01 Jul 2018-Nature Methods

TL;DR: The present Bioconda, a distribution of bioinformatics software for the lightweight, multi-platform and language-agnostic package manager Conda, improves analysis reproducibility by allowing users to define isolated environments with defined software versions.

...read moreread less

Abstract: We present Bioconda (https://bioconda.github.io), a distribution of bioinformatics software for the lightweight, multi-platform and language-agnostic package manager Conda. Currently, Bioconda offers a collection of over 3000 software packages, which is continuously maintained, updated, and extended by a growing global community of more than 200 contributors. Bioconda improves analysis reproducibility by allowing users to define isolated environments with defined software versions, all of which are easily installed and managed without administrative privileges.

...read moreread less

699 citations

Journal Article•DOI•

The nf-core framework for community-curated bioinformatics pipelines.

[...]

Philip Ewels¹, Alexander Peltzer², Sven Fillinger², Harshil Patel³, Johannes Alneberg⁴, Andreas Wilm⁵, Maxime Garcia⁶, Paolo Di Tommaso⁷, Sven Nahnsen² - Show less +5 more•Institutions (7)

Science for Life Laboratory¹, University of Tübingen², Francis Crick Institute³, Royal Institute of Technology⁴, Genome Institute of Singapore⁵, Karolinska Institutet⁶, Pompeu Fabra University⁷

01 Mar 2020-Nature Biotechnology

TL;DR: The nf-core framework is introduced as a means for the development of collaborative, peerreviewed, best-practice analysis pipelines that can be used across all institutions and research facilities and introduces a higher degree of portability as compared to custom in-house scripts.

...read moreread less

Abstract: To the Editor — The standardization, portability and reproducibility of analysis pipelines are key issues within the bioinformatics community. Most bioinformatics pipelines are designed for use on-premises; as a result, the associated software dependencies and execution logic are likely to be tightly coupled with proprietary computing environments. This can make it difficult or even impossible for others to reproduce the ensuing results, which is a fundamental requirement for the validation of scientific findings. Here, we introduce the nf-core framework as a means for the development of collaborative, peerreviewed, best-practice analysis pipelines (Fig. 1). All nf-core pipelines are written in Nextflow and so inherit the ability to be executed on most computational infrastructures, as well as having native support for container technologies such as Docker and Singularity. The nf-core community (Supplementary Fig. 1) has developed a suite of tools that automate pipeline creation, testing, deployment and synchronization. Our goal is to provide a framework for high-quality bioinformatics pipelines that can be used across all institutions and research facilities. Being able to reproduce scientific results is the central tenet of the scientific method. However, moving toward FAIR (findable, accessible, interoperable and reusable) research methods1 in data-driven science is complex2,3. Central repositories, such as bio. tools4, omictools5 and the Galaxy toolshed6, make it possible to find existing pipelines and their associated tools. However, it is still notoriously challenging to develop analysis pipelines that are fully reproducible and interoperable across multiple systems and institutions — primarily because of differences in hardware, operating systems and software versions. Although the recommended guidelines for some analysis pipelines have become standardized (for example, GATK best practices7), the actual implementations are usually developed on a case-by-case basis. As such, there is often little incentive to test, document and implement pipelines in a way that permits their reuse by other researchers. This can hamper sustainable sharing of data and tools, and results in a proliferation of heterogeneous analysis pipelines, making it difficult for newcomers to find what they need to address a specific analysis question. As the scale of -omics data and their associated analytical tools has grown, the scientific community is increasingly moving toward the use of specialized workflow management systems to build analysis pipelines8. They separate the requirements of the underlying compute infrastructure from the analysis and workflow description, introducing a higher degree of portability as compared to custom in-house scripts. One such popular tool is Nextflow9. Using Nextflow, software packages can be bundled with analysis pipelines using built-in integration for package managers, such as Conda, and containerization platforms, such as Docker and Singularity. Moreover, support for most common highperformance-computing batch schedulers and cloud providers allows simple deployment of analysis pipelines on almost any infrastructure. The opportunity to run pipelines locally during initial development and then to proceed seamlessly to largescale computational resources in highperformance-computing or cloud settings provides users and developers with great flexibility. The nf-core community project collects a curated set of best-practice analysis pipelines built using Nextflow. Similar projects Participate

...read moreread less

663 citations

Journal Article•DOI•

CONSORT Statement for Randomized Trials of Nonpharmacologic Treatments: A 2017 Update and a CONSORT Extension for Nonpharmacologic Trial Abstracts

[...]

Isabelle Boutron¹, Douglas G. Altman¹, David Moher¹, Kenneth F. Schulz¹, Philippe Ravaud¹ - Show less +1 more•Institutions (1)

Columbia University¹

04 Jul 2017-Annals of Internal Medicine

TL;DR: Both the updated CONSORT extension for NPT trials and the Consolidated Standards of Reporting Trials extension for abstracts should help authors, editors, and peer reviewers improve the transparency of NPT trial reports.

...read moreread less

Abstract: Incomplete and inadequate reporting is an avoidable waste that reduces the usefulness of research. The CONSORT (Consolidated Standards of Reporting Trials) Statement is an evidence-based reporting guideline that aims to improve research transparency and reduce waste. In 2008, the CONSORT Group developed an extension to the original statement that addressed methodological issues specific to trials of nonpharmacologic treatments (NPTs), such as surgery, rehabilitation, or psychotherapy. This article describes an update of that extension and presents an extension for reporting abstracts of NPT trials. To develop these materials, the authors reviewed pertinent literature published up to July 2016; surveyed authors of NPT trials; and conducted a consensus meeting with editors, trialists, and methodologists. Changes to the CONSORT Statement extension for NPT trials include wording modifications to improve readers' understanding and the addition of 3 new items. These items address whether and how adherence of participants to interventions is assessed or enhanced, description of attempts to limit bias if blinding is not possible, and specification of the delay between randomization and initiation of the intervention. The CONSORT extension for abstracts of NPT trials includes 2 new items that were not specified in the original CONSORT Statement for abstracts. The first addresses reporting of eligibility criteria for centers where the intervention is performed and for care providers. The second addresses reporting of important changes to the intervention versus what was planned. Both the updated CONSORT extension for NPT trials and the CONSORT extension for NPT trial abstracts should help authors, editors, and peer reviewers improve the transparency of NPT trial reports.

...read moreread less

650 citations

Collapse

Monya Baker

Papers

Cited by