The Replication Crisis and the Workflow of Data Analysis
Loading...
Can’t use the file because of accessibility barriers? Contact us with the title of the item, permanent link, and specifics of your accommodation need.
Date
2020-01-17
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Indiana University Workshop in Methods
Permanent Link
Abstract
Many disciplines are paying increasing attention to reproducible results. The fundamental idea is that other scientists should have access to your data and be able to obtain the same results—this is reproducibility. More generally, your results should be robust so that other scientists can confirm your findings using other data. Increasingly journals require authors to provide their data and analysis file before a paper is accepted to verify that that results. Producing reproducible results is highly dependent on your workflow for data analysis. This workflow encompasses the entire process of scientific research: Planning, documenting, and organizing your work; creating, labeling, naming, and verifying variables; performing and presenting statistical analyses; preserving your work; and ending with reproducible results. Most of the work in statistics classes focuses on estimating and interpreting models. In “real world” research projects, these activities may involve less than 10% of the total work. Professor Long’s talk is about the other 90% of the work. An efficient workflow saves time, introduces greater reliability into the steps of the analysis, and generates reproducible results.
Description
Dr. Long is Distinguished Professor and Chancellor’s Professor Emeritus of Sociology and Statistics at Indiana University.
Keywords
Citation
Journal
DOI
Link(s) to data and video for this item
Relation
Rights
Type
Presentation