Workflow of Data Preparation
Loading...
Can’t use the file because of accessibility barriers? Contact us with the title of the item, permanent link, and specifics of your accommodation need.
Date
2023-02-03
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Indiana University Workshop in Methods
Permanent Link
Abstract
Before conducting analyses, we all know that we need to “clean the data”, but what exactly does that mean? What steps are involved and in what order? How do we decide what needs to be done? Data preparation involves all the steps that occur between data collection and analysis (e.g., merging, appending, labeling, data analytics, cross-validation, constructing/re-constructing variables for analysis, identifying missing data). This seminar will provide a general framework for approaching these processes. The framework informs decisions about an ideal order in which data cleaning should be conducted to represent data both accurately and fully. This framework also delves into some of the trickier issues. For example, when you come across anomalous, vague, or missing data – what kinds of things should you consider? I will also provide guidance for ensuring that your findings are reproducible. Finally, I will discuss how to prepare data for analysis as efficiently as possible.
Description
Bianca Manago is an Assistant Professor in the Department of Sociology at Vanderbilt University. Her research addresses questions at the intersection of research methods, medical sociology, inequality, and social psychology. Her research has been funded by the National Science Foundation and the American Sociological Association and has appeared in American Sociological Review, Social Forces, Social Science and Medicine, Sociological Methodology, Social Science Research, Proceedings of the National Academy of Sciences, and Annual Review of Sociology.
Keywords
Citation
Journal
DOI
Link(s) to data and video for this item
Relation
Rights
Type
Presentation