Checklist: Data Cleaning

Get printable version here. For more detailed instructions on how to implement the different tasks in this checklist, see Data Cleaning. Note that this checklist is best displayed in Chrome, Firefox, Safari or any other modern browser.

Back to Parent

This article is part of the topic Check Lists

Additional Resources

DIME Analytics’ guidelines on data cleaning 1 and 2
The Stata Cheat Sheets on Data processing and Data Transformation are helpful reminder of relevant Stata code.
The Quartz guide to bad data on Github has lots of helpful tips for dealing with the kind of data problems that often come up in real world settings.
See this data cleaning checklist to ensure that common cleaning actions have been completed. Note that this is not an exhaustive list. Such a list is impossible to create as the individual datasets and the analysis require different cleaning depending on context.