Duplicates and Survey Logs

Jump to: navigation, search

Read First

  • To complete that the survey data is complete, both these steps must be done. Skipping one might result in incomplete data set.

Data Duplicates

Before analyzing the outcomes of quality checks or sometimes even before running real time quality checks, we need to check for duplicates in the data. Duplicates are common in ODK/SurveyCTO and need to be removed before starting other data quality checks.

To remove duplicates, you can use the DIME's Stata command ieduplicates which can be found in the ietoolkit Stata package.

Comparing Server Data to Field Logs

Comparing server data to field logs makes sure that all the data collected during the survey has made it to your server. This can be done by writing code that generates a survey log which counts the number of surveys on the server and matching that log with the field logs.

Back to Parent

This article is part of the topic *topic name, as listed on main page*

Additional Resources

  • list here other articles related to this topic, with a brief description and link