Difference between revisions of "Monitoring Data Quality"

Jump to: navigation, search
Line 5: Line 5:
== Steps important in the quality checks ==
== Steps important in the quality checks ==
It is very important to do quality checks on data during the survey as it is difficult to fix the problem/recollect the data if the error is found after the completion of the survey.
It is very important to do quality checks on data during the survey as it is difficult to fix the problem/recollect the data if the error is found after the completion of the survey.
*Testing for Duplicates - Since SurveyCTO/ODK data has a lot of duplicates, the first thing you need to do is check for duplicates and remove the duplicates. To see how to remove duplicates using Stata, please see the main article at [[verifying all the data is on the server | Data Completion Verification]]
*Testing for Duplicates - Since SurveyCTO/ODK data has a lot of duplicates, the first thing you need to do is check for duplicates and remove the duplicates. To see how to remove duplicates using Stata, please see the main article at [[ Data Completion Verification | verifying all the data is on the server]]
*Test that all data from the field is on the server.  
*Test that all data from the field is on the server.  
*High frequency tests of data quality
*High frequency tests of data quality

Revision as of 20:08, 25 January 2017

Read First

  • Data quality checks should be done before and during the survey, as there is little that we can do after a survey if the data contains errors.
  • Lots of preparation should be made before the survey and steps should be followed during the survey so any error that is caught can be changed quickly before it is too late.

Steps important in the quality checks

It is very important to do quality checks on data during the survey as it is difficult to fix the problem/recollect the data if the error is found after the completion of the survey.

  • Testing for Duplicates - Since SurveyCTO/ODK data has a lot of duplicates, the first thing you need to do is check for duplicates and remove the duplicates. To see how to remove duplicates using Stata, please see the main article at verifying all the data is on the server
  • Test that all data from the field is on the server.
  • High frequency tests of data quality
    • IPA Template only (Template assumes SurvyCTO)
      • if not written in SurveyCTO -possible to adapt data to template, or template to data, but might be easier to write your own tests in Stata
    • IPA Template + additional tests in Stata
    • Test written in Stata only
      • Option if data is not collected with SurveyCTO
  • Follow up using the Data Explorer in SurveyCTO
  • Back Checks

--Something about being close to the survey location and redoing the survey if absolutely necessary.

Data Quality Checks

Comparing back checks with the main data

Back to Parent

This article is part of the topic Monitoring data quality

Additional Resources

  • list here other articles related to this topic, with a brief description and link