Primary Data Collection
Primary data collection is the process of gathering data through surveys, interviews or experiments. Household surveys are the prototypical example of primary data. Researchers can personally direct primary data collection to ensure that data meets the standards of quality, availability, statistical power and sampling required for a particular research question. With globally increasing access to survey tools and software, field manuals, and specialized survey firms, primary data has become the dominant source for empirical inquiry in development economics.
- The large majority of primary data comes from personal interviews. Depending on the research, these may take the form of household surveys, business (firm) surveys, or agricultural (farm) surveys.
- Primary data collection typically requires a great deal of foresight, planning and coordination. Steps to preparing and conducting primary data collection follow.
The following are critical steps in preparing for and conducting primary data collection:
- Pre-register research and consider producing a pre-analysis plan.
- Acquire human subjects approval and get set up with the proper tools for encryption and de-identification.
- Compile the survey budget.
- Determine the sampling frame, calculate sample size, conduct sampling and power calculations, and randomize treatment.
- Design and translate the survey instrument.
- Program the instrument if data is being collected electronically via a Computer-Assisted Personal Interviews (CAPI) or Computer-Assisted Field Entry (CAFE) survey.
- Establish survey protocols.
- Pilot the survey instrument – both the content and protocols.
- Procure a a survey firm, taking care to prepare detailed Terms of Reference.
- Train enumerators.
- Monitor data quality through backchecks, high frequency checks, and other methods.
- Maintained an organized data folder via
- Clean and analyze data.
Back to Parent
This article is part of the topic Primary Data Collection
Oxfam provides a detailed case study of how to use electronic data collection (SurveyCTO) combined with Stata code to improve data quality in the field.