Secondary Data Sources

Secondary data is data collected by any party other than the researcher, including administrative data from programs, geodata from specialized sources, and census or other population data from governments. Secondary data provides important context for any investigation, and in some cases (such as administrative program data), it is the only source which covers the full population needed to conduct a research project.

Read First

Research teams usually rely on two broad categories of data - primary data, and secondary data.
Impact evaluations rely on many different sources of secondary data, such as: administrative, geospatial, sensor, telecom, and crowd-sourcing.
Research teams should decide on the kind of data they want to use, based on context and project needs.

Types of Secondary Data

Administrative and Monitoring Data

Administrative data includes all data collected through existing government ministries, programs and projects. It is a potentially rich source of data for an impact evaluation. Some of the key challenges with administrative data include:

Digitization: in a lot of cases, the data is in paper format only.
Restricted access: it is also difficult to get access to certain data because it contains sensitive information.
Lack of unique ID: in some cases, administrative datasets might be missing a numeric ID variable.

National Survey Data

Existing survey data may be of use depending on the sampling frame for the impact evaluation, level of representativity of the existing data, and availability of disaggregated data. National Statistics Office typically collect a wide array of nationally-representative data, such as Living Standards Measurement Surveys and censuses. International survey efforts such as the Demographic and Health Surveys [1] and Enterprise Surveys [2] are also good sources.

Additional Resources

JPAL, Handbook on Using Administrative Data

Relevant DIME Analytics Trainings

Here's the lecture slides on Secondary Data Sources
DIME Continuing Education training recording on Spatial Data in Stata

Navigation

Tools

Secondary Data Sources

Contents

Read First

Types of Secondary Data

Administrative and Monitoring Data

National Survey Data

Geo Spatial Data

Remote Sensing

Telecom Data

Crowd-sourced Data

Related Pages

Additional Resources

Relevant DIME Analytics Trainings

Secondary Data Sources

Read First

Types of Secondary Data

Administrative and Monitoring Data

National Survey Data

Geo Spatial Data

Remote Sensing

Telecom Data

Crowd-sourced Data

Related Pages

Additional Resources

Relevant DIME Analytics Trainings

follow us

newsletter