Search results

ID Variable Properties
...usehold, etc) with a distinct value. This article lists five properties of ID variables that researchers should keep in mind when creating, collecting, a ...y identifying, constant across a project, constant throughout the duration of the project, and anonymous.

7 KB (1,257 words) - 15:17, 13 April 2021
Unit of Observation
...n the context of surveys and datasets and explains how to confirm the unit of observation for a given dataset. ...ng with a dataset that you have not created yourself, identifying the unit of observation is the first step to understanding the data.

5 KB (763 words) - 17:50, 21 May 2019
Ieduplicates
...'research team''' should run <code>ieduplicates</code> with each new batch of incoming data to ensure [[Monitoring Data Quality|high quality data]] befor * <code>ieduplicates</code> is part of the package <code>[[iefieldkit]]</code>, which has been developed by [https

13 KB (1,931 words) - 20:40, 11 August 2023
Data Management
...a folder, the structure of the data sets in the folder, and identification of the observations in the data sets is critical. * An important step before starting with '''data management''' is creating a [[D

7 KB (1,089 words) - 16:17, 26 June 2023
Iecompdup
...ID Variable Properties | ID variables]] exist, so they can be resolved. '''ID variables''' are '''variables''' that uniquely identify every observation i * <code>iecompdup</code> is part of the package <code>[[iefieldkit]]</code>, which has been developed by [https

11 KB (1,642 words) - 19:57, 15 August 2023
Secondary Data Sources
...pact Evaluation Team|Research teams]] usually rely on two broad categories of data - [[Primary Data Collection|primary data]], and '''secondary data'''. * '''Impact evaluations''' rely on many different sources of '''secondary data''', such as: [[Administrative and Monitoring Data|adminis

3 KB (444 words) - 15:42, 16 August 2023
Data Flow Charts
...the '''analysis'''. They are also a useful tool to communicate to the rest of the team, and [[Data Documentation|document]] how the '''analysis datasets' ...esearch/dime/data-and-analytics DIME Analytics] to organize 3 main aspects of data work:

7 KB (1,090 words) - 20:34, 14 August 2023
Randomization in Stata
...ation in SurveyCTO | randomizing in survey software]]. The main advantages of randomizing in Stata follow: * The researcher has more control of the process and can check [[Balance tests | randomization balance]] and [[S

5 KB (706 words) - 20:36, 20 July 2022
Master Dataset
...t up, they significantly reduce sources of error, and simplify the process of working with datasets from multiple sources - baseline data, endline data, * '''Master data sets''' are a crucial component of using a [[Data Map|data map]] to organize [[Research Data Work|data work]].

13 KB (1,902 words) - 20:27, 3 August 2023
Data Cleaning
...Research_Assistant|Research Assistant]] (RA). This page outlines the goals of data cleaning, recommends role division, outlines common issues encountered *The goal of data cleaning is to clean individual data points and to make the [[Master D

15 KB (2,248 words) - 18:09, 14 August 2023
Variable Construction
..., [[Data Cleaning|data cleaning]], and [[Data Analysis|data analysis]]. '''Variable construction''' involves processing cleaned data to make the data points mo * '''Variable construction''' is a part of the '''data work''' process. The other stages are [[De-identification|de-id

14 KB (1,948 words) - 14:23, 13 April 2021
De-identification
...entities be connected with data. De-identification is a critical component of [[Research Ethics | ethical]] [[Protecting Human Research Subjects | human The following steps ensure proper handling and storage of [[Personally Identifying Information (PII)|PII]]:

11 KB (1,543 words) - 14:00, 17 August 2023
High Frequency Checks
...n and code these '''high frequency checks''' is in parallel to the process of [[Questionnaire Design | questionnaire design]] and [[Questionnaire Program ...|back checks]], '''High frequency checks (HFCs)''' are an important aspect of the [[Data Quality Assurance Plan|data quality assurance plan]].

9 KB (1,394 words) - 18:29, 28 June 2023
Publishing Data
...oducibility]]. Preparation for data publication begins in the early stages of research: effective [[Data Management | data management]] and analytics thr ...ng information (PII)]] such as names, locations, or financial records, all of which are not [[Research Ethics | ethical]] to publish.

6 KB (833 words) - 17:07, 21 June 2021
SurveyCTO Coding Practices
Thousands of users in more than 150 countries depend on '''SurveyCTO''' to conduct [[Com * '''SurveyCTO''' also lets users change the format of question text using [[SurveyCTO HTML Input|basic HTML commands]].

11 KB (1,644 words) - 23:53, 20 July 2023
DataWork Survey Round
...t folder. The DataWork Survey Round folder is useful throughout all stages of a project. ...[iefolder]]</code> creates the DataWork Survey Round folder as a component of the DataWork folder, which is typically housed in [https://www.box.com Box]

10 KB (1,577 words) - 14:28, 12 June 2019
Reproducible Research
...the same results as the original study, which strengthens the conclusions of the original study. It is important to push researchers towards publishing ...smxz/ bootcamp on reproducible research], which covers the various aspects of '''reproducibility'''.

22 KB (3,101 words) - 17:28, 30 June 2023

Navigation

Tools

Search results

Search results

follow us

newsletter