Search results

Jump to: navigation, search
  • ...usehold, etc) with a distinct value. This article lists five properties of ID variables that researchers should keep in mind when creating, collecting, a ...y identifying, constant across a project, constant throughout the duration of the project, and anonymous.
    7 KB (1,257 words) - 15:17, 13 April 2021
  • ...n the context of surveys and datasets and explains how to confirm the unit of observation for a given dataset. ...ng with a dataset that you have not created yourself, identifying the unit of observation is the first step to understanding the data.
    5 KB (763 words) - 17:50, 21 May 2019
  • ...'research team''' should run <code>ieduplicates</code> with each new batch of incoming data to ensure [[Monitoring Data Quality|high quality data]] befor * <code>ieduplicates</code> is part of the package <code>[[iefieldkit]]</code>, which has been developed by [https
    13 KB (1,931 words) - 20:40, 11 August 2023
  • ...a folder, the structure of the data sets in the folder, and identification of the observations in the data sets is critical. * An important step before starting with '''data management''' is creating a [[D
    7 KB (1,089 words) - 16:17, 26 June 2023
  • ...ID Variable Properties | ID variables]] exist, so they can be resolved. '''ID variables''' are '''variables''' that uniquely identify every observation i * <code>iecompdup</code> is part of the package <code>[[iefieldkit]]</code>, which has been developed by [https
    11 KB (1,642 words) - 19:57, 15 August 2023
  • ...pact Evaluation Team|Research teams]] usually rely on two broad categories of data - [[Primary Data Collection|primary data]], and '''secondary data'''. * '''Impact evaluations''' rely on many different sources of '''secondary data''', such as: [[Administrative and Monitoring Data|adminis
    3 KB (444 words) - 15:42, 16 August 2023
  • ...the '''analysis'''. They are also a useful tool to communicate to the rest of the team, and [[Data Documentation|document]] how the '''analysis datasets' ...esearch/dime/data-and-analytics DIME Analytics] to organize 3 main aspects of data work:
    7 KB (1,090 words) - 20:34, 14 August 2023
  • ...ation in SurveyCTO | randomizing in survey software]]. The main advantages of randomizing in Stata follow: * The researcher has more control of the process and can check [[Balance tests | randomization balance]] and [[S
    5 KB (706 words) - 20:36, 20 July 2022
  • ...t up, they significantly reduce sources of error, and simplify the process of working with datasets from multiple sources - baseline data, endline data, * '''Master data sets''' are a crucial component of using a [[Data Map|data map]] to organize [[Research Data Work|data work]].
    13 KB (1,902 words) - 20:27, 3 August 2023
  • ...Research_Assistant|Research Assistant]] (RA). This page outlines the goals of data cleaning, recommends role division, outlines common issues encountered *The goal of data cleaning is to clean individual data points and to make the [[Master D
    15 KB (2,248 words) - 18:09, 14 August 2023
  • ..., [[Data Cleaning|data cleaning]], and [[Data Analysis|data analysis]]. '''Variable construction''' involves processing cleaned data to make the data points mo * '''Variable construction''' is a part of the '''data work''' process. The other stages are [[De-identification|de-id
    14 KB (1,948 words) - 14:23, 13 April 2021
  • ...entities be connected with data. De-identification is a critical component of [[Research Ethics | ethical]] [[Protecting Human Research Subjects | human The following steps ensure proper handling and storage of [[Personally Identifying Information (PII)|PII]]:
    11 KB (1,543 words) - 14:00, 17 August 2023
  • ...n and code these '''high frequency checks''' is in parallel to the process of [[Questionnaire Design | questionnaire design]] and [[Questionnaire Program ...|back checks]], '''High frequency checks (HFCs)''' are an important aspect of the [[Data Quality Assurance Plan|data quality assurance plan]].
    9 KB (1,394 words) - 18:29, 28 June 2023
  • ...oducibility]]. Preparation for data publication begins in the early stages of research: effective [[Data Management | data management]] and analytics thr ...ng information (PII)]] such as names, locations, or financial records, all of which are not [[Research Ethics | ethical]] to publish.
    6 KB (833 words) - 17:07, 21 June 2021
  • Thousands of users in more than 150 countries depend on '''SurveyCTO''' to conduct [[Com * '''SurveyCTO''' also lets users change the format of question text using [[SurveyCTO HTML Input|basic HTML commands]].
    11 KB (1,644 words) - 23:53, 20 July 2023
  • ...t folder. The DataWork Survey Round folder is useful throughout all stages of a project. ...[iefolder]]</code> creates the DataWork Survey Round folder as a component of the DataWork folder, which is typically housed in [https://www.box.com Box]
    10 KB (1,577 words) - 14:28, 12 June 2019
  • ...the same results as the original study, which strengthens the conclusions of the original study. It is important to push researchers towards publishing ...smxz/ bootcamp on reproducible research], which covers the various aspects of '''reproducibility'''.
    22 KB (3,101 words) - 17:28, 30 June 2023