Difference between revisions of "Crowd-sourced Data"

Revision as of 20:49, 12 April 2019

Crowdsourced data collection is a participatory method of building a dataset with the help of a large group of people. This page provides a brief overview of crowdsourced data collection in development and highlights points to consider when crowdsourcing data.

Read First

Through crowdsourced data collection, researchers can collect plentiful, valuable, and disperse data at a cost typically lower than that of traditional data collection methods.
Crowdsourced data may introduce sampling issues. Consider the trade-offs between sample size and sampling issues before deciding to crowdsource data.
Make sure the platform on which you are collecting crowdsourced data is well-tested.

Overview

Crowdsourced data collection allows researchers to cheaply outsource simple tasks or questionnaires, gather data in real time, and obtain far more numerous and widespread observations than in traditional data collection given its relatively low cost. Notably, crowdsourced data collection allows researchers to more easily reach people and places, giving researchers insight into local markets, events, or even prices. Researchers may crowdsourced data collection via a number of platforms including mobile apps or internet marketplaces like Amazon Mechanical Turk.

Considerations when crowdsourcing data

Ensure a large network of contributors: this is essential to crowdsourcing success. If collecting geographically specific data, keep in mind that the potential for crowdsourcing is limited in rural areas due to technology constraints and low levels of connectivity.
Follow network growth carefully. Crowdsourcing requires a crowd, not a handful!
Consider the trade-offs between sample size and sampling issues. The reliability of crowdsourcing data is often questioned because of the lack of underlying sampling frame. Crowdsourcing may not be the right tool if you need rigorous sampling and data structure.
Request simple tasks from contributors. The instruments used in crowdsourced data collection should not look like traditional questionnaires that includes skip codes, relevancies, constraints. Remember that contributors will not have the training of typical enumerators.
Ensure that the platform on which you are collecting crowdsourced data is well-tested: in one case, DIME took the promises of a Silicon Valley partner at face value -- but the available version of their technology delivered less than hoped.
Quantify trade-offs carefully. What are the cost savings compared to traditional enumeration? Will they offset losses in precision or quality?

Back to Parent

This article is part of the topic Secondary Data Sources

Additional Resources

Hunt and Spect’s Crowdsourced Mapping in Crisis Zones: Collaboration, Organisation and Impact
Bott, Gigler and Young's The Role of Crowdsourcing for Better Governance in Fragile State Contexts
Komarov, Reinecke and Gajos’ Crowdsourcing Performance Evaluations of User Interfaces tests whether Amazon Mechanical Turk results differ from traditional questionnaire results
In a DAI blogpost, Kelsey Stern Buchbinder explains the use of crowdsourced data in development and its role in providing on-the-ground insights

@@ Line 1: / Line 1: @@
-<span style="font-size:150%">
+<onlyinclude>
-<span style="color:#ff0000"> '''NOTE: this article is only a template. Please add content!''' </span>
+Crowdsourced data collection is a participatory method of building a dataset with the help of a large group of people. This page provides a brief overview of crowdsourced data collection in development and highlights points to consider when crowdsourcing data.</onlyinclude>
-</span>
-add introductory 1-2 sentences here
 == Read First ==
-* include here key points you want to make sure all readers understand
+*Through crowdsourced data collection, researchers can collect plentiful, valuable, and disperse data at a cost typically lower than that of traditional data collection methods.
+*Crowdsourced data may introduce sampling issues. Consider the trade-offs between sample size and sampling issues before deciding to crowdsource data.
+* Make sure the platform on which you are collecting crowdsourced data is well-tested.
+== Overview ==
+Crowdsourced data collection allows researchers to cheaply outsource simple tasks or questionnaires, gather data in real time, and obtain far more numerous and widespread observations than in traditional data collection given its relatively low cost. Notably, crowdsourced data collection allows researchers to more easily reach people and places, giving researchers insight into [https://mdp.berkeley.edu/data-crowdsourcing-the-gap-between-ideation-and-implementation/ local markets], [http://www.lse.ac.uk/international-development/conflict-and-civil-society/current-projects/crowdsourcing-conflict-and-peace-events-in-the-syrian-conflict events], or even [https://www.technologyreview.com/s/520151/crowdsourcing-mobile-app-takes-the-globes-economic-pulse/ prices]. Researchers may crowdsourced data collection via a number of platforms including mobile apps or internet marketplaces like [https://www.mturk.com/ Amazon Mechanical Turk].
-== Guidelines ==
+== Considerations when crowdsourcing data ==
-* organize information on the topic into subsections. for each subsection, include a brief description / overview, with links to articles that provide details
+* Ensure a large network of contributors: this is essential to crowdsourcing success. If collecting geographically specific data, keep in mind that the potential for crowdsourcing is limited in rural areas due to technology constraints and low levels of connectivity.
-===Subsection 1===
+* Follow network growth carefully. Crowdsourcing requires a crowd, not a handful!
-===Subsection 2===
+* Consider the trade-offs between sample size and sampling issues. The  reliability of crowdsourcing data is often questioned because of the lack of underlying sampling frame. Crowdsourcing may not be the right tool if you need rigorous sampling and data structure.
-===Subsection 3===
+* Request simple tasks from contributors. The instruments used in crowdsourced data collection should not look like traditional [[Questionnaire Design | questionnaires]] that includes skip codes, relevancies, constraints. Remember that contributors will not have the training of typical [[Enumerator Training | enumerators]].
+* Ensure that the platform on which you are collecting crowdsourced data is well-tested: in one case, DIME [https://blogs.worldbank.org/impactevaluations/lessons-crowdsourcing-failure took the promises] of a Silicon Valley partner at face value -- but the available version of their technology delivered less than hoped.
+* Quantify trade-offs carefully. What are the cost savings compared to traditional enumeration? Will they offset losses in precision or quality?
 == Back to Parent ==
-This article is part of the topic [[*topic name, as listed on main page*]]
+This article is part of the topic [[Secondary Data Sources]]
 == Additional Resources ==
-* list here other articles related to this topic, with a brief description and link
+*Hunt and Spect’s [https://jhumanitarianaction.springeropen.com/articles/10.1186/s41018-018-0048-1 Crowdsourced Mapping in Crisis Zones: Collaboration, Organisation and Impact]
+*Bott, Gigler and Young's [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.474.2448&rep=rep1&type=pdf The Role of Crowdsourcing for Better Governance in Fragile State Contexts]
+*Komarov, Reinecke and Gajos’ [https://dash.harvard.edu/bitstream/handle/1/12363924/Crowdsourcing%20Performance%20Evaluations.pdf?sequence=1&isAllowed=y Crowdsourcing Performance Evaluations of User Interfaces] tests whether Amazon Mechanical Turk results differ from traditional questionnaire results
+*In a DAI [https://dai-global-digital.com/crowdsourced-data-collection-provides-on-the-ground-insights.html blogpost], Kelsey Stern Buchbinder explains the use of crowdsourced data in development and its role in providing on-the-ground insights
-[[Category: *category name* ]]
+[[Category: Secondary Data Sources ]]

Navigation

Tools

Difference between revisions of "Crowd-sourced Data"

Revision as of 20:49, 12 April 2019

Contents

Read First

Overview

Considerations when crowdsourcing data

Back to Parent

Additional Resources

Difference between revisions of "Crowd-sourced Data"

Revision as of 20:49, 12 April 2019

Read First

Overview

Considerations when crowdsourcing data

Back to Parent

Additional Resources

follow us

newsletter