Difference between revisions of "Spatial Analysis"

Jump to: navigation, search
 
(5 intermediate revisions by 2 users not shown)
Line 1: Line 1:
==Introduction==
Spatial analysis involves utilizing geographical information to create descriptive and informative outputs.


Spatial analysis involves utilizing geographical information to create descriptive and informative outputs. These can be maps or other images and/or statistical data such as distances to other points, boundaries, roads or public services. The development world is rapidly expanding its capacity to access and analyze spatial data ranging from [http://pubdocs.worldbank.org/en/395241501768694367/KePSIE-Methodology-Note.pdf market clustering] to [http://www.worldbank.org/en/topic/sustainabledevelopment/brief/earth-observation-for-development satellite imagery] to agricultural plot mapping in SurveyCTO, the ability to process this data is also improving quickly.
==Read First==
*The info used can be maps or other images and/or statistical data such as distances to other points, boundaries, roads or public services.  
*The development world is rapidly expanding its capacity to access and analyze spatial data ranging from [http://pubdocs.worldbank.org/en/395241501768694367/KePSIE-Methodology-Note.pdf market clustering] to [http://www.worldbank.org/en/topic/sustainabledevelopment/brief/earth-observation-for-development satellite imagery] to agricultural plot mapping in SurveyCTO and the ability to process this data is also improving quickly.


==Interactive Spatial Analysis Tools==
==Interactive Spatial Analysis Tools==


Historically, spatial analysis tools have been challenging to code in a reproducible way, due to proprietary software systems with little extensibility. ArcGIS is a prime example of this, as the default implementation can only be coded reproducibly using [https://libraries.mit.edu/files/gis/PythonProgrammingforArcgis.pdf Python]. The open-source equivalent [http://qgis.org/en/site/ QGIS] has [http://www.qgistutorials.com/en/docs/getting_started_with_pyqgis.html similar limitations]. Both are useful for exploring spatial data interactively, but poor for saving and reproducing output workflows.
Historically, spatial analysis tools have been challenging to code in a [[Reproducible Research|reproducible]] way, due to proprietary software systems with little extensibility. ArcGIS is a prime example of this, as the default implementation can only be coded '''reproducibly''' using [https://libraries.mit.edu/files/gis/PythonProgrammingforArcgis.pdf Python]. The open-source equivalent [http://qgis.org/en/site/ QGIS] has [http://www.qgistutorials.com/en/docs/getting_started_with_pyqgis.html similar limitations]. Both are useful for exploring spatial data interactively, but poor for saving and '''reproducing''' output workflows.


[https://www.google.com/earth/download/gep/agree.html Google Earth Pro] (free) provides an extensive interface for exploring data, particularly when combined with a tool that can export data in the associated KML format, such as <code>dta2kml</code> ([https://ideas.repec.org/c/boc/bocode/s457748.html SSC], [https://github.com/worldbank/stata/tree/master/src/dta2kml GitHub]).
[https://www.google.com/earth/download/gep/agree.html Free Google Earth Pro] provides an extensive interface for exploring data, particularly when combined with a tool that can export data in the associated KML format, such as <code>dta2kml</code> ([https://ideas.repec.org/c/boc/bocode/s457748.html SSC], [https://github.com/worldbank/stata/tree/master/src/dta2kml GitHub]).


==Reproducible Coding for Spatial Analysis==
==Reproducible Coding for Spatial Analysis==


By contrast, programs such as [https://www.r-project.org R] now support an [http://www.rspatial.org/index.html extensive suite] of open-source tools for the reproducible manipulation of spatial datasets. These tools are essential for tasks that involve creating new data – especially when specific formulae or measurements are needed or parameters may want to be varied in subsequent runs.  
By contrast, programs such as [https://www.r-project.org R] now support an [http://www.rspatial.org/index.html extensive suite] of open-source tools for the [[Reproducible Research|reproducible]] manipulation of spatial [[Master Dataset|datasets]]. These tools are essential for tasks that involve creating new data – especially when specific formulae or measurements are needed or parameters may want to be varied in subsequent runs.  


[https://www.stata.com Stata] also supports some [https://www.stata.com/support/faqs/graphics/spmap-and-maps/ commands like spmap] for spatial analysis, such as displaying maps with data. However it is not well-equipped to calculate new statistics, such as linking points to underlying political geographies or surrounding features, as it handles non-point information poorly in general.
[https://www.stata.com Stata] also supports some [https://www.stata.com/support/faqs/graphics/spmap-and-maps/ commands like spmap] for spatial analysis, such as displaying maps with data. However it is not well-equipped to calculate new statistics, such as linking points to underlying political geographies or surrounding features, as it handles non-point information poorly in general.
Line 17: Line 19:
==Cloud Computing for Spatial Analysis==
==Cloud Computing for Spatial Analysis==


As the size and complexity of geospatial data grows to include resources like [https://www.digitalglobe.com global scale satellite imagery], the very size of some data, let alone the computing complexity, has become infeasible on personal computers. Specialized firms, which are often kept on a subscription basis for large consumers like the World Bank, provide data and cloud computing services on that material. For example, a researcher may wonder whether traffic in a certain part of a transport corridor has increased in specific ways; by using machine learning on a series of satellite images of the same location, the firm can produce estimates of the changes in the number of cars on the road over time. This functionality can extend to land use, settlement quality (roof type), and, where imagery are of high enough resolution, even identifying crop usage on individual agricultural plots.
As the size and complexity of [[Geo Spatial Data|geospatial data]] grows to include resources like [https://www.digitalglobe.com global scale satellite imagery], the size of some data, let alone the computing complexity, has become infeasible on personal computers. Specialized [[Survey Firm|firms]], which are often kept on a subscription basis for large consumers like the World Bank, provide data and cloud computing services on that material. For example, a researcher may wonder whether traffic in a certain part of a transport corridor has increased in specific ways; by using machine learning on a series of satellite images of the same location, the '''firm''' can produce estimates of the changes in the number of cars on the road over time. This functionality can extend to land use, settlement quality (roof type), and, where imagery is of high enough resolution, even identifying crop usage on individual agricultural plots.


==Resources==
==Additional Resources==


* [http://www.openstreetmap.org OpenStreetMap]
* [http://www.openstreetmap.org OpenStreetMap]

Latest revision as of 14:22, 9 August 2023

Spatial analysis involves utilizing geographical information to create descriptive and informative outputs.

Read First

  • The info used can be maps or other images and/or statistical data such as distances to other points, boundaries, roads or public services.
  • The development world is rapidly expanding its capacity to access and analyze spatial data ranging from market clustering to satellite imagery to agricultural plot mapping in SurveyCTO and the ability to process this data is also improving quickly.

Interactive Spatial Analysis Tools

Historically, spatial analysis tools have been challenging to code in a reproducible way, due to proprietary software systems with little extensibility. ArcGIS is a prime example of this, as the default implementation can only be coded reproducibly using Python. The open-source equivalent QGIS has similar limitations. Both are useful for exploring spatial data interactively, but poor for saving and reproducing output workflows.

Free Google Earth Pro provides an extensive interface for exploring data, particularly when combined with a tool that can export data in the associated KML format, such as dta2kml (SSC, GitHub).

Reproducible Coding for Spatial Analysis

By contrast, programs such as R now support an extensive suite of open-source tools for the reproducible manipulation of spatial datasets. These tools are essential for tasks that involve creating new data – especially when specific formulae or measurements are needed or parameters may want to be varied in subsequent runs.

Stata also supports some commands like spmap for spatial analysis, such as displaying maps with data. However it is not well-equipped to calculate new statistics, such as linking points to underlying political geographies or surrounding features, as it handles non-point information poorly in general.

Cloud Computing for Spatial Analysis

As the size and complexity of geospatial data grows to include resources like global scale satellite imagery, the size of some data, let alone the computing complexity, has become infeasible on personal computers. Specialized firms, which are often kept on a subscription basis for large consumers like the World Bank, provide data and cloud computing services on that material. For example, a researcher may wonder whether traffic in a certain part of a transport corridor has increased in specific ways; by using machine learning on a series of satellite images of the same location, the firm can produce estimates of the changes in the number of cars on the road over time. This functionality can extend to land use, settlement quality (roof type), and, where imagery is of high enough resolution, even identifying crop usage on individual agricultural plots.

Additional Resources