Transform to Open Science Logo that shows a top as a rocket taking off and the text Transform to Open Science in the white vapor plume around the launch site

Project Status: WIP – Initial development is in progress, but there has not yet been a stable, usable release suitable for the public. Docker Image :whale2:


This project seeks to introduce cloud-native approaches to geospatial analysis in R & Python through the lens of environmental justice applications. This is not meant as a complete course in geospatial analysis – though we encourage interested readers to consider Geocomputation in R or Python as an excellent resource. We present opinionated recipes meant to empower users with the following design goals:

  • Open science: open source software, open data, open standards, and reproducibility are emphasized.
  • Recipes are presented as reproducible computational notebooks (Quarto and Jupyter) set in narrative analysis.
  • Access to cloud-based compute is not required. The high speed networks of commercial cloud providers can accelerate network-bound operations, but can run from anywhere.
  • Access to a powerful laptop or desktop is not required either. Examples can run from a browser interface on free compute resources such as Codespaces.
  • Portable software environments: a Docker container with precise software used to produce these examples. The container is build from the included Dockerfile using the included GitHub Action and is deployed by the Codespaces instance.
  • Alignment between R and Python: recipes will try to show approaches in both languages and seek to utilize common interfaces (such as STAC and GDAL) that can be shared across these platforms.

This is a work in progress! Check back often, and feedback welcome! Test these modules, file an issue or pull request, launch into a Codespaces environment, or reach out on the discussion board.

What is “cloud-native” anyway? We define cloud-native to mean simply that data is accessed over http range request methods, rather than downloading entire files. Code-based examples will develop why this is important and how it differs from renting cloud-based compute. The core philosophy is that what many users already know how to do locally translates pretty seamlessly here, and then a bit extra is required to coerce certain software to stay in ‘range request’ mode and not get greedy trying to download everything. Some authors define this concept somewhat differently, and certainly not all range requests give the same performance, nor are http range requests best in all cases.

Acknowledgements

Thanks to input, suggestions, feedback and ideas from so many amazing folks in the OpenScapes community and input and financial support from the NASA TOPS community.

Transform to Open Science Logo that shows a top as a rocket taking off and the text Transform to Open Science in the white vapor plume around the launch site

OpenScapes Hex Logo