GSoC/GCI Archive
Google Summer of Code 2015 The Eclipse Foundation

GeoTrellis: Cassandra Backend to GeoTrellis-Spark

by Alex Kmoch for The Eclipse Foundation

GeoTrellis is a Scala framework for fast, parallel processing of geospatial data. GeoTrellis also supports raster data processing on Apache Spark. GeoTrellis supports Hadoop HDFS and Accumulo as Spark backends. Cassandra is another popular distributed data store. This project aims to improve the GeoTrellis Catalog prototype implementation for Cassandra to allow processing of raster layers via Spark RDDs as well as add vector RDD capabilties, with a focus on a performance-based indexing scheme.