SCynergy 2026 - GeoAI Workshop β Cheat Sheet¶
π§ End-to-End Pipeline¶
This workshop demonstrates a full GeoAI workflow:
- Data Acquisition β selecting relevant satellite scenes using STAC
- Data Alignment β ensuring all datasets share the same spatial grid
- Data Packaging β converting large scenes into model-ready tensors
- Model Inference β applying a pretrained TerraMind flood model
π Key message: AI performance depends heavily on correct geospatial preprocessing.
π Notebook 1 β Data Acquisition¶
Key Concept: STAC¶
STAC (SpatioTemporal Asset Catalog) is a structured way to search geospatial datasets by location, time, and metadata. It means that, with STAC, you are querying a catalog, not manually downloading files.
Temporal Phases¶
Why the workflow uses:
- pre-event
- event
- post-event
π Floods are dynamic β comparing time periods improves detection.
Scene Selection Logic¶
Why:
- Sentinel-2 β filter clouds
- Sentinel-1 β prioritize time + coverage
π βBest sceneβ depends on task, not just recency.
Output: Manifest¶
This ensures reproducibility by recording selected scenes.
π Notebook 2 β Data Packaging¶
Common Grid¶
All datasets are transformed into:
- same coordinate system
- same resolution
- same spatial extent
π This enables pixel-wise comparison across modalities.
Reprojection¶
Reprojection is a geometric transformation, not just format conversion.
Handling Missing Data¶
Invalid pixels become NaN and are masked to prevent incorrect learning.
Chip Extraction¶
Large images are divided into smaller patches (e.g., 256Γ256):
- fits GPU memory
- enables batch processing
Memory Efficiency¶
Data is stored using memory mapping to handle large files efficiently.
π Notebook 3 β Model Inference¶
Tensor Structure¶
Inputs follow shape: [B, C, T, H, W] meaning [Batch, Channels, Time, Height, Width]
Multimodal Fusion¶
The model combines:
- optical (Sentinel-2)
- radar (Sentinel-1)
- terrain (DEM)
Temporal Handling¶
DEM is repeated across time because the model expects temporal inputs.
Output¶
Model predicts flood vs non-flood segmentation masks.
Important Notes¶
- Alignment enables fusion.
- Data quality drives model quality.
- This pipeline is reusable beyond floods.