Home » Use Cases » Climate and air pollution prediction from spatio-temporal observational constraints
Use Case icon

Climate and air pollution prediction from spatio-temporal observational constraints


Current climate models cannot resolve key small‑scale processes such as clouds, aerosols, and ocean eddies, which introduces significant uncertainties. Their evaluation is generally performed with spatio‑temporal aggregated observations. At the same time, increasingly sophisticated retrievals from satellites are becoming available at matching spatial resolution, which would in theory provide strong constraints; yet in practice, data volumes are too large for routine pixel‑level evaluation. Novel approaches are required.

This use case addresses the identified issue by developing embedding‑based methods to analyse, evaluate, and track small‑scale processes in compressed latent space:

  • Another strand focuses on benchmarking the compression of atmospheric remote sensing data, ensuring that novel neural compression schemes follow the requirements of weather experts with a focus on clouds.
  • One strand of experiments focuses on understanding the utility of embeddings for specific applications in atmospheric science, and how the atmosphere interacts with Earth’s surface.

Atmospherically turbulent cloud scene
Fig. 1: Pseudo-RGB of an atmospherically turbulent cloud scene as observed by the MODIS mission. The snapshot covers the day-night terminator line with night at the bottom and day on top. Land and water below clouds are marked by brown and blue colors, respectively.

Quantisation of feature space to discretise embeddings of Earth observation data into a collection of discrete tokens
Fig. 2: Quantisation of feature space to discretise embeddings of Earth observation data into a collection of discrete tokens. These embeddings enable cloud and aerosol classification from sparse labels, support self‑supervised learning for detecting clouds, dust storms, and air‑pollution features, and allow forecasting of remote observations across satellite data and global cloud‑resolving models.


Expected Impact


Neural compression of remote sensing data into foundation model embedding spaces is key to analyse multi‑petabyte geospatial datasets produced by projects such as NextGEMS, DYAMOND3 global cloud, and future DestinationEarth simulations.

Embeddings offer a scalable way to evaluate and constrain clouds, aerosols, and air‑pollution processes in both satellite observations and next‑generation cloud‑resolving climate models. By enabling classification from sparse labels, self‑supervised detection of clouds, dust storms, and pollution features, and the tracking of convective systems and aerosol plumes, embeddings provide a practical route to extract scientific information from datasets that are otherwise too large for routine pixel‑level comparison. They also create a compact space for detecting changes linked to temperature shifts or emission reductions, identifying extreme events and outlier climate responses, and revealing potential model issues.


Book

References



  • Embed2Scale Partners to Present at EGU2026

    Embed2Scale Partners to Present at EGU2026

    Read more