CloudDiff: A Conditional Diffusion Model to Generate Mesoscale Cloud Structures
Abstract:
Understanding the driving forces behind mesoscale cloud organization is fundamental to reducing uncertainties in cloud climate feedbacks. Traditional climate models cannot explicitly resolve mesoscale cloud structures due to their limited resolution, leading to large uncertainties in cloud climate feedback estimates. Storm-resolving models that simulate the atmosphere at kilometre resolution have the potential to reduce these uncertainties. Yet, these models are still biased in their organizational structure when compared to satellite observations. Approaches constraining cloud feedbacks directly from the satellite records are promising but often rely on manually chosen cloud controlling factors (CCFs) that do not necessarily capture all the information necessary to explain mesoscale organizational structures and generally only utilise linear models to predict cloud radiative properties from CCFs.We present CloudDiff, a probabilistic machine learning model that generates mesoscale cloud structures at kilometre resolution conditioned on environmental conditions in the atmosphere, namely the temperature and humidity profiles as well as vertical and horizontal winds. The model is trained on MODIS Level 1 satellite data and environmental conditions from ECMWF ERA5 reanalysis data. CloudDiff is able to reconstruct realistic MODIS observations from matching ERA5 environmental conditions and achieves a lower reconstruction error compared to generating MODIS observations solely from pre-defined CCFs. In CloudDiff’s generation stage, the environmental conditions are compressed into a latent representation using an attention mechanism. This latent representation can be interpreted as a set of CCFs that have been learned purely from data. We’ll discuss the properties of the learned CCFs including how they relate to existing CCFs, their geographical distribution, and their predictive power of the radiative properties of cloud fields.Calibration of climate model parameterizations using Bayesian experimental design
Sensitivity analysis for climate science with generative flow models
Abstract:
Sensitivity analysis is a cornerstone of climate science, essential for understanding phenomena ranging from storm intensity to long-term climate feedbacks. However, computing these sensitivities using traditional physical models is often prohibitively expensive in terms of both computation and development time. While modern AI-based generative models are orders of magnitude faster to evaluate, computing sensitivities with them remains a significant bottleneck. This work addresses this challenge by applying the adjoint state method for calculating gradients in generative flow models. We apply this method to the cBottle generative model, trained on ERA5 and ICON data, to perform sensitivity analysis of any atmospheric variable with respect to sea surface temperatures. We quantitatively validate the computed sensitivities against the model’s own outputs. Our results provide initial evidence that this approach can produce reliable gradients, reducing the computational cost of sensitivity analysis from weeks on a supercomputer with a physical model to hours on a GPU, thereby simplifying a critical workflow in climate science. The code can be found at https://github.com/Kwartzl8/ cbottle_adjoint_sensitivity.Lossy neural compression for geospatial analytics: a review
Abstract:
Over the past decades, there has been an explosion in the amount of available Earth observation (EO) data. The unprecedented coverage of Earth’s surface and atmosphere by satellite imagery has resulted in large volumes of data that must be transmitted to ground stations, stored in data centers, and distributed to end users. Modern Earth system models (ESMs) face similar challenges, operating at high spatial and temporal resolutions, producing petabytes of data per simulated day. Data compression has gained relevance over the past decade, with neural compression (NC) emerging from deep learning and information theory, making EO data and ESM outputs ideal candidates because of their abundance of unlabeled data.
In this review, we outline recent developments in NC applied to geospatial data. We introduce the fundamental concepts of NC, including seminal works in its traditional applications to image and video compression domains with a focus on lossy compression. We discuss the unique characteristics of EO and ESM data, contrasting them with “natural images,” and we explain the additional challenges and opportunities they present. Additionally, we review current applications of NC across various EO modalities and explore the limited efforts in ESM compression to date. The advent of self-supervised learning (SSL) and foundation models (FMs) has advanced methods to efficiently distill representations from vast amounts of unlabeled data. We connect these developments to NC for EO, highlighting the similarities between the two fields and elaborate on the potential of transferring compressed feature representations for machine-to-machine communication. Based on insights drawn from this review, we devise future directions relevant to applications in EO and ESMs.