New Paper on Transformer Limits in Metocean Forecasting

Our team has published a new paper in Journal of Computational Science on when transformer models work well for metocean forecasting, and when simpler convolutional models remain a better choice.

The article, “Exploring the limitations of transformer models for metocean forecasting,” was authored by Julia Borisova, Mikhail Borisov, Stanislava Vostrikova, Viktor Golikov, Andrey Kuznetsov, Gleb Solovev, Alexander A. Stepanets, Mikhail Krinitskiy, and Nikolay O. Nikitin.

The paper studies three metocean forecasting settings: long-term Arctic sea ice forecasting, atmospheric bias correction, and ocean forecasting. The authors compare transformer-based architectures with convolutional neural networks and show that transformer limitations are conditional rather than universal.

For Arctic sea ice forecasting, transformer models such as TimeSformer and SwinLSTM struggled with annual dynamics, including summer melt, while lightweight CNN baselines performed better and improved error metrics by up to 30% against existing state-of-the-art systems. In atmospheric bias correction, CNNs also outperformed transformer-based alternatives, reducing errors in Global Forecast System fields by about 20% relative to transformers.

The ocean forecasting case produced a different result: transformer models with contrastive pre-training achieved stronger performance across ocean variables, including a 40% reduction in mixed layer depth error. The study therefore offers a practical message for metocean machine learning: start with CNNs when data are limited or fine spatial structure is critical, and use transformers when representation learning and pre-training can support the task.

Read the paper

Mikhail Krinitskiy
Mikhail Krinitskiy
Head of lab

Current research interests are machine learning and deep learning of various flavours applied in Earth Sciences started with observational applications, now shifted to generic data mining and natural processes modeling. The main applications are in Atmospheric sciences, including remote sensing, and also in Ocean sciences. There are also some applications in geochemistry and paleoreconstruction applications. Lecturing masters courses “Machine learning for Earth Sciences” and “Deep learning for Earth Sciences,” a.k.a. ML4ES and DL4ES (Rus.) in Moscow Institute of Physics and Technology and in Lomonosov Moscow State University.