--- layout: '@/layouts/Doc.astro' title: 'AuroraGPT: Training Foundation Models on Supercomputers' date: '2025-12-16' location: 'ANL' --- Sam Foreman 2025-12-16 - [🧰 AuroraGPT: Toolbox](#toolbox-auroragpt-toolbox) - [👥 Team Leads](#busts_in_silhouette-team-leads) - [🤝 Teams](#handshake-teams) - [🏋️ Challenges](#weight_lifting-challenges) - [💾 AuroraGPT: Training](#floppy_disk-auroragpt-training) - [🍹 AuroraGPT: Blending Data, Efficiently](#tropical_drink-auroragpt-blending-data-efficiently) - [📉 Training AuroraGPT-7B on 2T Tokens](#chart_with_downwards_trend-training-auroragpt-7b-on-2t-tokens) - [📉 Training AuroraGPT-2B on 7T Tokens](#chart_with_downwards_trend-training-auroragpt-2b-on-7t-tokens) - [✨ Features](#sparkles-features) - [✨ Features (even more!)](#sparkles-features-even-more) - [🧬 MProt-DPO](#dna-mprot-dpo) - [🧬 Scaling Results (2024)](#dna-scaling-results-2024) - [🧬 MProt-DPO: Scaling Results](#dna-mprot-dpo-scaling-results) - [🚂 Loooooooooong Sequence Lengths](#steam_locomotive-loooooooooong-sequence-lengths) - [🌎 AERIS (2025)](#earth_americas-aeris-2025) - [👀 High-Level Overview of AERIS](#eyes-high-level-overview-of-aeris) - [➕ Contributions](#heavy_plus_sign-contributions) - [⚠️ Issues with the Deterministic Approach](#warning-issues-with-the-deterministic-approach) - [🎲 Transitioning to a Probabilistic Model](#game_die-transitioning-to-a-probabilistic-model) - [🌀 Sequence-Window-Pipeline Parallelism `SWiPe`](#cyclone-sequence-window-pipeline-parallelism-swipe) - [🚀 AERIS: Scaling Results](#rocket-aeris-scaling-results) - [🌪️ Hurricane Laura](#tornado-hurricane-laura) - [📓 References](#notebook-references) - [❤️ Acknowledgements](#heart-acknowledgements) - [Extras](#extras) ## 🧰 AuroraGPT: Toolbox - **Datasets and data pipelines** (how do we deal with scientific data?) - **Software infrastructure and workflows** (scalable, robust, extensible) - **Evaluation of state-of-the-art LLM Models** (how do they perform on scientific tasks?)
Figure 13: Hurricane Laura tracks (top) and intensity (bottom).
Initialized 7(a), 5(b) and 3(c) days prior to 2020-08-28T00z.