Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language Model
This article co-authored by Alexandra Sasha Luccioni, Sylvain Viguier, and Anne-Laure Ligozat estimates that BLOOM, a Large Language Model, with 176 billion parameters, emits 24.7 tonnes of CO2 during a single training.
Abstract: Progress in machine learning (ML) comes with a cost to the environment, given that training ML models requires computational resources, energy and materials. In the present article, we aim to quantify the carbon footprint of BLOOM, a 176-billion parameter language model, across its life cycle. We estimate that BLOOM’s final training emitted approximately 24.7 tonnes of CO2eq if we consider only the dynamic power consumption, and 50.5 tonnes if we account for all processes ranging from equipment manufacturing to energy-based operational consumption. We also carry out an empirical study to measure the energy requirements and carbon emissions of its deployment for inference via an API endpoint receiving user queries in real-time. We conclude with a discussion regarding the difficulty of precisely estimating the carbon footprint of ML models and future research directions that can contribute towards improving carbon emissions reporting.
Click on this link to read the full article.
Image credit: Image by Freepik