TY - GEN
T1 - Untangling GPU Power Consumption
T2 - 2026 European Conference on Computer Systems, EUROSYS 2026
AU - Jacquet, Pierre
AU - Agusti, Maxime
AU - Caron, Eddy
AU - Coti, Camille
AU - De Assunção, Marcos Dias
AU - Lefèvre, Laurent
AU - Orgerie, Anne Cécile
N1 - Publisher Copyright:
© 2026 Copyright held by the owner/author(s)
PY - 2026/4/26
Y1 - 2026/4/26
N2 - As the demand for AI-driven workloads increases, the energy consumption of Graphics Processing Units (GPUs) devices has come under intense scrutiny, particularly in hyperscale data centers where large numbers of accelerators are centralized and leased to diverse clients. In the context of cloud hyperscalers, GPUs power monitoring presents several challenges that vary depending on the product offered. The monitoring capabilities of physical devices may be limited or even absent for some products. However, given the substantial energy demands of GPUs, power monitoring is essential for both cloud providers and clients. Operators require tools to manage power distribution effectively, such as balancing workloads across Power Distribution Units (PDUs), while clients need visibility into power usage to optimize their workloads for energy efficiency. To address these challenges, we propose methods for estimating the energy consumption of jobs running on GPU devices in cloud environments, spanning from shared and managed offerings like ML-as-a-Service (MLaaS) to less managed products (e.g., Infrastructure-as-a-Service (IaaS)). Our models demonstrate the benefits of sharing GPUs for small AI workloads, as well as the current sub-optimal utilization of GPUs in cloud hyperscalers, based on insights from an IaaS GPU cluster.
AB - As the demand for AI-driven workloads increases, the energy consumption of Graphics Processing Units (GPUs) devices has come under intense scrutiny, particularly in hyperscale data centers where large numbers of accelerators are centralized and leased to diverse clients. In the context of cloud hyperscalers, GPUs power monitoring presents several challenges that vary depending on the product offered. The monitoring capabilities of physical devices may be limited or even absent for some products. However, given the substantial energy demands of GPUs, power monitoring is essential for both cloud providers and clients. Operators require tools to manage power distribution effectively, such as balancing workloads across Power Distribution Units (PDUs), while clients need visibility into power usage to optimize their workloads for energy efficiency. To address these challenges, we propose methods for estimating the energy consumption of jobs running on GPU devices in cloud environments, spanning from shared and managed offerings like ML-as-a-Service (MLaaS) to less managed products (e.g., Infrastructure-as-a-Service (IaaS)). Our models demonstrate the benefits of sharing GPUs for small AI workloads, as well as the current sub-optimal utilization of GPUs in cloud hyperscalers, based on insights from an IaaS GPU cluster.
KW - Cloud computing
KW - GPU
KW - Power consumption
UR - https://www.scopus.com/pages/publications/105038436791
U2 - 10.1145/3767295.3769333
DO - 10.1145/3767295.3769333
M3 - Contribution to conference proceedings
AN - SCOPUS:105038436791
T3 - EUROSYS 2026 - Proceedings of the 2026 European Conference on Computer Systems
SP - 624
EP - 640
BT - EUROSYS 2026 - Proceedings of the 2026 European Conference on Computer Systems
PB - Association for Computing Machinery, Inc
Y2 - 27 April 2026 through 30 April 2026
ER -