Cloud Prices Surge with the Rise of GenAI

For the first time since the debut of cloud services, prices began to edge up in the first six months of 2023, according to the Nimbus Pricing Index (NPI). Many have wondered if inflationary pressures and weaker revenue growth for cloud service providers (CSPs) would eventually lead to price increases. That time seems to have arrived. Although the impact is still marginal in most geographies, the price increases are significant in some European countries.

About the Series

The NPI, which reflects average pricing in US dollars across the three main CSPs, recorded an average increase of 0.3% per price point (in other words, the price offered by one vendor in one location). The largest single increase was about 2% in South America. (See Appendix 1 for the full NPI table.) We also note that price ranges in some locales nudged upward in the first half of 2023 compared to the second half of 2022.

The four locations in Exhibit 1 are hubs where all three major CSPs have an offering that meets the specifications of the NPI. These modest increases indicate that CSPs continue to look for ways to boost revenue without resorting to direct price-list increases—especially in computing power, which is the largest component of most clients’ cloud billing.

NPI Prices Ranges Have Increased Slightly in 2023

But the story is a bit different when you consider pricing in other currencies. Because Azure changed its foreign currency exchange rates for some locations in Europe, billings denominated in euros, pounds, Danish krones, Norwegian krones, or Swedish krona jumped between 9% to 15%. Elsewhere, there have been hikes in network egress pricing for Google Cloud Platform as well as some types of storage in specific locations such as Indonesia and South America.

Generative AI Models: Build or Buy?

GenAI is a white-hot topic across industries, and that holds true for cloud services. Cloud workloads that perform AI training require a more sophisticated set of specifications than workloads constructed for more standard operations. These specifications can include significant memory requirements and often require more specialized computations, including optimized graphics processing units (GPUs).

GenAI raises the age-old tech question of whether to build or buy. There are many deployment options available for those looking to utilize GenAI, but in our analysis we chose to compare the two extreme ends of this spectrum: a from-scratch custom model versus a zero-customization vendor option. (See Exhibit 2.)

The Cost of Building an In-House GenAi Model Increases as Complexity Grows

The models studied ranged from a low-volume, low-complexity model—one that sorts internal IT tickets to find high priority items, for example—to a high-volume, high-complexity model, such as a customer-facing service chatbot for a large firm. This analysis provides us with a range of possible spending that includes computing power, storage, training, API costs and human resourcing. (See Appendix 2 for details on the complexity and volume parameters for model types.) In future installments of Cloud Cover we will analyze other options within this range, such as using open-source models.

Learn More: Cloud Cover Series

According to our analysis, the vendor solution is always more affordable thanks to significant economies of scale and in-house GPU availability. Still, the decision about whether to build or buy is not always clear cut, especially for the low-volume models. For example, it’s only 1.2 times more expensive to build a low-volume, simple model in-house compared to purchasing an off-the-shelf version. (See Exhibit 3.)

Building Your Own GenAI Model Comes at a Premium

Because off-the-shelf solutions limit customization (and potentially performance), a company might justify building its model in-house. But as volume and complexity rise, the additional spending needed to build in-house is harder to justify—for example, the cost of building a high-volume simple model is 11.4 times that of purchasing an off-the-shelf version.

In most cases, vendor pricing for out-of-the-box models of similar complexity varies. (See Appendix 3 for select vendor pricing.) But the choice of vendor does not heavily impact overall cost, except perhaps for high-volume models, where the API call costs begin to outweigh other costs such as resourcing.

Choosing the Right Architecture

As companies look for ways to keep cloud architecture costs down, more are considering switching their computing architecture from x86 to Advanced RISC Machine (ARM). For many years, x86 has dominated virtual machine deployment because these processors perform well with a high throughput, and because they are deeply embedded in existing hardware. But x86 also consumes significant energy and generates more heat.

For these reasons, CSPs have begun rolling out ARM-based options. ARM processors are mobile chips frequently used in smartphones and other devices where weight and size are key factors. Amazon Web Services introduced its Graviton ARM-based processors in 2018; Azure followed suit with its Ampere Altra processor in 2020 and Google with its Tau T2A virtual machine series in 2022.

Along with their power efficiency (which can assist with ESG commitments) and good performance at a lower cost compared to x86, ARM services are highly scalable and can readily support spikes in usage. Research shows that using ARM does indeed yield savings: in most locations, ARM prices are 20% lower than like-for-like x86 architectures, though the range runs from 19% all the way to 52%. (See Exhibit 4.)

ARM Cloud Architecture Yields Significant Savings

There are, however, downsides to using ARM architectures. They have a simplified instruction set, so while ARMs can calculate quickly, their ability to perform multiple actions simultaneously and run parallel tasks through a single CPU is more limited. Another significant drawback is that firms need to rearchitect existing x86 applications to work on ARM-based virtual machines to achieve optimal performance. But when this rearchitecting occurs, companies can improve performance over x86 equivalents due to ARM’s higher proportion of physical CPUs per vCPU.

Please keep an eye out for the next issue of Cloud Cover, where we’ll continue to develop the NPI and address key questions on cloud services and pricing—including insights on specialist offerings, other CSPs, and price evolution over time.

NPI Values for CSP-Served Locations, by Vendor

航空宇宙・防衛

自動車業界

消費財業界

Within 消費財業界

教育

Within 教育

エネルギー

Within エネルギー

金融機関

Within 金融機関

ヘルスケア業界

Within ヘルスケア業界

産業財業界

Within 産業財業界

保険業界

Within 保険業界

プリンシパル・インベスター、プライベート・エクイティ

Within プリンシパル・インベスター、プライベート・エクイティ

パブリックセクター

Within パブリックセクター

流通業界

Within 流通業界

テクノロジー、メディア、通信

Within テクノロジー、メディア、通信

運輸・物流

Within 運輸・物流業界

旅行・観光業界

Within 旅行・観光業界

都市計画

Within 都市計画

AI

Within AI

パーパス（存在意義）

ビジネス・レジリエンス

トランスフォーメーション

Within トランスフォーメーション

気候変動・サステナビリティ

Within 気候変動・サステナビリティ

コーポレートファイナンス＆ストラテジー

Within コーポレートファイナンス＆ストラテジー

コストマネジメント

Within コストマネジメント

顧客インサイト

Within 顧客インサイト

デジタル/テクノロジー/データ

Within デジタル/テクノロジー/データ

イノベーション戦略策定・実行

Within イノベーション戦略策定・実行

グローバルビジネス

Within グローバルビジネス

製造

Within 製造

マーケティング・セ－ルス

Within マーケティング・セ－ルス

M&A、トランザクション、PMI

Within M&A, Transactions, and PMI

オペレーション

Within オペレーション

組織

Within 組織

人材戦略

Within 人材戦略

プライシング・レベニューマネジメント

Within プライシング・レベニューマネジメント

リスクマネジメント、コンプライアンス

Within リスクマネジメント、コンプライアンス

社会貢献

Within 社会貢献

最新の論考

注目テーマ

CEOアジェンダ

BCGヘンダーソン研究所（BHI）

My Subscriptions

リーダーシップチーム

人材とカルチャー

Within People and Culture

BCGオフィス紹介

Cloud Cover: Cloud Prices Rise as the Era of Generative AI Dawns

Key Takeaways

Generative AI Models: Build or Buy?