VCF 9 enables private AI
VMware Cloud Foundation 9 includes a so-called Private AI Practice: an integrated way to make GPU capacity available, segment it, and connect it to applications and containers. The platform can indeed help set up a sovereign AI environment on private infrastructure. However, it is important to understand what VCF 9 does and does not solve.
VMware has traditionally been a virtualization platform. It divides hardware into smaller, logically separated units from large physical resources. This is useful for standard workloads. But large AI models behave differently. They require as much contiguous GPU memory as possible. In that context, splitting hardware is counterproductive: the more you virtualize, the less memory is available per model.
VCF 9 therefore has a clear use case. It is suited for specific, well-defined scenarios where a smaller model needs to run within an existing private cloud environment. Think of an organization that wants to automate a targeted AI task, keep data private, and does not have the budget or scale for a full AI factory. In that scenario, VCF 9 with its Private AI Practice can deliver a platform at the push of a button that exposes GPUs, connects applications, and manages containers. That is valuable functionality.
However, for organizations that want to host large open-source models to build a broad Claude-like internal experience, VCF 9 is not the primary choice. In that case, bare-metal GPU infrastructure with maximum available memory is the logical path. That has little to do with VMware.
Technology is not ideology. The right choice depends on the use case.
Scale determines the use case
There is a clear split in how organizations approach private AI.
On one side are organizations aiming to build a full internal AI experience. They want to run a large, powerful model on their own hardware, fully sovereign, with full control over data and output. This requires serious infrastructure and a well thought-out business model. Model scale determines output quality, and quality comes at a price.
On the other side are organizations with a specific, well-defined need. They want to automate one process, analyze one category of alerts, or query one data source. For that, a large model is unnecessary. A smaller, fine-tuned local model can provide sufficient intelligence without the infrastructure overhead of a full AI platform—and without the ongoing token costs of an external API.
Between these two extremes sits the mid-market and SME segment: organizations that are thinking about their data but do not want to invest hundreds of thousands in their own hardware. For them, a hybrid approach is often the most logical path: hosting with a provider that already manages the data, offers a sovereign platform, and can expose GPU capacity without requiring a full AI factory to be built in-house.
The question behind the question
When organizations think about Private AI, they are effectively answering four questions at once:
What do we want to do with AI, and how intensive will its use be?
What data is involved, and how sensitive is it?
Which model fits the use case, and how much quality loss is acceptable?
What infrastructure is required, and what does it realistically cost?
Only once these questions are answered does it make sense to discuss platforms, tooling, and architecture.
VCF 9, bare-metal GPUs, Chinese open-source models, European APIs: these are all tools. The point is to choose the right tool for the right job.
Fundaments supports that decision-making process. Not from a preference for any specific platform, but from years of experience with infrastructure under critical workloads and the conviction that architectural choices start with the question, not the technology.