9 min read

The Enterprise AI Journey: Five Phases from ChatGPT to Owning Your AI Factory

Picture of Marc Austin Marc Austin : Updated on June 25, 2026

Every enterprise we talk to is somewhere on the same road. The faces in the boardroom change, the industry changes, the model du jour changes — but the arc does not. It starts with a CEO declaring “we need to do AI.” It ends, eventually, with a sober conversation about owning the AI factory that produces the company’s intelligence advantage.

What most maps of this journey miss is that there are two journeys happening at once. The first is operational: copilots and productivity tools — AI that makes the company cheaper to run. The second matters more: AI built into the products the company sells. The first shows up as Operating Costs. The second shows up as Cost of Good Sold (COGS) — and when inference cost scales with every customer interaction, AI economics stop being an IT line item and start setting your gross margin.

This post maps the five phases of that journey. If you recognize yourself in Phase 1, this is a preview of what’s coming. If you’re already in Phase 3, this is permission to admit it.

Phase	What it looks like	Who feels the pain	The bill that ends it
1	Everything runs on frontier APIs	CFO, CSO	Token opex + AI-product COGS + proprietary data leaving the company’s control
2	Open weights in your hyperscaler VPC	CFO, CIO	2–3x GPU premium + per-GB egress lock-in
3	Neocloud GPUs, DIY everything else	CTO, CSO	Two quarters of platform engineering + security gaps
4	Neocloud with a real cloud network (Hedgehog)	CFO	~20% third-party AI platform tax
5	Own the factory: your infrastructure, Hedgehog network + platform	—	This is the destination

Phase 1: “We Need to Do AI” — The Public API Honeymoon

The CEO returns from an offsite, a customer dinner, or a board meeting and declares the new mandate. The CTO is tasked with developer adoption, the CMO with content, sales gets a copilot pilot, legal gets a research assistant. And — quietly at first — a product team ships the company’s first AI feature: an assistant inside the app, a natural-language interface, an AI summarization layer. All of it runs on frontier APIs, because that is where everyone starts.

For a while, this feels like flying. Productivity climbs. Customers actually use the new feature. The CEO tells the board the company is “AI-native now.”

Then two bills and a memo arrive.

The CFO’s first bill is opex. Token spend, which started as a rounding error, has become a real line item. A few hundred power users running agentic workflows, RAG over internal documents, and long-context summarization can push monthly API spend into six and seven figures — metered consumption with no committed-use discount and no FinOps playbook.

The CFO’s second bill is worse, because it’s COGS. The AI product feature worked. Usage now scales with customers — which means inference cost scales with revenue, and at frontier API prices every new customer arrives with a per-token royalty attached. The gross margin on the company’s first AI product is being set by a vendor’s price list. Pricing strategy meetings start including the phrase “tokens per session.”

The CSO’s memo is shorter. It explains, in the careful language security officers use when they want the executive team to actually read something, that nine months of the company’s crown jewels — its proprietary data: pricing strategy, M&A targets, source code, customer lists, unreleased roadmaps — have been crossing a vendor boundary. The provider’s terms likely promise not to train on API traffic, and that promise is probably kept. That isn’t the point. The data traversed infrastructure the enterprise neither owns nor audits, sat in someone else’s logs, under someone else’s retention policy. In regulated industries, the memo has a second paragraph about contract clauses and a third about regulators.

The conversation that ends Phase 1 usually contains some version of the line: “It’s the data, stupid.” Confidential data is the company’s only durable advantage in the AI era — the one thing competitors cannot replicate, and the raw material of every differentiated AI product it will ever ship. And it has been flowing across a vendor boundary because the API was easy.

Phase 1 ends with a decision: bring the model to the data, not the data to the model.

Phase 2: Open Weights on the Big Three

The CTO’s team gets a new charter: stand up an enterprise AI stack on AWS, Azure, or GCP — wherever the cloud footprint already sits. The pitch writes itself: “We already trust the hyperscaler with our data. We’ll run open-weight models in our own VPC. Same security envelope as the rest of the business.”

The team imports the corpus into the cloud data platform, evaluates open-weight models — Llama, Mistral, Qwen, DeepSeek, gpt-oss — builds RAG against proprietary documents, and fine-tunes. And something important happens: for narrow, high-value tasks — contract review, support triage, and crucially the AI features inside the product — the fine-tuned open model starts beating the frontier APIs on the metrics that matter. The product team now has a model competitors can’t call, trained on data competitors don’t have. For the first time, the company owns differentiated AI rather than renting undifferentiated AI — and the per-token royalty is gone; inference cost is now infrastructure cost.

For a quarter or two, this looks like the answer. Then the CFO gets the next bill.

Hyperscaler GPU pricing carries a structural premium — typically two to three times what specialist GPU providers charge for the same silicon. A modest production deployment — a couple of 8-GPU nodes for inference plus periodic fine-tuning bursts — clears six figures a month before storage, networking, and managed services. And because the AI product is growing, the inference fleet grows with it. The COGS problem hasn’t gone away; it has changed landlords.

The CFO asks for an optimization plan. The CIO comes back with the number nobody wants to look at: the cost of getting the data out. Hyperscaler internet egress runs roughly $0.05–$0.12 per GB depending on provider and volume — call it $50,000 to $90,000+ to move a petabyte, before cross-region replication, NAT processing, and inter-AZ transfer fees compound it. For AI workloads — where that proprietary data is repeatedly re-embedded, re-indexed, and re-trained — data transfer is a materially larger share of the bill than for classic workloads.

The second uncomfortable conversation of the journey: the company didn’t give the farm to the model vendors — it gave it to the hyperscaler. The data is expensive to move, the GPUs are structurally premium, and every byte that wants to leave is monetized.

Phase 2 ends with a new question: is there a cheaper place to run this — without the egress tax?

Phase 3: The Neocloud Experiment — Cheaper GPUs, Missing Cloud

On the surface, yes. The neoclouds were built for exactly this: dense GPU fleets, fast back-end interconnects, free or near-free egress increasingly standard, and pricing that undercuts the hyperscalers dramatically — for a production inference fleet, the difference compounds to hundreds of thousands of dollars per node per year, and a 16-node fleet becomes a seven-figure annual swing. On paper, the case closes itself.

The team migrates. And then it discovers what it bought.

The network gap. Neoclouds invested where the training benchmarks live: the back-end fabric — InfiniBand, rail-optimized topologies, GPUDirect RDMA. They did not invest in the front end. The foundational cloud network services an enterprise takes for granted — VPC isolation, security groups, managed routing into the corporate WAN, firewalls, load balancing — are missing, half-implemented, or bring-your-own. This should surprise no one: the hyperscalers took years and billions of dollars to build those capabilities. AWS went as far as acquiring an entire chip company, Annapurna Labs, and building its Nitro system to deliver cloud networking in custom silicon. A neocloud racing to stand up GPU capacity is not going to recreate that investment on the timeline the enterprise needs it. So the platform team spends the next two quarters building the cloud it thought it was buying: open-source policy engines, load balancers, firewalls, custom BGP, and the Terraform glue to hold it together. Every connection back to the data lake, the VPN, the identity provider is a custom integration. The savings start getting eaten by a platform team that wasn’t in the business case.

The security gap. The CSO’s audit finds tenant boundaries enforced by the scheduling software rather than the network, and management planes exposed in ways the hyperscalers stopped tolerating years ago. The starkest finding is east-west traffic: data moving laterally between workloads — including other customers’ workloads — on the same GPU cluster, often unencrypted and uninspected. The hyperscalers encrypt this traffic at multiple layers as a matter of course; most neoclouds do not, and without the network investment described above, realistically never will. Compliance posture varies widely across providers. For internal workloads, this is uncomfortable. For product workloads serving customer data, it can be disqualifying.

The due-diligence question. Many neoclouds are young, highly leveraged businesses financing enormous capex against multi-year contracts — a structure that works until it doesn’t, and a market that will consolidate. The CFO’s checklist gets longer: What does the provider’s balance sheet look like? How concentrated is its customer base? And the question that matters most: if this provider is acquired, restructured, or fails, what happens to our fine-tuned models, our embeddings, and our customer-facing inference traffic? These risks are manageable — but only if you ask before you migrate, and only if your architecture is portable enough to move.

Phase 3 ends with a hard-won realization: the neocloud GPU economics are right, but the cloud around the GPUs is missing. AI needs a real cloud network — without handing the GPU savings back to get one.

Phase 4: A Neocloud Built Like a Cloud — Enter Hedgehog

This is where Hedgehog enters the story. In Phase 4, the enterprise finds a neocloud — or pushes its provider to become one — whose network is built on Hedgehog: Hedgehog VPC, which enforces tenant isolation in the network itself and provides the familiar cloud networking controls — private networks, routing, security groups — that behave exactly as the enterprise’s teams and security policies already expect; and Hedgehog Gateway, which makes ingress and egress a first-class network service instead of a per-GB profit center.

What changes for the executive team:

For the CSO: segmentation is enforced and auditable at the network layer, producing the evidence regulators and customer security questionnaires actually accept. Compliance becomes a configuration exercise, not a custom integration project — which means the AI product can serve regulated customers.
For the CIO: the data-movement problem inverts. Training data flows in, inference results flow out, regions replicate, the corporate WAN connects — without watching the FinOps dashboard. That’s tens of thousands of dollars per petabyte saved on egress, and the far larger benefit of architecting freely instead of architecting around the bill.
For the CTO: the two-quarter platform detour ends. The foundational network services exist as products, with APIs that look like the ones the team already knows.
For the CFO: the Phase 3 GPU savings finally land in the P&L — on the internal opex line and the product COGS line — without being clawed back by hidden platform engineering.

One residual tax remains. Most neoclouds sell GPUs and storage, not platforms. To get from “we have GPUs” to “we have fine-tuned models serving production traffic,” most enterprises bolt on a third-party AI platform layer — Together AI, Modal, Anyscale, RunPod’s serverless tier, or similar. These platforms are good: model serving, autoscaling, fine-tuning pipelines, observability. They also charge a real premium — commonly around 20 percent on top of the underlying GPU cost. Phase 4 is the right architecture with one remaining toll booth.

Phase 5: Own the Factory

In Phase 5, the enterprise crosses the line from renter to owner. The premise, after four phases, is finally obvious: run open-weight models, fine-tuned on our private data, on AI infrastructure we own — powered by the Hedgehog AI Network and the Hedgehog AI Platform.

Let’s be honest about when this trade makes sense, because the obvious CFO objection is real: ownership means capex, power, facilities, and hardware refresh risk. Owning beats renting when you have sustained baseline load — and by the time an enterprise reaches this point, it almost always does: AI product features serving customers around the clock, copilots and RAG across every business unit, a regular fine-tuning cadence. Rent the bursts and the experiments; own the steady-state factory floor. Most Phase 5 enterprises run a hybrid — owned baseline, rented spikes — and the owned baseline is where the economics compound.

Every premium accumulated across the first four phases comes out of the cost structure. The network stops being a markup embedded in someone else’s GPU rate and becomes a flat, predictable cost the enterprise controls. The platform layer — model serving, autoscaling, fine-tuning, observability — stops being a percentage skimmed off every GPU-hour and becomes part of the stack the enterprise owns. Data movement stops being metered at all. The savings are real and quantifiable, but the strategic benefits are larger:

Gross margin control. For the AI products, inference COGS becomes the amortized cost of owned infrastructure instead of a vendor’s price list. The company — not its supplier — decides what its AI features cost to serve, and therefore how to price them. Pricing power comes home.
Air-gapped security. The data never leaves infrastructure the enterprise controls. No shared tenancy with a competitor — or a model vendor. For financial services, healthcare, defense, and critical infrastructure, air-gapped operation isn’t a feature; it’s the only acceptable answer on the compliance form.
Complete control of enterprise data. Training data, fine-tuning sets, embeddings, inference logs, RAG indexes — every form the company’s proprietary data takes — all on storage the enterprise owns. When a regulator asks where the data is, the answer is one sentence.
Free ingress and egress. The enterprise stops architecting around the bill. Pipelines pull from any source; inference serves any destination; replication is a design choice, not a budget line.
No DIY tax. The temptation is to build the factory from raw components — GPUs, network design, platform integration, a new team. We have watched enterprises spend two to three years and tens of millions doing exactly that, and arrive somewhere permanently behind the market. Hedgehog delivers the AI network and AI platform as products, on infrastructure the enterprise owns. The CTO deploys a factory instead of building one.

How Long Does the Journey Take?

Most enterprises we see are 12 to 18 months into Phase 1 or Phase 2 today. Converging on Phase 5 typically takes another 18 to 36 months of bill shocks, security audits, and architectural rework. The leaders who get there first hold two structural advantages over those still rationalizing Phase 2: a cost advantage on everything AI does inside the company, and a margin-and-data advantage on everything AI does inside their products.

The CEO who started this journey by saying “we need to do AI” eventually says something different: we need to own the means of producing our intelligence. That is the real destination. The five phases are just how long it takes the executive team to agree on it.

Hedgehog builds the AI network and the AI platform for enterprises that have decided to own their AI factory. If you recognize your organization in Phases 1 through 4, we’ll map where you are and model the economics with you — a working TCO comparison of rented APIs, hyperscaler VPC, neocloud, and owned infrastructure, built on your numbers, not ours. The bill shocks are predictable. So is the architecture that ends them.

1 min read

The AI Networking Revolution: Dawn of a New Epoch

Art Fewell : Mar 14, 2025 10:00:54 AM

When we look back at the history of networking, it's clear that the industry moves in distinct epochs. These technological eras aren't merely defined...

AI Network GPU network AI Inference AI Training Machine Learning AI Fine-tuning

1 min read

AI Training vs. Inference: Designing Networks for Real-World AI Workloads

Art Fewell : Apr 23, 2025 5:26:27 PM

The same AI model can require significant differences in infrastructure depending on whether you're training it or running it. Here's why your...

AI Network AI Inference AI Training Machine Learning AI Fine-tuning

1 min read

Dell'Oro Data Center Switch Report: The Market Is Choosing Ethernet for AI

Marc Austin : Mar 10, 2025 4:21:39 PM

Dell'Oro Group just released their 4Q 2024 Ethernet Switch - Data Center Report showing record-breaking sales fueled by AI buildouts and a recovery...

AI Network AI Edge GPU network AI Cloud AI Inference AI Training AI Fine-tuning