Amazon Sells Custom AI Chips to Challenge Nvidia's Market Control

Amazon Sells Custom AI Chips to Challenge Nvidia's Market Control | Fazen Markets

Amazon has confirmed plans to sell its custom-designed inferentia and trainium AI accelerator chips to external customers, directly challenging Nvidia's dominance in the critical AI hardware market. The strategy was reported by Barrons on 18 June 2026, marking a significant escalation in the cloud provider's competition with the premier chip designer. This commercial move follows years of internal development for Amazon Web Services and arrives as Nvidia trades at $210.69, up 1.58% on the day, and Amazon shares hover at $244.39 as of mid-morning UTC today.

Context — why this matters now

Nvidia has established a commanding position in the data center GPU market, with analysts estimating its share exceeds 80% for AI training workloads. The last major challenge to its architectural dominance came in late 2022 when multiple cloud providers, including Google with its TPU and Microsoft with its Maia chip, announced custom silicon projects. These were largely kept for internal use, preserving Nvidia's stranglehold on the commercial market.

The current catalyst is the explosive demand for inference, the phase of AI where trained models answer queries. Inference workloads are forecast to represent over 70% of AI compute spending by 2027, as models like those from OpenAI and Anthropic are deployed at scale. Nvidia's general-purpose GPUs, while powerful, are often seen as over-engineered and expensive for this specific task. Amazon's decision to commercialize its chips now targets this massive, growing segment where cost-per-inference is the primary metric.

Data — what the numbers show

The market data illustrates the scale of the competition Amazon is entering. Nvidia's stock has a market capitalization exceeding $5.2 trillion, buoyed by its AI monopoly. In contrast, Amazon trades with a market cap of approximately $2.5 trillion. Amazon's custom chips, like the Trainium2, claim to deliver up to 4x better price-performance for training large language models compared to previous generations. For inference, the Inferentia2 chip promises 50% higher throughput and 40% lower latency than its predecessor.

A key comparison lies in cloud provider spending. Prior to this move, AWS, Microsoft Azure, and Google Cloud were collectively forecast to spend over $150 billion on data center hardware in 2026, with a significant portion flowing to Nvidia. By selling its own chips, Amazon aims to capture a direct revenue stream while potentially reducing its own capital expenditure on third-party hardware. The financial impact will be measured in gross margins; Nvidia's data center gross margin sits above 70%, a target for any successful challenger.

Analysis — what it means for markets / sectors / tickers

The direct second-order effect is pressure on Nvidia's pricing power and market share in the inference segment. Companies like Meta and Tesla, which operate massive AI inference clusters, could become early customers for Amazon's more cost-optimized chips. This would directly reduce Nvidia's total addressable market. It also benefits semiconductor foundry TSMC, as Amazon's chips are manufactured on its advanced processes, diversifying its customer base beyond Nvidia and AMD.

The acknowledged risk is software lock-in. Nvidia's CUDA platform is the entrenched ecosystem for AI developers. Amazon's success hinges on its ability to provide a competitive software stack, including frameworks like PyTorch and TensorFlow. Without strong software, even superior hardware can struggle for adoption. Recent flow data shows increased institutional interest in semiconductor supply chain companies, like ASML and LAM Research, as the market diversifies away from a single vendor.

Outlook — what to watch next

The first major catalyst is Amazon's Q2 2026 earnings call, scheduled for late July. Management commentary on early customer interest and projected revenue from chip sales will be critical. The second catalyst is the launch timeline for Nvidia's next-generation Rubin architecture, expected in late 2026, which will define the new performance benchmark.

Key levels to watch include Nvidia's stock price holding above its 200-day moving average near $205 and Amazon's ability to break through resistance at its yearly high of $245.73. The performance of the iShares Semiconductor ETF (SOXX) relative to the Nasdaq will indicate whether the market views this as a net positive for the broader sector or a zero-sum share shift. The conditional outcome hinges on whether Amazon can secure design wins with at least one other major cloud provider or large-scale enterprise by year-end.

Frequently Asked Questions

How does Amazon's move compare to Google's TPU strategy?

Google's Tensor Processing Units (TPUs) have been available externally since 2018, but primarily through Google Cloud's platform-as-a-service model. Customers rent access to TPU pods but cannot purchase the physical chips. Amazon's strategy is more direct, selling the chips outright for deployment in customer-owned data centers or other clouds. This is a more aggressive commercial challenge to Nvidia's hardware business model.

What does this mean for companies using AI, like OpenAI or Anthropic?

For AI model developers, increased competition in the hardware market should lead to lower compute costs over time. This could improve the economics of running large-scale inference services. It also provides more architectural choice, allowing companies to optimize different parts of their workload on different chips. However, it may add complexity to their engineering teams, who must now support and optimize for multiple hardware backends.

Will this affect the price of cloud AI services for developers?

Yes, but indirectly. Amazon is likely to use this strategy to lower its own infrastructure costs for AWS. Those savings could be passed on to customers using AWS's AI services, like SageMaker and Bedrock, in the form of lower pricing or more generous free tiers. The competitive pressure may also force Google Cloud and Microsoft Azure to re-evaluate their pricing for AI-centric virtual machine instances to remain competitive.

Bottom Line

Amazon's commercial chip sale directly attacks Nvidia's most profitable market, aiming to fragment the AI hardware ecosystem and capture value.

Disclaimer: This article is for informational purposes only and does not constitute investment advice. CFD trading carries high risk of capital loss.

Amazon Sells Custom AI Chips to Challenge Nvidia's Market Control

Vortex HFT — Free Expert Advisor

Key Takeaways

Trade the Markets Discussed in This Article

Context — why this matters now

Data — what the numbers show

Analysis — what it means for markets / sectors / tickers

Outlook — what to watch next

Frequently Asked Questions

How does Amazon's move compare to Google's TPU strategy?

What does this mean for companies using AI, like OpenAI or Anthropic?

Will this affect the price of cloud AI services for developers?

Bottom Line

Trade XAUUSD on autopilot — free Expert Advisor

Stay informed

Ready to trade the markets?

Related

Intel Stock Jumps 14.5% to $134 on AI Data Center Momentum

Micron Stock Surges 24% to $140 on AI Memory Cycle Optimism

Commonwealth Bank Appoints New CIO and CTO, Boosts Tech Spend

Taiwan Launches Combat Readiness Drill Amid Rising China Tensions