What's new
GR WEB DEV | Buy and Download | Watch and Download | one line of code

Register a free account today to become a member! Once signed in, you'll be able to participate on this site by adding your own topics and posts, as well as connect with other members through your own private inbox!

  • You can directly chat with any of Staff Member for help.

Microsoft announces powerful new chip for AI inference

Maia200-Hero-Image.png


Microsoft announces powerful new chip for AI inference Lucas Ropek 8:00 AM PST · January 26, 2026 Microsoft has announced the launch of its latest chip, the Maia 200, which the company describes as a silicon workhorse designed for scaling AI inference.

The 200, which follows the company’s Maia 100 released in 2023 , has been technically outfitted to run powerful AI models at faster speeds and with more efficiency, the company has said. Maia comes equipped with over 100 billion transistors, delivering over 10 petaflops in 4-bit precision and approximately 5 petaflops of 8-bit performance — a substantial increase over its predecessor.

Inference refers to the computing process of running a model, in contrast with the compute required to train it. As AI companies mature, inference costs have become an increasingly important part of their overall operating cost, leading to renewed interest in ways to optimize the process.

Microsoft is hoping that the Maia 200 can be part of that optimization, making AI businesses run with less disruption and lower power use. “In practical terms, one Maia 200 node can effortlessly run today’s largest models, with plenty of headroom for even bigger models in the future,” the company said.

Microsoft’s new chip is also part of a growing trend of tech giants turning to self-designed chips as a way to lessen their dependence on Nvidia, whose cutting-edge GPUs have become increasingly pivotal to AI companies’ success. Google, for instance, has its TPU, the tensor processing units — which aren’t sold as chips but as compute power made accessible through its cloud . Then there’s Amazon Trainium, the e-commerce giant’s own AI accelerator chip, which just launched its latest version , the Trainium3, in December. In each case, the TPUs can be used to offload some of the compute that would otherwise be assigned to Nvidia GPUs, lessening the overall hardware cost.

With Maia, Microsoft is positioning itself to compete with those alternatives. In its press release Monday, the company noted that Maia delivers 3x the FP4 performance of third-generation Amazon Trainium chips, and FP8 performance above Google’s seventh generation TPU.

Microsoft says that Maia is already hard at work fueling the company’s AI models from its Superintelligence team. It has also been supporting the operations of Copilot, its chatbot. As of Monday, the company said it has invited a variety of parties — including developers, academics, and frontier AI labs — to use its Maia 200 software development kit in their workloads.

Techcrunch event Disrupt 2026 Tickets: One-time offer Tickets are live! Save up to $680 while these rates last, and be among the first 500 registrants to get 50% off your +1 pass. TechCrunch Disrupt brings top leaders from Google Cloud, Netflix, Microsoft, Box, a16z, Hugging Face, and more to 250+ sessions designed to fuel growth and sharpen your edge. Connect with hundreds of innovative startups and join curated networking that drives deals, insights, and inspiration. Disrupt 2026 Tickets: One-time offer Tickets are live! Save up to $680 while these rates last, and be among the first 500 registrants to get 50% off your +1 pass. TechCrunch Disrupt brings top leaders from Google Cloud, Netflix, Microsoft, Box, a16z, Hugging Face, and more to 250+ sessions designed to fuel growth and sharpen your edge. Connect with hundreds of innovative startups and join curated networking that drives deals, insights, and inspiration. San Francisco | October 13-15, 2026 REGISTER NOW Topics

Lucas Ropek Senior Writer, TechCrunch

October 13-15 San Francisco, CA Tickets are live at the lowest rates of the year. Save up to $680 on your pass — and if you’re among the first 500 registrants, score a +1 pass at 50% off . Meet investors. Discover your next portfolio company. Hear from 250+ tech leaders , dive into 200+ sessions , and explore 300+ startups building what’s next. Don’t miss these one-time savings.

Most Popular TikTok users freak out over app’s ‘immigration status’ collection — here’s what it means Sarah Perez

Researchers say Russian government hackers were behind attempted Poland power outage Zack Whittaker

Microsoft gave FBI a set of BitLocker encryption keys to unlock suspects’ laptops: Reports Lorenzo Franceschi-Bicchierai

Capital One acquires Brex for a steep discount to its peak valuation, but early believers are laughing all the way to the bank Connie Loizos

Anthropic’s CEO stuns Davos with Nvidia criticism Connie Loizos

-- --
 
Back
Top