← All Posts

AWS Graviton5 Debuts with 192 Arm Cores and PCIe 6.0

The Hot Take: ARM seems to be breaking out from everywhere. Fujitsu, Nvidia, AWS and ARM. Qualcomm seems to be playing catch up in the server market from the looks of it.

AWS has provided a first look at its next-generation Graviton5 processor, a custom server CPU developed by Annapurna Labs for deployment across the company's cloud computing platform and AI inference infrastructure.

Read the full article

Intel details long-awaited Crescent Island AI GPU at Computex, boasts up to 480 GB of LPDDR5X to combat memory shortages — company shares more details of its Xe3P inference accelerator at Computex

The Hot Take: Intel moving fast to make up lost ground on this front for sure. From the looks trying to hit the $ sweet spot too.

Intel revealed more details of its next-gen Data Center GPU, code-named Crescent Island, at Computex 2026. This inference-optimized chip will feature up to 480GB of LPDDR5X memory for efficient handling of massive AI contexts.

Read the full article

Intel Resurrects On-Package Memory With Razor Lake-AX, Loading Up LPDDR6 to Hunt Down AMD’s Medusa Halo by 2028

The Hot Take: APU's competition starting to heat up for that Ai dollar.

Intel's next-generation Razor Lake-AX chips will compete directly against AMD's Medusa Halo while featuring on-package memory. Intel Is Bringing Back On-Package Memory With Its Next-Gen Razor Lake-AX Chips That Fight Against AMD's Medusa Halo On-Package Memory was last used by Intel for its Lunar Lake SoCs. These SoCs were aimed at low-power mobile platforms, and while the chips themselves offered solid performance in a 30W budget, Intel's next on-package memory solution will be a big one. As per Haze2K1 on X, Intel Razor Lake-AX SoCs will feature on-package memory. This is a big deal as moving the DRAM closer to […]Read full article at https://wccftech.com/intel-resurrects-on-package-memory-with-razor-lake-ax-to-hunt-down-amd-medusa-halo/

Read the full article

'Changing of the Guard'? AMD, Intel, and Micron Soar While Nvidia Lags

The Hot Take: AMD seems to be out performing Intel & Nvidia on the market, while Nvidia is still the preferred Ai holy-grail? Just seems odd.

While Nvidia has dominated the "infrastructure boom" since 2022's launch of ChatGPT and "the generative AI craze," CNBC writes that "This week offered the starkest illustration yet of what MIzuho analyst Jordan Klein said could be a 'changing of the guard in AI.'" Chipmakers Advanced Micro Devices and Intel notched gains of about 25%, while memory maker Micron jumped more than 37% and fiber-optic cable maker Corning climbed about 18%. All four of those companies have more than doubled in value this year, with Intel leading the way, up well over 200%. Nvidia, meanwhile, is only slightly ahead of the Nasdaq in 2026, gaining 15% for the year, aided by an 8% rally this week. In spreading the wealth to a wider swath of hardware companies, investors are clearly betting that the bull market in AI has long legs and that data centers are going to need a wider array of advanced components for years to come. Memory has been the biggest theme of late due to a global shortage that's driven up prices and turned Micron, a 47-year-old company tucked in a sleepy corner of the semiconductor market, into one of the hottest trades over the past 12 months. Micron blew past an $800 billion market capitalization for the first time this week, and the stock is now up over 750% in the past year. CEO Sanjay Mehrotra told CNBC in March that key customers are only getting "50% to two-thirds of their requirements" because of supply issues. The memory market is largely dominated by Micron, along with Korea-based Samsung and SK Hynix, which are also both in the midst of historic rallies... Bank of America estimates the data center CPU market could more than double from $27 billion in 2025 to $60 billion in 2030. AMD's quarterly results this week underscored the emerging trend, as earnings, revenue and guidance sailed past estimates on strong data center growth. The company has long led the CPU charge, and CEO Lisa Su said on the earnings call that AMD now expects 35% growth over the next three to five years in the server CPU market, up from a forecast of 18% growth that the company provided in November. The article cites two other big movers: Intel "is in the midst of a revival sparked by a major investment from the U.S. government last year. Intel's stock had its best month on record in April, more than doubling, and has continued notching massive gains, rising 33% in the early days of May." Nvidia still remains the world's most valuable company "and is expected to show revenue growth of 70% this fiscal year," the article points out — adding that companies like Corning are also benefiting from Nvidia partnerships. "Glass maker Corning, which celebrated its 175th anniversary this week, signed a massive deal with Nvidia on Wednesday that involves the development of three new U.S. factories dedicated entirely to optical technologies... likely a major step in Nvidia's move away from copper cables and towards fiber-optic cables as it builds out its rack-scale systems." Read more of this story at Slashdot.

Read the full article

Claude hitches ride on SpaceX's datacenter capacity

The Hot Take: Ai usage growing pretty steady and fast it would appear.

Anthropic is partnering with SpaceX to ease capacity constraints that have stranded Claude customers, a gesture that may soothe developer discontent about service availability and cost. Ami Vora, chief product officer at Anthropic, announced the expanded rate limits during Code for Claude, a developer event livestreamed from San Francisco. "As of today, we are increasing rate limits for developers on Claude Code and the Claude Platform," said Vora. "More specifically, we are doubling Claude Code's five-hour rate limits for Pro, Max, Team, and seat-based enterprise plans. And we're raising our API limits considerably for Claude Opus." Anthropic is also ending its peak hours limit reduction on Claude Code for Pro and Max accounts. The AI biz is able to do this, she explained, thanks to a partnership with SpaceX that expands available inference capacity. Anthropic has struck a deal to use "all the capacity of [SpaceX’s] Colossus 1 data center." According to SpaceX, "Colossus 1 features over 220,000 Nvidia GPUs, including dense deployments of H100, H200, and next-generation GB200 accelerators." The deal adds more than 300 megawatts of new capacity within the month and follows similar compute arrangements with Amazon and Google/Broadcom. The company's insatiable hunger for processing power may even take it into space. Anthropic says that it "expressed interest in partnering with SpaceX to develop multiple gigawatts of orbital AI compute capacity." In recent months, Anthropic has struggled to meet unexpected demand for Claude services – its models became sufficiently capable to win over skeptical developers and usage patterns shifted as a result of the popularity of OpenClaw's long-running agents. "Year over year, API volume is up nearly 17x on the cloud platform," said Vora. "And on Claude Code, the average developer is now spending 20 hours per week running Claude." Amid this growing popularity, Anthropic has also wrestled with bugs that affected model performance. During her presentation, Vora tempered expectations by noting that no new model would be announced. Instead, she presided over a review of new and recent Claude features in an effort to frame model improvements as exponential. The salient exponent here would be two – the doubling of Claude's five-hour rate limits. Model performance, as measured by benchmarks, has been incremental. Opus 4.7 is a few percentage points better than Opus 4.6 in various measurements, not twice as capable or more. That didn't stop Vora from claiming, "even though model capabilities are improving on an exponential, most organizations are still adopting AI on a linear path." Vora's use of "exponential" may be more of a thematic framing device than a literal assertion of progress, a device to draw a contrast between Claude's capabilities and a more cautious pace of corporate AI adoption. She cast the upcoming feature review as an opportunity for customers to see where Claude development is headed, "So you can plan for it and ride the exponential with us." The remainder of the presentation consisted of a summary of recent Claude feature improvements. These include: multi-agent orchestration, outcomes, and dreaming – a capability that showed up in the recent Claude Code source leak. "With Dreaming," explained Angela Jiang, head of product for the Claude platform, "Claude is actually able to self-learn. It's able to actually inspect over its previous sessions, figure out skills that it missed, lessons it should have learned, and actually apply those directly to memory on its own." Boris Cherny, head of Claude Code, took a turn on stage to remind everyone about Routines, a way to trigger and run Claude jobs locally or on cloud servers. "Routines can be run on a schedule, they can be kicked off by webhooks, or they can even be kicked off by arbitrary API calls, you can run them locally on your machine or on remote cloud compute," he said. Cherny said, "for me personally, a lot of my code nowadays is written by routines. I'm not the one doing the prompting. I'm the one creating a routine that does the prompting." Who wouldn't want to "ride the exponential" when one's company is paying the API bill? ®

Read the full article

Arm and AMD eye agentic AI CPU feast

The Hot Take: So was ARM not breaking into the WinTel market so they shifted to Ai market or are they just chasing the highest dollars?

Investment bank UBS reckons agentic AI will send CPU demand soaring, with Arm and AMD best placed to grab the spoils. UBS analysts believe the growth of agentic AI software will drive strong demand for CPUs in the AI era. The bank said agentic AI increases processor workloads and favours chips with higher core counts and better power efficiency. That view gives Arm the biggest potential upside, followed by AMD. Intel could benefit too, since a growing total addressable market tends to lift more than one silicon boat. In fresh coverage of British chip design house Arm, UBS said CPU demand is surging. The bank said agentic AI computing will favour chips with higher core counts and a bias towards power efficiency. UBS reckons the total server market could grow five times by calendar year 2030. It put the figure at $170bn, up from $30bn in calendar year 2025. Within that market, UBS expects Arm to benefit the most. The bank said Arm could potentially grab as much as 40-45 per cent of the total share, which would make the x86 crowd choke on its roadmaps. The bank’s report cited expert comments behind three main themes explaining the surge in CPU demand. The first is that agentic AI workloads rely more heavily on CPU cores. That shift is expected to require a three- to fivefold increase in CPU core counts per user and per GPU. Servers with standalone CPUs will need more chips, which is the kind of problem chip sellers enjoy having. UBS said that demand for agentic AI will push some workloads to local PCs. It pointed to Anthropic’s Claude Code as an example. The need for higher core counts and power efficiency should tilt demand first towards Arm and then AMD. That sounds grim for anyone still selling yesterday’s watt-guzzling boxes as tomorrow’s AI answer. Chipzilla could still serve this market through its Coral Rapids platform, according to UBS. The catch is that benefiting from a bigger market and winning the best bits of it are not quite the same trick.  

Read the full article

Tenstorrent Vows to ‘Crush Everyone’ as Galaxy Blackhole Hits 350 Tokens/s on DeepSeek R1, Undercutting NVIDIA’s GB300 5x AI TCO

The Hot Take: RISC-V breaking the GPU strangle hold on GPU's? YES PLEASE.

Tenstorrent made a bold claim during their TT-Deploy livestream, saying they are going to crush everyone at everything, including AI, with their Galaxy servers. Tenstorrent Galaxy Supercluster Offers 10x Faster GenAI Video, And Destroys Current-Gen GPUs With "Blitz Mode", Offering 350+ Tokens/s In DeepSeek R1 Jim Keller and his Tenstorrent are on a mission to challenge the existing AI hierarchy with their RISC-V-powered platforms. As such, the company unveiled its latest Galaxy Blackhole servers for AI at scale. With Galaxy Blackhole, Tenstorrent offers a fully Networked and native AI solution that includes compute, memory, and networking, all unified into a […]Read full article at https://wccftech.com/tenstorrent-vows-to-crush-everyone-galaxy-blackhole-hits-350-tokens-on-deepseek-r1-undercut-nvidia-gb300-ai-tco/

Read the full article

Intel & AMD Work On APX, The Next Major Step In The Evolution of x86 Architectures, Adds More Performance Without Requiring More Die Area & Power

The Hot Take: Just what we need Ai specific instructions for them to gobble up all the CPU's now.

APX or Advanced Performance Extensions are the next evolution of x86 as Intel & AMD co-develop new standards for the architecture. APX Expands the x86 Instruction Set, Bringing Faster Performance & New Features That Will Benefit Both Intel and AMD's Next-Gen Chips Two days ago, we talked about ACE (AI Compute Extensions), which is a unified instruction set that aims to increase matrix-multiply performance for next-gen x86 chips. ACE is just one part of the grander scheme in which both Intel and AMD are working together to evolve the x86 architecture under a single unified framework through the recently established […]Read full article at https://wccftech.com/intel-amd-work-on-apx-the-next-major-step-in-the-evolution-of-x86-architectures/

Read the full article