Inference - Search News

Report: Nvidia is working on a top secret AI inference chip that could debut next month

The new inference platform is expected to be launched at Nvidia’s annual GTC developer conference in San Jose later this ...

10d

Taalas Launches Hardcore Chip With ‘Insane’ AI Inference Performance

Taalas has launched an AI accelerator that puts the entire AI model into silicon, delivering 1-2 orders of magnitude greater ...

Forget AI Training: AI Inference Is the Real Money Maker in 2026. Here Are 2 Stocks to Own.

Inference will take over for training as the primary AI compute moving forward. Broadcom has struck gold with its custom ...

2don MSN

Nvidia Plans New Chip to Speed AI Processing, Shake Up Computing Market

Under pressure from rivals, the chip giant is set to offer a new product focused on rapid processing of AI queries for ...

Nutanix, AMD Bet $250 Million On Enterprise AI Inference

Nutanix partners with AMD on $250 million enterprise AI deal. Strategic investment includes $150M equity stake and $100M for ...

Microsoft's new AI training method eliminates bloated system prompts without sacrificing model performance

Microsoft researchers have developed On-Policy Context Distillation (OPCD), a training method that permanently embeds ...

Blockonomi

Nvidia Partners with Groq on New Inference Platform as OpenAI Seeks Speed

Nvidia develops new Groq-powered inference platform for OpenAI after $20B licensing deal, set for GTC reveal next month. NVDA ...

Researchers baked 3x inference speedups directly into LLM weights — without speculative decoding

Researchers from the University of Maryland, Lawrence Livermore, Columbia and TogetherAI have developed a training technique that triples LLM inference speed without auxiliary models or infrastructure ...

Yahoo Finance

AI Inference Company Evaluation Report 2025 | NVIDIA, AMD, and Intel Compete for Dominance with Diverse Hardware and Strategic Partnerships

Dublin, Aug. 05, 2025 (GLOBE NEWSWIRE) -- The "AI inference - Company Evaluation Report, 2025" report has been added to ResearchAndMarkets.com's offering. The AI Inference Market Companies Quadrant is ...

17d

Nvidia Deepens AI Inference Push With Groq Deal And Rubin Platform

Nvidia agreed to acquire Groq's AI inference chip assets for $20b, aiming to expand its position in AI deployment hardware. The company introduced its new Rubin chip platform, designed around next ...

Sandisk partners with SK hynix to create global standard of high-bandwidth flash for AI inference

Sandisk and SK hynix push High Bandwidth Flash (HBF) standard via OCP to cut AI inference costs and boost scalability.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results