The new inference platform is expected to be launched at Nvidia’s annual GTC developer conference in San Jose later this ...
Taalas has launched an AI accelerator that puts the entire AI model into silicon, delivering 1-2 orders of magnitude greater ...
Inference will take over for training as the primary AI compute moving forward. Broadcom has struck gold with its custom ...
Under pressure from rivals, the chip giant is set to offer a new product focused on rapid processing of AI queries for ...
Nutanix partners with AMD on $250 million enterprise AI deal. Strategic investment includes $150M equity stake and $100M for ...
Microsoft researchers have developed On-Policy Context Distillation (OPCD), a training method that permanently embeds ...
Nvidia develops new Groq-powered inference platform for OpenAI after $20B licensing deal, set for GTC reveal next month. NVDA ...
Researchers from the University of Maryland, Lawrence Livermore, Columbia and TogetherAI have developed a training technique that triples LLM inference speed without auxiliary models or infrastructure ...
Dublin, Aug. 05, 2025 (GLOBE NEWSWIRE) -- The "AI inference - Company Evaluation Report, 2025" report has been added to ResearchAndMarkets.com's offering. The AI Inference Market Companies Quadrant is ...
Nvidia agreed to acquire Groq's AI inference chip assets for $20b, aiming to expand its position in AI deployment hardware. The company introduced its new Rubin chip platform, designed around next ...
Sandisk and SK hynix push High Bandwidth Flash (HBF) standard via OCP to cut AI inference costs and boost scalability.