LLM Inference Optimization

LLM Data Mixture Breaks When Training Pools Shift: Causal Inference Offers Fix

LLM training data mixture optimization breaks when training pools shift — every prior proxy experiment becomes stale.

DeepSeek open sources DSpark, a new framework to speed up LLM inference by up to 85%

DSpark can make decoding faster, but acceptance quality still determines how much speed the system actually realizes.

OpenAI and Broadcom unveil 'Jalapeño' Intelligence Processor for LLM inference

"A blank-slate design for modern LLM inference, not a general-purpose accelerator adapted from earlier AI workloads" ...

11d

From ChatGPT to Chips: OpenAI Unveils Jalapeño to Power Faster LLMs and More Affordable AI

OpenAI and Broadcom unveiled Jalapeño, a custom AI inference chip designed for LLMs, promising higher efficiency, lower costs ...

VentureBeat

New LLM optimization technique slashes memory costs up to 75%

Researchers at the Tokyo-based startup Sakana AI have developed a new technique that enables language models to use memory more efficiently, helping enterprises cut the costs of building applications ...

OpenAI, Broadcom (AVGO) Unveil “Jalapeño” AI Accelerator for Enhanced LLM Inference

Broadcom Inc. (NASDAQ:AVGO) is one of the best stocks for beginners to buy now. On June 24, OpenAI and Broadcom introduced ...

11d

OpenAI and Broadcom Unveil LLM-Optimized Intelligence Processor

Built from the ground up for current and future LLMs across the industryDeveloped from design to production in nine months, accelerated by ...

Business Wire

MangoBoost Launches Mango LLMBoost™: AI Inference Optimization Software with Up to 12.6x Relative Performance Improvement and 92% Cost Savings

BELLEVUE, Wash.--(BUSINESS WIRE)--MangoBoost, a provider of cutting-edge system solutions designed to maximize AI data center efficiency, is announcing the launch of Mango LLMBoost™, system ...

Tweakers

Senior LLM Inference Engineer

Senior LLM Inference Engineer. Netherlands - Amsterdam. PDT - Data Science & AI / 1. Role: Permanent / Hybrid. apply for this job. Join our AI team at Prosus, the largest cons ...

11d

OpenAI and Broadcom unveil Jalapeño Intelligence Processor for LLM workloads

OpenAI and Broadcom have announced Jalapeño, OpenAI’s first Intelligence Processor. The AI accelerator is designed around ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results