LINE BREAK

Sign in Subscribe

James Corcoran

The great AI knowledge transfer: Apple researchers quantify optimal conditions for teacher-student model distillation

The great AI knowledge transfer: Apple researchers quantify optimal conditions for teacher-student model distillation

Recent discourse within the AI community has centered on model distillation, notably fueled by speculation surrounding DeepSeek's R1 model. Distillation, in essence, is a technique whereby the outputs of a large, high-performing model are utilized to train a smaller, more efficient model. This process, exemplified by the rumored

Weekly 3x3: CME to launch Solana futures. Texas needs reactors. ChatGPT beats DeepSeek on stock predictions.

Weekly 3x3: CME to launch Solana futures. Texas needs reactors. ChatGPT beats DeepSeek on stock predictions.

From Solana futures and fragile payments infrastructure to negative power prices and soaring data-center energy demand, markets are testing new edges. AI research pushes into finance and long-context reliability as real-world constraints bite.

AI memory reboot: Titans are the new transformers

AI memory reboot: Titans are the new transformers

Think about how we learn. We absorb facts, connect them, remember key details, and build a bigger picture over time. Current AI struggles with this. LLMs are good at processing information in front of them, but they have a hard time remembering things from earlier in a long text or

Back to the future in time series forecasting: Sundial points to a new dawn of foundation models

Back to the future in time series forecasting: Sundial points to a new dawn of foundation models

A recent paper from China introduces Sundial, a family of time series models that appears to represent a significant leap forward in time series forecasting. To understand Sundial's potential impact, it's helpful to trace the evolution of this field, which has deep roots stretching back centuries.

Weekly 3x3: Chips act is dead. Vance on AI opportunity. Hardware-aligned models.

Weekly 3x3: Chips act is dead. Vance on AI opportunity. Hardware-aligned models.

Stagflation risk is back on the agenda as chip policy wobbles and pressure builds to reshore manufacturing. In AI, leaders warn against overregulation, hyperscalers move into chip design, and research pushes long-context efficiency and reasoning under constraints.

Train smarter, not harder: Clever combinations of existing techniques are driving the next wave of AI optimization

Train smarter, not harder: Clever combinations of existing techniques are driving the next wave of AI optimization

The buzz around DeepSeek is undeniable, and for good reason. But the really extraordinary thing isn't some magical new algorithm. It's their clever application and combination of existing techniques. DeepSeek has shown us that there’s still plenty of optimization out there to discover. Their approach

Total recall, AI edition: How caching enhances LLM interactions

Total recall, AI edition: How caching enhances LLM interactions

The field of Large Language Model (LLM) application development is a hotbed of innovation, with researchers and engineers constantly developing new software engineering practices and architectural patterns to optimize performance. Retrieval-Augmented Generation (RAG) was a significant early advancement, enabling LLMs to access and integrate external knowledge. However, for long-context LLMs,