Daily digest — 2026-06-23

Communication Dynamics Neural Networks: FFT-Diagonalized Layers for Improved Hessian Conditioning at Reduced Parameter Count

The article introduces Communication Dynamics Neural Networks (CDNNs) and presents CDLinear, a block-circulant linear layer that reduces parameter count to 1/B compared to dense layers while maintaining performance. CDLinear achieves 97.50% test accuracy on the 8x8 MNIST benchmark using only 2,380 parameters, significantly fewer than the 8,970 parameters of a dense layer, with a mean Hessian condition number of 1.9e4, vastly improved over the dense baseline's 5.9e6. This work provides a new approach to layer design that enhances optimization diagnostics and conditioning, which is crucial for practitioners aiming to build efficient neural networks with reduced computational overhead.

arXiv cs.AI — 14 d ago · found 12 d agoModels

RAG over Thinking Traces Can Improve Reasoning Tasks

The paper introduces a novel approach to enhance reasoning tasks in AI by utilizing retrieval-augmented generation (RAG) with thinking traces—intermediate thinking trajectories from problem-solving attempts—rather than traditional document retrieval. The proposed T3 method converts these traces into structured representations, leading to significant performance improvements on benchmarks like AIME 2025-2026, with relative gains of +56.3% for Gemini-2.5-Flash and notable improvements for other models as well. This research indicates that leveraging thinking traces as a retrieval corpus can substantially enhance reasoning capabilities in AI systems, making it a valuable strategy for practitioners working with LLMs.

arXiv cs.AI — 14 d ago · found 12 d agoRAG

Learning Evidence Highlighting for Frozen LLMs

The paper introduces HiLight, an Evidence Emphasis framework designed to enhance the performance of frozen Large Language Models (LLMs) by decoupling evidence selection from reasoning. HiLight employs a lightweight Emphasis Actor that uses reinforcement learning to insert highlight tags around critical spans in the input without altering the original text, leading to improved performance in tasks like sequential recommendation and long-context question answering. This approach demonstrates zero-shot transferability across different Solver architectures, indicating its potential for broader applicability in enhancing LLMs without requiring task-specific evidence labels.

arXiv cs.AI — 14 d ago · found 12 d agoInference

The day in AI, distilled.

Communication Dynamics Neural Networks: FFT-Diagonalized Layers for Improved Hessian Conditioning at Reduced Parameter Count

RAG over Thinking Traces Can Improve Reasoning Tasks

Learning Evidence Highlighting for Frozen LLMs

Models & Releases

Training & Optimization

Safety & Security

Research & Evaluation