Daily digest — 2026-06-27

Prefilling-dLLM: Predictive Prefilling for Long-Context Inference in Diffusion Language Models

The paper introduces Prefilling-dLLM, a framework designed to optimize long-context inference in diffusion language models (dLLMs) by partitioning the input prefix into N chunks and caching their key-value (KV) representations. This method reduces computational complexity from quadratic in the full sequence length to quadratic only in the decode length, achieving state-of-the-art performance on benchmarks like LongBench and InfiniteBench, with speedups of 9.1–28.0x for 8K–32K contexts. The findings are significant for practitioners as they enable efficient handling of long contexts in dLLMs, improving both speed and resource utilization.

arXiv cs.CL — 18 d ago · found 16 d agoTraining

Small Data, Big Noise: Adversarial Training for Robust Parameter-Efficient Fine-Tuning

The paper presents SDBN (Small Data Big Noise), a novel framework that integrates adversarial training with Parameter-Efficient Fine-Tuning (PEFT) to enhance robustness and generalization in NLP tasks, particularly when training data is limited. It introduces two variants: SDBN-h, which utilizes character-level edits for robust optimization, and SDBN-p, which employs LLM-generated variants, demonstrating significant performance improvements across benchmarks in low-resource scenarios. This work is crucial for practitioners as it addresses the challenges of noise and data scarcity in PEFT, enabling more reliable model adaptations without increasing parameter counts or computational demands.

arXiv cs.CL — 18 d ago · found 16 d agoTraining

Which LoRA? An Empirical Study on the Effectiveness of LoRA Techniques During Multilingual Instruction Tuning

The study published in arXiv investigates the effectiveness of various LoRA variants in multilingual instruction tuning, revealing no significant advantages of more complex LoRA techniques over basic LoRA. Experiments conducted on two datasets across diverse languages indicate that layer-wise language representations remain consistent across models fine-tuned with different LoRA methods. This finding suggests that practitioners may not need to adopt more complex LoRA variants for improving cross-lingual transfer and knowledge retention in multilingual tasks.

arXiv cs.CL — 18 d ago · found 16 d agoTraining

The day in AI, distilled.

Prefilling-dLLM: Predictive Prefilling for Long-Context Inference in Diffusion Language Models

Small Data, Big Noise: Adversarial Training for Robust Parameter-Efficient Fine-Tuning

Which LoRA? An Empirical Study on the Effectiveness of LoRA Techniques During Multilingual Instruction Tuning

Models & Releases

Training Techniques

Safety & Security

Tooling & Open Source