
Unlocking Generative Power: Multi-Token Prediction for Next-Gen LLMs
19 Jul 2025
We conclude our work on multi-token prediction as a superior method for training LLMs, delivering enhanced performance for generative/reasoning tasks

Defining the Frontier: Multi-Token Prediction's Place in LLM Evolution
19 Jul 2025
Explore the landscape of language modeling losses, multi-token prediction, and self-speculative decoding. highlights.

Unraveling Multi-Token Prediction: Bridging Training-Inference Gaps with Lookahead
18 Jul 2025
Dive into the core reasons behind multi-token prediction's superior LLM performance, exploring how it mitigates distributional discrepancy

Unveiling LLM Intelligence: Multi-Token Prediction Drives Qualitative Reasoning Shifts
18 Jul 2025
Explore how multi-token prediction fundamentally alters LLM capabilities, dramatically improving induction and algorithmic reasoning

Unrivaled LLM Efficacy: Multi-Token Prediction Revolutionizes Performance Across Domains
18 Jul 2025
Witness multi-token prediction's transformative power across seven large-scale experiments: unlocking exponential gains with model size, 3x faster inference

Optimizing LLM Learning: Multi-Token Cross-Entropy Loss Explained
18 Jul 2025
Explore the core of our approach: a generalized cross-entropy loss that enables LLMs to predict multiple future tokens simultaneously

Beyond Next-Token: Multi-Token Prediction Reshapes LLM Training Paradigms
18 Jul 2025
Challenge traditional LLM training! Our multi-token prediction introduces a simple, cost-free auxiliary loss yielding profound improvements

Cited Works: Collider Physics, Quantum Field Theory, and Particle Detectors
18 Jul 2025
A list of scholarly references at the intersection of collider physics, quantum field theory, electroweak interactions, and the development of particle detector

Unveiling Axion-Like Particles: Coupling to Electroweak Field Strengths
18 Jul 2025
Explore a simplified model of axion-like particles (ALPs) that couple to electroweak field strengths, leveraging the ALPsEFT FeynRules model file