Between Diffusion Language Models and Autoregressive Models
Exploring the latest research on diffusion language models, data efficiency, and hybrid approaches with autoregressive models.
Exploring the latest research on diffusion language models, data efficiency, and hybrid approaches with autoregressive models.
Explaining the differences between autoregressive models and diffusion language models from a B/F (Bytes per FLOP) perspective in LLM inference.
And do you know about Muon?
Exploring the latest research trends in 1-bit LLMs and the accuracy reversal phenomenon through extreme quantization.