Streaming-dLLM: Accelerating Diffusion LLMs via Suffix Pruning and Dynamic Decoding
arXiv:2601.17917v2 Announce Type: replace-cross Abstract: Diffusion Large Language Models (dLLMs) offer a compelling paradigm for natural language generation, leveraging parallel…
