diffusion/denoising LLMs @ IESL

Firing on all tokens.   A working group at the Information Extraction and Synthesis Lab, UMass Amherst.

dIESL is a working group at the Information Extraction and Synthesis Lab (IESL), UMass Amherst, studying non-autoregressive (NAR) language models — masked diffusion, insertion, and edit-based generation. The name is diffusion at IESL; the engine reference is on us.

Autoregressive models produce text strictly left to right. We are interested in models that can do better: generate tokens in flexible orders, fill in arbitrary-length gaps, revise prior decisions, and exploit pairwise relative-position structure rather than absolute positions. Concretely, we work on

  • Masked diffusion — sampling strategies, classifier-free guidance, and inference-time scaling.
  • Insertion language models — variable-length generation through token injection, derived from continuous-time Markov chains on sequences of variable length.
  • Edit / substitution models — generation as iterative refinement, with an API that resembles a stream of diff-style operations on the current draft.
  • Software for NAR research — see xlm-core, our modular framework for training and comparing NAR language models.

people

Person Affiliation
Andrew McCallum UMass Amherst
Dhruvesh Patel UMass Amherst
Benjamin Rozonoyer UMass Amherst
Avinash Amballa UMass Amherst
Neil Band Stanford
Joey Bose Imperial College London, Mila
Sai Sreenivas Chintha UMass Amherst
Soumitra Das UMass Amherst
Durga Prasad Maram UMass Amherst
Jacopo Minniti University of Toronto
Tahira Naseem IBM Research
Gaurav Pandey IBM Research
Tim G. J. Rudner University of Toronto, Vijil
Aishwarya Sahoo UMass Amherst
Md Arafat Sultan IBM Research
Ramón Fernandez Astudillo IBM Research

news

Mar 01, 2026 xLM is presented at EACL 2026 (System Demonstrations track) in Rabat, Morocco. Paper · Code
Feb 01, 2026 A Continuous Time Markov Chain Framework for Insertion Language Models accepted as a Spotlight at AISTATS 2026.
May 08, 2025 Released the Insertion Language Models preprint — sequence generation with arbitrary-position insertions.

selected publications

  1. Learned Relay Representations for Forward-Thinking Discrete Diffusion Models
    Benjamin Rozonoyer, Jacopo Minniti, Dhruvesh Patel, and 4 more authors
    In Structured Probabilistic Inference & Generative Modeling (SPIGM) and Frontiers in Generative AI (FoGen) Workshops at ICML, Jul 2026
    Accepted to SPIGM @ ICML and FoGen @ ICML
  2. EACL Demo
    xLM: A Python Package for Non-Autoregressive Language Models
    Dhruvesh Patel, Durga Prasad Maram, Sai Sreenivas Chintha, and 2 more authors
    In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 3: System Demonstrations), Mar 2026
  3. AISTATS
    Spotlight
    A Continuous Time Markov Chain Framework for Insertion Language Models
    Dhruvesh Patel, Benjamin Rozonoyer, Soumitra Das, and 3 more authors
    In Proceedings of the 29th International Conference on Artificial Intelligence and Statistics (AISTATS), 2026
  4. Insertion Based Sequence Generation with Learnable Order Dynamics
    Dhruvesh Patel, Benjamin Rozonoyer, Gaurav Pandey, and 3 more authors
    In Proceedings of the 43rd International Conference on Machine Learning, Jul 2026
  5. Insertion Language Models: Sequence Generation with Arbitrary-Position Insertions
    Dhruvesh Patel, Aishwarya Sahoo, Avinash Amballa, and 3 more authors
    In Structured Probabilistic Inference & Generative Modeling Workshop at NeurIPS, 2025
  6. Improved Sampling from Masked Diffusion Models with Position Contrastive Guidance
    Dhruvesh Patel, Tahira Naseem, Gaurav Pandey, and 3 more authors
    In Structured Probabilistic Inference & Generative Modeling Workshop at NeurIPS, 2025