Dhruvesh Patel

dp.jpg

dhruveshpate@umass.edu

I am a Computer Science PhD Researcher at UMass Amherst, advised by Prof. Andrew McCallum at the Information Extraction and Synthesis Laboratory, and a Visiting Researcher at IBM Research. My research focuses on generative modeling for discrete sequences, especially alternatives to left-to-right language modeling. Before UMass, I completed my undergraduate and first master’s degree at IIT Madras, where I worked on robotics research with Prof. Sandipan Bandyopadhyay.

I have also been fortunate to work with collaborators across industry research labs, including Meta Reality Labs and Abridge AI. Before graduate school, I spent two years as a software engineer at MathWorks and a year collaborating with Prof. Partha Talukdar on applied NLP problems.

CV available at the bottom of this page.

research

Most language models generate text one token at a time, from left to right. I am interested in models that can draft, revise, infill, and reason over text in more flexible ways. My current work focuses on probabilistic models for non-autoregressive sequence generation, with an emphasis on making generation faster and more controllable.

I am especially interested in how to make these alternatives practical at scale: adapting pre-trained autoregressive LLMs, designing efficient non-autoregressive pre-training objectives, and improving sampling for discrete diffusion models.

Much of my earlier work studies the same question from a more fundamental angle: how should neural models represent, score, and search over structured discrete spaces? This includes structured prediction with energy-based models, geometric representations such as box embeddings, and models for label spaces, hierarchies, and relational structure.

Together with Benjamin Rozonoyer, I run dIESL, a reading and working group on non-autoregressive LLMs at IESL.

affiliations and internships

news

Jun 23, 2026 The new dIESL page is up — our reading and working group on non-autoregressive LLMs at IESL.
May 15, 2026 Learned Relay Representations for Forward-Thinking Discrete Diffusion Models was accepted at the ICML 2026 Workshop on Foundations of Deep Generative Models!
May 1, 2026 Insertion Based Sequence Generation with Learnable Order Dynamics was accepted at ICML 2026!
Mar 10, 2026 xLM: A Python Package for Non-Autoregressive Language Models was accepted at the EACL 2026 System Demonstrations track!
Jan 15, 2026 A Continuous-Time Markov Chain Framework for Insertion Language Models was accepted as a spotlight paper (top 6%) at AISTATS 2026!
Oct 1, 2025 I will be presenting Improved Sampling from Masked Diffusion Models with Position Contrastive Guidance at the Structured Probabilistic Inference and Generative Models workshop at NeurIPS 2025.
Jun 1, 2025 Work on Insertion Language Models (ILMs) is out on arXiv! It will be presented at the Structured Probabilistic Inference and Generative Models workshop at NeurIPS 2025.
Oct 1, 2024 Learning Representations for Hierarchies with Minimal Support was accepted at NeurIPS 2024!
Apr 1, 2024 Language Guided Exploration for RL Agents in Text Environments was accepted at NAACL (findings) 2024.
Aug 1, 2023 My work on Pre-trained language models for Visual Planning for Human Assistance, done as a research intern at Meta Reality Labs., has been accepted at ICCV 2023.
Sep 7, 2022 Super excited to start my internship at Meta Reality Labs!
Apr 25, 2022 Excited to present our work on multi-label classification using box embeddings at ICLR 2022!
Nov 1, 2020 Happy to announce that I will be starting my Ph.D. in Spring (January) 2021 at UMass Amherst with Prof. Andrew McCallum as my advisor.
Oct 1, 2020 Internship work done at Abridge AI is accepted at Clinical NLP workshop 2020.
Sep 30, 2020 Paper titled Reading Comprehension as Natural Language Inference: A Semantic Analysis is accepted at *SEM 2020.
May 10, 2020 Excited to start research internship at Abridge AI.
Jan 15, 2020 Paper titled “Representing Joint Hierarchies using Box Embeddings” is accepted at AKBC 2020.

mentors and collaborators

I have been fortunate to work with many amazing people over the years. Here are my mentors and collaborators.

Current

Joey Bose(2026)current
Gaurav Pandey(2025–2026)current
Ramon Astudillo(2025–2026)current
Tim Rudner(2024–2026)current
Tahira Naseem(2023–2026)current

Previous

Michael Boratko(2019–2025)
Keerthiram Murugesan(2022–2024)
Unnat Jain(2023)
Ruta Desai(2022–2023)
Kenneth Clarkson(2023)
Akash Srivastava(2023)
Jay-Yoon Lee(2020–2022)
Pavan Kapanipathi(2019)
Kartik Talamadupula(2019)
Partha Talukdar(2018)
Sandipan Bandyopadhyay(2016)

selected publications

2026

  1. ICMLWorkshop
    Learned Relay Representations for Forward-Thinking Discrete Diffusion Models
    Benjamin Rozonoyer, Jacopo Minniti, Dhruvesh Patel, Neil Band, Joey Bose, and 2 more authors
    In ICML 2026 Workshop on Foundations of Deep Generative Models: Understanding Memorization, Generalization, and Reasoning, 2026
  2. Insertion Based Sequence Generation with Learnable Order Dynamics
    Dhruvesh Patel, Benjamin Rozonoyer, Gaurav Pandey, Tahira Naseem, Ramón Fernandez Astudillo, and 1 more author
    In ICML, 2026
  3. A Continuous Time Markov Chain framework for Insertion Language Models
    Dhruvesh Patel, Benjamin Rozonoyer, Soumitra Das, Tahira Naseem, Tim G. J. Rudner, and 1 more author
    In AISTATS (Spotlight, Top 6%), 2026

2025

  1. NeurIPSWorkshop
    animated_bars2.gif
    Improved Sampling from Masked Diffusion Models with Position Contrastive Guidance
    Dhruvesh Patel, Tahira Naseem, Gaurav Pandey, Md Arafat Sultan, Andrew McCallum, and 1 more author
    In NeurIPS 2025 Workshop on Structured Probabilistic Inference & Generative Modeling, 2025
  2. NeurIPSWorkshop
    ilm_teaser.png
    Insertion Language Models: Sequence Generation with Arbitrary-Position Insertions
    Dhruvesh Patel, Aishwarya Sahoo, Avinash Amballa, Tahira Naseem, Tim G. J. Rudner, and 1 more author
    In NeurIPS 2025 Workshop on Structured Probabilistic Inference & Generative Modeling, 2025

2023

  1. Pretrained Language Models as Visual Planners for Human Assistance
    Dhruvesh PatelHamid EghbalzadehNitin Kamra, Michael Louis Iuzzolino, Unnat Jain, and 1 more author
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Oct 2023

2022

  1. Structured Energy Network As a Loss
    Jay-Yoon LeeDhruvesh Patel, Purujit Goyal, Wenlong Zhao, Zhiyang Xu, and 1 more author
    In Advances in Neural Information Processing Systems, Oct 2022
  2. Modeling Label Space Interactions in Multi-label Classification using Box Embeddings
    In The Tenth International Conference on Learning Representations , Oct 2022

services

reviewing

  • NeurIPS (2023, 2024, 2025): Reviewer; Top Reviewer (2023, 2024)
  • ICLR (2024, 2025): Reviewer
  • ICML (2024, 2025, 2026): Reviewer; Silver Reviewer (2026)
  • ARR (2023): Reviewer
  • AAAI (2025): Reviewer
  • TMLR (2026): Reviewer