Learning Tractable Distributions Of Language Model Continuations (bibtex)

by Gwen Yidou Weng, Ian Li, Anji Liu, Oliver Broadrick, Yuchen Cui, Guy Van den Broeck and Benjie Wang
Abstract:
Controlled generation imposes sequence-level constraints (syntax, style, safety) that depend on future tokens, making exact conditioning of an autoregressive LM intractable. Tractable surrogates such as HMMs can approximate continuation distributions and steer decoding, but standard surrogates are often weakly context-aware. We propose Learning to Look Ahead (LTLA), a hybrid method that uses base-LM embeddings to condition a globally learned tractable surrogate: a neural head predicts only a prefix-dependent latent prior, while a shared HMM answers continuation queries exactly. LTLA is designed to avoid two common efficiency traps when adding neural context. First, it avoids vocabulary-sized prefix rescoring (V extra LM evaluations) by scoring all next-token candidates via a single batched HMM forward update. Second, it avoids predicting a new HMM per prefix by learning one shared HMM and conditioning only the latent prior, which enables reuse of cached future-likelihood (backward) messages across decoding steps. Empirically, LTLA improves continuation likelihood over standard HMM surrogates, enables lookahead control for vision--language models by incorporating continuous context, achieves 100% syntactic constraint satisfaction, and improves detoxification while adding only a 14% decoding-time overhead.
Reference:
Gwen Yidou Weng, Ian Li, Anji Liu, Oliver Broadrick, Yuchen Cui, Guy Van den Broeck and Benjie Wang. Learning Tractable Distributions Of Language Model Continuations, In Arxiv, 2025.
Bibtex Entry:
@inproceedings{WengArxiv25,
  title     = {Learning Tractable Distributions Of Language Model Continuations},
  author    = {Weng, Gwen Yidou and Li, Ian and Liu, Anji and Broadrick, Oliver and Cui, Yuchen and Van den Broeck, Guy and Wang, Benjie},
  booktitle = {Arxiv},
  url       = "https://starai.cs.ucla.edu/papers/WengArxiv25.pdf",
  eprint    = {2511.16054},
  archivePrefix = {arXiv},
  primaryClass = {cs.CL},
  month     = 11,
  year      = {2025},
  keywords  = {techreport}
}
PDF Preview:
(PDF preview not available, download PDF instead)
Powered by bibtexbrowser