Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models (bibtex)

by Siyan Zhao, Daniel Israel, Guy Van den Broeck and Aditya Grover
Siyan Zhao, Daniel Israel, Guy Van den Broeck and Aditya Grover. Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models, In Proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS), 2025.
Bibtex Entry:
  author    = {Zhao, Siyan and Israel, Daniel and Van den Broeck, Guy and Grover, Aditya},
  title     = {Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models}, 
  booktitle = {Proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS)},
  month     = {may},
  year      = {2025},
  url       = "",
  keywords  = {conference,selective}
PDF Preview:
(PDF preview not available, download PDF instead)
Powered by bibtexbrowser