Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models (bibtex)

by Siyan Zhao, Daniel Israel, Guy Van den Broeck and Aditya Grover
Reference:
Siyan Zhao, Daniel Israel, Guy Van den Broeck and Aditya Grover. Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models, In Proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS), 2025.
Bibtex Entry:
@inproceedings{ZhaoAISTATS25,
  author    = {Zhao, Siyan and Israel, Daniel and Van den Broeck, Guy and Grover, Aditya},
  title     = {Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models}, 
  booktitle = {Proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS)},
  month     = {may},
  year      = {2025},
  url       = "https://arxiv.org/pdf/2404.09529.pdf",
  keywords  = {conference,selective}
}
PDF Preview:
(PDF preview not available, download PDF instead)
Powered by bibtexbrowser