Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models (bibtex)
by Siyan Zhao, Daniel Israel, Guy Van den Broeck and Aditya Grover
View — Paper PDF
Reference:
Siyan Zhao, Daniel Israel, Guy Van den Broeck and Aditya Grover. Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models, In Proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS), 2025.
Bibtex Entry:
@inproceedings{ZhaoAISTATS25, author = {Zhao, Siyan and Israel, Daniel and Van den Broeck, Guy and Grover, Aditya}, title = {Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models}, booktitle = {Proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS)}, month = {may}, year = {2025}, url = "https://arxiv.org/pdf/2404.09529.pdf", keywords = {conference,selective} }
PDF Preview:
Powered by bibtexbrowser