October 10, 2024
2024
Our work on self-speculative decoding has been accepted to the NeurIPS 2024 ENLSP workshop.