SLED: A Speculative LLM Decoding Framework for Efficient Edge Serving
Published in arXiv preprint, 2025
Recommended citation: Li, X., Spatharakis, D., Ghafouri, S., Fan, J., Vandierendonck, H., John, D., Ji, B., & Nikolopoulos, D. (2025). "SLED: A Speculative LLM Decoding Framework for Efficient Edge Serving." arXiv preprint arXiv:2506.09397.
Download Paper