ParvaGPU: Efficient Spatial GPU Sharing for Large-Scale DNN Inference in Cloud Environments
Published in International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2024
Recommended citation: Lee, M., Seong, S., Kang, M., Lee, J., Na, G.-J., Chun, I.-G., Nikolopoulos, D., & Hong, C.-H. (2024). "ParvaGPU: Efficient Spatial GPU Sharing for Large-Scale DNN Inference in Cloud Environments." In SC24: International Conference for High Performance Computing, Networking, Storage and Analysis, 1-14. https://doi.org/10.1109/SC41406.2024.00048
Download Paper