Optimizing GPU virtualization with address mapping and delayed submission | |
Wang, Xiaolin ; Wang, Hanbing ; Sang, Yan ; Wang, Zhenlin ; Luo, Yingwei | |
2014 | |
英文摘要 | The state-of-the-art GPU virtualization framework, gVirtuS, relies on an API remoting mechanism to set up a communication channel between a virtual machine and the host, so that a CUDA application in a virtual machine can be executed 'remotely' in the host. We observe that this API remoting mechanism often involves large-volume and frequent data transmissions between the host OS and the guest OS, which lead to a significant performance degradation. We present an address mapping scheme so the host can directly access the machine memory space of the guest and thus avoid data copying between the guest and the host. To reduce the frequency of data transmissions, we introduce a delayed submission scheme. We implement both address mapping and delayed submission in KVM. Our evaluation on a set of CUDA benchmarks shows that address mapping can improve over the original gVirtuS by up to 6.5 times. Delayed submission is able to further reduce the virtualization overhead by half in a pathological case. ? 2014 IEEE.; EI; 0 |
语种 | 英语 |
DOI标识 | 10.1109/HPCC.2014.70 |
内容类型 | 其他 |
源URL | [http://ir.pku.edu.cn/handle/20.500.11897/263031] |
专题 | 信息科学技术学院 |
推荐引用方式 GB/T 7714 | Wang, Xiaolin,Wang, Hanbing,Sang, Yan,et al. Optimizing GPU virtualization with address mapping and delayed submission. 2014-01-01. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论