在拥有GPU的服务器上运行docker,如果期望docker能够使用 --gpus all 的参数,需要额外安装nvidia-container-toolkit,安装方法如下:
wget <https://nvidia.github.io/nvidia-docker/gpgkey> --no-check-certificate
sudo apt-key add gpgkey
distribution=$(. /etc/os-release;echo $ID$VERSION_ID)
curl -s -L <https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list> | sudo tee /etc/apt/sources.list.d/nvidia-docker.list
sudo apt-get update
sudo apt-get install -y nvidia-container-toolkit
接下来验证安装并进行配置
nvidia-ctk --version
sudo nvidia-ctk runtime configure --runtime=docker
根据提示的要求,重启docker服务
sudo systemctl restart docker