nVidia GPUs

dvelben Uncategorized September 24, 2021September 27, 2021 1 Minute

The NVIDIA GPU Operator [is] based on the operator framework and automates the management of all NVIDIA software components needed to provision […] GPU worker nodes in a Kubernetes cluster – the driver, container runtime, device plugin and monitoring.
The GPU operator should run on nodes that are equipped with GPUs. To determine which nodes have GPUs, the operator relies on Node Feature Discovery(NFD) within Kubernetes.
https://developer.nvidia.com/blog/nvidia-gpu-operator-simplifying-gpu-management-in-kubernetes/

NVIDIA Container Runtime is a GPU aware container runtime, compatible with the Open Containers Initiative (OCI) specification used by Docker, CRI-O.
https://developer.nvidia.com/nvidia-container-runtime

Starting a GPU enabled CUDA container […] and specify the nvidia runtime:
[edited] $ docker run –rm –runtime=nvidia –gpus=all nvcr.io/nvidia/cuda:latest nvidia-smi
GPUs can be specified to the Docker CLI using either the --gpus option starting with Docker 19.03 or using the environment variable NVIDIA_VISIBLE_DEVICES. This variable controls which GPUs will be made accessible inside the container.
https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/user-guide.html

Tagged
docker
nvidia

Published by dvelben

View all posts by dvelben

Published September 24, 2021September 27, 2021

Leave a comment Cancel reply