记录 Dynamo 学习实践 P/D 分离。
1 | git clone https://github.com/ai-dynamo/dynamo.git |
根据这个issue可知,需要 Ubuntu24.04。
Ubuntu 22.04 currently is not supported as it has glibc 2.35 which is below minimum required glibc 2.39
否则会报错:
1 | pip install ai-dynamo[all] |
CUDA 基础镜像为:
1 | ARG BASE_IMAGE="nvcr.io/nvidia/cuda-dl-base" |
打包镜像: 1
2
3
4
5
6
7
8https://catalog.ngc.nvidia.com/orgs/nvidia/containers/cuda-dl-base
$ docker pull nvcr.io/nvidia/cuda-dl-base:25.01-cuda12.8-devel-ubuntu24.04
$ docker images
REPOSITORY TAG IMAGE ID CREATED SIZE
nvcr.io/nvidia/cuda-dl-base 25.01-cuda12.8-devel-ubuntu24.04 583d3d06ed9f 2 months ago 11.9GB
$ docker save -o 25.01-cuda12.8-devel-ubuntu24.04.tar nvcr.io/nvidia/cuda-dl-base
/data/team/zhaodongyu/25.01-cuda12.8-devel-ubuntu24.04.tar
安装python3.10的必要性: 1
2ERROR: Ignored the following versions that require a different python version: 1.21.2 Requires-Python >=3.7,<3.11; 1.21.3 Requires-Python >=3.7,<3.11; 1.21.4 Requires-Python >=3.7,<3.11; 1.21.5 Requires-Python >=3.7,<3.11; 1.21.6 Requires-Python >=3.7,<3.11
ERROR: Could not find a version that satisfies the requirement pydantic>=2.9 (from ai-dynamo-vllm) (from versions: none)
1 | wget https://www.python.org/ftp/python/3.10.13/Python-3.10.13.tgz |
python 路径: /root/python310/bin/python3
由于无法访问nvidia官网,在 https://pypi.nvidia.com/ 手动下载
ai_dynamo_vllm-0.7.2-cp38-abi3-manylinux1_x86_64.whl
1 | apt-get update |
Dynamo CLI has the following 4 sub-commands.
- 🏃 dynamo run: quickly spin up a server to experiment with a specified model, input and output target.
- 🫴 dynamo serve: compose a graph of workers locally and serve.
- 🔨 (Experimental) dynamo build: containerize either the entire graph or parts of graph to multiple containers
- 🚀 (Experimental) dynamo deploy: deploy to K8 with helm charts or custom operators
- ☁️ (Experimental) dynamo cloud: interact with your dynamo cloud server