Skip to content

Quick alternative measurement of pod startup latency#1

Open
waltforme wants to merge 21 commits intodumb0002:vllm-testfrom
waltforme:vllm-test
Open

Quick alternative measurement of pod startup latency#1
waltforme wants to merge 21 commits intodumb0002:vllm-testfrom
waltforme:vllm-test

Conversation

@waltforme
Copy link
Copy Markdown

This PR is a quick alternative of the ClusterLoader-based method to measure vLLM server's pod startup latency. It can and should retire when the ClusterLoader-based method is readily working with OpenShift.

dumb0002 and others added 20 commits February 10, 2025 16:17
…OCP clusters)

Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>

Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>

Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Braulio Dumba <brauliodumba@gmail.com>
Signed-off-by: Jun Duan <jun.duan.phd@outlook.com>
CLEANUP=${CLEANUP:-false}

# For MacOS, `brew install coreutils` can make gdate available, if necessary
START_TIME=$(gdate +%s%3N)
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

bash-3.2$ gdate
bash: gdate: command not found

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @MikeSpreitzer , this script is supposed to be used from a Mac (which establishes VPN connection to intranet), from where are you trying this?

@dumb0002
Copy link
Copy Markdown
Owner

dumb0002 commented Feb 14, 2025

I tested it but I am not sure that it run properly as I see some error messages:

error: the path "./perf-test/pod-startup/vllm-pvc-no-template.yaml" does not exist error: the path "./perf-test/pod-startup/vllm-deployment-no-template.yaml" does not exist pod/vllm-test-b7bc98c77-mqrs2 condition met Pod startup latency: 4.576 seconds

Maybe making it clear from which directory the script needs to be run from will fix this error?

Signed-off-by: Jun Duan <jun.duan.phd@outlook.com>
@waltforme
Copy link
Copy Markdown
Author

Sure, done in the latest commit.

I tested it but I am not sure that it run properly as I see some error messages:

error: the path "./perf-test/pod-startup/vllm-pvc-no-template.yaml" does not exist error: the path "./perf-test/pod-startup/vllm-deployment-no-template.yaml" does not exist pod/vllm-test-b7bc98c77-mqrs2 condition met Pod startup latency: 4.576 seconds

Maybe making it clear from which directory the script needs to be run from will fix this error?

@dumb0002 dumb0002 force-pushed the vllm-test branch 2 times, most recently from 1bd3d70 to 2b405bb Compare March 18, 2025 16:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants