[CI]cleanup e2e test #4800

MrZ20 · 2025-12-08T12:38:09Z

What this PR does / why we need it?

This PR refactors the E2E multicard test suite to improve test case identification and maintainability. Specifically, it renames various test functions to be more descriptive (explicitly indicating model families like Qwen/DeepSeek and parallelism strategies like DP/TP/PP/EP) and cleans up outdated or redundant test configurations in the offline distributed inference tests.

Key Changes:

Test Function Renaming (Standardization): Renamed multiple test functions across tests/e2e/multicard/ to include clear suffixes/prefixes regarding the model and parallel strategy. This helps differentiate test cases in CI logs and prevents naming collisions.

test_aclgraph_capture_replay.py:

test_aclgraph_capture_replay_dp2 -> test_aclgraph_capture_replay_metrics_dp2

test_data_parallel.py:

test_data_parallel_inference -> test_qwen_inference_dp2

test_data_parallel_tp2.py:

test_data_parallel_inference -> test_qwen_inference_dp2_tp2

test_expert_parallel.py:

test_e2e_ep_correctness -> test_deepseek_correctness_ep

test_external_launcher.py:

test_external_launcher -> test_qwen_external_launcher
test_moe_external_launcher -> test_qwen_moe_external_launcher_ep
test_external_launcher_and_sleepmode -> test_qwen_external_launcher_with_sleepmode
test_external_launcher_and_sleepmode_level2 -> test_qwen_external_launcher_with_sleepmode_level2
test_mm_allreduce -> test_qwen_external_launcher_with_matmul_allreduce

test_full_graph_mode.py:

test_models_distributed_Qwen3_MOE_TP2_WITH_FULL_DECODE_ONLY -> test_qwen_moe_with_full_decode_only
test_models_distributed_Qwen3_MOE_TP2_WITH_FULL -> test_qwen_moe_with_full

test_fused_moe_allgather_ep.py:

test_generate_with_allgather -> test_deepseek_moe_fused_allgather_ep
test_generate_with_alltoall -> test_deepseek_moe_fused_alltoall_ep

test_offline_weight_load.py:

test_offline_weight_load_and_sleepmode -> test_qwen_offline_weight_load_and_sleepmode

test_pipeline_parallel.py:

test_models -> test_models_pp2

Distributed Inference Cleanup (test_offline_inference_distributed.py):

model list changes:

QWEN_DENSE_MODELS = [
-     "vllm-ascend/Qwen3-8B-W8A8", "vllm-ascend/Qwen2.5-0.5B-Instruct-W8A8"
+     "vllm-ascend/Qwen3-0.6B-Instruct-W8A8",
]

- QWEN_W4A8_OLD_VERSION_MODELS = [
-    "vllm-ascend/Qwen3-8B-W4A8",
- ]

- QWEN_W4A8_NEW_VERSION_MODELS = [
-     "vllm-ascend/DeepSeek-V3-W4A8-Pruing",
-     "vllm-ascend/DeepSeek-V3.1-W4A8-puring",
- ]

+ DEEPSEEK_W4A8_MODELS = [
+      "vllm-ascend/DeepSeek-V3.1-W4A8-puring",
+ ]

Test Function Changes:

removed test_models_distributed_QwQ
removed test_models_distributed_Qwen3_W8A8
removed test_models_distributed_Qwen3_W4A8DYNAMIC_old_version
test_models_distributed_Qwen3_W4A8DYNAMIC_new_version -> test_models_distributed_Qwen3_W4A8DYNAMIC

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.12.0
vLLM main: vllm-project/vllm@ad32e3e

gemini-code-assist

Code Review

This pull request focuses on cleaning up the end-to-end tests. The changes primarily involve renaming test files and functions to be more descriptive and accurate, which improves the maintainability and clarity of the test suite. Additionally, several obsolete models and tests have been removed, and model lists for testing have been updated. The changes are well-aligned with the goal of cleaning up the test code. I have reviewed the pull request and found no critical or high-severity issues.

github-actions · 2025-12-08T13:02:55Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

Signed-off-by: MrZ20 <[email protected]>

gemini-code-assist bot reviewed Dec 8, 2025

View reviewed changes

github-actions bot added the module:tests label Dec 8, 2025

MrZ20 force-pushed the vllm_modify128 branch from 253f481 to 9f52d54 Compare December 9, 2025 01:56

vllm-ascend-ci added ready read for review ready-for-test start test by label for PR labels Dec 9, 2025

MrZ20 added 5 commits December 10, 2025 17:38

clean CI

cd85b17

Signed-off-by: MrZ20 <[email protected]>

modify

b7da2aa

Signed-off-by: MrZ20 <[email protected]>

modify lint

a9aebb2

Signed-off-by: MrZ20 <[email protected]>

revert

d1e8ee8

Signed-off-by: MrZ20 <[email protected]>

modify

bb37125

Signed-off-by: MrZ20 <[email protected]>

MrZ20 force-pushed the vllm_modify128 branch from 23b2789 to bb37125 Compare December 10, 2025 09:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CI]cleanup e2e test #4800

[CI]cleanup e2e test #4800

MrZ20 commented Dec 8, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

github-actions bot commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[CI]cleanup e2e test #4800

Are you sure you want to change the base?

[CI]cleanup e2e test #4800

Conversation

MrZ20 commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

github-actions bot commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MrZ20 commented Dec 8, 2025 •

edited

Loading