EAGLE3 automation triage chart for new models by yeyu-nvidia · Pull Request #1417 · NVIDIA/Model-Optimizer

yeyu-nvidia · 2026-05-08T17:48:50Z

Summary

This PR delivers the EAGLE3 automation triage work (OKR-30): testing the 4-step EAGLE3 offline pipeline against 10 new models to reveal failure modes, and providing a living triage document for ongoing tracking.

Model	Size	dump script
DeepSeek/DeepSeek-V3.2	685B MoE, 2 nodes	TRT-LLM
GLM/GLM-5	744B MoE, 2 nodes bench	TRT-LLM
MiniMax/MiniMax-M2.5	230B MoE	HF
Mistral/Ministral-3-8B	8B dense	HF
Mistral/Ministral-3-14B	14B dense	HF
MoonshotAI/Kimi-K2.5	1T MoE, MLA	TRT-LLM
NVIDIA/Kimi-K2.5-NVFP4	NVFP4 quant; tasks 1-2 use BF16	TRT-LLM
OpenAI/GPT-OSS-20B	20B dense	HF
Qwen/Qwen3.5-9B	9B VLM	HF
Qwen/Qwen3.5-27B	27B VLM	HF
Qwen/Qwen3.5-35B-A3B	35B MoE	HF
StepFun/Step-3.5-Flash	197B MoE, SWA	HF

EAGLE3_TRIAGE.md — living triage document with:

Mermaid decision-tree covering all failure modes per pipeline step
Model test result table (7 models tested on OCI-HSG cluster, 3 pending)
6 documented root-cause issues with fix status (1 already fixed)
Update instructions for adding new models as they are tested

Failure modes found (7 models on OCI-HSG, 2026-04-15)

#	Issue	Affected	Status
1	`dump_offline_data_vllm.sh` missing — universal task_1 failure	All 7	Fixed in this PR via `_hf.sh`
2	`offline_training.sh` HF Hub upload on local path	All 7	Fixed in sandbox MR b838b171
3	Task 0 time limit exceeded	5/7 models	Partially addressed (afterany deps)
4	GPT-OSS-20B tiktoken cache not populated	GPT-OSS-20B	Open
5	`trust_remote_code` not forwarded to benchmark	MiniMax-M2.5	Open
6	DeepSeek-V3.2 task_1 OOM (signal 15)	DeepSeek-V3.2	Open

Test plan

Dry-run configs: uv run launch.py --yaml examples/<org>/<model>/hf_offline_eagle3.yaml --dryrun --yes -v
Verify EAGLE3_TRIAGE.md Mermaid diagram renders in GitHub
Update triage table as Kimi-K2.5, GLM-5, Kimi-K2.5-NVFP4 are tested

🤖 Generated with Claude Code

Summary by CodeRabbit

New Features
- Added EAGLE3 offline speculative decoding pipeline support with configuration files for multiple models.
- Added launcher script for HuggingFace-based hidden state dumping workflow.
Documentation
- Added EAGLE3 pipeline triage guide with troubleshooting decision tree and known issues catalog.

Add hf_offline_eagle3.yaml configs in tools/launcher/examples/ for: - DeepSeek/DeepSeek-V3.2 (685B MoE, 2 nodes, TP=8) - GLM/GLM-5 (744B MoE, 2 nodes bench, TP=4/EP=2) - MiniMax/MiniMax-M2.5 (230B MoE, TP=4/EP=4) - Mistral/Ministral-3-8B (8B dense, TP=4) - Mistral/Ministral-3-14B (14B dense, TP=4) - MoonshotAI/Kimi-K2.5 (1T MoE, TP=4/EP=1) - NVIDIA/Kimi-K2.5-NVFP4 (NVFP4 quant; tasks 1-2 use BF16 base) - OpenAI/GPT-OSS-20B (20B dense, TP=4) - Qwen/Qwen3.5-27B (27B dense VLM, TP=4) - Qwen/Qwen3.5-35B-A3B (35B MoE, TP=4/EP=4) - Qwen/Qwen3.5-9B (9B dense VLM, TP=4) - StepFun/Step-3.5-Flash (197B MoE with SWA, TP=4/EP=4) Each config follows the standard 4-step EAGLE3 offline pipeline: query → dump hidden states → train draft head → benchmark. Uses public slurm_factory and common/ script paths. Co-Authored-By: Claude Opus 4.6 <[email protected]> Signed-off-by: Ye Yu <[email protected]>

Adds tools/launcher/examples/EAGLE3_TRIAGE.md, a living document for tracking EAGLE3 pipeline failure modes across new models. Content: - Mermaid decision-tree diagram mapping each pipeline step (query → dump → train → benchmark) to known failure modes and root causes - Model test result table with pass/fail/timeout status for 7 models tested on the OCI-HSG cluster (3 remaining to be tested) - 6 documented issues with symptoms, affected models, root causes, and fix recommendations: 1. dump_offline_data_vllm.sh missing (universal, all models) 2. offline_training.sh HF Hub upload bug (universal, all models) 3. Task 0 time limit exceeded (5 models) 4. GPT-OSS-20B tiktoken cache missing (model-specific) 5. trust_remote_code not passed to benchmark (MiniMax-M2.5) 6. DeepSeek-V3.2 task_1 OOM (model-specific) - Update instructions for adding new models or failure modes Co-Authored-By: Claude Opus 4.6 <[email protected]> Signed-off-by: Ye Yu <[email protected]>

From the nmm-okr30 sandbox MR (b838b171): - Add common/eagle3/dump_offline_data_hf.sh: HF-based (device_map=auto) hidden state extraction for models not supported by TRT-LLM. Handles VLMs, custom-code models, and architectures absent from TRT-LLM. - Update task_1 for 8 models to use dump_offline_data_hf.sh: MiniMax-M2.5, Ministral-3-8B, Ministral-3-14B, GPT-OSS-20B, Qwen3.5-9B, Qwen3.5-27B, Qwen3.5-35B-A3B, Step-3.5-Flash. Models that retain TRT-LLM dump: DeepSeek-V3.2, GLM-5, Kimi-K2.5, Kimi-K2.5-NVFP4 (all pure-text MoE with TRT-LLM support). - Update EAGLE3_TRIAGE.md with actual test results from 7 models run on OCI-HSG cluster on 2026-04-15, marking Issue 2 (HF upload bug) as FIXED and correcting Issue 1 status (hf script created as fix). Co-Authored-By: Claude Opus 4.6 <[email protected]> Signed-off-by: Ye Yu <[email protected]>

coderabbitai · 2026-05-08T17:49:03Z

📝 Walkthrough

Walkthrough

This pull request adds HuggingFace-based offline EAGLE3 speculative decoding support through a new launcher script, diagnostic documentation, and pipeline configurations for 10 model variants. The shell wrapper enables distributed hidden state extraction; the triage guide documents common failure modes; and YAML configs define reproducible 4-task workflows for each supported model.

Changes

EAGLE3 HuggingFace Offline Pipeline

Layer / File(s)	Summary
HuggingFace Dump Script `tools/launcher/common/eagle3/dump_offline_data_hf.sh`	New Bash launcher that wraps `compute_hidden_states_hf.py`, sources `service_utils.sh`, installs `datasets` package, and maps SLURM array task IDs to distributed processing rank/world-size parameters. Forwards `HF_MODEL_CKPT` and CLI arguments to Python backend.
Triage & Diagnostics `tools/launcher/examples/EAGLE3_TRIAGE.md`	Documents the 4-task offline EAGLE3 pipeline, provides a Mermaid decision tree for diagnosing failures by step (timeouts, OOM, tokenizer/arch issues, missing scripts, training exports, benchmark engine gaps), maintains a per-model test matrix with pass/fail/timeout outcomes, catalogs 6 known issues with root causes and current status, and supplies maintenance instructions.
Model Configurations `tools/launcher/examples/{DeepSeek,GLM,MiniMax,Mistral,MoonshotAI,NVIDIA,OpenAI,Qwen,StepFun}/.../hf_offline_eagle3.yaml`	Ten YAML pipeline definitions (DeepSeek-V3.2, GLM-5, MiniMax-M2.5, Ministral-3-{14B,8B}, Kimi-K2.5 variants, GPT-OSS-20B, Qwen3.5 variants, Step-3.5-Flash). Each config specifies: job name, global HF model path, and four sequential tasks with scripts, arguments, environment variables, container images, and SLURM resource allocation (nodes, GPUs, task counts). Tasks share `/scratchspace` artifacts: Task 0 (vLLM data synthesis) → Task 1 (HF hidden state dump) → Task 2 (EAGLE3 draft head training) → Task 3 (VLLM speculative decoding benchmark).

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Important

Pre-merge checks failed

Please resolve all errors before merging. Addressing warnings is optional.

❌ Failed checks (1 error)

Check name	Status	Explanation	Resolution
Security Anti-Patterns	❌ Error	Critical security issue: dump_offline_data_hf.sh hardcodes --trust_remote_code on line 54, forcing it unconditional. Violates check `#3`: trust_remote_code must be configurable with False default.	Remove hardcoded --trust_remote_code from line 54. Make it configurable via environment variable or parameter, defaulting to False to match the underlying Python script's secure design.

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title refers to the EAGLE3 triage chart, which is part of the change (EAGLE3_TRIAGE.md), but misses the substantial additions: a new HF-based dump script and 12 model configuration examples that form the core deliverables.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch yeyu/eagle3-launcher-examples-new-models

Tip

💬 Introducing Slack Agent: The best way for teams to turn conversations into code.

Slack Agent is built on CodeRabbit's deep understanding of your code, so your team can collaborate across the entire SDLC without losing context.

Generate code and open pull requests
Plan features and break down work
Investigate incidents and troubleshoot customer tickets together
Automate recurring tasks and respond to alerts with triggers
Summarize progress and report instantly

Built for teams:

Shared memory across your entire org—no repeating context
Per-thread sandboxes to safely plan and execute work
Governance built-in—scoped access, auditability, and budget controls

One agent for your entire SDLC. Right inside Slack.

👉 Get started

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 20

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@tools/launcher/common/eagle3/dump_offline_data_hf.sh`:
- Around line 50-52: Add an explicit guard that checks the HF_MODEL_CKPT
environment variable is set and non-empty before running the Python command: if
HF_MODEL_CKPT is missing or empty, print a clear error mentioning HF_MODEL_CKPT
and exit with a non-zero status so the subsequent call to
compute_hidden_states_hf.py (the python3 invocation that uses --model
${HF_MODEL_CKPT} and --dp-rank ${TASK_ID}) never runs; keep the check
immediately above the python3 invocation.
- Line 34: The pip installation currently suppresses errors using "pip install
datasets 2>/dev/null || true" which hides failures while the script imports
datasets (at the top) and later calls load_dataset("json", ...); remove the
error suppression so installation failures are visible by replacing that
invocation with a plain failing install (e.g., "pip install datasets") or ensure
the environment pre-installs the dependency; update the launcher script around
the pip install line and any documentation/bootstrap steps accordingly so
ImportError/debug info surfaces immediately.
- Around line 36-48: The script uses unsafe parameter expansion and masks
errors; update checks to use quoted defaults like [ -z
"${SLURM_ARRAY_TASK_ID:-}" ] and [ -z "${SLURM_ARRAY_TASK_COUNT:-}" ] and assign
TASK_ID and TASK_COUNT from the quoted values (e.g.,
TASK_ID="${SLURM_ARRAY_TASK_ID}" ) to avoid "unary operator expected" errors;
forward arguments using "$@" instead of ${@} to preserve multi-word args; stop
silencing pip failures by removing `2>/dev/null || true` from the `pip install
datasets` invocation so installs surface errors; and add a validation check for
HF_MODEL_CKPT (e.g., [ -n "${HF_MODEL_CKPT:-}" ] with an explanatory error and
exit) before it is used.

In `@tools/launcher/examples/DeepSeek/DeepSeek-V3.2/hf_offline_eagle3.yaml`:
- Around line 33-34: The YAML global_vars block that defines hf_model (hf_model:
/hf-local/deepseek-ai/DeepSeek-V3.2) is missing the required environment
variables; set MLM_MODEL_CFG to the HuggingFace repo ID for this model (e.g.,
deepseek-ai/DeepSeek-V3.2 or the canonical repo string) and set QUANT_CFG to the
chosen quantization profile (e.g., NVFP4_DEFAULT_CFG or INT8_DEFAULT_CFG)
alongside hf_model in the same global_vars section so the launcher picks up
MLM_MODEL_CFG and QUANT_CFG for this config (apply same change for the other
blocks noted around lines 46-47, 64-66, 104-106).

In `@tools/launcher/examples/EAGLE3_TRIAGE.md`:
- Around line 11-37: Several fenced code blocks in EAGLE3_TRIAGE.md are missing
language tags and required surrounding blank lines; update the fences around the
"Model checkpoint (HuggingFace)" block, the "ValueError: The repository ..."
block, and the "Per-model results template" block to use explicit language
markers (e.g., ```text or ```markdown) and ensure there is a blank line before
and after each fence so markdownlint stops warning; specifically edit the fences
surrounding the literal text "Model checkpoint (HuggingFace)", the error snippet
starting "ValueError: The repository ... contains custom code...", and the
"Per-model results template" section to add the appropriate ```text/```markdown
tags and blank lines.

In `@tools/launcher/examples/GLM/GLM-5/hf_offline_eagle3.yaml`:
- Around line 33-35: The YAML config for the new GLM-5 model sets hf_model but
omits required environment wiring for MLM_MODEL_CFG and QUANT_CFG; update the
global_vars block to include MLM_MODEL_CFG with the HuggingFace repo id
(matching hf_model value) and a default QUANT_CFG (e.g., NVFP4_DEFAULT_CFG or
INT8_DEFAULT_CFG), and ensure each task's environment entries (the task-level
environment maps referenced in this file) also set MLM_MODEL_CFG and QUANT_CFG
so downstream launcher logic receives consistent model metadata.
- Around line 94-103: The task's args block is missing the required
--trust-remote-code flag which prevents models that use custom model/tokenizer
classes from loading; update the args list (the args block shown with
--speculative_algorithm EAGLE3 and --mtbench ...) to include --trust-remote-code
true (or just --trust-remote-code if CLI accepts flag style) so task_3 forwards
trust to remote code when launching the model.

In `@tools/launcher/examples/MiniMax/MiniMax-M2.5/hf_offline_eagle3.yaml`:
- Around line 31-32: The model config under global_vars only sets hf_model; add
the required environment variables MLM_MODEL_CFG and QUANT_CFG alongside
hf_model: set MLM_MODEL_CFG to the HuggingFace repo ID for this model (e.g.,
"MiniMaxAI/MiniMax-M2.5") and set QUANT_CFG to the chosen quantization profile
(e.g., "NVFP4_DEFAULT_CFG" or "INT8_DEFAULT_CFG"); update each affected
global_vars block (the one containing hf_model and the other similar blocks
referenced) so every launcher job has both MLM_MODEL_CFG and QUANT_CFG defined.
- Around line 89-105: task_3 in the YAML (the specdec benchmark using script
common/specdec_bench/quick_check.sh) is missing the required --trust-remote-code
flag for the MiniMax custom architecture; update the task_3 args list to include
- --trust-remote-code alongside the other flags (e.g., with
--speculative_algorithm EAGLE3) so the benchmark runs with trust_remote_code
enabled.

In `@tools/launcher/examples/Mistral/Ministral-3-14B/hf_offline_eagle3.yaml`:
- Around line 28-30: The new model block using global_vars hf_model:
/hf-local/mistralai/Ministral-3-14B-Instruct-2512-BF16 is missing the required
environment metadata; add MLM_MODEL_CFG and QUANT_CFG entries alongside hf_model
so launcher jobs get consistent model-id and quant-config metadata — set
MLM_MODEL_CFG to the HuggingFace repo ID for MinistraI-3-14B (the canonical repo
string) and set QUANT_CFG to the appropriate quantization constant (e.g.,
NVFP4_DEFAULT_CFG or INT8_DEFAULT_CFG) in the same global_vars section; apply
the same addition for the other mentioned blocks (lines ~41-43, 59-60, 99-101)
where hf_model entries exist.

In `@tools/launcher/examples/Mistral/Ministral-3-8B/hf_offline_eagle3.yaml`:
- Around line 27-29: The new model YAML sets global_vars.hf_model but is missing
the required environment vars MLM_MODEL_CFG and QUANT_CFG; update the
global_vars block in hf_offline_eagle3.yaml (and the other instances referenced
around lines 40-42, 58-60, 98-100) to add MLM_MODEL_CFG with the HuggingFace
repo ID for the model and QUANT_CFG with the appropriate quantization profile
(e.g., NVFP4_DEFAULT_CFG or INT8_DEFAULT_CFG) so both variables are present
alongside hf_model for the new model config.

In `@tools/launcher/examples/MoonshotAI/Kimi-K2.5/hf_offline_eagle3.yaml`:
- Around line 34-35: Add the missing required environment metadata by setting
MLM_MODEL_CFG to the HuggingFace repo ID and QUANT_CFG to the chosen
quantization profile for each model block; specifically update the global_vars
block containing hf_model (/hf-local/moonshotai/Kimi-K2.5) to include
MLM_MODEL_CFG: moonshotai/Kimi-K2.5 and QUANT_CFG: <choose e.g.,
NVFP4_DEFAULT_CFG or INT8_DEFAULT_CFG>, and make the same additions to the other
model blocks referenced (around the lines noted: 47-48, 65-66, 105-107) so every
model config includes both MLM_MODEL_CFG and QUANT_CFG keys.

In `@tools/launcher/examples/NVIDIA/Kimi-K2.5-NVFP4/hf_offline_eagle3.yaml`:
- Around line 40-43: global_vars block for this model only defines hf_model and
hf_model_bf16 but is missing the required environment variables; add
MLM_MODEL_CFG and QUANT_CFG entries (e.g., MLM_MODEL_CFG:
"nvidia/Kimi-K2.5-NVFP4" and QUANT_CFG: "NVFP4_DEFAULT_CFG" or the appropriate
quant config) to the model's task environment or shared vars so the launcher
conventions are satisfied; update the same pattern for the other model blocks
referenced (the additional hf_model/hf_model_bf16 groups) by wiring
MLM_MODEL_CFG and QUANT_CFG via task environments or shared/global vars with
interpolation.
- Around line 102-110: The benchmark args block for this Kimi model family must
include the trust flag to allow remote tokenizer code; update the args list (the
same args block used by task_3/benchmark step) to add "--trust-remote-code" (or
"--trust-remote-code true") so the benchmark initialization for the Kimi
tokenizer will not fail; locate the args sequence in the YAML (the block
containing --draft_model_dir, --draft_length, etc.) and append the
--trust-remote-code flag.

In `@tools/launcher/examples/OpenAI/GPT-OSS-20B/hf_offline_eagle3.yaml`:
- Around line 28-30: The YAML is missing required environment variables: set
MLM_MODEL_CFG to the HuggingFace repo ID and QUANT_CFG to the chosen
quantization config (e.g., NVFP4_DEFAULT_CFG or INT8_DEFAULT_CFG) alongside the
existing hf_model; add these keys under global_vars (or in the
shared/interpolated env for all tasks) and ensure each task environment block
that references hf_model also includes MLM_MODEL_CFG and QUANT_CFG so the
launcher can pick them up consistently.
- Around line 91-104: task_3's runtime invocation is missing the required
--trust-remote-code flag and the TIKTOKEN_RS_CACHE_DIR environment export;
update the args array (the block containing --speculative_algorithm EAGLE3,
--engine VLLM, etc.) to include --trust-remote-code, and add a
TIKTOKEN_RS_CACHE_DIR entry to the environment list (alongside HF_LOCAL and
HF_MODEL_CKPT) pointing to the intended cache dir (e.g., /tiktoken_rs_cache or
the project's cache path) so the model runs with the same runtime settings as
earlier tasks.

In `@tools/launcher/examples/Qwen/Qwen3.5-27B/hf_offline_eagle3.yaml`:
- Around line 24-26: The pipeline config's global_vars block defines hf_model
but is missing the required environment variables MLM_MODEL_CFG and QUANT_CFG;
update the global_vars section (where hf_model is set) to add MLM_MODEL_CFG with
the HuggingFace repo ID for this model and QUANT_CFG with the appropriate
quantization profile (e.g., NVFP4_DEFAULT_CFG or INT8_DEFAULT_CFG) so the entry
for Qwen3.5-27B conforms to launcher metadata conventions; apply the same
addition to the other occurrences noted (around lines 36-38, 54-56, 94-97) to
keep all model configs consistent.

In `@tools/launcher/examples/Qwen/Qwen3.5-35B-A3B/hf_offline_eagle3.yaml`:
- Around line 29-31: The YAML global_vars block defines hf_model but is missing
the required environment variables MLM_MODEL_CFG and QUANT_CFG; add
MLM_MODEL_CFG with the HuggingFace repo ID for this model and set QUANT_CFG to
the appropriate quant config (e.g., NVFP4_DEFAULT_CFG or INT8_DEFAULT_CFG) in
the same global_vars section and ensure any places referencing this model (the
blocks around the existing hf_model entries) propagate these two env vars so the
launcher receives both model metadata and quant config.

In `@tools/launcher/examples/Qwen/Qwen3.5-9B/hf_offline_eagle3.yaml`:
- Around line 24-25: Add the mandatory environment entries MLM_MODEL_CFG and
QUANT_CFG next to the existing global_vars hf_model definition (hf_model:
/hf-local/Qwen/Qwen3.5-9B): set MLM_MODEL_CFG to the HuggingFace repo ID for
this model and set QUANT_CFG to the chosen quantization profile (e.g.,
NVFP4_DEFAULT_CFG or INT8_DEFAULT_CFG). Repeat the same addition for the other
hf_model blocks present later in the file (the other occurrences of global_vars
with hf_model) so all model configs include both required env vars.

In `@tools/launcher/examples/StepFun/Step-3.5-Flash/hf_offline_eagle3.yaml`:
- Around line 34-35: The new model config only sets hf_model but omits the
required environment metadata; add MLM_MODEL_CFG with the HuggingFace repo ID
(e.g., the same value as hf_model or the canonical HF repo string) and set
QUANT_CFG to the appropriate quantization profile (e.g., NVFP4_DEFAULT_CFG or
INT8_DEFAULT_CFG) inside the same global_vars block so launcher code sees
standardized envs; apply the same pattern for the other model blocks in this
file that define hf_model (the other global_vars entries referenced in the
comment).

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 76d008e0-290f-420a-9842-78a95a3f5988

📥 Commits

Reviewing files that changed from the base of the PR and between e2d29c8 and 7c1388a.

📒 Files selected for processing (14)

tools/launcher/common/eagle3/dump_offline_data_hf.sh
tools/launcher/examples/DeepSeek/DeepSeek-V3.2/hf_offline_eagle3.yaml
tools/launcher/examples/EAGLE3_TRIAGE.md
tools/launcher/examples/GLM/GLM-5/hf_offline_eagle3.yaml
tools/launcher/examples/MiniMax/MiniMax-M2.5/hf_offline_eagle3.yaml
tools/launcher/examples/Mistral/Ministral-3-14B/hf_offline_eagle3.yaml
tools/launcher/examples/Mistral/Ministral-3-8B/hf_offline_eagle3.yaml
tools/launcher/examples/MoonshotAI/Kimi-K2.5/hf_offline_eagle3.yaml
tools/launcher/examples/NVIDIA/Kimi-K2.5-NVFP4/hf_offline_eagle3.yaml
tools/launcher/examples/OpenAI/GPT-OSS-20B/hf_offline_eagle3.yaml
tools/launcher/examples/Qwen/Qwen3.5-27B/hf_offline_eagle3.yaml
tools/launcher/examples/Qwen/Qwen3.5-35B-A3B/hf_offline_eagle3.yaml
tools/launcher/examples/Qwen/Qwen3.5-9B/hf_offline_eagle3.yaml
tools/launcher/examples/StepFun/Step-3.5-Flash/hf_offline_eagle3.yaml

coderabbitai · 2026-05-08T17:54:51Z

+#   --input-data, --output-dir, --max-seq-len, etc.
+###################################################################################################
+
+pip install datasets 2>/dev/null || true


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

🧩 Analysis chain

🏁 Script executed:

cat -n tools/launcher/common/eagle3/dump_offline_data_hf.sh

Repository: NVIDIA/Model-Optimizer

Length of output: 2458

🏁 Script executed:

find . -name "compute_hidden_states_hf.py" -type f

Repository: NVIDIA/Model-Optimizer

Length of output: 148

🏁 Script executed:

cat -n ./examples/speculative_decoding/collect_hidden_states/compute_hidden_states_hf.py | head -100

Repository: NVIDIA/Model-Optimizer

Length of output: 3844

🏁 Script executed:

grep -n "load_dataset" ./examples/speculative_decoding/collect_hidden_states/compute_hidden_states_hf.py | head -5

Repository: NVIDIA/Model-Optimizer

Length of output: 230

Remove error suppression from pip install since datasets is a required dependency.

pip install datasets 2>/dev/null || true on line 34 masks installation failures completely. The Python script imports datasets at line 31 and uses it at lines 110+ (e.g., load_dataset("json", ...)). If installation fails silently, the job will fail later with an ImportError instead of revealing the actual installation error, making debugging difficult and behavior non-reproducible. Either ensure the dependency is pre-installed in the execution environment, or let the installation command fail visibly so errors are clear.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@tools/launcher/common/eagle3/dump_offline_data_hf.sh` at line 34, The pip installation currently suppresses errors using "pip install datasets 2>/dev/null || true" which hides failures while the script imports datasets (at the top) and later calls load_dataset("json", ...); remove the error suppression so installation failures are visible by replacing that invocation with a plain failing install (e.g., "pip install datasets") or ensure the environment pre-installs the dependency; update the launcher script around the pip install line and any documentation/bootstrap steps accordingly so ImportError/debug info surfaces immediately.

coderabbitai · 2026-05-08T17:54:51Z

+if [ -z ${SLURM_ARRAY_TASK_ID} ]; then
+    TASK_ID=0
+else
+    echo "SLURM_ARRAY_TASK_ID ${SLURM_ARRAY_TASK_ID}"
+    TASK_ID=${SLURM_ARRAY_TASK_ID}
+fi
+
+if [ -z ${SLURM_ARRAY_TASK_COUNT} ]; then
+    TASK_COUNT=1
+else
+    echo "SLURM_ARRAY_TASK_COUNT ${SLURM_ARRAY_TASK_COUNT}"
+    TASK_COUNT=${SLURM_ARRAY_TASK_COUNT}
+fi


⚠️ Potential issue | 🔴 Critical | ⚡ Quick win

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash rg -n '\[ -z \${SLURM_ARRAY_TASK_(ID|COUNT)\} \]|\$\{@\}' tools/launcher/common/eagle3/dump_offline_data_hf.sh

Repository: NVIDIA/Model-Optimizer

Length of output: 207

🏁 Script executed:

cat -n tools/launcher/common/eagle3/dump_offline_data_hf.sh

Repository: NVIDIA/Model-Optimizer

Length of output: 2458

🏁 Script executed:

# Search for the patterns mentioned in the review with corrected regex rg -n '\[ -z \$' tools/launcher/common/eagle3/dump_offline_data_hf.sh -A 2

Repository: NVIDIA/Model-Optimizer

Length of output: 209

🏁 Script executed:

# Also search for ${@} pattern rg -n '\$\{@\}' tools/launcher/common/eagle3/dump_offline_data_hf.sh

Repository: NVIDIA/Model-Optimizer

Length of output: 78

SLURM array defaults and argument forwarding have unsafe parameter expansion that will cause bash errors.

Unquoted [ -z ${...} ] will fail with "unary operator expected" when variables are unset. Unquoted ${@} breaks multi-word argument handling. Use [ -z "${VAR:-}" ] and "$@" to make this deterministic and safe.

Also, pip install datasets 2>/dev/null || true masks installation failures, and ${HF_MODEL_CKPT} is used without validation.

Proposed fix

-if [ -z ${SLURM_ARRAY_TASK_ID} ]; then - TASK_ID=0 -else - echo "SLURM_ARRAY_TASK_ID ${SLURM_ARRAY_TASK_ID}" - TASK_ID=${SLURM_ARRAY_TASK_ID} -fi +if [ -z "${SLURM_ARRAY_TASK_ID:-}" ]; then + TASK_ID=0 +else + echo "SLURM_ARRAY_TASK_ID ${SLURM_ARRAY_TASK_ID}" + TASK_ID="${SLURM_ARRAY_TASK_ID}" +fi -if [ -z ${SLURM_ARRAY_TASK_COUNT} ]; then - TASK_COUNT=1 -else - echo "SLURM_ARRAY_TASK_COUNT ${SLURM_ARRAY_TASK_COUNT}" - TASK_COUNT=${SLURM_ARRAY_TASK_COUNT} -fi +if [ -z "${SLURM_ARRAY_TASK_COUNT:-}" ]; then + TASK_COUNT=1 +else + echo "SLURM_ARRAY_TASK_COUNT ${SLURM_ARRAY_TASK_COUNT}" + TASK_COUNT="${SLURM_ARRAY_TASK_COUNT}" +fi python3 modules/Model-Optimizer/examples/speculative_decoding/collect_hidden_states/compute_hidden_states_hf.py \ @@ - ${@} + "$@"

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

if [ -z ${SLURM_ARRAY_TASK_ID} ]; then

TASK_ID=0

else

echo "SLURM_ARRAY_TASK_ID ${SLURM_ARRAY_TASK_ID}"

TASK_ID=${SLURM_ARRAY_TASK_ID}

fi

if [ -z ${SLURM_ARRAY_TASK_COUNT} ]; then

TASK_COUNT=1

else

echo "SLURM_ARRAY_TASK_COUNT ${SLURM_ARRAY_TASK_COUNT}"

TASK_COUNT=${SLURM_ARRAY_TASK_COUNT}

fi

if [ -z "${SLURM_ARRAY_TASK_ID:-}" ]; then

TASK_ID=0

else

echo "SLURM_ARRAY_TASK_ID ${SLURM_ARRAY_TASK_ID}"

TASK_ID="${SLURM_ARRAY_TASK_ID}"

fi

if [ -z "${SLURM_ARRAY_TASK_COUNT:-}" ]; then

TASK_COUNT=1

else

echo "SLURM_ARRAY_TASK_COUNT ${SLURM_ARRAY_TASK_COUNT}"

TASK_COUNT="${SLURM_ARRAY_TASK_COUNT}"

fi

🧰 Tools

🪛 Shellcheck (0.11.0)

[info] 36-36: Double quote to prevent globbing and word splitting.

(SC2086)

[info] 43-43: Double quote to prevent globbing and word splitting.

(SC2086)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@tools/launcher/common/eagle3/dump_offline_data_hf.sh` around lines 36 - 48, The script uses unsafe parameter expansion and masks errors; update checks to use quoted defaults like [ -z "${SLURM_ARRAY_TASK_ID:-}" ] and [ -z "${SLURM_ARRAY_TASK_COUNT:-}" ] and assign TASK_ID and TASK_COUNT from the quoted values (e.g., TASK_ID="${SLURM_ARRAY_TASK_ID}" ) to avoid "unary operator expected" errors; forward arguments using "$@" instead of ${@} to preserve multi-word args; stop silencing pip failures by removing `2>/dev/null || true` from the `pip install datasets` invocation so installs surface errors; and add a validation check for HF_MODEL_CKPT (e.g., [ -n "${HF_MODEL_CKPT:-}" ] with an explanatory error and exit) before it is used.

coderabbitai · 2026-05-08T17:54:51Z

+python3 modules/Model-Optimizer/examples/speculative_decoding/collect_hidden_states/compute_hidden_states_hf.py \
+    --model ${HF_MODEL_CKPT} \
+    --dp-rank ${TASK_ID} \


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Validate HF_MODEL_CKPT before launching Python.

This script documents HF_MODEL_CKPT as required, but never checks it. Add an explicit guard so failures are immediate and actionable.

Proposed fix

+if [ -z "${HF_MODEL_CKPT:-}" ]; then + echo "HF_MODEL_CKPT is required" + exit 1 +fi + python3 modules/Model-Optimizer/examples/speculative_decoding/collect_hidden_states/compute_hidden_states_hf.py \

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

python3 modules/Model-Optimizer/examples/speculative_decoding/collect_hidden_states/compute_hidden_states_hf.py \

--model ${HF_MODEL_CKPT} \

--dp-rank ${TASK_ID} \

if [ -z "${HF_MODEL_CKPT:-}" ]; then

echo "HF_MODEL_CKPT is required"

exit 1

fi

python3 modules/Model-Optimizer/examples/speculative_decoding/collect_hidden_states/compute_hidden_states_hf.py \

--model ${HF_MODEL_CKPT} \

--dp-rank ${TASK_ID} \

🧰 Tools

🪛 Shellcheck (0.11.0)

[info] 51-51: Double quote to prevent globbing and word splitting.

(SC2086)

[info] 52-52: Double quote to prevent globbing and word splitting.

(SC2086)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@tools/launcher/common/eagle3/dump_offline_data_hf.sh` around lines 50 - 52, Add an explicit guard that checks the HF_MODEL_CKPT environment variable is set and non-empty before running the Python command: if HF_MODEL_CKPT is missing or empty, print a clear error mentioning HF_MODEL_CKPT and exit with a non-zero status so the subsequent call to compute_hidden_states_hf.py (the python3 invocation that uses --model ${HF_MODEL_CKPT} and --dp-rank ${TASK_ID}) never runs; keep the check immediately above the python3 invocation.

coderabbitai · 2026-05-08T17:54:52Z

+  global_vars:
+    hf_model: /hf-local/deepseek-ai/DeepSeek-V3.2


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Include mandatory MLM_MODEL_CFG and QUANT_CFG env variables

This new model config does not set the required model repo-id and quant profile envs, which can break standardized launcher behavior across tasks.

As per coding guidelines, Set MLM_MODEL_CFG environment variable to the HuggingFace repo ID when adding a new model config and Set QUANT_CFG environment variable (e.g., NVFP4_DEFAULT_CFG, INT8_DEFAULT_CFG) when adding a new model config.

Also applies to: 46-47, 64-66, 104-106

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@tools/launcher/examples/DeepSeek/DeepSeek-V3.2/hf_offline_eagle3.yaml` around lines 33 - 34, The YAML global_vars block that defines hf_model (hf_model: /hf-local/deepseek-ai/DeepSeek-V3.2) is missing the required environment variables; set MLM_MODEL_CFG to the HuggingFace repo ID for this model (e.g., deepseek-ai/DeepSeek-V3.2 or the canonical repo string) and set QUANT_CFG to the chosen quantization profile (e.g., NVFP4_DEFAULT_CFG or INT8_DEFAULT_CFG) alongside hf_model in the same global_vars section so the launcher picks up MLM_MODEL_CFG and QUANT_CFG for this config (apply same change for the other blocks noted around lines 46-47, 64-66, 104-106).

coderabbitai · 2026-05-08T17:54:52Z

+```
+Model checkpoint (HuggingFace)
+        │
+        ▼
+┌──────────────────┐
+│  Task 0: Query   │  vLLM server generates prompt/response pairs
+│  (data synthesis)│  Script: common/vllm/query.sh
+└────────┬─────────┘
+         │ (afterany — downstream tasks run even if this times out)
+         ▼
+┌──────────────────┐
+│  Task 1: Dump    │  Target model runs forward pass, saves hidden states
+│  (hidden states) │  Script: common/eagle3/dump_offline_data.sh  (TRT-LLM)
+└────────┬─────────┘        or  dump_offline_data_hf.sh  (HF/vLLM fallback)
+         │
+         ▼
+┌──────────────────┐
+│  Task 2: Train   │  Draft head trained on hidden states (Accelerate + FSDP)
+│  (EAGLE3 head)   │  Script: common/eagle3/train_eagle.sh
+└────────┬─────────┘
+         │
+         ▼
+┌──────────────────┐
+│  Task 3: Bench   │  Speculative decoding benchmark via vLLM
+│  (benchmark)     │  Script: common/specdec_bench/quick_check.sh
+└──────────────────┘
+```


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Fix fenced-block lint violations (language + blank-line spacing)

A few code fences are missing a language tag and/or required surrounding blank lines, which will keep markdownlint warnings active.

Suggested doc fix

-``` +```text Model checkpoint (HuggingFace) @@ -``` +``` **Symptom:** + -``` +```text ValueError: The repository ... contains custom code... Please pass trust_remote_code=True

@@
Per-model results template:
+

#### Model: <name> @@ - **New failure pattern:** Yes/No — <description if yes>

</details> Also applies to: 181-183, 219-229 <details> <summary>🧰 Tools</summary> <details> <summary>🪛 markdownlint-cli2 (0.22.1)</summary> [warning] 11-11: Fenced code blocks should have a language specified (MD040, fenced-code-language) </details> </details> <details> <summary>🤖 Prompt for AI Agents</summary>

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In @tools/launcher/examples/EAGLE3_TRIAGE.md around lines 11 - 37, Several
fenced code blocks in EAGLE3_TRIAGE.md are missing language tags and required
surrounding blank lines; update the fences around the "Model checkpoint
(HuggingFace)" block, the "ValueError: The repository ..." block, and the
"Per-model results template" block to use explicit language markers (e.g.,
text or markdown) and ensure there is a blank line before and after each
fence so markdownlint stops warning; specifically edit the fences surrounding
the literal text "Model checkpoint (HuggingFace)", the error snippet starting
"ValueError: The repository ... contains custom code...", and the "Per-model
results template" section to add the appropriate text/markdown tags and
blank lines.

</details>   

coderabbitai · 2026-05-08T17:54:52Z

+    args:
+      - --draft_model_dir /scratchspace/export
+      - --draft_length 3
+      - --output_length 4096
+      - --engine VLLM
+      - --tp_size 4
+      - --ep_size 1
+      - --speculative_algorithm EAGLE3
+      - --mtbench /hf-local/HuggingFaceH4/mt_bench_prompts/raw/question.jsonl
+      - --concurrency 1
+    environment:
+      - HF_LOCAL: /hf-local
+      - HF_MODEL_CKPT: <<global_vars.hf_model>>
+    slurm_config:


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Benchmark step omits required GPT-OSS runtime settings.

task_3 is missing --trust-remote-code and does not export TIKTOKEN_RS_CACHE_DIR, even though both are required for this model family in earlier steps/comments.

Proposed fix

task_3: args: - --draft_model_dir /scratchspace/export - --draft_length 3 - --output_length 4096 - --engine VLLM + - --trust-remote-code - --tp_size 4 - --ep_size 1 @@ environment: - HF_LOCAL: /hf-local - HF_MODEL_CKPT: <<global_vars.hf_model>> + - TIKTOKEN_RS_CACHE_DIR: /hf-local/tiktoken_cache

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@tools/launcher/examples/OpenAI/GPT-OSS-20B/hf_offline_eagle3.yaml` around lines 91 - 104, task_3's runtime invocation is missing the required --trust-remote-code flag and the TIKTOKEN_RS_CACHE_DIR environment export; update the args array (the block containing --speculative_algorithm EAGLE3, --engine VLLM, etc.) to include --trust-remote-code, and add a TIKTOKEN_RS_CACHE_DIR entry to the environment list (alongside HF_LOCAL and HF_MODEL_CKPT) pointing to the intended cache dir (e.g., /tiktoken_rs_cache or the project's cache path) so the model runs with the same runtime settings as earlier tasks.

coderabbitai · 2026-05-08T17:54:52Z

+  global_vars:
+    hf_model: /hf-local/Qwen/Qwen3.5-27B
+


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Required MLM_MODEL_CFG and QUANT_CFG are missing in this new model pipeline config.

Please add both env vars so this config matches launcher metadata conventions used by downstream tooling.

As per coding guidelines, Set MLM_MODEL_CFG environment variable to the HuggingFace repo ID when adding a new model config and Set QUANT_CFG environment variable (e.g., NVFP4_DEFAULT_CFG, INT8_DEFAULT_CFG) when adding a new model config.

Also applies to: 36-38, 54-56, 94-97

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@tools/launcher/examples/Qwen/Qwen3.5-27B/hf_offline_eagle3.yaml` around lines 24 - 26, The pipeline config's global_vars block defines hf_model but is missing the required environment variables MLM_MODEL_CFG and QUANT_CFG; update the global_vars section (where hf_model is set) to add MLM_MODEL_CFG with the HuggingFace repo ID for this model and QUANT_CFG with the appropriate quantization profile (e.g., NVFP4_DEFAULT_CFG or INT8_DEFAULT_CFG) so the entry for Qwen3.5-27B conforms to launcher metadata conventions; apply the same addition to the other occurrences noted (around lines 36-38, 54-56, 94-97) to keep all model configs consistent.

coderabbitai · 2026-05-08T17:54:52Z

+  global_vars:
+    hf_model: /hf-local/Qwen/Qwen3.5-35B-A3B
+


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

New model config is missing MLM_MODEL_CFG and QUANT_CFG.

Please define and propagate both env vars in this YAML so the launcher has required model and quant metadata.

As per coding guidelines, Set MLM_MODEL_CFG environment variable to the HuggingFace repo ID when adding a new model config and Set QUANT_CFG environment variable (e.g., NVFP4_DEFAULT_CFG, INT8_DEFAULT_CFG) when adding a new model config.

Also applies to: 42-44, 59-61, 99-102

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@tools/launcher/examples/Qwen/Qwen3.5-35B-A3B/hf_offline_eagle3.yaml` around lines 29 - 31, The YAML global_vars block defines hf_model but is missing the required environment variables MLM_MODEL_CFG and QUANT_CFG; add MLM_MODEL_CFG with the HuggingFace repo ID for this model and set QUANT_CFG to the appropriate quant config (e.g., NVFP4_DEFAULT_CFG or INT8_DEFAULT_CFG) in the same global_vars section and ensure any places referencing this model (the blocks around the existing hf_model entries) propagate these two env vars so the launcher receives both model metadata and quant config.

coderabbitai · 2026-05-08T17:54:52Z

+  global_vars:
+    hf_model: /hf-local/Qwen/Qwen3.5-9B


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Set mandatory MLM_MODEL_CFG and QUANT_CFG in this new config

Please add both required env vars so this model config conforms to launcher standards and avoids downstream metadata gaps.

As per coding guidelines, Set MLM_MODEL_CFG environment variable to the HuggingFace repo ID when adding a new model config and Set QUANT_CFG environment variable (e.g., NVFP4_DEFAULT_CFG, INT8_DEFAULT_CFG) when adding a new model config.

Also applies to: 36-37, 54-55, 94-96

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@tools/launcher/examples/Qwen/Qwen3.5-9B/hf_offline_eagle3.yaml` around lines 24 - 25, Add the mandatory environment entries MLM_MODEL_CFG and QUANT_CFG next to the existing global_vars hf_model definition (hf_model: /hf-local/Qwen/Qwen3.5-9B): set MLM_MODEL_CFG to the HuggingFace repo ID for this model and set QUANT_CFG to the chosen quantization profile (e.g., NVFP4_DEFAULT_CFG or INT8_DEFAULT_CFG). Repeat the same addition for the other hf_model blocks present later in the file (the other occurrences of global_vars with hf_model) so all model configs include both required env vars.

coderabbitai · 2026-05-08T17:54:52Z

+  global_vars:
+    hf_model: /hf-local/stepfun-ai/Step-3.5-Flash


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Add required MLM_MODEL_CFG and QUANT_CFG for this new model config

This config is missing the required model/quant env metadata, so downstream launcher logic that depends on standardized envs can break or become inconsistent.

Suggested patch shape

global_vars: hf_model: /hf-local/stepfun-ai/Step-3.5-Flash + mlm_model_cfg: stepfun-ai/Step-3.5-Flash + quant_cfg: <SET_QUANT_CFG> task_0: @@ environment: - HF_LOCAL: /hf-local + - MLM_MODEL_CFG: <<global_vars.mlm_model_cfg>> + - QUANT_CFG: <<global_vars.quant_cfg>> task_1: @@ environment: - HF_MODEL_CKPT: <<global_vars.hf_model>> + - MLM_MODEL_CFG: <<global_vars.mlm_model_cfg>> + - QUANT_CFG: <<global_vars.quant_cfg>> task_3: @@ environment: - HF_LOCAL: /hf-local - HF_MODEL_CKPT: <<global_vars.hf_model>> + - MLM_MODEL_CFG: <<global_vars.mlm_model_cfg>> + - QUANT_CFG: <<global_vars.quant_cfg>>

As per coding guidelines, Set MLM_MODEL_CFG environment variable to the HuggingFace repo ID when adding a new model config and Set QUANT_CFG environment variable (e.g., NVFP4_DEFAULT_CFG, INT8_DEFAULT_CFG) when adding a new model config.

Also applies to: 47-48, 65-66, 105-107

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@tools/launcher/examples/StepFun/Step-3.5-Flash/hf_offline_eagle3.yaml` around lines 34 - 35, The new model config only sets hf_model but omits the required environment metadata; add MLM_MODEL_CFG with the HuggingFace repo ID (e.g., the same value as hf_model or the canonical HF repo string) and set QUANT_CFG to the appropriate quantization profile (e.g., NVFP4_DEFAULT_CFG or INT8_DEFAULT_CFG) inside the same global_vars block so launcher code sees standardized envs; apply the same pattern for the other model blocks in this file that define hf_model (the other global_vars entries referenced in the comment).

codecov · 2026-05-08T18:01:48Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 76.75%. Comparing base (555be6c) to head (7c1388a).
⚠️ Report is 5 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff            @@
##             main    #1417    +/-   ##
========================================
  Coverage   76.74%   76.75%            
========================================
  Files         476      476            
  Lines       51307    51483   +176     
========================================
+ Hits        39377    39516   +139     
- Misses      11930    11967    +37

Flag	Coverage Δ
regression	`15.20% <ø> (+0.07%)`	⬆️
unit	`52.56% <ø> (+0.02%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

yeyu-nvidia and others added 3 commits May 7, 2026 11:51

coderabbitai Bot reviewed May 8, 2026

View reviewed changes

Conversation

yeyu-nvidia commented May 8, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Contents

Failure modes found (7 models on OCI-HSG, 2026-04-15)

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Pre-merge checks failed

❌ Failed checks (1 error)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 8, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 8, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 8, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 8, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 8, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 8, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 8, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 8, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 8, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 8, 2026

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

yeyu-nvidia commented May 8, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 8, 2026 •

edited

Loading

codecov Bot commented May 8, 2026 •

edited

Loading