Log metrics horizontally in W&B to simplify comparison to future runs #494

arcticfly · 2025-12-19T21:04:23Z

Utility function to simplify comparison of constant model's metrics against those of newly trained models.

In the screenshot below, the golden line is the error rate of a comparison model.

Example:

await log_constant_metrics_wandb(
    model=model,
    num_steps=num_horizontal_run_steps,
    split_metrics={
        "val": metrics_dict,
        "val_failures": {
            k: v for k, v in metrics_dict.items() if "failed" in k
        },
    },
)

arcticfly added 3 commits December 19, 2025 13:02

Log metrics horizontally in W&B to simplify comparison to future runs

e912b02

Fix lint checks

48618be

Accept split_metrics

d48b0f6

arcticfly requested a review from bradhilton December 19, 2025 21:20

bradhilton approved these changes Dec 19, 2025

View reviewed changes

arcticfly merged commit 8bfb8b8 into main Dec 19, 2025
2 checks passed

arcticfly deleted the horizontal-run branch December 19, 2025 21:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Log metrics horizontally in W&B to simplify comparison to future runs #494

Log metrics horizontally in W&B to simplify comparison to future runs #494

Uh oh!

arcticfly commented Dec 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Log metrics horizontally in W&B to simplify comparison to future runs #494

Log metrics horizontally in W&B to simplify comparison to future runs #494

Uh oh!

Conversation

arcticfly commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

arcticfly commented Dec 19, 2025 •

edited

Loading