Fix load_weights with strict=False to filter extra weights before update by gmin7 · Pull Request #3214 · ml-explore/mlx

gmin7 · 2026-03-06T18:45:47Z

Proposed changes

When loading weights from a checkpoint that contains more layers than the model (e.g., loading a full
model's safetensors into a model instantiated with num_hidden_layers=1), load_weights(..., strict=False)
raises an IndexError: list index out of range. This happens because indexed keys like layers.1.weight
pass through tree_unflatten and Module.update tries to index into the model's layers list at positions
that don't exist.

This restores the filtering of weight keys to only those present in the model's parameters when
strict=False, so extra weights are silently dropped before reaching update.

Checklist

Put an x in the boxes that apply.

I have read the CONTRIBUTING document
I have run pre-commit run --all-files to format my code / installed pre-commit prior to committing changes
I have added tests that prove my fix is effective or that my feature works
I have updated the necessary documentation (if needed)

… the weights dict prunes out the unused weight keys to avoid idx error during tree_unflatten

… of bounds, and if so we ignore it on strict=False

angeloskath

Thanks for the catch and fix!

Michelle DiMarco added 3 commits March 6, 2026 09:49

If strict=False and only a subset of the model is loaded, then ensure…

1e22bed

… the weights dict prunes out the unused weight keys to avoid idx error during tree_unflatten

Add unit test

dad96e0

Move fix to update(), where we check if the index of the param is out…

40a0f2e

… of bounds, and if so we ignore it on strict=False

angeloskath approved these changes Mar 6, 2026

View reviewed changes

angeloskath merged commit d2702a4 into ml-explore:main Mar 10, 2026
29 of 32 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix load_weights with strict=False to filter extra weights before update#3214

Fix load_weights with strict=False to filter extra weights before update#3214
angeloskath merged 3 commits intoml-explore:mainfrom
gmin7:mdimarco/lazy_load_idx_fix

gmin7 commented Mar 6, 2026 •

edited

Loading

Uh oh!

angeloskath left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gmin7 commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed changes

Checklist

Uh oh!

angeloskath left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gmin7 commented Mar 6, 2026 •

edited

Loading