How to load model with load_state_dict ? #18097

aymuos15 · 2024-12-07T17:45:45Z

Search before asking

I have searched the Ultralytics YOLO issues and discussions and found no similar questions.

Question

weights = f"{HOME}/runs/detect/train70/weights/best.pt"
model = YOLO(weights)

This works fine! But I am in a situation where I can not use this.

My alternative which I found: ultralytics/yolov5#1441 (comment)

But this does not work. few layers throw mismatch error:


model = YOLO('yolo11n.yaml')
model.nc = 8 
model.model.load_state_dict(torch.load(weights)['model'].state_dict())

How do I go about this?

Additional

No response

The text was updated successfully, but these errors were encountered:

UltralyticsAssistant · 2024-12-07T17:46:13Z

👋 Hello @aymuos15, thank you for your interest in Ultralytics 🚀! We recommend a visit to the Docs for new users where you can find many Python and CLI usage examples and where many of the most common questions may already be answered.

If this is a 🐛 Bug Report, please provide a minimum reproducible example to help us debug it.

If this is a ❓ Question, please provide as much information as possible, including a detailed explanation of your use case and full error logs, as well as verifying compatibility with the latest ultralytics version. See the suggested upgrade command below.

For this specific inquiry regarding model loading using load_state_dict and its associated errors, it is important to ensure that:

The model architecture (yolo11n.yaml) and the weights (best.pt) are compatible.
Any custom modifications, such as altering model.nc, are correctly integrated and do not cause layer mismatches.

Feel free to share exact error messages or provide details about how these layers mismatch to help us gain further insight into the issue. An Ultralytics engineer will also assist you soon to provide more guidance 🙂.

Join the Ultralytics community where it suits you best. For real-time chat, head to Discord 🎧. Prefer in-depth discussions? Check out Discourse. Or dive into threads on our Subreddit to share knowledge with the community.

Upgrade

Upgrade to the latest ultralytics package, including all requirements, in a Python>=3.8 environment with PyTorch>=1.8 to ensure your issue is not already resolved in the latest version:

pip install -U ultralytics

Environments

YOLO may be run in any of the following up-to-date verified environments (with all dependencies including CUDA/CUDNN, Python and PyTorch preinstalled):

Notebooks with free GPU:
Google Cloud Deep Learning VM. See GCP Quickstart Guide
Amazon Deep Learning AMI. See AWS Quickstart Guide
Docker Image. See Docker Quickstart Guide

Status

If this badge is green, all Ultralytics CI tests are currently passing. CI tests verify the correct operation of all YOLO Modes and Tasks on macOS, Windows, and Ubuntu every 24 hours and on every commit.

aymuos15 · 2024-12-07T17:53:57Z

model.model.load_state_dict(torch.load(weights)['model'].state_dict(), strict=False)

Even this throws the same error

glenn-jocher · 2024-12-07T22:24:01Z

@aymuos15 it seems the weight file might not completely match the model architecture. Instead, consider using the load method provided by the Ultralytics YOLO class: model = YOLO('yolo11n.yaml').load(weights). This will handle any mismatched parameters more gracefully. For further details, refer to the Ultralytics model documentation.

Dec 7, 2024 · 2024-12-07T22:36:19Z

Simply changing model.nc wouldn't work since the model is already created with the nc in the yaml. You will have to change nc in the yaml.

aymuos15 · 2024-12-09T06:27:05Z

Thanks a lot for the suggestions.

Could you please answer the following question:

when I run: model = YOLO('yolo11n.yaml').load(weights) I get: Transferred 448/499 items from pretrained weights

but not when I run: model = YOLO(weights)

It is the exact same weights file.

Dec 9, 2024 · 2024-12-09T08:09:03Z

Does your yaml's nc match with the weights?

aymuos15 · 2024-12-09T08:49:41Z

This fixed my error. Thanks a lot!

Simply changing model.nc wouldn't work since the model is already created with the nc in the yaml. You will have to change nc in the yaml.

I double checked now!

glenn-jocher · 2024-12-09T22:56:17Z

You're welcome! Glad it worked for you. Let us know if you have any further questions or run into any issues. Happy experimenting with YOLO!

aymuos15 added the question Further information is requested label Dec 7, 2024

aymuos15 closed this as completed Dec 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to load model with load_state_dict ? #18097

How to load model with load_state_dict ? #18097

aymuos15 commented Dec 7, 2024

UltralyticsAssistant commented Dec 7, 2024

aymuos15 commented Dec 7, 2024

glenn-jocher commented Dec 7, 2024

Dec 7, 2024

aymuos15 commented Dec 9, 2024

Dec 9, 2024

aymuos15 commented Dec 9, 2024

glenn-jocher commented Dec 9, 2024

How to load model with load_state_dict ? #18097

How to load model with load_state_dict ? #18097

Comments

aymuos15 commented Dec 7, 2024

Search before asking

Question

Additional

UltralyticsAssistant commented Dec 7, 2024

Upgrade

Environments

Status

aymuos15 commented Dec 7, 2024

glenn-jocher commented Dec 7, 2024

Dec 7, 2024

aymuos15 commented Dec 9, 2024

Dec 9, 2024

aymuos15 commented Dec 9, 2024

glenn-jocher commented Dec 9, 2024