-
-
Notifications
You must be signed in to change notification settings - Fork 6.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for loading and inferencing on aws-neuron and aws-neuronx #18199
base: main
Are you sure you want to change the base?
Conversation
Support aws neuronx
fix: NeuronYOLO base class
fix: support python < 3.10
Feature/support inf1
Sync fork
🔍 Existing Issues For ReviewYour pull request is modifying functions with the following pre-existing issues: 📄 File: ultralytics/cfg/init.py
Did you find this useful? React with a 👍 or 👎 |
I have read the CLA Document and I sign the CLA 2 out of 3 committers have signed the CLA. |
I have read the CLA Document and I sign the CLA |
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #18199 +/- ##
==========================================
- Coverage 78.52% 71.52% -7.01%
==========================================
Files 128 134 +6
Lines 17138 17785 +647
==========================================
- Hits 13458 12721 -737
- Misses 3680 5064 +1384
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hey @takipipo , the AWS-Neuron inference integration you're working on looks very promising. Thank you for your effort and contributions!
That said, the PR requires some adjustments and cleanup to align with our codebase.
-
Duplicate Code: You've introduced a new exporter.py and autobackend.py, which results in a lot of duplicate code. Instead of creating new files, please integrate your changes into the existing exporter.py and autobackend.py to maintain consistency and reduce redundancy.
-
NeuronYOLO Model: There’s no need to create a separate NeuronYOLO model for this integration. Both inference and export functionality should be handled within autobackend.py and exporter.py as part of the existing framework.
Unfortunately, I’m unable to test your PR in its current state. Once you've made these changes I’ll take another look.
Thanks again for your contributions!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why did you simplify the docstrings here?
@@ -614,7 +614,7 @@ def export_mnn(self, prefix=colorstr("MNN:")): | |||
|
|||
@try_export | |||
def export_ncnn(self, prefix=colorstr("NCNN:")): | |||
"""YOLO NCNN export using PNNX https://github.com/pnnx/pnnx.""" | |||
"""YOLOv8 NCNN export using PNNX https://github.com/pnnx/pnnx.""" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"""YOLOv8 NCNN export using PNNX https://github.com/pnnx/pnnx.""" | |
"""YOLO NCNN export using PNNX https://github.com/pnnx/pnnx.""" |
Given that the current Ultralytics does not support AWS-Neuron, AWS-NeuronX, which is one of the unsupported options by Ultralytics, AWS-NeuronX is a cost-effective way to productionize the YOLO models.
Here is the Proposal method for exporting the Ultralytics model to AWS-Neuron.
Here is the Proposal method for exporting the Ultralytics model to AWS-NeuronX
cc: @nirattisai-t @luangtatipsy