-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Qwen 3 support #3199
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi could you please test with latest, this should now be fixed with #3200 |
Also tried awq format and that does not work either with the latest commits. Seems like its defaulting to AutoModel where its not supported. |
Mine is defaulting to automodel as well |
Hi @mht-sharma |
@maziyarpanahi clone and run from that commit. |
I have confident that commit has fixed the issue, we just need it to be officially released so we can use it in production. (custom builds are not allowed in production, they become messy) |
I pulled the latest commit when testing and built the image manually and it was not working with that commit. I think the model may be working without quant with this commit but quants are not working yet. |
I'm having the same issue |
+1 |
Model description
Hi,
It seems Qwen3 models are not supported:
I am getting
invalid operation: cannot convert number to u64 (in <string>:18)
error which might be connected to the warning before, or there is something in the chat template that raise an exception insideapply_chat_template
.Open source status
Provide useful links for the implementation
https://huggingface.co/collections/Qwen/qwen3-67dd247413f0e2e4f653967f
The text was updated successfully, but these errors were encountered: