We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
请问是否有计划下一步加入Speculative Decoding?MNN框架下的LLM拥有高度的一致性,相对大内存与较低的算力更适合双模型同时加载。
The text was updated successfully, but these errors were encountered:
正在实现中,预计本月会支持
Sorry, something went wrong.
No branches or pull requests
请问是否有计划下一步加入Speculative Decoding?MNN框架下的LLM拥有高度的一致性,相对大内存与较低的算力更适合双模型同时加载。
The text was updated successfully, but these errors were encountered: