Skip to content

Will MNN Chat app support ”Speculative Decoding“? #3412

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
BeetSoup128 opened this issue May 1, 2025 · 1 comment
Open

Will MNN Chat app support ”Speculative Decoding“? #3412

BeetSoup128 opened this issue May 1, 2025 · 1 comment
Labels
llm llm export error or use error Suggestion

Comments

@BeetSoup128
Copy link

请问是否有计划下一步加入Speculative Decoding?MNN框架下的LLM拥有高度的一致性,相对大内存与较低的算力更适合双模型同时加载。

@jxt1234
Copy link
Collaborator

jxt1234 commented May 3, 2025

正在实现中,预计本月会支持

@jxt1234 jxt1234 added Suggestion llm llm export error or use error labels May 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
llm llm export error or use error Suggestion
Projects
None yet
Development

No branches or pull requests

2 participants