-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Issues: huggingface/text-generation-inference
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Gemma3: CUDA error: an illegal memory access was encountered
#3227
opened May 14, 2025 by
sebastianliebscher
2 of 4 tasks
Strange output when using Structured output with Gemma 3 12b it
#3214
opened May 8, 2025 by
zacksiri
2 of 4 tasks
Different result between
/chat/completions
and /generate
endpoint
#3203
opened May 1, 2025 by
ibndias
2 of 4 tasks
Failure to install https://huggingface.co/meta-llama/Llama-3.2-11B-Vision-Instruct
#3195
opened Apr 27, 2025 by
vishnuchalla
3 of 4 tasks
Extremely high calculated token count for VLM (Qwen 2.5 VL
#3191
opened Apr 24, 2025 by
boris-lok-pentadoc
2 of 4 tasks
Token count discrepancy when using Qwen2.5-VL with multiple images
#3177
opened Apr 15, 2025 by
JjjFangg
4 tasks
RuntimeError on CUDA capture with FP8 when deploying Llama-4-Maverick on TGI 3.2.3 with using H100 GPUS
#3175
opened Apr 15, 2025 by
nskpro-cmd
Extended queuing time when using guidance (structured generation) and grammar
#3173
opened Apr 15, 2025 by
jazken
2 of 4 tasks
BatchPrefillWithPagedKVCacheWrapper.plan() got an unexpected keyword argument 'head_dim'
#3165
opened Apr 11, 2025 by
ruckc
Tokenizer loading fails for mistralai/Ministral-8B-Instruct-2410 using TGI on GCP Vertex AI
#3163
opened Apr 10, 2025 by
pavlonator
2 of 4 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.