-
Notifications
You must be signed in to change notification settings - Fork 127
Pull requests: microsoft/onnxruntime-genai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Catch ort exception when trying to allocate key value cache
#1058
opened Nov 11, 2024 by
baijumeswani
Loading…
Add option to disable symmetric INT4 quantization
#1053
opened Nov 8, 2024 by
kunal-vaishnavi
Loading…
Add Quantized_model + float LoRA model scenario to model builder
#1043
opened Nov 7, 2024 by
apsonawane
Loading…
Added functionality to choose dml adapter by luid
#1041
opened Nov 6, 2024 by
DavitGrigoryan132
Loading…
Add an IChatClient implementation to OnnxRuntimeGenAI
#987
opened Oct 16, 2024 by
stephentoub
Loading…
Make Microsoft.ML.OnnxRuntimeGenAI.Tokenizer a Microsoft.ML.Tokenizers.Tokenizer
#970
opened Oct 11, 2024 by
stephentoub
Loading…
ProTip!
What’s not been updated in a month: updated:<2024-10-14.