Replies: 1 comment
-
Can you give more information about what you tried? What model were you using? This error is coming from onnxruntime, vs genai itself. It appears the model expects input_ids to be 4096 in size and 6 is passed in. Is this expecting input_ids to be padded out to 4096? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I was looking for libraries that can prune in addition to quantizing and it appears that SparseML can do that. However, I tried taking the onnx weights exported by the library and loading it to genai. I keep getting errors like the one below. I'm guessing that maybe pruning isn't supported by this library yet?
Beta Was this translation helpful? Give feedback.
All reactions