-
Notifications
You must be signed in to change notification settings - Fork 443
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TypeError: QDQQuantizer.__init__() missing 1 required positional argument: 'mode' #2019
Comments
Hi, I exported the model (with cli and opset > 13), ran your script and it worked as expected. from functools import partial
from transformers import AutoTokenizer
from optimum.onnxruntime import ORTModelForSequenceClassification, ORTQuantizer
from optimum.onnxruntime.configuration import AutoCalibrationConfig, AutoQuantizationConfig
model_id = "distilbert-base-uncased-finetuned-sst-2-english"
tokenizer = AutoTokenizer.from_pretrained(model_id)
onnx_model = ORTModelForSequenceClassification.from_pretrained("./onnx_model")
quantizer = ORTQuantizer.from_pretrained(onnx_model)
qconfig = AutoQuantizationConfig.tensorrt(per_channel=True)
def preprocess_fn(ex, tokenizer):
return tokenizer(ex["sentence"])
calibration_dataset = quantizer.get_calibration_dataset(
"glue",
dataset_config_name="sst2",
preprocess_function=partial(preprocess_fn, tokenizer=tokenizer),
num_samples=50,
dataset_split="train",
)
calibration_config = AutoCalibrationConfig.minmax(calibration_dataset)
ranges = quantizer.fit(
dataset=calibration_dataset,
calibration_config=calibration_config,
operators_to_quantize=qconfig.operators_to_quantize,
)
model_quantized_path = quantizer.quantize(
save_dir="./output_opt",
calibration_tensors_range=ranges,
quantization_config=qconfig,
) |
I installed it the official way |
Package Version aiohappyeyeballs 2.4.0 |
yeah we should probably pin a minimum onnxruntime version, please update onnxruntime and onnx with |
System Info
Who can help?
No response
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction (minimal, reproducible, runnable)
Expected behavior
Save a quantified model
The text was updated successfully, but these errors were encountered: