🚘 Auto opt cli #1343

trajepl · 2024-09-03T09:57:08Z

Describe your changes

Auto opt cli.

e.g. for bert model, we can optimize the model from huggingface and conduct onnx model with:

olive auto-opt --model Intel/bert-base-uncased-mrpc --data_config_path data_config.json --task text-classification

olive auto-opt --model Intel/bert-base-uncased-mrpc --data_config_path data_config.json --task text-classification --precision int4 --providers CPU

olive auto-opt --model Intel/bert-base-uncased-mrpc --data_config_path data_config.json --task text-classification --precision fp16 --providers CUDA

# use model builder
olive auto-opt --model microsoft/Phi-3-mini-4k-instruct --precision fp16 --providers CUDA --use_model_builder

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.
Is this PR including examples changes? If yes, please remember to update example documentation in a follow-up PR.

(Optional) Issue link

…pt-cli

olive/cli/auto_opt.py

devang-ml · 2024-09-03T15:48:06Z

olive/cli/auto_opt.py

+
+        search_strategy_group = sub_parser.add_argument_group("search strategy options")
+        search_strategy_group.add_argument(
+            "--num-samples", type=int, default=5, help="Number of samples for search algorithm"


Do we really need to expose this in the CLI?

I was thinking if the search takes long time, user can somehow reduce the numbers of num-samples to stop the search in time.

devang-ml

Let's add "auto-opt" needed packages list under extra dependencies so that user can do pip install olive-ai[auto-opt]

trajepl · 2024-09-04T10:17:10Z

Let's add "auto-opt" needed packages list under extra dependencies so that user can do pip install olive-ai[auto-opt]

I think it might be hard to define a unified extra dependencies lists for auto-opt. Since auto-opt may be used in different devices which causes the conflicts on different onnxruntime packages.

I think we have a feature to dynamicly get_required_packages based on the given accelerators which can be used in auto-opt.

devang-ml · 2024-09-04T16:20:41Z

We can exclude OnnxRuntime and IHV toolkits. But we should be able to install other required packages using pip install olive-ai[auto-opt].

trajepl · 2024-09-05T03:21:56Z

We can exclude OnnxRuntime and IHV toolkits. But we should be able to install other required packages using pip install olive-ai[auto-opt].

Updated. Basically, pip install olive-ai is adequate to run auto-opt cli. Furthermore, to simplify the conversion, since olive currently use optimum to get onnx model, I added optimum to olive-ai[auto-opt].

Also I tested that, after the installation olive-ai[auto-opt], user only needs to install corresponding onnxruntime or onnxruntime-genai(only enable if set --use_model_builder) based on the device(cpu/gpu etc.), olive auto-opt cli can conduct reasonable results.

samuel100 · 2024-09-05T15:49:04Z

What models will be supported by auto-opt? When I have tried the auto optimizer using:

{
    "input_model":{
        "type": "HfModel",
        "model_path": "microsoft/phi-3.5-mini-instruct",
        "task": "text-generation"
    },
    "systems": {
        "local_system": {
            "type": "LocalSystem",
            "accelerators": [
                {
                    "device": "cpu",
                    "execution_providers": [
                        "CPUExecutionProvider"
                    ]
                }
            ]
        }
    },
    "auto_optimizer_config": {
        "opt_level": 0,
        "disable_auto_optimizer": false,
        "precision": "int4"
    },
    "host": "local_system",
    "target": "local_system",
    "cache_dir": "cache",
    "output_dir" : "models"
}

I get an error message on the OrtTransformerOptimizer pass:

ValueError: Unsupported model type: phi3, please select one from [bart, bert, bert_tf, bert_keras, clip, gpt2, gpt2_tf, gpt_neox, swin, tnlr, t5, unet, vae, vit, conformer, phi] which need to be set under OrtTransformersOptimization.config

If these are the only models supported then it is a little underwhelming because they are pretty outdated. It is also a bit odd because the optimizer works for optimum models when I run olive finetune.

As a general rule, a user should be able to plug in:

A wide range of model architectures (for example, if Llama4 is released and it has the same architecture as Llama3, then I'd expect the tool to work).
Device target
precision

The user should not need to worry about adding data.

devang-ml · 2024-09-05T16:16:29Z

The list and the error message is from OnnxRuntime's transformer optimizer.

devang-ml · 2024-09-05T16:25:43Z

Model supported by the transformer optimizer are listed at https://github.com/microsoft/onnxruntime/blob/f4d62eeb2e058e2c7b5de0eaa9599368d32b23d5/onnxruntime/python/tools/transformers/optimizer.py#L50

olive/cli/auto_opt.py

xiaoyu-work · 2024-09-05T18:09:19Z

olive/cli/auto_opt.py

+
+        # output options
+        output_group = sub_parser.add_argument_group("output options")
+        output_group.add_argument(


Not related to this PR but we could gather this to base.py as well.

Good call. But I saw there are different arguments attributes. Some of them are required, some of them are not. And they have different default values.

Can we update it in follow-up PR?

olive/cli/auto_opt.py

xiaoyu-work · 2024-09-05T18:15:00Z

olive/cli/auto_opt.py

+            device = (
+                "gpu"
+                if self.args.providers
+                and any(p[: -(len("ExecutionProvider"))] in ["CUDA", "Tensorrt", "Dml"] for p in self.args.providers)


system_group.add_argument( "--providers", type=str, nargs="*", choices=["CPU", "CUDA", "Tensorrt", "Dml", "VitisAI", "Qnn"], help="List of execution providers to use for optimization", )

without ExecutionProvider?

Should device be set to cpu if VitisAI and Qnn provided here?

ExecutionProvider will added automatically by this cli.

Should device be set to cpu if VitisAI and Qnn provided here?

I think yes. For VitisAi/QNN, we can run quantization on cpu, then inference the model with corresponding EP.

xiaoyu-work · 2024-09-05T18:15:22Z

olive/cli/auto_opt.py

+                and any(p[: -(len("ExecutionProvider"))] in ["CUDA", "Tensorrt", "Dml"] for p in self.args.providers)
+                else "cpu"
+            )
+        providers = self.args.providers or ["CPUExecutionProvider"] if device == "cpu" else ["CUDAExecutionProvider"]


Same as above

olive/cli/auto_opt.py

…pt-cli

olive/cli/auto_opt.py

devang-ml and others added 3 commits August 28, 2024 18:09

First cut implementing auto-opt special command

46073e8

Merge branch 'main' of https://github.com/microsoft/olive into auto-o…

bc375b4

…pt-cli

auto-opt

fdbe4e2

github-advanced-security bot found potential problems Sep 3, 2024

View reviewed changes

olive/cli/auto_opt.py Fixed Show fixed Hide fixed

devang-ml reviewed Sep 3, 2024

View reviewed changes

trajepl added 2 commits September 4, 2024 17:57

enable model builder in auto opt

103caa3

fix

205ca28

add auto-opt extra dependencies

0cdabb3

xiaoyu-work reviewed Sep 5, 2024

View reviewed changes

olive/cli/auto_opt.py Outdated Show resolved Hide resolved

xiaoyu-work reviewed Sep 5, 2024

View reviewed changes

olive/cli/auto_opt.py Outdated Show resolved Hide resolved

xiaoyu-work reviewed Sep 5, 2024

View reviewed changes

olive/cli/auto_opt.py Outdated Show resolved Hide resolved

xiaoyu-work reviewed Sep 5, 2024

View reviewed changes

olive/cli/auto_opt.py Show resolved Hide resolved

trajepl added 4 commits September 6, 2024 08:36

Merge branch 'main' of https://github.com/microsoft/olive into auto-o…

da0a876

…pt-cli

fix

4c34af5

fix comments

aa5e8f7

fix

113a98b

samuel100 reviewed Sep 9, 2024

View reviewed changes

olive/cli/auto_opt.py Show resolved Hide resolved

samuel100 reviewed Sep 9, 2024

View reviewed changes

olive/cli/auto_opt.py Outdated Show resolved Hide resolved

samuel100 reviewed Sep 9, 2024

View reviewed changes

olive/cli/auto_opt.py Show resolved Hide resolved

add data config

05c44e1

devang-ml and others added 2 commits September 13, 2024 15:50

OnnxQuantization pass require data_config

fa8cc98

Merge branch 'main' into auto-opt-cli

7b80d5c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🚘 Auto opt cli #1343

🚘 Auto opt cli #1343

trajepl commented Sep 3, 2024 •

edited

Loading

devang-ml Sep 3, 2024

trajepl Sep 4, 2024

devang-ml left a comment

trajepl commented Sep 4, 2024 •

edited

Loading

devang-ml commented Sep 4, 2024

trajepl commented Sep 5, 2024 •

edited

Loading

samuel100 commented Sep 5, 2024

devang-ml commented Sep 5, 2024

devang-ml commented Sep 5, 2024

xiaoyu-work Sep 5, 2024

trajepl Sep 6, 2024

xiaoyu-work Sep 5, 2024

xiaoyu-work Sep 5, 2024

trajepl Sep 6, 2024

trajepl Sep 6, 2024 •

edited

Loading

xiaoyu-work Sep 5, 2024

🚘 Auto opt cli #1343

Are you sure you want to change the base?

🚘 Auto opt cli #1343

Conversation

trajepl commented Sep 3, 2024 • edited Loading

Describe your changes

Checklist before requesting a review

(Optional) Issue link

Choose a reason for hiding this comment

Choose a reason for hiding this comment

devang-ml left a comment

Choose a reason for hiding this comment

trajepl commented Sep 4, 2024 • edited Loading

devang-ml commented Sep 4, 2024

trajepl commented Sep 5, 2024 • edited Loading

samuel100 commented Sep 5, 2024

devang-ml commented Sep 5, 2024

devang-ml commented Sep 5, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

trajepl Sep 6, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

trajepl commented Sep 3, 2024 •

edited

Loading

trajepl commented Sep 4, 2024 •

edited

Loading

trajepl commented Sep 5, 2024 •

edited

Loading

trajepl Sep 6, 2024 •

edited

Loading