update scout example #2310

mengniwang95 · 2025-10-11T07:16:24Z

User description

Type of Change

update example

Description

detail description

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

PR Type

Enhancement

Description

Added new main.py script for Llama4 quantization
Updated run_quant.sh to use main.py
Added neural-compressor dependency

Diagram Walkthrough

flowchart LR
  A["Add main.py"] -- "Quantization script" --> B["Update run_quant.sh"]
  B -- "Use main.py" --> C["Add neural-compressor dependency"]

File Walkthrough

Relevant files

Enhancement

main.py `Add Llama4 Quantization Script` examples/pytorch/multimodal-modeling/quantization/auto_round/llama4/main.py Added script for Llama4 quantization using AutoRoundConfig Included argument parsing for model, scheme, device, etc. Implemented model preparation and conversion	+95/-0
run_quant.sh `Update run_quant.sh to Use main.py` examples/pytorch/multimodal-modeling/quantization/auto_round/llama4/run_quant.sh Modified to call `main.py` instead of `auto_round`	+5/-6

Dependencies

requirements.txt `Add neural-compressor Dependency` examples/pytorch/multimodal-modeling/quantization/auto_round/llama4/requirements.txt Added `neural-compressor` dependency	+1/-0

Signed-off-by: Mengni Wang <[email protected]>

PRAgent4INC · 2025-10-11T07:17:02Z

PR Reviewer Guide 🔍

Here are some key observations to aid the review process:

⏱️ Estimated effort to review: 3 🔵🔵🔵⚪⚪
🧪 No relevant tests
🔒 No security concerns identified
⚡ Recommended focus areas for review Possible Issue The `layer_config` dictionary is being populated with incorrect keys. The keys should be module names, but they are currently being set to the module objects themselves. This will likely cause issues when the `qconfig` is used. for n, m in model.named_modules(): if not isinstance(m, (torch.nn.Linear)): continue for name in fp_layers: if name in n: layer_config[n] = {"bits": 16, "act_bits": 16} break Hardcoded Value The `export_format` argument in `setup_parser` is hardcoded to `"llm_compressor"`, which might not be flexible enough. Consider making it configurable via command-line arguments. self.add_argument("--export_format", default="llm_compressor", type=str, help="the format to save the model" )

PRAgent4INC · 2025-10-11T07:17:23Z

PR Code Suggestions ✨

Explore these optional code suggestions:

Category	Suggestion	Impact
General	Align export format Ensure `export_format` consistency between arguments. examples/pytorch/multimodal-modeling/quantization/auto_round/llama4/main.py [82-89] qconfig = AutoRoundConfig( tokenizer=tokenizer, scheme=args.scheme, layer_config=layer_config, - export_format="llm_compressor", + export_format=args.export_format, is_mllm=True, output_dir=args.output_dir, ) Suggestion importance[1-10]: 8 __ Why: Ensuring `export_format` consistency between arguments prevents potential mismatches and enhances reliability.	Medium
	Optimize layer configuration Use a set for `fp_layers` to improve lookup efficiency. examples/pytorch/multimodal-modeling/quantization/auto_round/llama4/main.py [72-81] -if len(fp_layers) > 0: - for n, m in model.named_modules(): - if not isinstance(m, (torch.nn.Linear)): - continue - for name in fp_layers: - if name in n: - layer_config[n] = {"bits": 16, "act_bits": 16} - break +fp_layers_set = set(fp_layers) +for n, m in model.named_modules(): + if not isinstance(m, (torch.nn.Linear)): + continue + if any(name in n for name in fp_layers_set): + layer_config[n] = {"bits": 16, "act_bits": 16} Suggestion importance[1-10]: 6 __ Why: Converting `fp_layers` to a set can improve lookup efficiency, which is beneficial for larger models.	Low
	Simplify string trimming Use `rstrip` to remove trailing slashes more efficiently. examples/pytorch/multimodal-modeling/quantization/auto_round/llama4/main.py [65-66] -if model_name[-1] == "/": - model_name = model_name[:-1] +model_name = model_name.rstrip("/") Suggestion importance[1-10]: 5 __ Why: Using `rstrip` improves readability and efficiency slightly, but the impact is minimal.	Low

Signed-off-by: Mengni Wang <[email protected]>

mengniwang95 · 2025-10-11T07:36:14Z

@chensuyue please check the updated example results

examples/pytorch/multimodal-modeling/quantization/auto_round/llama4/requirements.txt

examples/pytorch/multimodal-modeling/quantization/auto_round/llama4/run_quant.sh

examples/pytorch/multimodal-modeling/quantization/auto_round/llama4/requirements.txt

Signed-off-by: Mengni Wang <[email protected]>

update scout example

5541de4

Signed-off-by: Mengni Wang <[email protected]>

mengniwang95 requested review from chensuyue and thuang6 October 11, 2025 07:16

PRAgent4INC added the Review effort 3/5 label Oct 11, 2025

fix param

8cf045a

Signed-off-by: Mengni Wang <[email protected]>

thuang6 approved these changes Oct 11, 2025

View reviewed changes

examples/pytorch/multimodal-modeling/quantization/auto_round/llama4/requirements.txt Outdated Show resolved Hide resolved

chensuyue added this to the 3.6 milestone Oct 13, 2025

chensuyue reviewed Oct 13, 2025

View reviewed changes

examples/pytorch/multimodal-modeling/quantization/auto_round/llama4/run_quant.sh Outdated Show resolved Hide resolved

Update run_quant.sh

8cc9493

chensuyue reviewed Oct 13, 2025

View reviewed changes

examples/pytorch/multimodal-modeling/quantization/auto_round/llama4/requirements.txt Outdated Show resolved Hide resolved

mengniwang95 added 6 commits October 13, 2025 09:48

Update README.md

95fd5ce

Update requirements.txt

ae13677

Update README.md

16c7375

Update main.py

b3be270

fix save format

83ff60f

Signed-off-by: Mengni Wang <[email protected]>

Update README.md

e428dcc

chensuyue approved these changes Oct 16, 2025

View reviewed changes

chensuyue merged commit ebddfee into master Oct 16, 2025
11 checks passed

chensuyue deleted the mengni/scout_update branch October 16, 2025 01:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

update scout example #2310

update scout example #2310

Uh oh!

mengniwang95 commented Oct 11, 2025 •

edited by PRAgent4INC

Loading

Uh oh!

PRAgent4INC commented Oct 11, 2025

Uh oh!

PRAgent4INC commented Oct 11, 2025

Uh oh!

mengniwang95 commented Oct 11, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

update scout example #2310

update scout example #2310

Uh oh!

Conversation

mengniwang95 commented Oct 11, 2025 • edited by PRAgent4INC Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

User description

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

PR Type

Description

Diagram Walkthrough

File Walkthrough

Uh oh!

PRAgent4INC commented Oct 11, 2025

PR Reviewer Guide 🔍

Uh oh!

PRAgent4INC commented Oct 11, 2025

PR Code Suggestions ✨

Uh oh!

mengniwang95 commented Oct 11, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

mengniwang95 commented Oct 11, 2025 •

edited by PRAgent4INC

Loading