[quantization] Resolve bugs and warnings in example script by mhs4670go · Pull Request #546 · Samsung/TICO

mhs4670go · 2026-03-10T08:02:44Z

This commit resolves bugs and warnings in example sciprts.

TICO-DCO-1.0-Signed-off-by: seongwoo mhs4670go@naver.com

This commit resolves bugs and warnings in example sciprts. TICO-DCO-1.0-Signed-off-by: seongwoo <mhs4670go@naver.com>

mhs4670go · 2026-03-10T08:03:26Z

tico/quantization/wrapq/examples/llama/quantize_mlp.py

 from tico.utils.utils import SuppressWarning

-name = "meta-llama/Llama-3.2-1B-Instruct"
+name = "Maykeye/TinyLLama-v0"


I accidentally changed this part of code before.

mhs4670go · 2026-03-10T08:04:01Z

tico/quantization/wrapq/examples/compare_ppl.py

        args.model,
        trust_remote_code=args.trust_remote_code,
        token=args.hf_token,
+        legacy=False,


This resolves following warning.

You are using the default legacy behaviour of the <class 'transformers.models.llama.tokenization_llama_fast.LlamaTokenizerFast'>. This is expected, and simply means that the `legacy` (previous) behavior will be used so nothing changes for you. If you want to use the new behaviour, set `legacy=False`. This should only be set if you understand what it means, and thoroughly read the reason why this was added as explained in https://github.com/huggingface/transformers/pull/24565 - if you loaded a llama tokenizer from a GGUF file you can ignore this message.

mhs4670go · 2026-03-10T08:04:34Z

tico/quantization/wrapq/examples/compare_ppl.py

        AutoModelForCausalLM.from_pretrained(
            args.model,
-            torch_dtype=dtype,
+            dtype=dtype,


This resolves following warning.

`torch_dtype` is deprecated! Use `dtype` instead!

mhs4670go · 2026-03-10T08:05:28Z

tico/quantization/wrapq/utils/introspection.py

    return {m: n for n, m in root.named_modules()}


+def extract_tensor(output: Any) -> Optional[torch.Tensor]:


To extract tensors from transformers.modeling_outputs.BaseModelOutputWithPast.

stamalakhov

LGTM! Thank you!

[quantization] Resolve bugs and warnings in example script

f932c3d

This commit resolves bugs and warnings in example sciprts. TICO-DCO-1.0-Signed-off-by: seongwoo <mhs4670go@naver.com>

mhs4670go requested a review from a team March 10, 2026 08:02

mhs4670go commented Mar 10, 2026

View reviewed changes

stamalakhov approved these changes Mar 10, 2026

View reviewed changes

mhs4670go merged commit d4f0186 into Samsung:main Mar 10, 2026
7 checks passed

mhs4670go deleted the mo branch March 10, 2026 09:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quantization] Resolve bugs and warnings in example script#546

[quantization] Resolve bugs and warnings in example script#546
mhs4670go merged 1 commit intoSamsung:mainfrom
mhs4670go:mo

mhs4670go commented Mar 10, 2026

Uh oh!

mhs4670go Mar 10, 2026

Uh oh!

mhs4670go Mar 10, 2026

Uh oh!

mhs4670go Mar 10, 2026

Uh oh!

mhs4670go Mar 10, 2026

Uh oh!

stamalakhov left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		return {m: n for n, m in root.named_modules()}


		def extract_tensor(output: Any) -> Optional[torch.Tensor]:

Conversation

mhs4670go commented Mar 10, 2026

Uh oh!

mhs4670go Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

mhs4670go Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

mhs4670go Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

mhs4670go Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

stamalakhov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants