Skip to content

[quantization] Add QuantLlamaModel#525

Merged
mhs4670go merged 1 commit intoSamsung:mainfrom
stamalakhov:llama_model_PR
Feb 26, 2026
Merged

[quantization] Add QuantLlamaModel#525
mhs4670go merged 1 commit intoSamsung:mainfrom
stamalakhov:llama_model_PR

Conversation

@stamalakhov
Copy link
Contributor

This PR adds wrapper for LlamaModel and adds tests for it

`./ccex test -k quantization.wrapq.wrappers.llama.test_quant_model`

RUN unit tests with -k quantization.wrapq.wrappers.llama.test_quant_model ...

Required transformers not installed — skipping GELUTanh tests
test_forward_diff (quantization.wrapq.wrappers.llama.test_quant_model.TestQuantLlamaModel) ... ok
test_mode_transitions (quantization.wrapq.wrappers.llama.test_quant_model.TestQuantLlamaModel) ... ok

----------------------------------------------------------------------
Ran 2 tests in 0.156s

OK

Draft: #495

TICO-DCO-1.0-Signed-off-by: s.malakhov s.malakhov@partner.samsung.com

This PR adds wrapper for LlamaModel and adds tests for it

TICO-DCO-1.0-Signed-off-by: s.malakhov <s.malakhov@partner.samsung.com>
@stamalakhov stamalakhov self-assigned this Feb 26, 2026
@stamalakhov stamalakhov requested a review from a team February 26, 2026 11:00
Copy link
Contributor

@mhs4670go mhs4670go left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@mhs4670go mhs4670go merged commit 822b6c5 into Samsung:main Feb 26, 2026
7 checks passed
@stamalakhov stamalakhov deleted the llama_model_PR branch February 26, 2026 12:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants