Refactor Mamba2 to use standardized output tracing by huyxdang · Pull Request #44087 · huggingface/transformers

huyxdang · 2026-02-17T11:30:25Z

Summary

Refactors the Mamba2 model to use the standardized output collection interface as part of #43979.

Changes

Standardized Output Mapping: Added _can_record_outputs to Mamba2PreTrainedModel mapping hidden_states → Mamba2Block.
Base Model Refactor: Added @capture_outputs and @merge_with_config_defaults decorators to Mamba2Model.forward.
Head Model Refactor: Added @can_return_tuple decorator to Mamba2ForCausalLM.forward to handle automated tuple/dict packaging.
Boilerplate Removal: Removed manual output_hidden_states and return_dict parameter resolution and manual collection loops in both Mamba2Model and Mamba2ForCausalLM.
Architecture Simplification: Simplified Mamba2Block.forward to return hidden_states directly as a single tensor.
Bug Fix: Fixed a TypeError in src/transformers/integrations/hub_kernels.py where integer version numbers in the kernel mapping caused a crash during loading.

Technical Context

Unlike traditional Transformer models which utilize attention mechanisms, Mamba2 is a State Space Model (SSM). It doesn't generate attention weights and thus the refractor focuses only on capturing hidden_states.

Migrate Mamba2Model and Mamba2ForCausalLM to use the PreTrainedModel output tracing decorators (@capture_outputs and @can_return_tuple). This removes manual boilerplate for collecting hidden states and packing return tuples, aligning the implementation with the library standard. Also fix a crash in hub_kernels.py where integer version numbers in the kernel mapping caused a TypeError during loading. Fixes huggingface#43979

github-actions · 2026-02-17T13:37:57Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: mamba2

huyxdang mentioned this pull request Feb 17, 2026

Call to contributions: refactor output tracing in transformers #43979

Open

Merge branch 'main' into mamb2-refractor-output-tracing

d689671

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor Mamba2 to use standardized output tracing#44087

Refactor Mamba2 to use standardized output tracing#44087
huyxdang wants to merge 2 commits intohuggingface:mainfrom
huyxdang:mamb2-refractor-output-tracing

huyxdang commented Feb 17, 2026

Uh oh!

github-actions bot commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

huyxdang commented Feb 17, 2026

Summary

Changes

Technical Context

Uh oh!

github-actions bot commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant