HF backend support for Qwen3.5 model family

Using the benchmarking [code](https://github.com/z-lab/dflash/blob/main/benchmark.py) and [requirements](https://github.com/z-lab/dflash/blob/main/requirements.txt) specification from the official [Github repo](https://github.com/z-lab/dflash) will cause a failure claiming Qwen3.5 is unsupported on the transformers v4.57.3. And by forcing the mainline transformers to be installed as stated by the Qwen 3.5 [codebase](https://huggingface.co/Qwen/Qwen3.5-35B-A3B#hugging-face-transformers), some random code error complaining about DynamicCache not supporting a specific method will be raised:
```
File ".venv/lib/python3.12/site-packages/transformers/models/qwen3_5/modeling_qwen3_5.py", line 1387, in _update_linear_attn_mask
 if (past_key_values is not None and past_key_values.has_previous_state) or (
                                     ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^              
 AttributeError: 'DynamicCache' object has no attribute 'has_previous_state'
```
Is it planed to add transformers support to this model, at least when transformers release a stable version with Qwen3.5 MoE model family support?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HF backend support for Qwen3.5 model family #44

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

HF backend support for Qwen3.5 model family #44

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions