models/tiiuae/Falcon3-7B-Instruct/t3000/functional (1x8 mesh port)

Port this t3000 model from a 2x4 mesh to a 1x8 mesh (TP=8).

Target: models/tiiuae/Falcon3-7B-Instruct/t3000/functional

Tasks:
- Update models/tiiuae/Falcon3-7B-Instruct/t3000/functional/model.py to set MESH_SHAPE = (1, 8) and adjust any mesh-axis/sharding assumptions for a 1x8 mesh.
- Keep architecture, dtypes, and cache behavior unchanged.
- Run demo + long eval:
  python demo.py models/tiiuae/Falcon3-7B-Instruct/t3000/functional/model.py
  python eval.py models/tiiuae/Falcon3-7B-Instruct/t3000/functional/model.py --model tiiuae/Falcon3-7B-Instruct --prompt_file prompts/bringup_eval_long.txt --max_new_tokens 100 --max_seq_len <current seq len>
- Update MODELS.md for the t3000 row and save demo.log/eval.log under models/tiiuae/Falcon3-7B-Instruct/t3000/functional/.

Notes:
- Keep paged attention / paged KV cache behavior unchanged.
- If TT metal cache error: set TT_METAL_CACHE=/tmp/tt-metal-cache and TT_METAL_RUNTIME_ROOT=/proj_sw/user_dev/moconnor/tt-runtime-root.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

models/tiiuae/Falcon3-7B-Instruct/t3000/functional (1x8 mesh port) #209

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

models/tiiuae/Falcon3-7B-Instruct/t3000/functional (1x8 mesh port) #209

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions