Skip to content

nested matmul implementation #53

@ZhennanQin

Description

@ZhennanQin
  • Support FP32 and BF16.
  • Block layout first, then extend to plain layout.
  • Default config to cover shapes used in LLAMA2.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type
No fields configured for issues without a type.

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions