Skip to content

Commit 3d57d0c

Browse files
author
plotnikov.v10
committed
docs zenDNN update: one more explanation clarified
1 parent 738a887 commit 3d57d0c

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

docs/backend/ZenDNN.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -200,7 +200,7 @@ For detailed profiling and logging options, refer to the [ZenDNN Logging Documen
200200

201201
- **Limited operation support**: Currently matrix multiplication (MUL_MAT) and expert-based matrix multiplication (MUL_MAT_ID) are accelerated via ZenDNN. Other operations fall back to the standard CPU backend. Future updates may expand supported operations.
202202
- **BF16 support**: BF16 operations require AMD Zen 4 or Zen 5 architecture (EPYC 9004/9005 series). On older CPUs, operations will use FP32.
203-
- **Q8_0 support scope**: Q8_0 acceleration is available for supported matrix multiplication paths. Other quantization formats may still fall back to the standard CPU backend.
203+
- **Q8_0 support scope**: Q8_0 acceleration is available for supported matrix multiplication paths. Other quantization formats still fall back to the standard CPU backend.
204204
- **NUMA awareness**: For multi-socket systems, manual NUMA binding may be required for optimal performance.
205205

206206
## Q&A

0 commit comments

Comments
 (0)