arozanov

Follow

🎯

Focusing

Anton Rozanov arozanov

🎯

Focusing

Follow

Software Developer. Node.js, React.

5 followers · 7 following

Achievements

Achievements

Organizations

Popular repositories Loading

turboquant-mlx turboquant-mlx Public

TurboQuant KV cache compression for MLX with fused Metal kernels. 4.6x compression at 98% FP16 speed.

Python 28 5
mlx-lm mlx-lm Public

Forked from ml-explore/mlx-lm

Run LLMs with MLX

Python 1
mlx mlx Public

Forked from ml-explore/mlx

MLX: An array framework for Apple silicon

C++
ggml-ane ggml-ane Public

Objective-C++