qixiang-99

qixiang-99

Achievements

TensorRT-LLM TensorRT-LLM Public

Forked from NVIDIA/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++
LazyVim LazyVim Public

Forked from LazyVim/LazyVim

Neovim config for the lazy

Lua
nvim_setup nvim_setup Public

Neovim automatically setup with bash script and etc

Shell
nvim_config nvim_config Public template

Forked from LazyVim/starter

Starter template for LazyVim

Lua
flash-attention flash-attention Public

Forked from vllm-project/flash-attention

Fast and memory-efficient exact attention

Python
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python