Panda Webui

Panda Webui

Overview

Panda WebUI is a lightweight and user-friendly web interface designed for running lightweight Language Models (LLMs) using the Hugging Face transformers library. In addition to supporting standard LLMs, Panda WebUI also accommodates GGUF models and several popular large Vision-Language Models (VLMs).

Installation

Create a new environment

conda create -n panda python=3.11
conda activate panda

Install pytorch

pip install torch==2.4.1 torchvision==0.19.1 --index-url https://download.pytorch.org/whl/cu121

For llama-cpp (GGUF inference) and deepspeed (training):

conda install -y -c "nvidia/label/cuda-12.1.1" cuda

If training is not needed, one can install the CUDA Runtime instead:

conda install -y -c "nvidia/label/cuda-12.1.1" cuda-runtime

Install the packages in requirements.txt
- If unable to install auto-gptq try installing it separately:
```
pip install auto-gptq --no-build-isolation
```
- If unable to install flash-attention try installing it using the system terminal
Create a symbolic link weights to the model directory:

ln -s /path_to_weights .

Models

To use models from Huggingface, set the Huggingface cache by setting the environment variable in .bashrc:

export HF_HOME=/path/to/cache

Then run the command to reload and apply the changes made to .bashrc:

source ~/.bashrc

Shell script

This shell script can be used to run from the terminal.

cd ~/path/to/Panda-LLM/ || { echo "Failed to change directory"; exit 1; }

# Activate the conda environment named 'panda'
source ~/miniforge3/etc/profile.d/conda.sh
conda activate panda

# Check if the conda environment 'panda' is active.
if [ "$CONDA_DEFAULT_ENV" = "panda" ]; then
    echo "Successfully activated conda environment 'panda'."
else
    echo "Failed to activate conda environment 'panda'."
    exit 1

# Run the Python script
python webui.py

Getting started with Panda Web

In Models, use the Download feature to download models from Huggingface. To use the downloaded model, refresh and load the model on the left side of the Models page. Once the model is loaded, you can start chatting with the model in Main.

FLOP estimation

Inference-time FLOP estimation is computed using a single high-definition image (resolution 1920x1080), 128 padding input tokens and 1 output token. The list of available models for this feature can be found here

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.vscode		.vscode
experimental		experimental
lora		lora
modules		modules
openai_api		openai_api
utils		utils
vlm_complexity_calculation		vlm_complexity_calculation
.gitignore		.gitignore
readme.md		readme.md
requirements.txt		requirements.txt
webui.py		webui.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Panda Webui

Overview

Installation

Models

Shell script

Getting started with Panda Web

FLOP estimation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Panda Webui

Overview

Installation

Models

Shell script

Getting started with Panda Web

FLOP estimation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages