miniGPT

A framework for training GPT models with mixed precision, gradient accumulation, and other features.

Features

Mixed Precision Training: Leverage FP16 and BFloat16 for faster training and reduced memory usage.
Gradient Accumulation: Simulate larger batch sizes without running out of GPU memory.
Optimized Data Loading: Efficient data loading using multiple workers to fully utilize CPU resources.
Checkpoint Management: Save and load model and optimizer states for easy training resumption.
Flexible Configuration: Easily adjust batch size, accumulation steps, and other parameters to suit your training setup.

Getting Started

Prerequisites

Python 3.6 or higher
PyTorch
Transformers
tqdm

Installation

Clone the repository:

git clone https://github.com/SimonVutov/miniGPT.git

2.1 downloading the following: CUDA Toolkit 12.5.0 (May 2024) cuDNN v8.9.7 (December 5th, 2023), for CUDA 12.x

2.2 Install the required packages: bash pip install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu121 pip install datasets transformers torch tqdm

Usage

Training: Run the training script to start training the GPT model.
```
python main.py
```
Generate Text: Use the generate_text function to generate text using the trained model.
```
from main import generate_text
generate_text("Your input text here")
```

Example

An example of training output and generated text:

Device: cuda

Model and optimizer loaded from checkpoint 'gpt2_epoch_1.pt'

Epoch 1, Batch 200, Loss: 4.9928, Tokens/sec: 18374.01, Time Elapsed: 55.16 sec

Epoch 1, Batch 400, Loss: 4.4935, Tokens/sec: 18840.11, Time Elapsed: 74.05 sec

Other Create torch virtual environment: python -m venv torch-env Activate it: torch-env\Scripts\activate

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
torch-env		torch-env
.gitignore		.gitignore
README.md		README.md
data.py		data.py
loss.txt		loss.txt
main.py		main.py
miniGPT.ipynb		miniGPT.ipynb
model.py		model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

miniGPT

Features

Getting Started

Prerequisites

Installation

Usage

Example

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

miniGPT

Features

Getting Started

Prerequisites

Installation

Usage

Example

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages