Collatz Conjecture AI Research System

Advanced AI system for analyzing the Collatz Conjecture using Deep Learning and parallel brute-force search

🎯 Overview

This project combines AI-guided pattern recognition with parallel brute-force search to investigate the Collatz Conjecture, one of mathematics' most famous unsolved problems.

Key Features

🧠 Transformer-based Neural Network for sequence prediction
🔍 Multi-threaded C++ Loop Searcher (Floyd's Cycle Detection)
⚡ Native C++ Data Engine for maximum performance
📊 Real-time Discord Integration for monitoring
🎓 Curriculum Learning with "Hard Mode" candidates (n > 2^68)
🔬 Advanced Optimizations: Mixed Precision (AMP), Cosine Annealing LR, Gradient Clipping

🚀 Quick Start

Prerequisites

GPU: NVIDIA GPU with 6GB+ VRAM (tested on RTX 3070 Ti)
CPU: Multi-core processor (tested on Ryzen 5900X)
RAM: 16GB+ recommended
OS: Linux (tested on Arch Linux)
Python: 3.13+
CUDA: 12.8+

Installation

# Clone the repository
git clone https://github.com/yourusername/collatz-ai.git
cd collatz-ai

# Run setup (creates venv, installs dependencies, compiles C++ modules)
chmod +x run.sh
./run.sh

Usage

# Start training
./run.sh

# Interactive commands during training:
# - Type 'stop' to save and exit
# - Type 'status' for current progress
# - Ctrl+C for graceful shutdown

🏗️ Architecture

AI Model (GPU)

Input: Parity Vector [0, 1, 0, 1, 1, ...]
  ↓
Embedding Layer (3 → 128d)
  ↓
Positional Encoding
  ↓
Transformer Encoder (4 layers, 4 heads)
  ↓
Dual Heads:
  ├─ Stopping Time Prediction (Regression, Log-Space)
  └─ Next Step Prediction (Classification)

Specifications:

Model Size: 128d, 4 layers, 4 attention heads
Batch Size: 512
Optimizer: AdamW with Cosine Annealing
Loss: Huber Loss (stopping time) + CrossEntropy (sequence)

Loop Searcher (CPU)

// Parallel brute-force search using Floyd's algorithm
// 22 threads × 1M numbers = 22M candidates per run
// Target: n > 2^68, n ≡ 3 (mod 4)

Features:

Multi-threaded C++ implementation
128-bit integer support (__int128)
Detects non-trivial cycles
Runs in background during training

📊 Performance

Training Metrics (100k steps)

Metric	Value
Final Loss	0.3698
Stopping Time Error	0.0003 (log-space)
Sequence Accuracy	~70%
Training Speed	~27s / 100 steps
GPU Utilization	~90% (7.2GB / 8GB)
CPU Utilization	~85% (20 workers)

Loop Search Results

Numbers Checked: 22,000,000 per run
Range: [2^68, 2^68 + 22M]
Non-trivial Cycles Found: 0 (as expected)

🛠️ Technical Details

Optimizations

Mixed Precision Training (AMP)
- Reduces VRAM usage by ~40%
- Increases training speed by ~30%
Native C++ Engine
- 20-30% faster data generation
- Supports numbers > 2^64 (128-bit)
Curriculum Learning
- 50% "Normal" data (sequential numbers)
- 50% "Hard" data (n > 2^68, special patterns)
Learning Rate Scheduling
- Cosine Annealing: 1e-4 → 1e-6
- Smooth convergence, prevents oscillation

File Structure

collatz_ai/
├── src/
│   ├── train.py              # Main training script
│   ├── model.py              # Transformer architecture
│   ├── engine.py             # Numba-optimized data generation
│   ├── dataset.py            # PyTorch Dataset/DataLoader
│   ├── analyze.py            # Model analysis & visualization
│   ├── discord_bot.py        # Discord webhook integration
│   ├── collatz_core.cpp      # C++ data engine
│   ├── loop_searcher.cpp     # C++ parallel loop searcher
│   ├── native_engine.py      # Python bindings (ctypes)
│   └── loop_search.py        # Loop searcher wrapper
├── checkpoints/              # Model checkpoints (auto-saved)
├── requirements.txt          # Python dependencies
├── run.sh                    # Setup & run script
└── README.md                 # This file

📈 Results & Insights

What the Model Learned

Stopping Time Prediction: Near-perfect accuracy (99.97%)
Parity Patterns: Strong recognition of even/odd sequences
Anomaly Detection: Identifies numbers with unusual stopping times

Known Anomalies

The model struggles with these numbers (unusually short stopping times):

Number: 1249, Actual: 176, Predicted: 233, Error: 57
Number: 1695, Actual: 179, Predicted: 236, Error: 57
Number: 1742, Actual: 179, Predicted: 235, Error: 56

🔬 Research Applications

For Mathematicians

Analyze stopping time distributions
Identify exceptional numbers
Visualize sequence embeddings (PCA)

For ML Researchers

Benchmark for sequence prediction
Study curriculum learning effects
Explore transformer behavior on mathematical sequences

For Collatz Enthusiasts

Automated large-scale verification
Pattern discovery in high ranges (> 2^68)
Real-time progress monitoring via Discord

🚧 Future Work

Distributed training across multiple GPUs
Larger model (256d, 6 layers) from scratch
GPU-accelerated loop detection
Extended search range (2^100 - 2^120)
Hybrid LSTM+Transformer architecture

📝 Configuration

Edit src/train.py to customize:

BATCH_SIZE = 512          # Adjust for your VRAM
NUM_WORKERS = 20          # CPU threads for data loading
STEPS = 1000000           # Training duration
D_MODEL = 128             # Model dimension
NUM_LAYERS = 4            # Transformer layers
NHEAD = 4                 # Attention heads

🤝 Contributing

Contributions welcome! Areas of interest:

Model architecture improvements
Faster loop detection algorithms
Better anomaly detection
Visualization enhancements

📄 License

GNU General Public License v3.0

This project is licensed under GPL v3, which means:

✅ You CAN:

Use for any purpose (personal, commercial, research)
Modify and improve the code
Distribute original or modified versions
Sell modified versions

⚠️ You MUST:

Share source code of any modifications
Use the same GPL v3 license
State significant changes
Include copyright and license notices

🎯 Mission: Help humanity solve the Collatz Conjecture through open collaboration!

See LICENSE file for full details.

🙏 Acknowledgments

Collatz Conjecture: Lothar Collatz (1937)
PyTorch Team: For the amazing framework
Numba Team: For JIT compilation magic
Community: For mathematical insights

📧 Contact

Issues: GitHub Issues
Discussions: GitHub Discussions

⚠️ Disclaimer: This is a research project. Finding a non-trivial cycle would disprove the Collatz Conjecture, which is highly unlikely but mathematically possible.

🎯 Goal: Advance our understanding of the Collatz Conjecture through AI-guided analysis and exhaustive verification.

Made with ❤️ for mathematics and machine learning

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github		.github
docs		docs
src		src
tests		tests
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
README_HF.md		README_HF.md
blockchain_config.example.json		blockchain_config.example.json
config.example.py		config.example.py
config.yaml		config.yaml
docker-compose.yml		docker-compose.yml
environment.yml		environment.yml
generate_docs_plots.py		generate_docs_plots.py
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
run.sh		run.sh
upload_to_hf.py		upload_to_hf.py
verify_search.py		verify_search.py

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Collatz Conjecture AI Research System

🎯 Overview

Key Features

🚀 Quick Start

Prerequisites

Installation

Usage

🏗️ Architecture

AI Model (GPU)

Loop Searcher (CPU)

📊 Performance

Training Metrics (100k steps)

Loop Search Results

🛠️ Technical Details

Optimizations

File Structure

📈 Results & Insights

What the Model Learned

Known Anomalies

🔬 Research Applications

For Mathematicians

For ML Researchers

For Collatz Enthusiasts

🚧 Future Work

📝 Configuration

🤝 Contributing

📄 License

🙏 Acknowledgments

📧 Contact

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages