Flower Classification with Transfer Learning

An image classification model that identifies 5 types of flowers using transfer learning with MobileNet V2. Built as a capstone project for ML Zoomcamp 2025.

Problem Description

Flower identification is a common challenge for gardeners, botanists, and nature enthusiasts. This project builds an AI-powered classifier that can identify flowers from photographs, making it easier to:

Identify unknown flowers while hiking or gardening
Assist in botanical research and cataloging
Power mobile apps for plant identification

The model classifies images into 5 flower categories:

🌼 Daisy
🌻 Sunflower
🌷 Tulip
🌹 Rose
🌾 Dandelion

Dataset

Source: TensorFlow Flowers Dataset

Total Images: 3,670 labeled photos
Classes: 5 flower types
Image Format: JPEG, various sizes
Train/Val Split: 80/20

Class	Total	Training	Validation
Daisy	633	526	107
Dandelion	898	707	191
Roses	641	522	119
Sunflowers	699	564	135
Tulips	799	617	182
Total	3,670	2,936	734

Class Distribution

The dataset is relatively balanced, with a class imbalance ratio of ~1.42 (dandelion has the most images, daisy the fewest).

Project Structure

flower-classification-capstone/
├── README.md                 # Project documentation
├── notebooks/
│   └── eda_and_training.ipynb  # EDA + model experiments
├── src/
│   ├── download_data.py      # Dataset download script
│   ├── train.py              # Model training script
│   └── predict.py            # Flask prediction service
├── models/                   # Saved model artifacts
│   ├── flower_classifier.keras
│   └── class_names.txt
├── data/                     # Dataset (downloaded separately)
├── docker/
│   └── Dockerfile            # Container definition
├── tests/
│   └── test_service.py       # API test script
├── requirements.txt          # Python dependencies
└── Pipfile                   # Pipenv dependencies

Model Approach

Baseline: Simple CNN

A basic convolutional neural network built from scratch:

3 Conv2D + MaxPooling blocks
Dense layer with dropout
Result: Severe overfitting (96% train, 67% validation)

Transfer Learning: MobileNet V2

Using a pre-trained MobileNet V2 (ImageNet weights) as a feature extractor:

Frozen base model + custom classification head
Data augmentation (flip, rotation, zoom)
GlobalAveragePooling + Dropout + Dense(5)
Result: 88% validation accuracy

Fine-Tuning

Unfreezing the last 30 layers of MobileNet V2 for fine-tuning:

Lower learning rate (1e-5)
5 additional epochs
Result: 88.3% validation accuracy (best model)

Training Results

Model	Train Accuracy	Val Accuracy	Notes
Baseline CNN	96%	67%	Overfitting
MobileNetV2 (frozen)	90%	88%	Transfer learning
MobileNetV2 (fine-tuned)	90%	88.3%	Selected model

Per-Class Performance (Final Model)

Class	Precision	Recall	F1-Score	Support
Daisy	0.83	0.95	0.89	107
Dandelion	0.95	0.91	0.93	191
Roses	0.77	0.92	0.84	119
Sunflowers	0.93	0.84	0.89	135
Tulips	0.90	0.82	0.86	182
Overall	0.88	0.88	0.88	734

Key Observations

Dandelion has the highest precision (95%) - distinctive yellow color and shape
Roses has the lowest precision (77%) - sometimes confused with tulips due to similar colors
Daisy has the highest recall (95%) - white petals with yellow center are easy to identify
Tulips has the lowest recall (82%) - varied colors cause some confusion with roses

How to Run

1. Clone the Repository

git clone https://github.com/HighviewOne/flower-classification-capstone.git
cd flower-classification-capstone

2. Set Up Environment

Option A: Using Conda (recommended for this project)

conda activate MLZoomCamp_env
pip install -r requirements.txt

Option B: Using Pipenv

pip install pipenv
pipenv install
pipenv shell

Option C: Using pip with venv

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install -r requirements.txt

3. Download the Dataset

python src/download_data.py

Or manually:

cd data
curl -O http://download.tensorflow.org/example_images/flower_photos.tgz
tar -xzf flower_photos.tgz

4. Train the Model (Optional)

The trained model is already included. To retrain:

python src/train.py

This will:

Load and preprocess the dataset
Train the MobileNetV2 transfer learning model
Save the model to models/flower_classifier.keras

5. Run the Web Service

python src/predict.py

The API will start at http://localhost:9696

6. Test a Prediction

Using an image URL:

curl -X POST http://localhost:9696/predict \
  -H "Content-Type: application/json" \
  -d '{"image_url": "https://upload.wikimedia.org/wikipedia/commons/thumb/4/40/Sunflower_sky_backdrop.jpg/800px-Sunflower_sky_backdrop.jpg"}'

Using a local file:

curl -X POST http://localhost:9696/predict \
  -F "image=@path/to/flower.jpg"

Expected Response:

{
  "prediction": "sunflowers",
  "confidence": 0.9823,
  "probabilities": {
    "daisy": 0.0012,
    "dandelion": 0.0034,
    "roses": 0.0045,
    "sunflowers": 0.9823,
    "tulips": 0.0086
  }
}

7. Run the Test Suite

python tests/test_service.py

Docker

Build the Container

docker build -t flower-classifier -f docker/Dockerfile .

Run the Container

docker run -it -p 9696:9696 flower-classifier

Test the Containerized Service

curl -X POST http://localhost:9696/predict \
  -H "Content-Type: application/json" \
  -d '{"image_url": "https://upload.wikimedia.org/wikipedia/commons/thumb/4/40/Sunflower_sky_backdrop.jpg/800px-Sunflower_sky_backdrop.jpg"}'

Kubernetes Deployment (Bonus)

Deploy the service to a local Kubernetes cluster using kind.

Prerequisites

# Install kind (Kubernetes in Docker)
winget install Kubernetes.kind

# Install kubectl
winget install Kubernetes.kubectl

Quick Deploy (Windows)

kubernetes\deploy-k8s.bat

Manual Deployment Steps

# 1. Create a kind cluster
kind create cluster --name flower-cluster

# 2. Load the Docker image into the cluster
kind load docker-image flower-classifier:latest --name flower-cluster

# 3. Apply Kubernetes manifests
kubectl apply -f kubernetes/deployment.yaml
kubectl apply -f kubernetes/service.yaml
kubectl apply -f kubernetes/hpa.yaml

# 4. Wait for deployment to be ready
kubectl rollout status deployment/flower-classifier

# 5. Check pod status
kubectl get pods -l app=flower-classifier

Access the Service

# Port forward to access the service
kubectl port-forward service/flower-classifier 9696:80

Then test:

curl http://localhost:9696/health
python tests/test_service.py

Kubernetes Features

Deployment: Manages pod lifecycle with rolling updates
Service: LoadBalancer exposes the API on port 80
HPA: Horizontal Pod Autoscaler scales from 1-3 replicas based on CPU usage

Cleanup

# Delete the cluster when done
kind delete cluster --name flower-cluster

API Endpoints

Endpoint	Method	Description
`/predict`	POST	Classify a flower image
`/health`	GET	Health check (returns 200 OK)

Request Formats

The /predict endpoint accepts three input formats:

JSON with image URL:

{"image_url": "https://example.com/flower.jpg"}

JSON with base64-encoded image:
```
{"image_base64": "iVBORw0KGgo..."}
```

Multipart form with file upload:

curl -F "image=@flower.jpg" http://localhost:9696/predict

Notebooks

EDA and Training: Complete exploratory data analysis, model experimentation, and hyperparameter tuning

EDA Highlights

Image sizes vary from 240×180 to 4000×3000 pixels (resized to 224×224 for training)
Aspect ratios are mostly close to 1.0 (square-ish images)
Data augmentation (random flip, rotation, zoom) helps prevent overfitting

Technologies Used

Python 3.11
TensorFlow 2.20 / Keras - Deep learning framework
MobileNet V2 - Pre-trained CNN for transfer learning
Flask - Web service framework
Docker - Containerization
NumPy, Pandas - Data manipulation
Matplotlib, Seaborn - Visualization
scikit-learn - Metrics and evaluation

Limitations & Future Work

Current Limitations

Model trained on only 5 flower types
Performance may vary with low-quality, blurry, or unusual angle images
Flowers with similar colors (roses/tulips) can be confused

Future Improvements

Expand to more flower species (10-20 classes)
Add confidence thresholding to reject uncertain predictions
Implement model versioning and A/B testing
Deploy to cloud (AWS/GCP) with auto-scaling
Build a mobile app with camera integration

Author

Michael - ML Zoomcamp 2025 Capstone Project

GitHub: @HighviewOne

License

This project is licensed under the MIT License.

Acknowledgments

DataTalks.Club for the ML Zoomcamp course
TensorFlow team for the flowers dataset and MobileNet V2
MobileNetV2 paper - Sandler et al., 2018

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
docker		docker
kubernetes		kubernetes
models		models
notebooks		notebooks
screenshots		screenshots
src		src
tests		tests
.gitignore		.gitignore
Pipfile		Pipfile
README.md		README.md
requirements.txt		requirements.txt

HighviewOne/flower-classification-capstone

Folders and files

Latest commit

History

Repository files navigation