Skip to content
View josephgitau's full-sized avatar
🏠
Working from home
🏠
Working from home
  • Nairobi, Kenya
  • 09:08 (UTC -12:00)

Organizations

@African-Center-for-Data-Science

Block or report josephgitau

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
josephgitau/readme.md

Welcome Banner

Typing SVG

LinkedIn Portfolio Work Zindi WhatsApp Profile Views


Jump to: About β€’ Zindi β€’ Focus β€’ Competencies β€’ Tech β€’ Competitions β€’ Projects β€’ GitHub β€’ Connect


πŸ‘‹ About Me

  • πŸ”­ Building and sharing ML, Data Science, and Data Engineering work at africdsa.com
  • πŸ† Zindi competitor β€” consistently placing in top rankings across African AI challenges
  • 🌱 Currently deepening expertise in NLP, Speech Recognition, and LLM fine-tuning
  • πŸš€ Building full-stack data products: FastAPI + Next.js + Supabase pipelines
  • πŸ’¬ Ask me about Python, ML workflows, data pipelines, geospatial ML, or real estate analytics
  • 🌍 Proudly coding from Nairobi, Kenya β€” solving African problems with AI
  • β™ŸοΈ Chess: daily tactics, endgames, and openings

πŸ“ˆ Live Zindi Stats

Rank
Points
Best Rank
Country


πŸ”— View full profile on Zindi β†’

Last updated: 2026-04-12 06:32:55 UTC


πŸ”₯ Current Focus

  • Building: Nairobi Property Pricing β€” full-stack real estate intelligence platform (Next.js + FastAPI + Supabase)
  • Competing: Active Zindi challenges β€” NLP, computer vision, time-series forecasting
  • Exploring: LLM fine-tuning and speech recognition for low-resource African languages
  • Sharing: ML tutorials and open-source African data science at africdsa.com

🧠 My Core Competencies

I build practical, production-minded ML and data systems β€” from ingestion and transformation to modeling, evaluation, and deployment.

Machine Learning
Machine Learning Skills Chart
Data Science
Data Science Skills Chart
Data Engineering
Data Engineering Skills Chart

πŸ› οΈ Tech Arsenal

Core ML & Data

Python PyTorch TensorFlow scikit-learn Pandas NumPy OpenCV

Web & Backend

FastAPI Next.js TypeScript Supabase

Tools & Platforms

Git Docker VS Code Jupyter GitHub Actions


πŸ† Competition Results

Competition Platform Position Solution
Barbados Lands & Surveys Plot Automation Zindi πŸ… Top Finish Repo

πŸ”— See all competitions on Zindi β†’

⭐ Star these repos if they help you β€” it really means a lot! πŸ™


πŸš€ Featured Projects

πŸ—ΊοΈ Nairobi Property Pricing Platform

Full-stack real estate intelligence platform for Nairobi β€” automated data pipeline from scraping to interactive affordability dashboard.

What makes it strong

  • Automated scraping pipeline: daily GitHub Actions workflow scrapes live listings and pushes to Supabase
  • Intelligent parsing: extracts bedroom counts from messy titles/URLs ("2br", "two bed", etc.)
  • Affordability analytics: price-per-bedroom metrics, tier segmentation, location summaries
  • Interactive frontend: Next.js + TypeScript dashboard with geographic affordability map

πŸ”— Backend: josephgitau/nairobi_property_pricing πŸ”— Frontend: josephgitau/nairobi-property-pricing-frontend 🌍 Live: josephgitau.me


πŸ“ Barbados Lands and Surveys Plot Automation Challenge

End-to-end geospatial digitization pipeline for cadastral survey maps: parcel boundary detection + polygon extraction + OCR to produce structured, searchable outputs ready for indexing and downstream GIS use.

What makes it strong

  • Boundary segmentation: UNet++ with EfficientNet-B7 encoder for precise parcel detection from raster maps
  • Production-grade post-processing: polygon cleaning (hole removal, simplification, smoothing) for valid GIS geometries
  • Robust OCR strategy: Qwen3-VL-30B (vision-language) for noisy map text β€” zero/few-shot generalization instead of risky fine-tuning
  • Reproducible deliverables: training + inference notebooks, checkpoints, and final merged outputs (polygons + OCR text)

Results: Public 0.965006861 β€’ Private 0.970242006

πŸ”— Repo: josephgitau/Barbados-Lands-and-Surveys-Plot-Automation-Challenge πŸ“’ Data prep: Open in Colab



🧠 Sentiment Story Generation Bot

NLP-powered bot that generates contextual stories based on detected sentiment.

πŸ”— Repo: josephgitau/Sentiment_Story_Generation_Bot


πŸ“Š GitHub Dashboard

Joseph's GitHub Stats Top Languages

GitHub Streak

Trophies

Activity Graph


🎸 Fun Bits

  • β™ŸοΈ Chess is both relaxation and mental training β€” tactics, endgames, and openings every day.
  • 🌍 Passionate about applying AI to solve African problems β€” housing, agriculture, language.
  • 🧩 Love clean, elegant solutions to messy real-world data.
  • πŸš€ Always building something new β€” the best model is the next one.

🀝 Let's Connect

LinkedIn Portfolio Zindi WhatsApp africdsa


"Build from data. Compete with purpose. Ship it."

Visitors

Footer

Popular repositories Loading

  1. The-African-Trust-Safety-LLM-Challenge The-African-Trust-Safety-LLM-Challenge Public

    Python 9 2

  2. Barbados-Lands-and-Surveys-Plot-Automation-Challenge Barbados-Lands-and-Surveys-Plot-Automation-Challenge Public

    Barbados Lands and Surveys Plot Automation Challenge

    Jupyter Notebook 8 2

  3. employee_leave_predictionn employee_leave_predictionn Public

    Jupyter Notebook 2

  4. Sentiment_Story_Generation_Bot Sentiment_Story_Generation_Bot Public

    Python 2

  5. nairobi-property-pricing-frontend nairobi-property-pricing-frontend Public

    TypeScript 2 1

  6. DEVCLASS DEVCLASS Public

    1