Skip to content
View sweNNN-svg's full-sized avatar
:octocat:
same.
:octocat:
same.

Block or report sweNNN-svg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sweNNN-svg/README.md

Hi, I'm Emre πŸ‘‹

profile views

About Me

I'm a Data & AI Engineer based in Trabzon, Turkey. I build production-grade LLM systems, RAG pipelines, and data infrastructure β€” with a focus on observability, security, and systems that are defensible at every layer.

  • πŸ”­ Current Focus: LLM integration, RAG pipelines, on-premise AI infrastructure with DLP layers
  • πŸ› οΈ Core Stack: Python, FastAPI, Apache Airflow, dbt, Snowflake, PostgreSQL, Docker
  • πŸ€– AI/LLM: Anthropic API, LangChain, Qdrant, sentence-transformers, RAG, prompt engineering
  • πŸ“Š Observability: Grafana, Zabbix, OpenTelemetry β€” applied to both data pipelines and LLM systems
  • πŸŽ“ Education: M.Sc. Entrepreneurship & Innovation Management, Karadeniz Technical University (ongoing)

Featured Projects

Production-grade Turkish RAG pipeline with automated quality scoring

  • End-to-end pipeline: PDF/DOCX β†’ chunker β†’ Qdrant β†’ LangChain β†’ Claude β†’ RAGAS β†’ PostgreSQL β†’ Grafana
  • Every query automatically scored on Faithfulness and Answer Relevancy via RAGAS (reference-free, no ground truth needed)
  • Benchmarked two chunking strategies on identical documents: fixed (faithfulness 0.33, ~3.8s) vs semantic (0.42, ~10s) β€” quantified trade-off instead of assuming
  • Eval results persisted in PostgreSQL for longitudinal analysis: drift detection, model version comparison, Grafana alerts on quality degradation
  • Qdrant chosen over Chroma for production-grade filtering, payload indexing, and horizontal scaling

Hybrid classification system: Rule Engine β†’ TF-IDF β†’ LLM fallback

  • Three-layer architecture: rule engine handles known TCODEs at 100% confidence with zero API cost, TF-IDF covers familiar patterns offline, Claude Haiku fallback handles only ambiguous tickets β€” minimizing both latency and API spend
  • Prompt engineered for deterministic JSON output with temperature=0.1
  • Covers 10 SAP modules (FI/CO, MM, SD, HR, PP, PM, QM, Basis, Authorization, E-Solutions)
  • Built from real experience managing 250+ SAP BW/4HANA process chains at enterprise scale

100% on-premise LLM usage with DLP layer

  • Intercepts and masks sensitive data (PII, credit card info) before it leaves the local network
  • KVKK/GDPR compliant, runs in isolated Docker environments
  • Designed for enterprises that need LLM capabilities without cloud data exposure

End-to-end telemetry platform with decoupled microservice architecture

  • Four-service Docker Compose stack: FastAPI ingestion β†’ PostgreSQL (raw + analytics layers) β†’ Python ETL worker β†’ Next.js dashboard
  • Decoupled ingestion from processing so API latency stays low under load while ETL scales independently (horizontal scaling)
  • Idempotent ETL via DELETE β†’ INSERT pattern β€” same time window can be reprocessed 100x with identical results; safe against retries and partial failures
  • Solved service startup race conditions with Docker healthcheck + depends_on + in-service retry logic for self-healing resilience
  • Resolved cross-container CORS/networking by separating server-side vs client-side request paths

Tech Stack

AI / LLM Anthropic LangChain Qdrant HuggingFace

Data Engineering Python Apache Airflow dbt Snowflake PostgreSQL

Infrastructure & Observability Docker FastAPI Grafana OpenTelemetry Linux


Certifications

  • πŸ“œ Snowflake Data Engineering β€” Snowflake (2026)
  • πŸ“œ Apache Airflow 3 Fundamentals β€” Astronomer (2026)
  • πŸ“œ dbt Fundamentals β€” dbt Labs (2026)
  • πŸ“œ IBM Data Engineering Professional (v2) β€” IBM (2024)
  • πŸ“œ PostgreSQL for Everybody Specialization β€” University of Michigan

Connect

LinkedIn Email

Pinned Loading

  1. turkish-rag-eval turkish-rag-eval Public

    RAG for Turkish documents that measures how well it works β€” automatic RAGAS evaluation, PostgreSQL persistence, Grafana dashboards

    Python

  2. data-ingestion-analytics-platform data-ingestion-analytics-platform Public

    High-throughput event tracking platform: FastAPI ingestion + PostgreSQL + async ETL worker + Next.js dashboard, fully containerized

    Go

  3. DevOps_School_Workshop_TR DevOps_School_Workshop_TR Public

    Open-source DevOps study notes: Linux, Networking, Docker, Terraform, Ansible, Jenkins, and monitoring β€” organized weekly with labs and resources

    HCL 3

  4. currency-modern-elt-pipeline currency-modern-elt-pipeline Public

    Modern ELT pipeline using Airbyte, dbt, PostgreSQL, and Grafana to ingest, transform, and visualize hourly currency exchange rates with Docker Compose orchestration.

  5. secure-llm-gateway-onprem secure-llm-gateway-onprem Public

    Secure Enterprise GenAI Gateway (On-Prem, DLP-enabled)

    Python