ML Engineer & AI Systems Architect — designing and deploying production AI systems: from LLM fine-tuning and inference optimization to high-load microservice platforms with monitoring and SLAs.
5+ years · 16 commercial projects · 35+ production services · 10M audio min/day ASR platform · 99.982% uptime
| Domain | Technologies |
|---|---|
| LLM & Inference | PyTorch · HuggingFace · vLLM · Ollama · QLoRA · Unsloth · TensorRT · Triton Inference Server · RAG · LangChain |
| Computer Vision | YOLOv8 · OpenCV · EfficientNet · ResNet-50 · Roboflow |
| Backend | FastAPI · asyncio · gRPC · WebSocket · PostgreSQL · Redis · Elasticsearch · pgvector · FAISS · SQLAlchemy · Alembic |
| MLOps & Infra | Docker · GitLab CI · GitHub Actions · Ray Serve · ClearML · Grafana · Prometheus · Loki · Traefik |
| Frontend | Vue 3 · Nuxt 3 · TypeScript · Tailwind CSS |
| Other | CatBoost · aiogram 3 · Fish Speech TTS · Supabase |


