Skip to content
View uahroberto's full-sized avatar

Block or report uahroberto

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
uahroberto/README.md

Hi there ๐Ÿ‘‹, I'm Roberto

Software Engineer | Systems Architecture | Open Source

Architecting reliable software systems through rigorous engineering. Implementing clean architecture, performance optimization, and strict testing to ensure long-term maintainability and system reliability.

๐Ÿ”ญ Featured Project

๐Ÿ“ผ Video RAG Pro

A Multimodal AI Engine that enables users to "chat" with video content by analyzing both audio speech and visual screen context (OCR).

Unlike simple text-wrappers, this project is engineered for depth and reliability:

  • ๐Ÿ‘๏ธ Visual RAG: Implements an OCR pipeline to "read" code and slides from video frames, capturing context that is shown but not spoken.
  • ๐Ÿง  Hybrid Search: Uses Qdrant to combine semantic embeddings with keyword search (BM25) for precise technical retrieval.
  • โšก Performance: Achieved 0.14 RTF on CPU using custom int8 quantization.
  • ๐Ÿ›ก๏ธ Engineering: Fully Dockerized ETL pipeline with strict type safety guarantees.

๐ŸŒฑ Open Source Contributions

I believe in improving the tools I use daily.

  • roboflow/supervision: Led the strict typing migration for the metrics module (+700 lines refactored) and ensured full compatibility with NumPy 2.0. Refactored core calculation logic in MeanAveragePrecision to align the library with production-grade engineering standards.
  • qdrant/qdrant-client: Enhanced Developer Experience (DX) by standardizing type hints across the client interface, streamlining hybrid search (Dense + Sparse) integration for gRPC and HTTP users.
  • SYSTRAN/faster-whisper: Proposing architectural patterns for Dynamic Batching and request queuing to solve latency bottlenecks in high-concurrency GPU environments.
  • pyDeprecate: Contributed to core design discussions on decorator deprecation lifecycles, advocating for "Fail Fast" import-time warnings to prevent silent technical debt in production systems.

๐Ÿ› ๏ธ Tech Stack

  • Core & Backend: Python 3.12 (Strict Typing), Docker, FastAPI/Streamlit.
  • AI & Data: OpenAI, Faster-Whisper, RapidOCR, Sentence-Transformers.
  • Vector Search: Qdrant (Hybrid Search: Dense + Sparse).
  • Quality & DevOps: Mypy, Ruff, Pre-commit hooks, GitHub Actions.

Pinned Loading

  1. roboflow/supervision roboflow/supervision Public

    We write your reusable computer vision tools. ๐Ÿ’œ

    Python 38.1k 3.4k

  2. qdrant/qdrant-client qdrant/qdrant-client Public

    Python client for Qdrant vector search engine

    Python 1.3k 215

  3. video-rag-pro video-rag-pro Public

    Dreading that 2-hour mandatory lecture? ๐Ÿ˜ด Load it into this Multimodal RAG Engine, let it process the audio & visuals, and simply ask for what you need to know. Skip the watch time, get the answers.

    Python 1

  4. LocalMind LocalMind Public

    Tu espacio de trabajo de IA privado y local. RAG seguro con Docker en minutos. Open Source.

    Python