Skip to content
View RaphaelFakhri's full-sized avatar

Highlights

  • Pro

Block or report RaphaelFakhri

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
RaphaelFakhri/README.md

Hi, I'm Raphael πŸ‘‹

Python engineer focused on web scraping, data extraction, and LLM-assisted automation β€” MSc in Computer Science, AUB.

I care about the unglamorous parts of data work: dynamic / JS-rendered sources, anti-bot handling, validation and cross-source consistency, and pipelines that fail loudly instead of quietly shipping bad data. Recent builds, all live:

  • scrapehub β€” async scraping framework (BeautifulSoup + Playwright)
  • llm-extract β€” LLM extraction with a pydantic + cross-source consistency gate
  • apify-aws-actor β€” self-monitoring Apify actor, AWS-deployable

Previously: contributor to Apache Kafka and the Slimefun Minecraft plugin.

πŸ“« fakhriraphael@gmail.com Β· Book a call

Pinned Loading

  1. apify-aws-actor apify-aws-actor Public

    Production-shaped Apify actor: scrape β†’ bounded-concurrency batching β†’ OpenRouter enrichment β†’ schema-drift monitoring/alerting β†’ AWS Fargate/Lambda deployable. Live demo: https://actor.symbai.dev

    Python

  2. llm-extract llm-extract Public

    LLM-assisted extraction pipeline: messy HTML β†’ OpenRouter β†’ normalize β†’ pydantic v2 validate β†’ cross-source consistency β†’ verification gate. Live demo: https://extract.symbai.dev

    Python

  3. scrapehub scrapehub Public

    Async, headless web-scraping framework for sanctioned sources β€” BeautifulSoup + Playwright, rotating proxies/UAs, polite rate-limiting, retry/backoff, async batching, pydantic-validated CSV/JSON. L…

    Python