Skip to content

Add Firecrawl integration#398

Open
leonardogrig wants to merge 1 commit intodeepset-ai:mainfrom
firecrawl:add-firecrawl-integration
Open

Add Firecrawl integration#398
leonardogrig wants to merge 1 commit intodeepset-ai:mainfrom
firecrawl:add-firecrawl-integration

Conversation

@leonardogrig
Copy link

@leonardogrig leonardogrig commented Feb 13, 2026

I'm Leo, dev rel at Firecrawl. Haystack's integration ecosystem is solid, especially for data ingestion. Adding Firecrawl here so Haystack users can scrape, crawl, search, and map the web using the firecrawl-py SDK.


What this adds

  • integrations/firecrawl.md - Integration page with usage examples
  • logos/firecrawl.png - Firecrawl logo

The integration uses the existing firecrawl-py SDK to scrape pages, search the web, and crawl sites, then convert results into Haystack Documents for use in pipelines.

Examples included

  • Single page scraping to Haystack Document
  • Web search with Document conversion
  • Site crawling with Document conversion
  • Full RAG pipeline (Firecrawl scrape + OpenAI embeddings + retrieval + generation)

Follows repo conventions

  • Based on the draft-integration.md template
  • Frontmatter matches required fields (layout, name, description, authors, pypi, repo, type)
  • Type set to "Data Ingestion" (consistent with Apify, Bright Data)
  • Python examples include all imports and are runnable
  • Purely additive, no existing files changed

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@leonardogrig leonardogrig requested a review from a team as a code owner February 13, 2026 15:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant