Skip to content

Add cudfgrep: GPU-accelerated grep utility with tests and docs#22206

Open
Pritiks23 wants to merge 1 commit intorapidsai:mainfrom
Pritiks23:feature/cudfgrep
Open

Add cudfgrep: GPU-accelerated grep utility with tests and docs#22206
Pritiks23 wants to merge 1 commit intorapidsai:mainfrom
Pritiks23:feature/cudfgrep

Conversation

@Pritiks23
Copy link
Copy Markdown

Description

This PR introduces cudfgrep, a GPU-accelerated grep utility using cuDF/nvtext.

Supports plain text files (one string per line, string only)
Outputs all regex matches per line (not just matching lines)
CLI matches grep (-e, -i, -c, --gds)
GDS enablement via CLI flag or environment variable
Comprehensive unit tests and benchmarking instructions included
Ready for future: column mode for CSV/Parquet, etc.
Closes #21078.

Checklist

  • [ Y] I am familiar with the Contributing Guidelines.
  • [ Y] New or existing tests cover these changes.
  • [Y ] The documentation is up to date with these changes.

@Pritiks23 Pritiks23 requested a review from a team as a code owner April 18, 2026 02:43
@Pritiks23 Pritiks23 requested review from mroeschke and vyasr April 18, 2026 02:43
@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot bot commented Apr 18, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@github-actions github-actions bot added Python Affects Python cuDF API. cudf.pandas Issues specific to cudf.pandas labels Apr 18, 2026
@GPUtester GPUtester moved this to In Progress in cuDF Python Apr 18, 2026
@davidwendt
Copy link
Copy Markdown
Contributor

Was this intended as a new PR? Would you close #21942 or this one?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cudf.pandas Issues specific to cudf.pandas Python Affects Python cuDF API.

Projects

Status: In Progress

Development

Successfully merging this pull request may close these issues.

[FEA] write a grep utility with cuDF

3 participants