Greq = Grep + Query

Greq is a CLI tool that searches files and returns the most relevant sections. It’s designed to provide concise, high-signal context for AI agents, helping reduce token usage while preserving the information that matters.

Unlike grep, which performs exact pattern matching, Greq applies linguistic ranking (BM25 & Text Embeddings) to score and sort file sections based on how relevant they are to your query. In practice, it behaves like a small search engine in your shell.

Greq works best with natural-language or multi-word queries. For simple single-keyword searches, grep is usually the faster and more appropriate tool.

🚀 Quick Install (Recommended)

Install with a single command:

curl -sSL https://raw.githubusercontent.com/KlausSchaefers/greq/main/install.sh | bash

🔍 How it Works

Greq uses the BM25 (Best Matching 25) ranking algorithm to score text relevance:

Document Chunking: Files are split into overlapping chunks for better context
BM25 Scoring: Each chunk is scored based on term frequency and document frequency
Semantic Scoring: Each chunk is transformed with a local LLM into a semantic embedding vector
Context Expansion: Results include surrounding chunks for better context
Ranking: Results are sorted by relevance score

This makes Greq particularly effective for:

✅ Multi-word queries
✅ Semantic searches
✅ Research and documentation search

Use grep for:

❌ Single keyword searches
❌ Exact pattern matching
❌ Regular expressions

Examples

# Basic search with metadata and highlighting
greq "machine learning" test/data -m -l

# Search with top 5 results and larger context
greq "rust programming" . --n 5 -C 2

greq "south america" test/data -w 0.7  # Uses text embeddings to match "south america to brazil"

# JSON output for scripting
greq "error handling" src/ -f json

# Search specific file types
greq "function" . --extensions "rs,py,js"

# Different chunk sizes for better context
greq "algorithms" . -s 300

# Fuzzy matching with sub-tokens (great for partial words)
greq "capo" test/data --sub-token 4     # Finds "capoeira", "capon", etc.
greq "config" . -t 5                # Short form: finds "configuration", "configure"

# Regular vs fuzzy search comparison
greq "capo" test/data                   # No matches (exact word search)
greq "capo" test/data -t 4              # Finds "capoeira" (fuzzy sub-token search)

# Pipe command output  
echo "some text content" | greq 'text'
cat file.txt | greq 'my query'

Options

    <QUERY>                  Search query
    [PATH]                   Directory or file to search [default: .]

  -n, --n <N>               Number of results [default: 3]
  -C, --context <CONTEXT>   Context chunks around matches [default: 1]
  -s, --chunk-size <SIZE>   Chunk size in characters [default: 200]
  -f, --format <FORMAT>     Output format: text or json [default: text]
  -m, --show-meta           Show metadata (filename, score, position)
  -l, --highlight           Enable highlighting of search terms
  -t, --sub-token <LENGTH>  Sub-token length for fuzzy matching (>3 enables fuzzy search) [default: 0]
  -w, --embedding-weight <WEIGHT>  Weight for combining BM25 and embedding scores (0.0-1.0) [default: 0]
  -h, --help                Print help
  -V, --version             Print version

Fuzzy Search with Sub-tokens

Greq supports fuzzy matching using overlapping sub-tokens. This is particularly useful when:

You remember only part of a word
Searching for compound words or technical terms
Dealing with typos or variations

Note: This will be slower and use more memory.

How it works:

Words are split into overlapping sub-sequences of specified length
Example: "capoeira" with --sub-token 4 becomes ["capo", "apoe", "poei", "oeir", "eira"]
Your search term "capo" will match because it appears as a sub-token

Usage:

# Enable fuzzy search with 4-character sub-tokens
greq "capo" test/data --sub-token 4

# Works with any sub-token length > 3
greq "config" . -t 5    # Finds "configuration", "configure", etc.

Hybrid Search (BM25 + Embeddings)

Greq supports hybrid search that combines traditional BM25 text ranking with semantic embeddings for enhanced search quality. This is particularly powerful for finding conceptually related content beyond exact keyword matches. Greq uses the all-MiniLM-L6-v2 model.

How it works:

BM25 handles exact term matching and statistical relevance
Embeddings capture semantic similarity and context
Results are combined using a weighted average

Usage:

# Enable hybrid search with 50% embedding weight
greq "machine learning algorithms" test/data --embedding-weight 0.5

# Heavier emphasis on semantic similarity
greq "error handling" src/ -w 0.8

# Light semantic enhancement (mostly BM25)
greq "rust programming" . -w 0.2

Weight values:

0.0 (default): Pure BM25 search
0.5: Balanced hybrid (50% BM25, 50% embeddings)
1.0: Pure embedding search

When to use hybrid search:

✅ Conceptual or semantic queries
✅ Finding related content beyond exact matches
✅ Research and exploratory search
❌ Simple keyword searches (BM25 alone is faster)

Note: Hybrid search requires downloading embedding models on first use and is slower than BM25-only search. The files will be downloaded in your user directory in the .greq-cache folder. You can change this behavior by setting the GREQ_CACHE_DIR ENV variable.

Manual Download

Alternatively, download manually from the Releases page:

Linux x86_64: greq
Linux ARM64: greq
Windows: greq.exe
macOS Intel: greq
macOS Apple Silicon: greq

After downloading, make the binary executable (Linux/macOS):

chmod +x greq

Uninstall

To remove greq from your system:

Quick uninstall (Recommended):

curl -sSL https://raw.githubusercontent.com/KlausSchaefers/greq/main/uninstall.sh | bash

Manual removal:

# Remove the binary
sudo rm /usr/local/bin/greq

# For Windows (if applicable)
# rm /usr/local/bin/greq.exe

macOS Security Notice

On macOS, you may see "cannot be verified" error. This is because the binary isn't signed with an Apple Developer certificate.

Option 1: Remove quarantine attribute (Recommended)

xattr -d com.apple.quarantine greq

Option 2: Right-click method

Right-click the greq binary in Finder
Select "Open"
Click "Open" in the security dialog

Option 3: System Settings

Try to run ./greq (it will fail)
Go to System Settings > Privacy & Security
Click "Allow Anyway" next to the greq message
Try running ./greq again and confirm

Build from Source

Debug

cargo build

Release

cargo build --release

Test

cargo run

Thanks

We use the fastembed-rs lib from Anush008. Please give him a star!

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.cargo		.cargo
.github/workflows		.github/workflows
src		src
tests		tests
.env		.env
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
install.sh		install.sh
uninstall.sh		uninstall.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Greq = Grep + Query

🚀 Quick Install (Recommended)

🔍 How it Works

Examples

Options

Fuzzy Search with Sub-tokens

Hybrid Search (BM25 + Embeddings)

Manual Download

Uninstall

macOS Security Notice

Build from Source

Thanks

About

Uh oh!

Releases 18

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Greq = Grep + Query

🚀 Quick Install (Recommended)

🔍 How it Works

Examples

Options

Fuzzy Search with Sub-tokens

Hybrid Search (BM25 + Embeddings)

Manual Download

Uninstall

macOS Security Notice

Build from Source

Thanks

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 18

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages