needle-in-a-haystack

Here are 3 public repositories matching this topic...

Counting-Stars (★)

[NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models

Semantically hard multi-needle long-context data generator. Stop testing LLMs with random-password needles.

python benchmark synthetic-data rag llm long-context llm-evaluation needle-in-a-haystack

Add a description, image, and links to the needle-in-a-haystack topic page so that developers can more easily learn about it.

To associate your repository with the needle-in-a-haystack topic, visit your repo's landing page and select "manage topics."