Counting-Stars (★)
-
Updated
Nov 24, 2025 - Jupyter Notebook
Counting-Stars (★)
[NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models
Semantically hard multi-needle long-context data generator. Stop testing LLMs with random-password needles.
Add a description, image, and links to the needle-in-a-haystack topic page so that developers can more easily learn about it.
To associate your repository with the needle-in-a-haystack topic, visit your repo's landing page and select "manage topics."