Idea: Live Hallucination Detector

while the Rag system is generating the tokens, we buffer a small batch (20 tokens)
and we predict hallucinations online.

for this method, do we have to train the Hallucination Detector in an special way?
does your project support it?