while the Rag system is generating the tokens, we buffer a small batch (20 tokens) and we predict hallucinations online. for this method, do we have to train the Hallucination Detector in an special way? does your project support it?
while the Rag system is generating the tokens, we buffer a small batch (20 tokens)
and we predict hallucinations online.
for this method, do we have to train the Hallucination Detector in an special way?
does your project support it?