LLMs Corrupt Your Documents When You Delegate: Our large-scale experiment with 19 LLMs reveals that […] even frontier models corrupt an average of 25% of document content by the end of long workflows

Arthur Besse@lemmy.ml · 1 month ago

LLMs Corrupt Your Documents When You Delegate: Our large-scale experiment with 19 LLMs reveals that […] even frontier models corrupt an average of 25% of document content by the end of long workflows

kingofras@lemmy.world · 1 month ago

it doesn’t matter. the principle is that if x is the length of your context window, then at 0.4x the chance of hallucinations start increasing exponentially. we’re now at token windows of 1M, and all it does is shift that hallucination window further away, so the model ‘feels’ stronger because it takes longer before it hallucinates, but eventually it always does.