LLMs Corrupt Your Documents When You Delegate: Our large-scale experiment with 19 LLMs reveals that […] even frontier models corrupt an average of 25% of document content by the end of long workflows

Arthur Besse@lemmy.ml · 16 days ago

LLMs Corrupt Your Documents When You Delegate: Our large-scale experiment with 19 LLMs reveals that […] even frontier models corrupt an average of 25% of document content by the end of long workflows

kingofras@lemmy.world · 16 days ago

By the end of long workflows

Yes, this has been known for 10 years.

Arthur Besse@lemmy.ml · 15 days ago

By the end of long workflows

Yes, this has been known for 10 years.

huh? the kind of “long workflows” this paper is discussing didn’t exist two years ago much less 10

kingofras@lemmy.world · 14 days ago

it doesn’t matter. the principle is that if x is the length of your context window, then at 0.4x the chance of hallucinations start increasing exponentially. we’re now at token windows of 1M, and all it does is shift that hallucination window further away, so the model ‘feels’ stronger because it takes longer before it hallucinates, but eventually it always does.