Deep research benchmark shows how poor LLMs are at writing accurate reports
youtube.comIf not for limited title characters the full title should be
"Deep research benchmark shows how poor current LLMs are at writing accurate financial reports"
Timestamped to the results.
The tables are very interesting.
The best LLM in their real world test, had a maximum 44% accuracy when creating the financial report.