
NewsMachine Learning
BullshitBench v2: Which LLMs Push Back on Nonsense?
via Medium ProgrammingDr. Leon Eversberg
Results from a new benchmark evaluating 80+ LLMs on whether they challenge or accept plausible-sounding nonsense prompts Continue reading on AI Advances »
Continue reading on Medium Programming
Opens in a new tab
0 views



