BullshitBench, created by Peter Gostev, evaluates AI models' ability to detect nonsense. One AI company did way better than ...