The Four Fatal Flaws of Benchmarking