The best AI models can't yet beat the engineers they’re supposed to replace at fixing real-world problems, a new benchmark suggests.