Bridging the 'Pass at 1' vs 'Pass at 100' Gap in AI Models