The real lesson is that these are just random results and all models fail at all...

The real lesson is that these are just random results and all models fail at all kinds of things all the time and other times get things right in all kind of questions.

Problem is the models have zero idea wether they are right or wrong and always believe they are right. Which makes them useful for anything were either you do not care if the answer is actually right or where somehow it is hard to come up with the right answer but very easy to verify it the answer is right and kind of useless for everything else.