Using Occam's razor, that is less probable than the model picking up on statistical regularities in human language, especially since that's what they are trained to do.
That's hard to conclude from Occam's razor here. Or, "statistical regularities" may have less explanatory power than you think, especially if the simplest statistical regularity is itself a fully predictive understanding of the concept of temperature.