Never heard that claim before, only that a certain subset of re-enforced learning may have used ChatGPT to grade responses. Is there more detail about it being allegedly a distilled OpenAI model?
There are many sources and discussions on this. Also DeepSeek recently changed their responses to hide references to various OpenAI things after all this came out, which is weird.