lots of coding tasks, discussions about physics/QM. I find that it produces better quality answers than 4o, which often will have subtle but simple mistakes.
Even writing, where it is supposed to be worse than 4O, I feel that is does better/has a more solid understanding of provided documents.
Interesting, could you share an example of this where it provides something of value? I've tried asking a few different LLMs to explain renormalization group theory, and it always goes off the rails in five questions or less.
Even writing, where it is supposed to be worse than 4O, I feel that is does better/has a more solid understanding of provided documents.