I don't think I am, and for context here I have built my own DQNs from scratch t...

red75prime · 2025-04-07T16:28:26 1744043306

> I think you'd need to explain to me why adding a tiny bit of additional complexity here is so unreasonable

As far as I understand DreamerV3 doesn't employ intrinsic rewards (like in novelty-based exploration). It adopts stochastic exploration which makes it practically impossible to get to rewards that require to consistently repeat an action with no intermediate rewards.

And finding intrinsic rewards that work good across diverse domains is a complex problem in itself.

blueflow · 2025-04-07T15:33:05 1744039985

Example: When humans play Minecraft, they already know object permanence from the real world. I did not see anywhere that AI got trained to learn object permanence. Yet it is required for basics like searching for your mineshaft after turning around.