drubs's comments

drubs · 2025-03-08T21:45:43 1741470343

Wouldn't make much sense. We generally train with 288 environments simultaneously. I've been thinking about ways to nicely stream all 288 environments though.

drubs · 2025-03-07T17:21:27 1741368087

Really excited to be a part of the team!

drubs · 2025-03-06T14:52:42 1741272762

Sounds cool to me.

drubs · 2025-03-06T00:36:33 1741221393

It's silly, but signs were a way to incentivize the agent to explore deeper into the Safari Zone among other areas.

drubs · 2025-03-05T23:32:27 1741217547

My first version of this project 5 years ago involved a python-lua named pipe using Bizhawk actually. No clue where that code went

drubs · 2025-03-05T22:24:52 1741213492

There's a ton of applications for AI. Back when I was at Spotify, I co-authored Basic Pitch (https://basicpitch.spotify.com/), an audio-to-midi library. There are a ton of uses for AI outside of what's heavily publicized.

drubs · 2025-03-05T19:32:32 1741203152

There's an entire section on how the decompilations were used :)

mclau156 · 2025-03-05T20:02:20 1741204940

Ok sorry I thought maybe there was a chance that the decomp project could edited in a way that would create a ROM that allowed RL to be done easier, but it seems like it just came in handy for looking up values along with the GB ASM tutorial, the alternative of my thought process is re-creating pokemon red in a modern language which you also mentioned

drubs · 2025-03-05T19:31:16 1741203076

Wrote about this in the results section. I think there is a way to mix the two and simplify the rewards in the process. A lot of the magic behind getting the agent to teach and use cut probably could have been handled by an LLM.

drubs · 2025-03-05T19:04:04 1741201444

The environments wouldn't concentrate enough in the Rocket Hideout beneath Celadon Game Corner. The agent would have the player wander the world reward hacking. With wild battles enabled, the environments would end up in Lavender Tower fighting Gastly.

> (and how on earth did you port Pokémon red to a RL environment? O.o)

Read and find out :)

bubblyworld · 2025-03-06T05:27:12 1741238832

Thanks haha, I kept reading =D I see, so it's not just that you have to visit the key areas, they need to show up in the episodes enough to provide a signal for training.

drubs · 2025-03-06T05:46:26 1741239986

drubs · 2025-03-05T18:42:02 1741200122

Thanks for the heads up. I just pushed a fix.

worble · 2025-03-05T18:55:22 1741200922

I think you fixed the one below the puffer.ai image, but not the one above Authors.

drubs · 2025-03-05T19:01:08 1741201268

and...fixed!

xinpw8 · 2025-03-05T19:15:57 1741202157

i am sorry for my awful qa on the site :((((((((((((