Hacker News new | past | comments | ask | show | jobs | submit | drubs's comments login

Wouldn't make much sense. We generally train with 288 environments simultaneously. I've been thinking about ways to nicely stream all 288 environments though.


Really excited to be a part of the team!


Sounds cool to me.


It's silly, but signs were a way to incentivize the agent to explore deeper into the Safari Zone among other areas.


My first version of this project 5 years ago involved a python-lua named pipe using Bizhawk actually. No clue where that code went


There's a ton of applications for AI. Back when I was at Spotify, I co-authored Basic Pitch (https://basicpitch.spotify.com/), an audio-to-midi library. There are a ton of uses for AI outside of what's heavily publicized.


There's an entire section on how the decompilations were used :)


Ok sorry I thought maybe there was a chance that the decomp project could edited in a way that would create a ROM that allowed RL to be done easier, but it seems like it just came in handy for looking up values along with the GB ASM tutorial, the alternative of my thought process is re-creating pokemon red in a modern language which you also mentioned


Wrote about this in the results section. I think there is a way to mix the two and simplify the rewards in the process. A lot of the magic behind getting the agent to teach and use cut probably could have been handled by an LLM.


The environments wouldn't concentrate enough in the Rocket Hideout beneath Celadon Game Corner. The agent would have the player wander the world reward hacking. With wild battles enabled, the environments would end up in Lavender Tower fighting Gastly.

> (and how on earth did you port Pokémon red to a RL environment? O.o)

Read and find out :)


Thanks haha, I kept reading =D I see, so it's not just that you have to visit the key areas, they need to show up in the episodes enough to provide a signal for training.


Yup!


Thanks for the heads up. I just pushed a fix.


I think you fixed the one below the puffer.ai image, but not the one above Authors.


and...fixed!


i am sorry for my awful qa on the site :((((((((((((


Consider applying for YC's Summer 2025 batch! Applications are open till May 13

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: