We have a standard harness for each of the model's that we test. Each prompt includes the rules, access to memory, and a lookup of the complete ruleset. The prompt adapts adding legal actions per turn and guidance depending on the stage of the game (updated based on the technological progress of the player).
Unlike RL algorithms these LLMs wouldn't learn quick enough without the prior knowledge the harness provides
note that I also have a system where if the temperature seems outlier compared to direct neighbors it averages the 3 nearest neighbors. this usually occurs in neighborhoods with a single sensor that can skew the results heavily at certain times of the day, etc.
I use Mr Chilly to demonstrate to non-SF folks how many microclimates SF (and the Bay Area has).
Only suggestion: separate Inner and Outer Sunset since there can be a massive difference between near Ocean Beach and near Irving/9th Ave in autumn (ie. SF's hottest season).
Edit: nevermind, just saw both inner_sunset and outer_sunset in /neighborhoods. I'd assumed it was merged based on the human readable list on the landing page. Thanks for the fun API!
or just click the buttons that accomplish the same thing. The point is someone at PurpleAir is asleep at the wheel if such an obvious default configuration isn't being set. If they can't get such a basic thing right, why do we trust anything else from them? "Anything else" specifically including "running their software on a raspberry pi inside my home network".
reply