I just realized there is a crossover with fuzzing and Reinforcement learning. Both are trying to find a sparse signal in and environment. Both need to find the right balance between exploration vs exploitation. Both need to perform better than random/brute force.