jomobro117 t1_jdljb97 wrote on March 25, 2023 at 7:54 AM

Thanks for sharing! Just a couple of questions. Is the evolutionary algorithm you use similar to PBT or fundamentally different in some way? And is there a plan to implement distributed training and HPO (similar to Ray RLlib with PBT from Tune)?

nicku_a OP t1_jdllrfv wrote on March 25, 2023 at 8:31 AM

Hey! Yes, there are similarities to PBT, but there are a few differences here. Firstly, the mutations implemented with AgileRL are much more dynamic. Rather than only mutating hyperparameters, we’re allowing any part of the algorithm/model to mutate - HPs, network architecture (layers and nodes), activation functions and network weights themselves. We also train the population in one go, and offer efficient learning by sharing experience within the population.

nicku_a OP t1_jdlluja wrote on March 25, 2023 at 8:32 AM

And yes the plan is to offer distributed training! As you can imagine there are about a million things we want/need to add! If you would like to get involved in the project and help out, please do

jomobro117 t1_jdmx6h7 wrote on March 25, 2023 at 4:21 PM

Interesting! Is there a discord or slack channel you hang out on for development?

[deleted] t1_jduk26o wrote on March 27, 2023 at 8:54 AM

[removed]