Open Issues Need Help
View All on GitHubAI Summary: Implement several popular policy gradient algorithms (PPO, RPO, DDPG, TD3) within the existing RelayRL distributed reinforcement learning framework, ensuring modularity and compatibility with the framework's design patterns. This involves writing the algorithm implementations in Python, integrating them with the existing Rust backend, and adding appropriate tests and documentation.
Complexity:
4/5
enhancement good first issue
Unstable Single-Agent Distributed Reinforcement Learning Framework written in Rust & exported to Python
Rust
#client-server#distributed-ml#python-bindings#pytorch#reinforcement-learning#tch-rs