RT Conference Proceedings A1 Miloš Stanković
A1 Miloš Beko
A1 Miloš Pavlović
A1 Ilija Popadić
A1 Srđan Stanković T1 Distributed On-Policy Actor-Critic Reinforcement Learning AD Sinteza 2022 - International Scientific Conference on Information Technology and Data Related Research YR 2022 NO doi: 10.15308/Sinteza-2022-389-393 SP 389 OP 393