1. CAGE. 2021 . CAGE Challenge 1 . In IJCAI-21 1st International Workshop on Adaptive Cyber Defense. CAGE. 2021. CAGE Challenge 1. In IJCAI-21 1st International Workshop on Adaptive Cyber Defense.
2. L. Espeholt , H. Soyer , R. Munos , K. Simonyan , V. Mnih , T. Ward , Y. Doron , V. Firoiu , T. Harley , I. Dunning , S. Legg , and K. Kavukcuoglu . 2018 . IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures. arXiv:1802.01561 [cs]. L. Espeholt, H. Soyer, R. Munos, K. Simonyan, V. Mnih, T. Ward, Y. Doron, V. Firoiu, T. Harley, I. Dunning, S. Legg, and K. Kavukcuoglu. 2018. IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures. arXiv:1802.01561 [cs].
3. OpenAI et al. 2019. Dota 2 with Large Scale Deep Reinforcement Learning. arXiv:1912.06680 [cs stat]. OpenAI et al. 2019. Dota 2 with Large Scale Deep Reinforcement Learning. arXiv:1912.06680 [cs stat].
4. M. Feng and H. Xu. 2017. Deep reinforecement learning based optimal defense for cyber-physical system in presence of unknown cyber-attack. In 2017 IEEE Symposium Series on Computational Intelligence (SSCI). IEEE. M. Feng and H. Xu. 2017. Deep reinforecement learning based optimal defense for cyber-physical system in presence of unknown cyber-attack. In 2017 IEEE Symposium Series on Computational Intelligence (SSCI). IEEE.
5. D. Horgan J. Quan D. Budden G. Barth-Maron M. Hessel H. van Hasselt and D. Silver. 2018. Distributed Prioritized Experience Replay. In arXiv:1803.00933 [cs]. D. Horgan J. Quan D. Budden G. Barth-Maron M. Hessel H. van Hasselt and D. Silver. 2018. Distributed Prioritized Experience Replay. In arXiv:1803.00933 [cs].