Your selections:
A multi-agent cooperative reinforcement learning model using a hierarchy of consultants, tutors and workers
- Abed-Alguni, Bilal H., Chalup, Stephan K., Henskens, Frans A., Paul, David J.
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States
- Banerjee, Chayan, Chen, Zhiyong, Noman, Nasimul
Cooperative reinforcement learning for independent learners
- Abed-Alguni, Bilal Hashem Kalil
Improving sample efficiency in deep reinforcement learning based control of dynamic systems
Learning nursery rhymes using adaptive parameter neurodynamic programming
- Walker, Josiah, Chalup, Stephan K.
Modelling railway traffic management through multi-agent systems and reinforcement learning
- Bretas, A., Mendes, A., Chalup, S., Jackson, M., Clement, R., Sanhueza, C.
NENYA: Cascade Reinforcement Learning for Cost-Aware Failure Mitigation at Microsoft 365
- Wang, Lu, Zhao, Pu, Zhang, Hongyu, Rajmohan, Saravan, Zhang, Dongmei, Du, Chao, Luo, Chuan, Su, Mengna, Yang, Fangkai, Liu, Yudong, Lin, Qingwei, Wang, Min, Dang, Yingnong
Of matchers and maximizers: how competition shapes choice under risk and uncertainty
- Schulze, Christin, van Ravenzwaaij, Don, Newell, Ben R.
Optimal Actor-Critic Policy With Optimized Training Datasets
- Banerjee, Chayan, Chen, Zhiyong, Noman, Nasimul, Zamani, Mohsen
Physics Informed Intrinsic Rewards in Reinforcement Learning
- Jiang, Jiazhou, Fu, Minyue, Chen, Zhiyong
- Hou, Jian, Wang, Fangyuan, Wang, Lili, Chen, Zhiyong
Reinforcement learning for constrained energy trading games with incomplete information
- Wang, Huiwei, Huang, Tingwen, Liao, Xiaofeng, Abu-Rub, Haitham, Chen, Guo
Reinforcement learning using expectation maximization based guided policy search for stochastic dynamics
- Mallick, Prakash, Chen, Zhiyiong, Zamani, Mohsen
Robot emotions generated and modulated by visual features of the environment
- Wong, Aaron S. W., Nicklin, Steven, Hong, Kenny, Chalup, Stephan K., Walla, Peter
Stochastic Optimal Control for Multivariable Dynamical Systems Using Expectation Maximization
- Mallick, Prakash, Chen, Zhiyong
- Zhang, Haoxi, Sanín, Cesar, Szczerbicki, Edward, Zhu, Ming
Are you sure you would like to clear your session, including search history and login status?