한국기술교육대학교 LINK 연구실

6월 11일 (수) 논문 세미나 – 아셀
Peng Xu, Xiatian Zhu, and David A. Clifton, et al. "Multimodal Learning With Transformers: A Survey." IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023.

6월 05일 (목) 논문 세미나 – 석영준
Fayyaz, Anoosha, et al. "On Selecting Paths for End-to-End Entanglement Creation in Quantum Networks." arXiv preprint arXiv:2505.02283 (2025).

5월 28일 (수) 논문 세미나 – 김민준
Black, Kevin, et al. "π0: A vision-language-action flow model for general robot control, arXiv, 2024."

5월 14일 (수) 논문 세미나 – 지창훈
Garg, Divyansh, et al. "Iq-learn: Inverse soft-q learning for imitation." Advances in Neural Information Processing Systems 34 (2021): 4028-4039.

4월 30일 (수) 논문 세미나 – 석영준
Roik, J., Bartkiewicz, K., Černoch, A. et al. "Routing in quantum communication networks using reinforcement machine learning". Quantum Inf Process 23, 89 (2024).

4월 23일 (수) 논문 세미나 – 최요한
Agarwal, Ananye, et al. "Legged locomotion in challenging terrains using egocentric vision." Conference on robot learning (CoRL), 2022.

4월 16일 (수) 논문 세미나 – 아셀
Qinqing Zheng, Amy Zhang, and Aditya Grover, "Online Decision Transformer." ICML, 2022.

3월 26일 (수) 논문 세미나 – 김민준
I Made Aswin Nahrendra, et al. "DreamWaQ: Learning Robust Quadrupedal Locomotion With Implicit Terrain Imagination via Deep Reinforcement Learning." ICRA, 2023.

3월 14일 (금) 논문 세미나 – 지창훈
Yu, Wenhao, et al. "Language to rewards for robotic skill synthesis." Conference on Robot Learning (CoRL), 2023.

2월 25일 (화) 논문 세미나 – 석영준
Tabish Rashid, Mikayel Samvelyan, et al. "QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning." Journal of Machine Learning Research, 2020.

2월 12일 (수) 논문 세미나 – 최요한
Hoeller, David, et al. "Anymal parkour: Learning agile navigation for quadrupedal robots." Science Robotics, 2024.

2월 4일 (화) 논문 세미나 – 아셀
I. Adamski, R. Adamski, T. Grel, A. Jędrych, K. Kaczmarek, H. Michalewski, et al. "Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes." High Performance Computing (ISC High Performance) 2018 – 33rd International Conference, 2018.

1월 21일 (화) 논문 세미나 – 김민준
Gabriel B. Margolis, Pulkit Agrawal. "Walk These Ways: Tuning Robot Control for Generalization with Multiplicity of Behavior." Conference on Robot Learning (CoRL), 2022.

1월 15일 (수) 논문 세미나 – 지창훈
Rocha, Lidia, et al. "Enhancing Safety via Deep Reinforcement Learning in Trajectory Planning for Agile Flights in Unknown Environments." IROS, 2024.

12월 30일 (월) 논문 세미나 – 최요한
Kumar, Ashish, et al. "RMA: Adapting rapid motor adaptation for bipedal robots." 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2022.

12월 23일 (월) 논문 세미나 – 아셀
A. Nair, P. Srinivasan, S. Blackwell, C. Alcicek, R. Fearon, A. De Maria, V. Panneershelvam, M. Suleyman, C. Beattie, S. Petersen, S. Legg, V. Mnih, K. Kavukcuoglu, D. Silver, et al. "Massively Parallel Methods for Deep Reinforcement Learning." International Conference on Machine Learning, 2015.

12월 13일 (금) 논문 세미나 – 김주봉
Alin-Bogdan Popa, Pantelimon George Popescu, et al. "The Future of QKD Networks." arXiv:2407.00877v1, 2024.
Johann T., Kühl S., & Pachnicke S., et al. "Deep Reinforcement Learning based Decentralized Routing and Load-Balancing in Meshed QKD-Networks." ECOC, 2024.

12월 3일 (화) 논문 세미나 – 최호빈
CX. Gao, C. Wu, M. Cao, R. Kong, Z. Zhang, Y. Yu, et al. "ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning." AAAI, 2024.

11월 26일 (화) 논문 세미나 – 석영준
Z. Zheng, C. Zhou, X. Tong, M. Yuan, Z. Wang, et al. "UDC: A Unified Neural Divide-and-Conquer Framework for Large-Scale Combinatorial Optimization Problems." NIPS, 2024.

11월 19일 (화) 논문 세미나 – 최요한
Zhuang, Ziwen, et al. "Robot parkour learning." CoRL, 2023.

11월 12일 (화) 논문 세미나 – 지창훈
Yu, Wenhao, et al. "Pathrl: An end-to-end path generation method for collision avoidance via deep reinforcement learning," ICRA, 2024.

11월 05일 (화) 논문 세미나 – 최호빈
Wang, Yuanfu, et al., "Critic-guided decision transformer for offline reinforcement learning," AAAI, Vol. 38, No. 14, 2024.

10월 29일 (화) 논문 세미나 – 아셀
H.-K. Lim, J.-B. Kim, I. Ullah, J.-S. Heo, and Y.-H. Han, "Federated Reinforcement Learning Acceleration Method for Precise Control of Multiple Devices", IEEE, 2021.

10월 22일 (화) 논문 세미나 – 석영준
T. Islam, M. Arifuzzaman and E. Arslan, "Reinforcement Learning Based Proactive Entanglement Swapping for Quantum Networks", IEEE, 2024.

10월 15일 (화) 논문 세미나 – 최요한
Cheng, Xuxin and Shi, Kexin and Agarwal, Ananye and Pathak, Deepak, "Extreme Parkour with Legged Robots", ICRA, 2024.

10월 8일 (화) 논문 세미나 – 한연희
Aviral Kumar, Aurick Zhou, George Tucker, Sergey Levine, "Conservative Q-Learning for Offline Reinforcement Learning" NIPS, 2020.

10월 1일 (화) 논문 세미나 – 지창훈
Allevato, Adam, et al. "Tunenet: One-shot residual tuning for system identification and sim-to-real robot task transfer." Conference on Robot Learning. PMLR, 2020.
Du, Yuqing, et al. "Auto-tuned sim-to-real transfer." ICRA, 2021.

09월 23일 (월) 논문 세미나 – 최호빈
Xie, Zhihui, et al., "Future-conditioned unsupervised pretraining for decision transformer," ICML, 2023.

09월 10일 (화) 논문 세미나 – 최요한
Luo, Fu, et al. "Neural combinatorial optimization with heavy decoder: Toward large scale generalization." Advances in Neural Information Processing Systems 36 (2023): 8845-8864.

08월 20일 (화) 논문 세미나 – 최요한
Sferrazza, Carmelo, et al. "Body Transformer: Leveraging Robot Embodiment for Policy Learning." RSS Workshop, 2024.

08월 14일 (수) 논문 세미나 – 지창훈
Yang, Zhao, et al. "Continuous episodic control." 2023 IEEE Conference on Games (CoG). IEEE, 2023.
Kuznetsov, Igor, and Andrey Filchenkov. "Solving continuous control with episodic memory." IJCAI, 2021.

08월 06일 (화) 논문 세미나 – 최호빈
Ma, Xiao, and Li, Wu-Jun, "Weighting Online Decision Transformer with Episodic Memory for Offline-to-Online Reinforcement Learning," ICRA, 2024.

07월 30일 (화) 논문 세미나 – 석영준
Y. Gao, S. Yang, F. Li and X. Fu, "Adaptive and Efficient Qubit Allocation Using Reinforcement Learning in Quantum Networks," in IEEE Network, vol. 36, no. 5, pp. 48-54, September/October 2022

07월 15일 (월) 논문 세미나 – 아셀
Irwan Bello, Hieu Pham, Quoc V. Le, Mohammad Norouzi, Samy Bengio. "Neural Combinatorial Optimization with Reinforcement Learning", ICLR, 2017.

07월 11일 (목) 논문 세미나 – 김주봉
Yang Zhang, Chenjia Bai, Bin Zhao, Junchi Yan, Xiu Li1, Xuelong Li, "Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models," arXiv:2406.15836v1, 2024.

07월 10일 (수) 논문 세미나 – 아셀
Oriol Vinyals, Meire Fortunato, Navdeep Jaitly. "Pointer Networks", NIPS, 2015.
Irwan Bello, Hieu Pham, Quoc V. Le, Mohammad Norouzi, Samy Bengio. "Neural Combinatorial Optimization with Reinforcement Learning", ICLR, 2017.

07월 02일 (화) 논문 세미나 – 이재원
Seungeun Rho, Laura Smith, Tianyu Li, Sergey Levine, Xue Bin Peng, Sehoon Ha. "Language Guided Skill Discovery," arXiv preprint, 2024.

06월 17일 (월) 논문 세미나 – 최요한
Rudin, N., Hoeller, D., Reist, P., & Hutter, M. Learning to walk in minutes using massively parallel deep reinforcement learning, CoRL, 2022.

06월 11일 (화) 논문 세미나 – 김주봉
L. Le, T. N. Nguyen, A. Lee and B. Dumba, "Entanglement Routing For Quantum Networks: A Deep Reinforcement Learning Approach," ICC 2022 - IEEE International Conference on Communications, 2022.

06월 04일 (화) 논문 세미나 – 지창훈
Liu, Yi, et al. "Efficient preference-based reinforcement learning using learned dynamics models," ICRA, 2023.

05월 30일 (목) 논문 세미나 – 최호빈
Wu, Yueh-Hua, Xiaolong Wang, and Masashi Hamaya, "Elastic decision transformer," NIPS, 2024.

05월 21일 (화) 논문 세미나 – 석영준
Pace, Roy, "Reinforcement Learning in a Quantum Network" (2022). Honors Theses. 1144.

05월 14일 (화) 논문 세미나 – 이재원
Yecheng Jason Ma, William Liang, Hung-Ju Wang, Sam Wang, Yuke Zhu, Linxi "Jim" Fan, Osbert Bastani, Dinesh Jayaraman. "DrEureka: Language Model Guided Sim-To-Real Transfer," arXiv preprint, 2024.

04월 23일 (화) 논문 세미나 – 지창훈
Romero, Angel, Yunlong Song, and Davide Scaramuzza. "Actor-critic model predictive control," ICRA ,2024.

04월 16일 (화) 논문 세미나 – 최호빈
Zheng, Qinqing, Amy Zhang, and Aditya Grover, "Online decision transformer," ICML, PMLR, 2022.

04월 09일 (화) 논문 세미나 – 이재원
Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, Daan Wierstra, "Continuous control with deep reinforcement learning," ICLR, 2016.

04월 02일 (화) 논문 세미나 – 최요한
Laura Smith, Ilya Kostrikov, Sergey Levine. "Demonstrating a walk in the park: Learning to walk in 20 minutes with model-free reinforcement learning." Robotics: Science and Systems, 2023.

03월 26일 (화) 논문 세미나 – 김주봉
Mingyang Wang, Zhenshan Bing, Xiangtong Yao, Shuai Wang, Huang Kai, Hang Su, Chenguang Yang, Alois Knoll. "Meta-Reinforcement Learning Based on Self-Supervised Task Representation Learning," AAAI, 2023.

03월 19일 (화) 논문 세미나 – 석영준
Jin, Yan, et al. "Pointerformer: Deep reinforced multi-pointer transformer for the traveling salesman problem." Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 37. No. 7. 2023.

03월 14일 (목) 논문 세미나 – 이재원
Yuqing Du, Olivia Watkins, et al., "Guiding Pretraining in Reinforcement Learning with Large Language Models," ICML, 2023.

03월 05일 (화) 논문 세미나 – 지창훈
Ahmadian, Arash, et al. "Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs." arXiv preprint, 2024.

02월 29일 (목) 논문 세미나 – 최호빈
Yuan, Minghai, et al., "Research on flexible job shop scheduling problem with AGV using double DQN," Journal of Intelligent Manufacturing, 2023.

02월 20일 (화) 논문 세미나 – 최요한
Tianwei Ni, Michel Ma, Benjamin Eysenbach, Pierre-Luc Bacon. "When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment." Advances in Neural Information Processing Systems, 2023.

02월 13일 (화) 논문 세미나 – 김주봉
Jacob Beck, Matthew Jackson, Risto Vuorio, Shimon Whiteson. "Hypernetworks in Meta-Reinforcement Learning," CoRL, 2022.

01월 30일 (수) 논문 세미나 – 최요한
Tianwei Ni, Benjamin Eysenbach, Ruslan Salakhutdinov. "Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs." International Conference on Machine Learning, 2022.

01월 23일 (화) 논문 세미나 – 이재원, 석영준
Pouryousef, Shahrooz, et al. "Quantum Network Planning for Utility Maximization." Proceedings of the 1st Workshop on Quantum Networks and Distributed Quantum Computing. 2023.
Cicconetti, Claudio, Marco Conti, and Andrea Passarella. "Request scheduling in quantum networks." IEEE Transactions on Quantum Engineering 2 (2021): 2-17.

01월 16일 (수) 논문 세미나 – 최호빈
Zhang, Lixiang, Yan Yan, and Yaoguang Hu., "Deep reinforcement learning for dynamic scheduling of energy-efficient automated guided vehicles," Journal of Intelligent Manufacturing, 2023.

01월 10일 (수) 논문 세미나 – 지창훈
Chebotar, Yevgen, et al. "Q-transformer: Scalable offline reinforcement learning via autoregressive q-functions." Conference on Robot Learning. PMLR, 2023.

01월 02일 (화) 논문 세미나 – 김주봉
Carlos Betancourt, Wen-Hui Chen. "Deep reinforcement learning for portfolio management of markets with a dynamic number of assets," Expert Systems With Applications, 2020.

12월 26일 (화) 논문 세미나 – 석영준
Mai, Xuan, Quanzhi Fu, and Yi Chen. "Packet routing with graph attention multi-agent reinforcement learning." 2021 IEEE Global Communications Conference (GLOBECOM). IEEE, 2021.
Zuo, Yingmin, et al. "Reinforcement learning-based resource allocation in quantum key distribution networks." Asia Communications and Photonics Conference. Optica Publishing Group, 2020.

12월 19일 (화) 논문 세미나 – 이재원
Tianbao Xie, Siheng Zhao, et al. "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning," arXiv, 2023.

12월 12일 (화) 논문 세미나 – 지창훈
Lee, Kyowoon, Seongun Kim, and Jaesik Choi, "Adaptive and Explainable Deployment of Navigation Skills via Hierarchical Deep Reinforcement Learning," arXiv, 2023.

12월 05일 (화) 논문 세미나 – 최호빈
Nachum, Ofir, et al., "Why does hierarchy (sometimes) work so well in reinforcement learning?," arXiv preprint arXiv:1909.10618, 2019.

11월 28일 (화) 논문 세미나 – 김주봉
Yun-HsuanLien, Yuan-KuiLi, Yu-ShuenWang. "Contrastive Learning and Reward Smoothing for Deep Portfolio Management," IJCAI, 2023.

11월 21일 (화) 논문 세미나 – 지창훈
Anonymous authors. "TD-MPC2: Scalable, Robust World Models for Continuous Control," Under review as a conference paper at ICLR 2024.

11월 14일 (화) 논문 세미나 – 석영준
Ayush Jain, Norio Kosaka, Kyung-Min Kim, Joseph J Lim. "Know Your Action Set: Learning Action Relations for Reinforcement Learning," ICLR Conference, 2022.

11월 07일 (화) 논문 세미나 – 이재원
Yecheng Jason Ma, William Liang, et al. "Eureka: Human-Level Reward Design via Coding Large Language Models," arXiv, 2023.

10월 31일 (화) 논문 세미나 – 최요한
Xuanhao Pan, Yan Jin, et al. "H-TSP: Hierarchically Solving the Large-Scale Travelling Salesman Problem." AAAI, 2023.

10월 24일 (화) 논문 세미나 – 최호빈
Cheng-En Wu and Hsiao-Ping Tsai, "Selecting Subgoal for Social AGV Path Planning by Using Reinforcement Learning," 2022 23rd IEEE International Conference on Mobile Data Management (MDM), IEEE, 2022.

10월 10일 (화) 논문 세미나 – 최요한
Wan, Shanchuan, et al. "DEIR: Efficient and Robust Exploration through Discriminative-Model-Based Episodic Intrinsic Rewards." IJCAI, 2023.

09월 26일 (화) 논문 세미나 – 울라 이산
Y. Liu, H. Yu, S. Xie and Y. Zhang, "Deep Reinforcement Learning for Offloading and Allocation in Vehicle Edge Computing and Networks" IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 68, NO. 11, NOVEMBER 2019.

09월 13일 (수) 논문 세미나 – 석영준
Yuan Cao, Yongli Zhao, et al. "Multi-Tenant Provisioning for Quantum Key Distribution Networks With Heuristics and Reinforcement Learning: A Comparative Study," IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, VOL. 17, NO. 2, 2020.

08월 30일 (수) 논문 세미나 – 임현교
Liu, Chenyi, et al. "DRL-OR: Deep Reinforcement Learning-based Online Routing for Multi-type Service Requirements," IEEE INFOCOM 2021 - IEEE Conference on Computer Communications, 2021.

08월 23일 (수) 논문 세미나 – 지창훈
Qi, Wen, et al. "An adaptive reinforcement learning-based multimodal data fusion framework for human–robot confrontation gaming," Neural Networks, 2023.

08월 08일 (화) 논문 세미나 – 최호빈
Ma, Yi, et al., "A hierarchical reinforcement learning based optimization framework for large-scale dynamic pickup and delivery problems," NIPS, 2021.

08월 02일 (수) 논문 세미나 – 최요한
Yeong-Dae Kwon, Jinho Choo et al., "Matrix encoding networks for neural combinatorial optimization," NIPS, 2021.

07월 19일 (수) 논문 세미나 – 김주봉
Luisa Zintgraf, Kyriacos Shiarlis, Maximilian Igl, Sebastian Schulze, Yarin Gal, Katja Hofmann, Shimon Whiteson, "VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning," ICLR, 2020.

07월 11일 (화) 논문 세미나 – 김주봉
Kate Rakelly*, Aurick Zhou*, Deirdre Quillen, Chelsea Finn, Sergey Levine, "Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables," ICML, 2019.

07월 05일 (수) 논문 세미나 – 최요한
Daniel J. Mankowitz, Andrea Michi, et al. "Faster sorting algorithms discovered using deep reinforcement learning," Nature, 2023.

06월 20일 (화) 논문 세미나 – 지창훈
Fujimoto, Scott, et al. "For SALE: State-Action Representation Learning for Deep Reinforcement Learning," arXiv, 2023.

06월 13일 (수) 논문 세미나 – 석영준
Yeong-Dae Kwon, Jinho Choo, et al. "POMO: Policy Optimization with Multiple Optima for Reinforcement Learning," NeurIPS, 2020.

05월 31일 (수) 논문 세미나 – 최호빈
Hu, Hongtao, et al. "Anti-conflict AGV path planning in automated container terminals based on multi-agent reinforcement learning," International Journal of Production Research, 2023.

05월 23일 (화) 논문 세미나 – 임현교
Rasool Fakoor, Pratik Chaudhari, et al. "Meta-Q-Learning," ICLR, 2020.

05월 16일 (화) 논문 세미나 – 지창훈
Edward S. Hu, Richard Chang, et al. "Planning goals for exploration," ICLR, 2023.

04월 18일 (화) 논문 세미나 – 최호빈
Vezhnevets, Alexander Sasha, et al., "Feudal networks for hierarchical reinforcement learning," International Conference on Machine Learning (ICML), PMLR, 2017.

04월 11일 (화) 논문 세미나 – 임현교
Z. Yuan, G. Li, Z. Wang, J. Sun and R. Cheng, "RL-CSL: A Combinatorial Optimization Method Using Reinforcement Learning and Contrastive Self-Supervised Learning," in IEEE Transactions on Emerging Topics in Computational Intelligence, 2022.

04월 04일 (화) 논문 세미나 – 울라 이산
L. Wang, K. Wang, C. Pan, W. Xu, N. Aslam and A. Nallanathan, "Deep Reinforcement Learning Based Dynamic Trajectory Control for UAV-Assisted Mobile Edge Computing," in IEEE Transactions on Mobile Computing, 2022.

03월 28일 (화) 논문 세미나 – 최요한
Chen, Lili, et al. "Decision transformer: Reinforcement learning via sequence modeling." NIPS, 2021.

03월 21일 (화) 논문 세미나 – 석영준
Andrea Skolik, Sofiene Jerbi, Vedran Dunjko, "Quantum agents in the Gym: a variational quantum algorithm for deep Q-learning," Quantum 6, 720, 2022.

03월 14일 (화) 논문 세미나 – 지창훈
H. Huang, Y. Yang, H. Wang, Z. Ding, H. Sari and F. Adachi, "Deep Reinforcement Learning for UAV Navigation Through Massive MIMO Technique," IEEE Transactions on Vehicular Technology, 2020
Yun, Won Joon, et al. "Distributed deep reinforcement learning for autonomous aerial eVTOL mobility in drone taxi applications," ICT Express, 2021 drone

02월 15일 (수) 논문 세미나 – 석영준
Miralem Mehic, Marcin Niemiec, "Quantum Key Distribution A Networking Perspective,"ACM Computing SurveysVolume 53Issue 5Article No.: 96pp 1–41, 2020.

02월 01일 (수) 논문 세미나 – 지창훈
Y. Song, M. Steinweg, E. Kaufmann and D. Scaramuzza, "Autonomous Drone Racing with Deep Reinforcement Learning," IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021. drone

01월 25일 (수) 논문 세미나 – 최호빈
Tang, Hengliang, et al., "A novel hierarchical soft actor-critic algorithm for multi-logistics robots task allocation," IEEE Access, 2021.

01월 25일 (수) 논문 세미나 – 지창훈
Zhou, Xin, et al. "Ego-planner: An esdf-free gradient-based local planner for quadrotors." IEEE Robotics and Automation Letters, 2020. drone

01월 12일 (목) 논문 세미나 – 울라 이산
Junyoung Park, Jaehyeong Chun, Sang Hun Kim, Youngkook Kim, Jinkyoo Park, "Learning to schedule job-shop problems: Representation and policy learning using graph neural network and reinforcement learning" International Journal of Production Research, 2021.

12월 28일 (수) 논문 세미나 – 석영준
Vinyals, Oriol and Fortunato, Meire and Jaitly, Navdeep, "Pointer Networks," NIPS, 2015.
Irwan Bello, et al., "Neural Combinatorial Optimization with Reinforcement Learning,", ICLR, 2017.

12월 21일 (수) 논문 세미나 – 최요한
Finn, Chelsea, Pieter Abbeel, and Sergey Levine., "Model-agnostic meta-learning for fast adaptation of deep networks.," International conference on machine learning. PMLR, 2017.

12월 15일 (목) 논문 세미나 – 지창훈
A. Kumar, et al. "Conservative q-learning for offline reinforcement learning," NIPS, 2020

12월 09일 (금) 논문 세미나 – 지창훈
A. Kumar, et al. "Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes," arXiv, 2022

11월 03일 (목) 논문 세미나 – 최요한
Wang, Jane X., et al., "LEARNING TO REINFORCEMENT LEARN,", 2017.

10월 27일 (목) 논문 세미나 – 석영준
Ruben Solozabal, "Constrained Combinatorial Optimization with Reinforcement Learning,", 2020.

10월 20일 (목) 논문 세미나 – 최호빈
Chenghao Li, et al., "Celebrating Diversity in Shared Multi-Agent Reinforcement Learning," NIPS, 2021.

10월 13일 (목) 논문 세미나 – 임현교
J. Cheng, Y. Wu, Y. Lin, Y. E, F. Tang and J. Ge, "VNE-HRL: A Proactive Virtual Network Embedding Algorithm Based on Hierarchical Reinforcement Learning," IEEE Transactions on Network and Service Management, 2021.

09월 30일 (금) 세미나 – 석영준
windy maze environment of ray

09월 22일 (목) 논문 세미나 – 지창훈
Lucy Xiaoyang Shi, Joseph J. Lim, Youngwoon Lee, "Skill-based Model-based Reinforcement Learning," arXiv, 2022.

09월 01일 (목) 논문 세미나 – 석영준
Ryan Lowe, Yi Wu, Aviv Tamar, Jean Harb, Pieter Abbeel, Igor Mordatch, "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments," NeurIPS, 2017

08월 24일 (수) 논문 세미나 – 김주봉
Nicklas Hansen, Rishabh Jangir, Yu Sun, Guillem Alenya, Pieter Abbeel, Alexei A Efros, Lerrel Pinto, Xiaolong Wang, "Self-Supervised Policy Adaptation During Deployment," ICLR, 2021.

08월 10일 (수) 논문 세미나 – 석영준
Mohammadreza Nazari, Afshin Oroojlooy, Lawrence V. Snyder, Martin Takáč, "Reinforcement Learning for Solving the Vehicle Routing Problem," NeurIPS, 2018.

08월 04일 (목) 논문 세미나 – 최요한
Luiz A. Celiberto Jr., et al., "Using Transfer Learning to Speed-Up Reinforcement Learning: a Cased-Based Approach." 2010 Latin American Robotics Symposium and Intelligent Robotics Meeting. IEEE, 2010.
Samuel Barrett, Matthew E. Taylor, and Peter Stone, "Transfer Learning for Reinforcement Learning on a Physical Robot," AAMAS, 2010. Presentation

07월 19일 (화) 논문 세미나 – 지창훈
Jagdeep Singh Bhatia, Holly Jackson, Yunsheng Tian, Jie Xu, Wojciech Matusik, "Evolution Gym: A Large-Scale Benchmark for Evolving Soft Robots," NIPS, 2021. {# #} {# Presentation#} {# #}

07월 13일 (수) 논문 세미나 – 최호빈
Wang, Yihan, et al., "Dop: Off-policy multi-agent decomposed policy gradients," ICLR, 2020. Presentation

07월 06일 (수) 논문 세미나 – 최요한
Meng Fang, et al., "Curriculum-guided Hindsight Experience Replay," Advances in Neural Information Processing Systems 32, 2019. Presentation

06월 16일 (목) 논문 세미나 – 석영준
Yang Yang, Yulin Hu, M. Cenk Gursoy, "Deep Reinforcement Learning and Optimization Based Green Mobile Edge Computing ," IEEE 18th Annual Consumer Communications & Networking Conference (CCNC), 2021.
Denis Yarats, Rob Fergus, Alessandro Lazaric, Lerrel Pinto, "Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning," arXiv:2107.09645, 2022.

06월 8일 (수) 논문 세미나 – 울라 이산
Youn J, Han Y-H, "Intelligent Task Dispatching and Scheduling Using a Deep Q-Network in a Cluster Edge Computing System," Sensors, 2022.

06월 3일 (금) 논문 세미나 – 최호빈
Hyeoksoo Lee, Jiwoo Hong, and Jongpil Jeong, "MARL-Based Dual Reward Model on Segmented Actions for Multiple Mobile Robots in AutomatedWarehouse Environment," Applied Sciences, 2022. Presentation

05월 27일 (금) 논문 세미나 – 장용연
Alexander C. Li, Lerrel Pinto, Pieter Abbeel, "Generalized Hindsight for Reinforcement Learning," arXiv, 2020. Presentation

05월 18일 (수) 논문 세미나 – 허주성
Stephan Zheng, Alexander Trott, Sunil Srinivasa, David C. Parkes, Richard Socher, "The AI Economist: Optimal Economic Policy Design via Two-level Deep Reinforcement Learning," https://arxiv.org/abs/2108.02755.

05월 4일 (수) 논문 세미나 – 지창훈, 김주봉
1) Aravind Srinivas, et al., "CURL: Contrastive Unsupervised Representations for Reinforcement Learning," arXiv, 2020. Presentation
2) Yuri Burda, et al., "Exploration by Random Network Distillation," ICLR, 2019.

04월 27일 (수) 논문 세미나 – 김주봉, 장용연
3) New Paper Idea
4) Marcin Andrychowicz, et al., "Hindsight Experience Replay," NIPS, 2017.

04월 20일 (수) 논문 세미나 – 울라 이산 (외부 세미나)
A. Qadeer and M. J. Lee, "DDPG-Edge-Cloud: A Deep-Deterministic Policy Gradient based Multi-Resource Allocation in Edge-Cloud System," 2022 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), 2022. {# #} {# Presentation#} {# #}

04월 13일 (수) 논문 세미나 – 지창훈
Nicklas Hansen, Xiaolong Wang, et al. "Temporal Difference Learning for Model Predictive Control." arXiv, 2022. Presentation

04월 06일 (수) 논문 세미나 – 석영준
Sur, Giwon, et al. "A Deep Reinforcement Learning-Based Scheme for Solving Multiple Knapsack Problems." Applied Sciences 12.6 (2022): 3068. Presentation

03월 30일 (수) 논문 세미나 – 최요한
Yeo Jin Kim, Min Chi, “Time-Aware Q-Networks: Resolving Temporal Irregularity for Deep Reinforcement Learning,” arXiv, 2021. Presentation

03월 23일 (수) 논문 세미나 – 임현교
Refaei Afshar, R., Zhang, Y., Firat, M., and Kaymak, U., “A State Aggregation Approach for Solving Knapsack Problem with Deep Reinforcement Learning,” arXiv, 2020. Presentation

03월 16일 (수) 논문 세미나 – 최호빈
Chenghao Li, et al., "Celebrating Diversity in Shared Multi-Agent Reinforcement Learning," NIPS, 2021. Presentation

03월 10일 (목) 논문 세미나 – 허주성
Stephan Zheng, et al., "The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies," https://arxiv.org/abs/2004.13332.

03월 03일 (목) 논문 세미나 – 김주봉
Irwan Bello, et al., "Neural Combinatorial Optimization with Reinforcement Learning,", ICLR, 2017.
Thomas D. Barrett, et al., "Exploratory Combinatorial Optimization with Reinforcement Learning,", AAAI, 2020.
Ofir Nachum, et al., "Data-Efficient Hierarchical Reinforcement Learning,", NIPS, 2018.

02월 24일 (목) 논문 세미나 – 최요한
Marcin Andrychowicz, et al., "Hindsight Experience Replay," NIPS, 2017. Presentation

02월 15일 (화) 논문 세미나 – 울라 이산
F. Qi, L. Zhuo and C. Xin, "Deep Reinforcement Learning Based Task Scheduling in Edge Computing Networks," 2020 IEEE/CIC International Conference on Communications in China (ICCC), pp. 835-840, 2020. Presentation

02월 08일 (화) 논문 세미나 – 석영준
RL — Policy Gradient Explained (Part I & II) Presentation

01월 25일 (화) 논문 세미나 – 임현교
H. Huang et al., "Scalable Orchestration of Service Function Chains in NFV-Enabled Networks: A Federated Reinforcement Learning Approach," in IEEE Journal on Selected Areas in Communications, vol. 39, no. 8, pp. 2558-2571, Aug. 2021. Presentation

01월 18일 (화) 논문 세미나 – 지창훈
Weirui Ye, et al., "Mastering Atari Games with Limited Data," NIPS, 2021.

01월 04일 (화) 논문 세미나 – 최요한
Junhyuk Oh, Satinder Singh, and Honglak Lee, "Value Prediction Network," NIPS, 2017. Presentation

12월 28일 (화) 논문 세미나 – 울라 이산
Taihui Li, et al., "An End-to-End Network Slicing Algorithm Based on Deep Q-Learning for 5G Network ," IEEE Access, July 2020.

12월 21일 (화) 논문 세미나 – 최호빈
Gupta, Tarun, et al., "Uneven: Universal value exploration for multi-agent reinforcement learning," International Conference on Machine Learning. PMLR, 2021.

12월 14일 (화) 논문 세미나 – 지창훈
Cameron Browne et al., "A Survey of Monte Carlo Tree Search Methods," IEEE Transactions on Computational Intelligence and AI in Games, Vol. 4, No. 1, March 2012.

12월 07일 (화) 논문 세미나 – 김주봉
Woojun Kim, Jongeui Park, Youngchul Sung, "Communication in Multi-Agent Reinforcement Learning-Intention Sharing," ICLR, 2021.

11월 30일 (화) 논문 세미나 – 지창훈
Levine, Sergey, et al., "Offline reinforcement learning: Tutorial, review, and perspectives on open problems.", arXiv preprint arXiv, 2020

11월 23일 (화) 논문 세미나 – 울라 이산
Lu Zhang, et al., "Task Offloading and Trajectory Control for UAV-Assisted Mobile Edge Computing Using Deep Reinforcement Learning," IEEE ACCESS, 2021.

11월 16일 (화) 논문 세미나 – 최요한
Marc G. Bellemare, Will Dabney, and Remi Munos, "A Distributional Perspective on Reinforcement Learning," ICML, 2017.

10월 26일 (화) 논문 세미나 – 최호빈
Lowe, Ryan, et al., "Multi-agent actor-critic for mixed cooperative-competitive environments," NIPS, 2017.

10월 19일 (화) 논문 세미나 – 김주봉
Jianhao Wang, Zhizhou Ren, Terry Liu, Yang Yu, Chongjie Zhang, "QPLEX: DUPLEX DUELING MULTI-AGENT Q-LEARNING," ICLR, 2021.

10월 12일 (화) 논문 세미나 – 지창훈
Richard S. Sutton. “Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming.” Machine learning proceedings 1990. Morgan Kaufmann, 1990.
Richard S. Sutton. “Dyna, an Integrated Architecture for Learning, Planning, and Reacting.” ACM Sigart Bulletin, 1991.

10월 05일 (화) 논문 세미나 – 울라 이산
P. Zhang, et al., "Dynamic Virtual Network Embedding Algorithm based on Graph Convolution Neural Network and Reinforcement Learning," IEEE Internet of Things Journal, 2021.

09월 28일 (화) 논문 세미나 – 최요한
Hanjun Dai, et al., "Learning Combinatorial Optimization Algorithms over Graphs," NIPS, 2017.

09월 14일 (화) 논문 세미나 – 최요한
Ziyu Wang, et al., "Dueling Network Architectures for Deep Reinforcement Learning." International Conference on Machine Learning. PMLR, 2016.

09월 07일 (화) 논문 세미나 – 김주봉
Chao Yu, Akash Velu, Eugene Vinitsky, Yu Wang, Alexandre Bayen, "The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games," arXiv preprint arXiv:2103.01955, 2021.

08월 30일 (월) 논문 세미나 – 지창훈
Hamid Ali, Hammad Majeed, Imran Usman, Khaled A. Almejalli. “Reducing Entropy Overestimation in Soft Actor Critic Using Dual Policy Network.” Hindawi, 2021.

08월 23일 (월) 논문 세미나 – 울라 이산
Yi, Mengjie, Xijun Wang, Juan Liu, Yan Zhang and B. Bai. “Deep Reinforcement Learning for Fresh Data Collection in UAV-assisted IoT Networks.” IEEE INFOCOM, pp. 716-721, 2020.

08월 09일 (월) 논문 세미나 – 임현교
N. Navid, F. Hung, S. Soleyman and D. Khosla. “Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning.” ArXiv abs/2010.04740, 2020.

08월 02일 (월) 논문 세미나 – 김주봉
T. Wang, T. Gupta, A. Mahajan, B. Peng, S. Whiteson, C. Zhang, "RODE: LEARNING ROLES TO DECOMPOSE MULTI-AGENT TASKS," arXiv preprint arXiv:2010.01523, 2020.

07월 26일 (월) 논문 세미나 – 울라 이산
Omar Bouhamed et. al., "A UAV-Assisted Data Collection for Wireless Sensor Networks: Autonomous Navigation and Scheduling," IEEE Access, 2020.

07월 19일 (월) 논문 세미나 – 최호빈
Huang, Shengyi, and Santiago Ontañón, "A closer look at invalid action masking in policy gradient algorithms," arXiv preprint arXiv:2006.14171, 2020.

07월 12일 (월) 논문 세미나 – 지창훈
Kulkarni, Tejas D., et al., "Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivation," Advances in neural information processing systems, 2016.

06월 28일 (월) 논문 세미나 – 최요한
Richard S. Sutton, et. al., "Policy Gradient Methods for Reinforcement Learning with Function Approximation," NIPS, 1999.

06월 21일 (월) 논문 세미나 – 지창훈
Pathak, Deepak, et al., "Curiosity-driven exploration by self-supervised prediction," International Conference on Machine Learning. PMLR, 2017.

06월 14일 (월) 논문 세미나 – 임현교
A. Rkhami, et al., "On the Use of Graph Neural Networks for Virtual Network Embedding," 2020 International Symposium on Networks, Computers and Communications (ISNCC), 2020.

05월 31일 (월) 논문 세미나 – 김주봉
Qiang Ma, et al., "Combinatorial Optimization by Graph Pointer Networks and Hierarchical Reinforcement Learning," arXiv:1911.04936, 2019.
Vaswani A., et al., "Attention Is All You Need," NIPS, 2017.
Vinyals, Oriol and Fortunato, Meire and Jaitly, Navdeep, "Pointer Networks," NIPS, 2015.

05월 24일 (월) 논문 세미나 – 최호빈
Tang, Hengliang, et al., "A novel hierarchical soft actor-critic algorithm for multi-logistics robots task allocation," IEEE Access, 2021.

05월 20일 (목) 논문 세미나 – 임현교
Wang, Cong, et al. "Modeling on Virtual Network Embedding Using Reinforcement Learning," Concurrency and Computation: Practice and Experience, 2020.
H. Yao, S. Ma, J. Wang, P. Zhang, C. Jiang and S. Guo, "A Continuous-Decision Virtual Network Embedding Scheme Relying on Reinforcement Learning," in IEEE Transactions on Network and Service Management, 2020.

05월 12일 (수) 논문 세미나 – 지창훈
Junta Wu and Huiyun Li, "Deep Ensemble Reinforcement Learning with Multiple DeepDeterministic Policy Gradient Algorithm" Hindawi, 2020.

05월 03일 (월) 논문 세미나 – 김주봉
Shariq Iqbal, et al., "RANDOMIZED ENTITY-WISE FACTORIZATION FOR MULTI-AGENT REINFORCEMENT LEARNING," arXiv:2006.04222, 2020.

04월 05일 (월) 논문 세미나 – 임현교
H. Yao, X. Chen, P. Zhang and L. Wang, "A novel reinforcement learning algorithm for virtual network embedding," Neurocomputing, 2018.

03월 22일 (월) 논문 세미나 – 김주봉
Tabish Rashid, Gregory Farquhar, Bei Peng, Shimon Whiteson, "Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning," NIPS, 2020.

03월 15일 (월) 논문 세미나 – 최호빈
Christianos, Filippos, Lukas Schäfer, and Stefano V. Albrecht, "Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning," arXiv preprint arXiv:2006.07169, 2020.

03월 08일 (월) 논문 세미나 – 지창훈
Scott Fujimoto, et al., "Addressing Function Approximation Error in Actor-Critic Methods", International Conference on Machine Learning, 2018.

03월 02일 (화) 논문 세미나 – 임현교
Z. Yan, et al., "Automatic Virtual Network Embedding: A Deep Reinforcement Learning Approach With Graph Convolutional Networks," IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, Vol. 38, No. 6, June, 2020.

02월 08일 (월) 논문 세미나 – 울라 이산
Khoa Nguyen, et al., "Efficient Virtual Network Embedding with Node Ranking and Intelligent Link Mapping," IEEE 9th International Conference on Cloud Networking (CloudNet), November, 2020.

02월 01일 (월) 논문 세미나 – 김주봉
Yong Liu, et al., "Multi-Agent Game Abstraction via Graph Attention Neural Network," AAAI, 2020.

01월 25일 (월) 논문 세미나 – 최호빈
Enright, John J., and Peter R. Wurman, "Optimization and coordinated autonomy in mobile fulfillment systems," Workshops at the twenty-fifth AAAI conference on artificial intelligence, 2011.

01월 27일 (수) 논문 세미나 – 울라 이산
Min Feng, et al., "Virtual Network Embedding based on Modified Genetic Algorithm," Peer-to-Peer Networking and Applications, October 2019.

01월 18일 (월) 논문 세미나 – 지창훈
Mohammed Hossny, et al., "Refined Continuous Control of DDPG Actors via Parametrised Activation," arXiv:2006.02818, June 2020.

01월 11일 (월) 논문 세미나 – 임현교
Mosharaf Chowdhury, et al., "ViNEYard: Virtual Network Embedding Algorithms With Coordinated Node and Link Mapping," IEEE/ACM TRANSACTIONS ON NETWORKING, VOL. 20, NO. 1, FEBRUARY 2012.

01월 04일 (월) 논문 세미나 – 황규영
Tuomas Haarnoja, et al., "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor," ICML 2018.

12월 21일 (화) 논문 세미나 – 최호빈, 김주봉
Anuj Mahajan, et al., "MAVEN: Multi-Agent Variational Exploration," NeurIPS, 7611-7622, 2019.

12월 14일 (월) 논문 세미나 – 울라 이산
M. Feng, et al., "Topology-Aware Virtual Network Embedding Through the Degree," National Doctoral Academic Forum on Information and Communications Technology 2013, Aug, 2013.

11월 16일 (월) 논문 세미나 – 임현교
Fengsheng Wei et al., “Network Slice Reconfiguration by Exploiting Deep Reinforcement Learning with Large Action Space,” IEEE Transactions on Network and Service Management, 2020.

11월 09일 (월) 논문 세미나 – 김주봉
Kyunghwan Son et al., “QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement learning,” Proceedings of the 36th International Conference on Machine Learning, 2019.

11월 02일 (월) 논문 세미나 – 울라 이산
Haozhe Wanga et al., “Data-driven dynamic resource scheduling for network slicing: A Deep reinforcement learning approach,” Information Sciences, 498, pp. 106-116, 2019.

10월 19일 (월) 논문 세미나 – 지창훈
S. Vassilaras et al., “Applying Deep Learning and Reinforcement Learning to Traveling Salesman Problem” in IEEE International Conference on Systems, Man, and Cybernetics(SMC). Aug .2018.
10월 05일 (월) 논문 세미나 – 임현교
S. Vassilaras et al., “The Algorithmic Aspects of Network Slicing,” in IEEE Communications Magazine, vol. 55, no. 8, pp. 112-119, Aug. 2017.

09월 28일 (월) 논문 세미나 – 김주봉
Yuki Miyashita and Toshiharu Sugawara, “Analysis of coordinated behavior structures with multi-agent deep reinforcement learning,”, Applied Intelligence, 2020.

09월 21일 (월) 논문 세미나 – 지창훈
Galina L. Rogova, Jyotsna Kasturi, "Reinforcement Learning Neural Network For Distributed Decision Making",2002 <a href=https://www.researchgate.net/publication/228559072_Reinforcement_learning_neural_network_for_distributed_decision_making target="_blank"> </a>

09월 14일 (월) 논문 세미나 – 임현교
J. Du, X. Huang, F. Wu and S. Leng, "Reinforcement Learning Empowered QoS-aware Adaptive Q-Routing in Ad-hoc Networks," 2020 International Wireless Communications and Mobile Computing (IWCMC), Limassol, Cyprus, pp. 551-556, 2020. <a href=https://ieeexplore.ieee.org/document/9148532 target="_blank"> </a>

09월 07일 (월) 논문 세미나 – 최호빈
Yali Du, Lei Han, Meng Fang, Tianhong Dai, Ji Liu, Dacheng Tao, "LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning," NIPS, 2019.

07월 30일 (목) 논문 세미나 – 임현교
Q. Fu, E. Sun, K. Meng, M. Li and Y. Zhang, "Deep Q-Learning for Routing Schemes in SDN-Based Data Center Networks," in IEEE Access, vol. 8, pp. 103491-103499, 2020.

07월 23일 (목) 논문 세미나 – 황규영
Ian Osband, Charles Blundell, Alexander Pritzel, Benjamin Van Roy, “Deep Exploration via Bootstrapped DQN,” Advances in Neural Information Processing Systems 29 (NIPS 2016).

02월 17일 (월) 논문 세미나 – 최호빈
Zäzilia Seibold, Thomas Stoll, Kai Furmans, “Layout-optimized sorting of goods with decentralized controlled conveying modules,” 2013 IEEE International Systems Conference (SysCon), Apr. 2013.

02월 11일 (화) 논문 세미나 – 김주봉
T. Eccles et al., “Biases for Emergent Communication in Multi-agent Reinforcement Learning,”, NIPS, 2019.

02월 03일 (월) 교재 세미나 – 황규영
Richard S. Sutton and Andrew G. Barto, “Reinforcement Learning: An Introduction,” second edition, MIT Press, Cambridge, MA, 2018.

01월 20일 (월) 논문 세미나 – 임현교</p>
- 논문: C. Nadiger, A. Kumar and S. Abdelhak, “Federated Reinforcement Learning for Fast Personalization,” 2019 IEEE Second International Conference on Artificial Intelligence and Knowledge Engineering (AIKE), Sardinia, Italy, pp. 123-127, 2019.

01월 17일 (금) 논문 세미나 – 황규영
Matteo Hessel, Joseph Modayil, Hado van Hasselt, Tom Schaul, Georg Ostrovski, Will Dabney, Dan Horgan, Bilal Piot, Mohammad Azar, David Silver, “Rainbow: Combining Improvements in Deep Reinforcement Learning,” AAAI, 2018.

01월 06일 (월) 논문 세미나 – 최호빈</p>
- 논문: Sanmit Narvekar, Peter Stone, “Learning Curriculum Policies for Reinforcement Learning,” arXiv:1812.00285, 2018.

12월 30일 (월) 논문 세미나 – 김주봉
David Silver, Guy Lever, Nicolas Heess, Thomas Degris, Daan Wierstra, Martin Riedmiller, “Deterministic Policy Gradient Algorithms,” JMLR, 2014. & Appendix
Timothy P. Lillicrap∗, Jonathan J. Hunt∗, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver & Daan Wierstra, “CONTINUOUS CONTROL WITH DEEP REINFORCEMENT LEARNING,” ICLR, 2016.

12월 23일 (월) 논문 세미나 – 임현교
Horgan, Dan, John Quan, David Budden, Gabriel Barth-Maron, Matteo Hessel, Hado van Hasselt and David Silver. “Distributed Prioritized Experience Replay.” ArXiv abs/1803.00933, 2018.

12월 16일 (월) 논문 세미나 – 임현교
- 논문: Nair, Arun, Praveen Srinivasan, Sam Blackwell, Cagdas Alcicek, Rory Fearon, Alessandro De Maria, Vedavyas Panneershelvam, Mustafa Suleyman, Charles Beattie, Stig Petersen, Shane Legg, Volodymyr Mnih, Koray Kavukcuoglu and David Silver. “Massively Parallel Methods for Deep Reinforcement Learning.” ArXiv abs/1507.04296, 2015.

11월 18일 (월) 논문 세미나 – 임현교
- 논문: Ramy E. Ali, Bilgehan Erman, Ejder Ba¸stug and Bruce Cilli, “Hierarchical Deep Double Q-Routing,” arXiv:1910.04041, 9 Oct. 2019.
11월 11일 (월) 논문 세미나 – 최호빈
- 논문: Yoshua Bengio, Jerome Louradour, Ronan Collobert, Jason Weston, “Curriculum Learning,” ICML, 2009.
11월 04일 (월) 논문 세미나 – 김주봉
- 논문: Zhang-Wei Hong, Shih-Yang Su, Tzu-Yun Shann, Yi-Hsiang Chang, Chun-Yi Lee , “A Deep Policy Inference Q-Network for Multi-Agent Systems,” arXiv:1712.07893, 9 Apr. 2018.

10월 14일 (월) 논문 세미나 – 황규영
Efe Camci, Erdal Kayacan, “End-to-End Motion Planning of Quadrotors Using Deep Reinforcement Learning,” arXiv:1909.13599v1, 30 Sep 2019.

10월 07일 (월) 논문 세미나 – 황규영
Iñaki Iturrate, Ricardo Chavarriaga, Luis Montesano, Javier Minguez, José del R. Millán, “Teaching brain-machine interfaces as an alternative paradigm to neuroprosthetics control,” nature Scientific Reports, 2015.

09월 30일 (월) 논문 세미나 – 김주봉
- 논문: Bowen Baker, “EMERGENT TOOL USE FROM MULTI-AGENT AUTOCURRICULA,” arXiv:1909.07528v1, 2019.
- 발표자료
09월 04일 (수) 논문 세미나 – 김주봉
- 논문: Jonathan Ho, Stefano Ermon, “Generative Adversarial Imitation Learning,” 30th Conference on Neural Information Processing Systems (NIPS 2016), Barcelona, Spain.
- 발표자료
06월 25일 (화) 논문 세미나 – 최호빈
- James C. Chen, et al., “Solving a Sortation Conveyor Layout Design Problem with Simulation-optimization Approach,” 2019 IEEE 6th International Conference on Industrial Engineering and Applications (ICIEA), 2019
- A. Jayaraman, et al., “A Sortation System Model,” Winter Simulation Conference Proceedings, 1997
- Fu-bin Pan, “Simulation Design of Express Sorting System—Example of SF’s Sorting Center,” The Open Cybernetics & Systemics Journal, 8, 1116-1122, 2014
- Samuel Chenatti, et al., “Deep Reinforcement Learning in Robotics Logistic Task Coordination,” 2018 Latin American Robotic Symposium, 2018
06월 17일 (월) 논문 세미나 – 김주봉
- 논문: J. Foerster, “Learning to Communicate with Deep Multi-Agent Reinforcement Learning,” 30th Conference on Neural Information Processing Systems (NIPS 2016), Barcelona, Spain.
- 발표자료
06월 10일 (월) 논문 세미나 – 허주성
Hyun-Joo Kim, “Design and Implementation of an Efficient Web Services Data Processing Using Hadoop-Based Big Data Processing Technique,” Journal of the Korea Academia-Industrial Cooperation Society, Vol. 16, No. 1 pp. 726-734, 2015
05월 27일 (월) 논문 세미나 – 임현교
- 논문: Almuthanna T. Nassar, Yasin Yilmaz, “Reinforcement Learning-based Resource Allocation in Fog RAN for IoT with Heterogeneous Latency Requirements,” arXiv:1806.04582, 2018
- 발표자료
05월 13, 20일 (월) 논문 세미나 – 권도형
- 논문: H. Yin and S. J. Pan, “Knowledge Transfer for Deep Reinforcement Learning with Hierarchical Experience Replay,” Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence
- 발표자료

05월 08일 (수) 논문 세미나 – 황규영
François Chollet, “Xception: Deep Learning with Depthwise Separable Convolutions”, arXiv:1610.02357v3 [cs.CV] 4 Apr 2017.

04월 29일 (월) 논문 세미나 – 최호빈
- 논문: Samuel L. Smith, Pieter-Jan Kindermans, Chris Ying, and Quoc V. Le, “Don’t Decay The Learning Rate, increase The Batch Size,” ICLR 2018.
- 발표자료
04월 08일 (월) 논문 세미나 – 임현교
- 논문: Quang Tran Anh Pham, Yassine Hadjadj-Aoul, and Abdelkader Outtagarts, “Deep Reinforcement Learning based QoS-aware Routing in Knowledge-defined networking,” Qshine 2018 – 14th EAI International Conference on Heterogeneous Networking for Quality, Reliability, Security and Robustness, pp. 1-13, Dec, 2018.
- 발표자료
04월 01일 (월) 논문 세미나 – 허주성
- 논문: AUTOMATIC OBJECT EXTRACTION FROM ELECTRONIC DOCUMENTS USING DEEP NEURAL NETWORK, Heejin Jang, Yeonghun Chae, Sangwon Lee, Jinyong Jo, KIPS Transactions on Software and Data Engineering, Vol. 7, No. 11, pp. 411-418, Nov. 2018.
- 발표자료
03월 25일 (월) 논문 세미나 – 최호빈
- 논문:Igor Adamski, Robert Adamski, Tomasz Grel, Adam Jędrych, Kamil Kaczmarek, Henryk Michalewski, “Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes”, arXiv:1801.02852 [cs.AI], Jan 2018.
- 발표자료

03월 18일 (월) 논문 세미나 – 황규영
Sartoretti, Y. Wu, W. Paivine, T. K. S. Kumar, S. Koenig, and H. Choset, “Distributed Reinforcement Learning for MultiRobot Decentralized Collective Construction”, DARS 2018.

03월 11일 (월) 논문 세미나 – 임현교
- 논문: J. Schulman, F. Wolski, P. Dhariwal, A. Radford and O. Klimov, “Proximal Policy Optimization Algorithms,” arXiv:1707.06347v2 [cs.LG], Aug. 2017.
- 발표자료
03월 4일 (월) 논문 세미나 – 김주봉
- 논문: J. Schulman, S. Levine, P. Moritz, M. Jordan and P. Abbeel, “Trust Region Policy Optimization,” arXiv:1502.05477v5 [cs.LG], Apr. 2017.
- 발표자료
02월 25일 (월) 논문 세미나 – 김주봉
- 논문: M. Egorov, “Multi-Agent Deep Reinforcement Learning,” Stanford University
- 발표자료
02월 15일 (금) 논문 세미나 – 권도형
- 논문: V. Mnih, A.P. Badia, M. Mirza, A. Graves, T.P. Lillicrap, T. Harley, D. Silver, and K. Kavukcuoglu, ” Asynchronous Methods for Deep Reinforcement Learning,” Proceedings of The 33rd International Conference on Machine Learning, vol.48 pp. 1928-1937, 2016.
- 발표자료
02월 07일 (목) 논문 세미나 – 임현교
- 논문: Chin-Feng Lai, Wei-Che Chien, Laurence T. Yang, Weizhong Qiang, “LSTM and Edge Computing for Big Data Feature Recognition of Industrial Electrical Equipment,” IEEE Transactions on Industrial Informatics. PP. 1-1, 2019.
- 발표자료
01월 31일 (목) 논문 세미나 – 김주봉
- 논문: S. James , P. Wohlhart, M. Kalakrishnan, D. Kalashnikov, A. Irpan, J. Ibarz, S. Levine, R. Hadsell, and K. Bousmalis, “Sim-to-Real via Sim-to-Sim: Data-efficient Robotic Grasping via Randomized-to-Canonical Adaptation Networks,” arXiv:1812.07252 [cs.RO], Dec. 2018.
- 발표자료

12월 20일 (목) 논문 세미나 – 임현교
M. Spryn, A. Sharma, D. Parkar, and M. Shrimal, "Distributed Deep Reinforcement Learning on the Cloud for Autonomous Driving," ACM/IEEE 1st International Workshop on Software Engineering for AI in Autonomous Systems, 2018.

UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning ICML 2021

Communication in Multi-Agent Reinforcement Learning-Intention Sharing ICLR 2021

Succinct and Robust Multi-Agent Communication With Temporal Message Control NIPS 2020

Multi-agent active perception with prediction rewards NIPS 2020

Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning NIPS 2020

Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity NIPS 2020

Learning Implicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning NIPS 2020

Learning Multi-Agent Communication through Structured Attentive Reasoning NIPS 2020

Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning NIPS 2020

Robust Multi-Agent Reinforcement Learning with Model Uncertainty NIPS 2020

Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning NIPS 2020

Scalable Multi-Agent Reinforcement Learning for Networked Systems with Average Reward NIPS 2020

Contextual Games: Multi-Agent Learning with Side Information NIPS 2020

Learning Individually Inferred Communication for Multi-Agent Cooperation NIPS 2020

EvolveGraph: Multi-Agent Trajectory Prediction with Dynamic Relational Reasoning NIPS 2020

Joint Policy Search for Multi-agent Collaboration with Imperfect Information NIPS 2020

"ATTENTION, LEARN TO SOLVE ROUTING PROBLEMS!" in conference paper at ICLR 2019

"DeepViNE: Virtual Network Embedding with Deep Reinforcement Learning," IEEE INFOCOM 2019.