• Members
    • Professor
    • Researchers
    • Alumni (Not yet!)
    • Register (admin only)
    • Analytics (members only)
  • Contact
    • Query to Lab
    • Query List
  • Publications
    • PubicationList
    • Register(Admin Only)
  • Projects
    • 📂 Project List
    • 📝 RegisterProject (admin only)
  • Useful Links
    • Notice Board
    • Seminar
    • Courses
    • Gallery
    • SCIE Search
    • Hongik University
    • Awards
    • WikiCFP
    • Algorithms
  • 기본 이미지
    Guest Login

(대학원) 강화학습

(Grad) Reinforce Learning · Prof. Giseop Noh

RL Theory | Q-learning | PPO | RLHF | Immitation Learning

Lecture plan (click me)

주차별 자료

  1. 01 Introduction to RL PDF
  2. 02 What does RL Learn? PDF
  3. 03 RL Taxonomy & MDP Basics PDF
  4. 04 Markov Decision Process in RL PDF
  5. 05 Dynamic Programming in RL PDF
  6. 06 Monte Carlo Methods in RL PDF
  7. 07 Temporal Difference PDF
  8. 08 MAB, Exploration vs. Exploitation PDF
  9. 09 Multi-Armed Bandit Practice MD
  10. 10 Q-Learning PDF
  11. 11 Deep Q-learning Network (DQN) MD
  12. 12 Practice for Code Repair using RL & LLM MD
  13. 13 Double DQN PDF
  14. 14 Dueling DQN PDF
  15. 15 Policy Gradient PDF
  16. 16 Play with Stable Baselines 3 MD
  17. 17 Actor-Critic Algorithm PDF
  18. 18 DDPG, TD3, SAC Algorithms PDF
  19. 19 TRPO and PPO Algorithms PDF
  20. 20 Immitation Learning & RL from Human Feedback PDF
전체 강좌 목록으로 이동

World Best AI Research Lab

(30016) 세종특별시 조치원읍 세종로 2639 홍익대학교 D동 402호
(30016) Hongik Unversity Bldg. D, Room# 402, 2639, Sejong-ro, Jochiwon-eup, Sejong

© 2025 노기섭 교수