Skip to content

frankthetank91/CS294_Study

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

11 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

AI Robotics KR - CS294 μŠ€ν„°λ”” Repository

image_link

μŠ€ν„°λ”” μ†Œκ°œ:

  • λͺ©μ : UC Berkeley의 2018년도 CS294 κ°•μ˜ μžλ£Œμ™€ "파이썬과 μΌ€λΌμŠ€λ‘œ λ°°μš°λŠ” κ°•ν™”ν•™μŠ΅" 자료λ₯Ό 기반으둜 μ‹¬μΈ΅κ°•ν™”ν•™μŠ΅μ— λŒ€ν•΄ κ³΅λΆ€ν•˜κΈ°.
  • κΈ°κ°„: 2019λ…„ 8μ›” ~ 2020λ…„ 1μ›” (μ’…λ£ŒμΌ λ―Έν™•μ •)
  • μ°Έμ—¬μž: μ–‘μ •μ•„, κΉ€μŠΉμ›, 김좩희, κΉ€ν•œμ€€, 이동섭, 이정연, 이해쀑, μ „νš¨μ •, μ΅œμ‹œν˜„, 졜윀규, μ΅œμ›μš°, ν™©νƒœμ€€
  • 기획 κ·Έλ£Ή: AI Robotics KR

μŠ€ν„°λ”” μ§„ν–‰ 방법 / Repository μ‚¬μš©λ²•:

  • κ°•μ˜ 리뷰 : 3인 1νŒ€ ꡬ성, 1μ£Ό 1κ°•μ”© κ°•μ˜ 리뷰 λ°œν‘œ
  • μ½”λ”© : κ°•μ˜ 진도에 λ§žμΆ°μ„œ 예제 μ½”λ“œ μ§„ν–‰
  • 리뷰 λ°œν‘œ 자료, 질의 응닡 κ΄€λ ¨λœ λ‚΄μš©μ€ 당일 λ°œν‘œ νŒ€μ—μ„œ μ •λ¦¬ν•΄μ„œ κΉƒν—ˆλΈŒμ— μ—…λ‘œλ“œ

νŒ€ ꡬ성

  • pytorch

    • μ΅œμ‹œν˜„, 이동섭, μ–‘μ •μ•„
    • ν™©νƒœμ€€, 이정연, 김좩희
  • tensorflow & keras

    • μ΅œμ›μš°, μ „νš¨μ •, κΉ€μŠΉμ›
    • 이해쀑, 졜윀규, κΉ€ν•œμ€€

무단 결석 & 지각 벌금

Deposit : 3λ§Œμ› 무단 결석(당일 μ·¨μ†Œ) μ‹œ λ§Œμ› 차감

지각

  • 10λΆ„ : 2000원
  • 30λΆ„ : 3000원
  • 1μ‹œκ°„ 이후 : 5000원

μŠ€ν„°λ”” μ§„λ„ν‘œ

μŠ€ν„°λ”” λ‚΄μš© λ‚ μ§œμ™€ μ‹œκ°„ λ°œν‘œμž
Lecture 2: Supervised Learning and Imitation 19/08/29 이해쀑, 졜윀규, κΉ€ν•œμ€€
Lecture 4: Reinforcement Learning Introduction 19/09/05 ν™©νƒœμ€€, 이정연, 김좩희
Lecture 5: Policy Gradients Introduction 19/09/19 μ΅œμ›μš°, μ „νš¨μ •, κΉ€μŠΉμ›
Lecture 6: Actor-Critic Introduction 19/09/26 μ΅œμ‹œν˜„, 이동섭, μ–‘μ •μ•„
Lecture 7: Value Functions and Q-Learning 19/10/03 이해쀑, 졜윀규, κΉ€ν•œμ€€
Lecture 8: Advanced Q-Learning Algorithms 19/10/10 ν™©νƒœμ€€, 이정연, 김좩희
Lecture 9: Advanced Policy Gradients 19/10/17 μ΅œμ›μš°, μ „νš¨μ •, κΉ€μŠΉμ›
Lecture 10: Optimal Control and Planning 19/10/24 μ΅œμ‹œν˜„, 이동섭, μ–‘μ •μ•„
Lecture 11: Model-Based Reinforcement Learning 19/10/31 이해쀑, 졜윀규, κΉ€ν•œμ€€
Lecture 12: Advanced Model Learning and Images 19/11/07 ν™©νƒœμ€€, 이정연, 김좩희
Lecture 13: Learning Policies by Imitating Other Policies 19/11/14 μ΅œμ›μš°, μ „νš¨μ •, κΉ€μŠΉμ›
Lecture 14: Probability and Variational Inference Primer 19/11/21 μ΅œμ‹œν˜„, 이동섭, μ–‘μ •μ•„
Lecture 15: Connection between Inference and Control 19/11/28 이해쀑, 졜윀규, κΉ€ν•œμ€€
Lecture 16: Inverse Reinforcement Learning 19/12/05 ν™©νƒœμ€€, 이정연, 김좩희
Lecture 17: Exploration: Part 1 19/12/12 μ΅œμ›μš°, μ „νš¨μ •, κΉ€μŠΉμ›
Lecture 18: Exploration: Part 2 19/12/19 μ΅œμ‹œν˜„, 이동섭, μ–‘μ •μ•„
Lecture 19: Transfer Learning and Multi-Task Learning 19/12/26 이해쀑, 졜윀규, κΉ€ν•œμ€€
Lecture 20: Meta-Learning 19/01/02 ν™©νƒœμ€€, 이정연, 김좩희
Lecture 21: Parallelism and RL System Design 19/01/09 μ΅œμ›μš°, μ „νš¨μ •, κΉ€
Lecture 22: Advanced Imitation Learning and Open Problems 19/01/16 μ΅œμ‹œν˜„, 이동섭, μ–‘μ •μ•„

CS294 μŠ€ν„°λ”” μ•ˆλ‚΄

- μ˜€ν”„λΌμΈμœΌλ‘œ λ§€μ£Ό λͺ©μš”일 19μ‹œ 30λΆ„~ 21μ‹œ 30뢄에 μ§„ν–‰λ©λ‹ˆλ‹€. (2019.08 ~ 2020.01)
- μŠ€ν„°λ”” ν˜•μ‹: λ§€μ£Ό λŒμ•„κ°€λ©΄μ„œ κ·Έ 주의 μ‘°κ°€ μ€€λΉ„ν•œ 자료λ₯Ό μ°Έκ³ ν•΄μ„œ 이둠 곡뢀 + μ‹€μŠ΅ + μ§ˆμ˜μ‘λ‹΅ 및 ν† λ‘  
- 이둠 곡뢀 자료: 
- [Lecture Slides](http://rail.eecs.berkeley.edu/deeprlcourse-fa18/)
- 질의 응닡: κ°•μ˜ λ‚΄μš©κ³Ό κ΄€λ ¨ν•΄ μ„œλ‘œ μ§ˆλ¬Έν•˜κ³  μ˜κ²¬μ„ κ³΅μœ ν•©λ‹ˆλ‹€.

μœ μš©ν•œ 링크 λͺ¨μŒ

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published