Skip to content

Commit a03040e

Browse files
Update README with Chapter 8 integration
1 parent e63f068 commit a03040e

1 file changed

Lines changed: 6 additions & 3 deletions

File tree

README.md

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,7 @@
77
[![ch5](https://github.com/srikanthbaride/Reinforcement-Learning-Explained-Code/actions/workflows/ch5.yml/badge.svg)](https://github.com/srikanthbaride/Reinforcement-Learning-Explained-Code/actions/workflows/ch5.yml)
88
[![ch6](https://github.com/srikanthbaride/Reinforcement-Learning-Explained-Code/actions/workflows/ch6.yml/badge.svg)](https://github.com/srikanthbaride/Reinforcement-Learning-Explained-Code/actions/workflows/ch6.yml)
99
[![ch7](https://github.com/srikanthbaride/Reinforcement-Learning-Explained-Code/actions/workflows/ch7.yml/badge.svg)](https://github.com/srikanthbaride/Reinforcement-Learning-Explained-Code/actions/workflows/ch7.yml)
10+
[![ch8](https://github.com/srikanthbaride/Reinforcement-Learning-Explained-Code/actions/workflows/ch8.yml/badge.svg)](https://github.com/srikanthbaride/Reinforcement-Learning-Explained-Code/actions/workflows/ch8.yml)
1011

1112
---
1213

@@ -22,6 +23,7 @@ It provides clean, minimal, and well-tested implementations of key reinforcement
2223
- [Chapter 5: Monte Carlo Methods](./ch5_monte_carlo)
2324
- [Chapter 6: Temporal-Difference Learning](./ch6_td_learning)
2425
- [Chapter 7: TD Control — SARSA and Q-Learning](./ch7_td_control)
26+
- [Chapter 8: Eligibility Traces and TD(λ)](./ch8_td_lambda)
2527

2628
---
2729

@@ -36,7 +38,7 @@ It provides clean, minimal, and well-tested implementations of key reinforcement
3638
| 5 | Monte Carlo Methods | ✅ Complete | Prediction, Control, On/Off-Policy |
3739
| 6 | Temporal-Difference Learning | ✅ Complete | TD(0), n-step TD, prediction examples |
3840
| 7 | TD Control | ✅ Complete | SARSA, Q-learning, Cliff-Walking, exploration |
39-
| 8+ | Advanced TD / Function Approx. | ⏳ In Progress | Eligibility Traces, Deadly Triad, etc. |
41+
| 8 | Eligibility Traces and TD(λ) | ✅ Complete | TD(λ), SARSA(λ), True Online TD(λ), gridworld demos |
4042

4143
---
4244

@@ -50,6 +52,7 @@ rl-fundamentals-code/
5052
├─ ch5_monte_carlo/ # Chapter 5
5153
├─ ch6_td_learning/ # Chapter 6
5254
├─ ch7_td_control/ # Chapter 7
55+
├─ ch8_td_lambda/ # Chapter 8
5356
├─ utils/
5457
└─ .github/workflows/
5558
```
@@ -64,10 +67,10 @@ To run all tests:
6467
python -m pytest -q
6568
```
6669

67-
Run only Chapter 7 tests:
70+
Run only Chapter 8 tests:
6871

6972
```bash
70-
python -m pytest -q tests/test_ch7_td_control.py
73+
python -m pytest -q ch8_td_lambda/tests
7174
```
7275

7376
---

0 commit comments

Comments
 (0)