Skip to content

Promote hand-written 2017 notes out of Archive/ into notes/#7

Open
0bserver07 wants to merge 1 commit into
masterfrom
claude/promote-original-notes
Open

Promote hand-written 2017 notes out of Archive/ into notes/#7
0bserver07 wants to merge 1 commit into
masterfrom
claude/promote-original-notes

Conversation

@0bserver07
Copy link
Copy Markdown
Owner

You were right — the structure had the trusted, hand-written CS294 notes filed under Archive/ (which sounds like a museum) while the unreviewed AI-drafted lectures occupied notes/ (the "main" content). For someone learning RL from this repo, that's exactly backwards: the part they can actually trust is buried, the part they should treat with skepticism is featured.

This PR fixes it. Not merging — you review and tell me to merge or change something.

What moved

  • Archive/2017-Course-Notes/CS294-DeepRL-Berkeley/notes/cs294-2017/ (with imgs/ intact, image links unchanged)
  • Archive/2017-Course-Notes/Elements-Of-RL/notes/sutton-barto-digest/
  • Archive/README.md deleted (it was a wrapper; no content lost)
  • Archive/ directory gone

Both moved files got a <!-- status: hand-written --> header so they're explicitly marked as trusted.

What changed in the docs

  • readme.md "What's here" now leads with Trusted, hand-written (CS294 notes, Sutton & Barto digest, curated talks/books/courses, tested exercises), then AI-drafted, useful as scaffold (treat with skepticism) (the 19-lecture series, cheat sheets, diagrams, scraped paper lists, tools).
  • readme.md "Start here" reordered: talks/books/courses first → exercises → drafts → AGENTS.md.
  • notes/README.md rewritten the same way — leads with the hand-written content, the lecture-series table sits below clearly framed as drafts.
  • AGENTS.md and CLAUDE.md updated: the layout table now points at notes/cs294-2017/ and notes/sutton-barto-digest/ as the trusted, frozen, never-reword material instead of Archive/. The voice rule that referenced "the 2017 archive" updated to point at the new path.

What I did separately (already live, not in this PR)

GitHub topics refreshed: dropped guideline and study; added rlhf, llm-alignment, dpo, grpo, ppo, rlvr, agentic-rl, lecture-notes, study-notes, deepseek-r1, constitutional-ai, policy-gradient, q-learning, sutton-barto. Description sharpened to "RL study guide — foundations through RLHF, DPO, GRPO, RLVR, agentic RL, and offline RL. Hand-written CS294 notes, 19 lecture drafts, 5 tested exercises, citations that resolve."

🤖 Generated with Claude Code

The trusted, hand-written CS294 notes were sitting in a folder called
Archive — which sounds like "old/dead" — while the unreviewed AI-drafted
lecture series occupied notes/. Backwards from a learner's perspective.

- Archive/2017-Course-Notes/CS294-DeepRL-Berkeley/ -> notes/cs294-2017/
  (with imgs/ intact)
- Archive/2017-Course-Notes/Elements-Of-RL/ -> notes/sutton-barto-digest/
- Both files got <!-- status: hand-written --> headers
- Archive/ directory deleted (Archive/README.md was just a wrapper)

readme.md restructured: "What's here" now leads with trusted hand-written
content (the CS294 notes, the Sutton & Barto digest, the curated
talks/books/courses, the tested exercises). The AI-drafted lecture
series is clearly demoted as "scaffold, treat with skepticism." "Start
here" reordered: talks/books -> exercises -> drafts.

notes/README.md rewritten in the same spirit. AGENTS.md and CLAUDE.md
updated to point at notes/cs294-2017/ and notes/sutton-barto-digest/
as the trusted, frozen, never-reword material.

GitHub topics refreshed separately (not in this commit): dropped
`guideline` and `study`; added `rlhf`, `llm-alignment`, `dpo`, `grpo`,
`ppo`, `rlvr`, `agentic-rl`, `lecture-notes`, `study-notes`,
`deepseek-r1`, `constitutional-ai`, `policy-gradient`, `q-learning`,
`sutton-barto`. Description sharpened.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant