Skip to content

07_reinforce 和 08_reinforce_with_baseline 似乎重复了? #5

@TimHo0331

Description

@TimHo0331

07_reinforce.py 是不是应该是用蒙特卡洛估计汇报的reinforce算法?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions