Skip to content

Chapter 19, Page 681 #213

@drei34

Description

@drei34

Hi,

I am not sure but I think there is an error for the example on page 681. The episode is BBCCCCBAT -> Fail so then we have S0=B, S1=B, ..., S8=T. So we have rewards R1, R2, ..., R8. So G0 = R1 + g*R2 + .. + g^{7} * R8 and R8=+1. So are we off by 1 in time?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions