Open
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
04_dqn.py中的代码应该属于目标网络06_target_network.py)06_doubledqn.py。在之前的double dqn 中,action 是由target_Q决定的,这是target nework的特点?我的理解是double dqn 中action 还是由Q 决定,然后使用target_Q 计算 \hat{q}_{j+1}06_dueling_network.py)其他小的更改:
DQNclass 里面的attribute name, 使用Q和target_Q.device我是RL 的初学者,很喜欢这本书,希望能有所贡献,谢谢!