The number of parameters does not match the setting in paper

In retnet-3b/config.json, according to the experimental settings of the paper 
https://arxiv.org/pdf/2307.08621.pdf , set decoder_ffn_embed_dim and decoder_value_embed_dim to twice the size of decoder_embed_dim. With the parameters in nn.Embedding, the model size is 3.2B, not 2.7B. If the number of parameters  of nn.Embedding is subtracted, the number of parameters is about 3B, which does not match the 2.7B mentioned in the paper.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

The number of parameters does not match the setting in paper #34

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

The number of parameters does not match the setting in paper #34

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions