Skip to content
This repository was archived by the owner on Oct 31, 2023. It is now read-only.
This repository was archived by the owner on Oct 31, 2023. It is now read-only.

About dataset generation #238

@Heisenberg-Yin

Description

@Heisenberg-Yin

I am a new rookie for the dense retrieval task. And I have a question for the dataset, which consists of a question, positives, hard negatives, and negatives.

I am not sure how can we get the positives, hard negatives, and negatives. From my respective, the query is equipped with positives, so the hard negatives are retrieved by the BM25, and negatives are selected randomly.

Am I right or not?

Best Wishes.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions