Skip to content

gold format for speaker diarization  #96

@keighrim

Description

@keighrim

New Feature Summary

It seems that we could relatively easily create some gold evaluation data for SD problem by combining the time-sync annotation and the speaker turn markers in our "gold" transcript files.

Related

There's the "cleaner" code that removes the speaker markers (clamsproject/clams-utils#2), and we should be able to "reverse" the functionality to obtain the speaker markers, to associate with the time frames for series of their utterances.

Alternatives

No response

Additional context

No response

Metadata

Metadata

Assignees

Labels

✨NNew feature or request

Type

No type

Projects

Status

Todo

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions