Skip to content

Handle PTB trees with Unicode words in them #8

@dmcc

Description

@dmcc

Either by fixing the encoding issues or temporarily replacing them with dummy ASCII words.

Thanks to Karin M. Sim Smith for the report.

Temporary workaround: If possible, don't pass trees with Unicode words in them. This should be safe since Stanford Dependencies generally don't care about the words in the trees and the few words that it does care about are in ASCII.

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions