-
Notifications
You must be signed in to change notification settings - Fork 2
Description
One of the STET source languages that’s been labored on over the past
several months is Tok Pisin. Sorry if I’ve already asked about this…
With Tok Pisin, there are fewer total words in the language than in
most languages. At least one or two of maybe two dozen central words
(some are bilong, gutpela, long, man, mekim, nogat, pasin, samting,
and wanpela) seem to be in every single sentence.
So it won’t be as easy as just giving you an input file with the 4th
column showing which words should show up in bold anytime they appear
within those verses, because the same simple word or two-word phrase
can have so many different meanings depending on how they’re combined
with the other simple words around them.
There might also be a whole phrase that’s needed to convey one
spiritual term, but then one word or two words in the middle might not
belong to that phrase and instead be part of the meaning of a
different part of the sentence!
Or words from that same little phrase might show up later in the verse
and not have anything to do with the term.
In this kind of case, can you customize super-exactly to each verse
what shows up as bold in the source text?
This is made up, but let’s say these two verses have the bolded words
as meaning “adopted.”
Rom 8:23
I no ol dispela samting tasol. Nogat. Yumi tu i bin i gat namba wan
pikinini kaikai bilong Spirit yumi tu i save krai insait long bel na
wet i stap long God bai kisim mipela olsem pikinini bilong em na kisim
bek dispela bodi bilong bagarap.
Rom 9:4
Em ol lain Israel. Ol i gat pasin bilong papa i kisim ol bek, na ol i
gat glori na ol i gat kontrak na ol i gat presen bilong lo na ol i gat
pasin bilong lotu long God na ol i gat ol promis.
So you see that there’s a big gap between the words in the first
verse.
And in the second verse, some words are bold but then they appear
later in the verse in the same sequence but don’t have anything to do
with the term when they appear that second time.
Can you think of a way to handle this kind of thing - maybe by a
differently treated input file with very specifically bolded words in
the source text column?
Metadata
Metadata
Assignees
Labels
Type
Projects
Status