Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

Do MONK processes ouch the source document?

0
Posted

Do MONK processes ouch the source document?

0

To the question whether MONK processes ‘touch’ the source document the answer is both ‘yes’ and ‘no’. If you think of the document as an ordered sequence of words, primitively rendered as a raw text file, the answer is ‘no’. The range of MONK tasks decidedly does not include the editing of texts. It does not even include the correction of gross errors–although we should certainly report them to the owners of source files. On the other hand, I do not see how we can avoid a fair amount of “up-tagging” and “down-tagging.” By “up-tagging” I understand the addition of elements whose attributes carry various kinds of metadata. Thus the results of tokenization, sentence splitting and part of speech are most unambiguously represented by enclosing the tokenized string with a element, adding attribute values, and adding sentence tags of some sort. Uptagging of some sort will also be necessary if a raw text file is to become part of a MONK collection. By “down-tagging” I understand the proce

Related Questions

What is your question?

*Sadly, we had to bring back ads too. Hopefully more targeted.