Segmentation AlgorithmsΒΆ

The input text given to algorithms is assumed to be prepared as a suite of space separated tokens, one utterance per line (see Data Preparation).

The following word segmentation algorithms are provided with wordseg: