Total tokens count: 138784
Token type WORD: 63925
Token type SPACE: 62605
Token type DOT: 4416
Token type COMMA: 3443
Token type OTHER: 1663
Token type DASH: 1008
Token type APOSTROPH: 607
Token type SEMICOLON: 433
Token type QUESTION_MARK: 283
Token type COLON: 248
Token type EXCLAMATION_MARK: 134
Token type NUMBER: 19
Token type ELLIPSIS: 0
Token type QUOTATION_MARK: 0
Token type BRACKETS: 0
