-
Couldn't load subscription status.
- Fork 93
Open
Description
We have been using a fast TokenBuffer API to speed up for various tokenizers in WordTokenizers.jl.
Referring to #141 #140, I think it might be beneficial to extend the TokenBuffer API for Documents and Corpus that TextAnalysis.jl offers (excluding NGramDocument and TokenDocument).
This can then be used to improve the performance for preprocessing.jl.
aviks
Metadata
Metadata
Assignees
Labels
No labels