Rustling#
Rustling is a blazingly fast library for computational linguistics. It is written in Rust, with Python bindings.
Performance#
Benchmarked against pure Python implementataions from NLTK and wordseg. See benchmarks/ for full details and reproduction scripts.
Component |
Task |
Speedup |
vs. |
|---|---|---|---|
Language Models |
Fit |
11x |
NLTK |
Score |
2x |
NLTK |
|
Generate |
25–39x |
NLTK |
|
Word Segmentation |
LongestStringMatching |
14x |
wordseg |
RandomSegmenter |
12x |
wordseg |
|
POS Tagging |
Training |
5x |
NLTK |
Tagging |
6x |
NLTK |
Installation#
pip install rustling
Sections#
Source Code#
The source code is available on GitHub.