Rustling#

Rustling is a blazingly fast library for computational linguistics. It is written in Rust, with Python bindings.

Performance#

Benchmarked against pure Python implementataions from NLTK and wordseg. See benchmarks/ for full details and reproduction scripts.

Component

Task

Speedup

vs.

Language Models

Fit

11x

NLTK

Score

2x

NLTK

Generate

25–39x

NLTK

Word Segmentation

LongestStringMatching

14x

wordseg

RandomSegmenter

12x

wordseg

POS Tagging

Training

5x

NLTK

Tagging

6x

NLTK

Installation#

pip install rustling

Sections#

Source Code#

The source code is available on GitHub.