GitHub
A lightweight, extensible toolkit for preparing, indexing, and querying textual data for semantic search and downstream NLP tasks.
Snapshot: April 2026