textacy
https://github.com/chartbeat-labs/textacy
Python
higher-level NLP built on spaCy
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported5 Subscribers
Add a CodeTriage badge to textacy
Help out
- Issues
- Add more code examples / tutorials
- Help: sort keywords into pre-defined topics / categories
- True-casing words and sentences
- Compute character index mapping for before `preprocess.normalize_whitespace`
- Anaphora resolution?
- Phrase models vs n-grams in pre-processing
- more, better, and interactive(?) data viz
- Question: Is it possible to save a Corporus to disk and then append or delete docs from it?
- Extracted topics make no sense; might have something to do with unicodes
- more, better example corpora
- Docs
- Python not yet supported