ExtractThinker is a Document Intelligence library for LLMs
Extract schema, statistics and entities from datasets
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
The Classical Language Toolkit
Public opinion analysis system
Toolkit for conversational AI
A natural language interface for computers
A Repo For Document AI
Stanford NLP Python library for many human languages
Obsei is a low code AI powered automation tool
Resources, corpora, and tools for Chinese natural language processing
fastNLP: A Modularized and Extensible NLP Framework
InferSent sentence embeddings
AiLearning, data analysis plus machine learning practice
We describe a simple XML format to share text documents and annotation
Lexicon and rule-based sentiment analysis tool
TextBlob is a Python library for processing textual data