ExtractThinker is a Document Intelligence library for LLMs
Text mining using tidy tools
Extract schema, statistics and entities from datasets
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
Public opinion analysis system
The Classical Language Toolkit
Toolkit for conversational AI
A Repo For Document AI
A natural language interface for computers
Stanford NLP Python library for many human languages
State of the Art Natural Language Processing
Modular Suite of NLP Tools
Obsei is a low code AI powered automation tool
Resources, corpora, and tools for Chinese natural language processing
Unicode XML TEI text analysis platform
Common Resource Grep
fastNLP: A Modularized and Extensible NLP Framework
Converting text to a structured representation
InferSent sentence embeddings
Text Analytics Platform
AiLearning, data analysis plus machine learning practice
We describe a simple XML format to share text documents and annotation
NLP tool for statistical analysis of words, sentences, documents
Lexicon and rule-based sentiment analysis tool
This project presents a new corpus for NEWS text analysis in Persian