Module for automatic summarization of text documents and HTML pages
A coding-free framework built on PyTorch
Toolkit for conversational AI
Industrial-strength Natural Language Processing (NLP)
A natural language interface for computers
The Classical Language Toolkit
Stanford NLP Python library for many human languages
A Repo For Document AI
Semantic search and workflows for medical/scientific papers
Underthesea - Vietnamese NLP Toolkit
ExtractThinker is a Document Intelligence library for LLMs
Superlinked is a Python framework for AI Engineers
ReFT: Representation Finetuning for Language Models
Trained models & code to predict toxic comments
The no-nonsense RAG chunking library
Extract schema, statistics and entities from datasets
The most accurate natural language detection library for Python
A Heterogeneous Benchmark for Information Retrieval
Han Language Processing
Large Language Model Text Generation Inference
Persian NLP Toolkit
WikiChat is an improved RAG
Haystack is an open source NLP framework to interact with your data
Easy-to-use and high-performance NLP and LLM framework
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models