A natural language interface for computers
Openai style api for open large language models
Industrial-strength Natural Language Processing (NLP)
Build AI-powered semantic search applications
Stanford NLP Python library for many human languages
Semantic search and workflows for medical/scientific papers
Data and tools for generating and inspecting OLMo pre-training data
ExtractThinker is a Document Intelligence library for LLMs
The no-nonsense RAG chunking library
Efficient Retrieval Augmentation and Generation Framework
A Heterogeneous Benchmark for Information Retrieval
An LLM-powered knowledge curation system that researches topics
Large Language Model Text Generation Inference
Han Language Processing
Neural Network Compression Framework for enhanced OpenVINO
The Classical Language Toolkit
Data processing for and with foundation models
ReFT: Representation Finetuning for Language Models
Haystack is an open source NLP framework to interact with your data
Efficient few-shot learning with Sentence Transformers
Bring the notion of Model-as-a-Service to life
The library to build & auto-optimize LLM applications
Extract schema, statistics and entities from datasets
A full spaCy pipeline and models for scientific/biomedical documents
Trained models & code to predict toxic comments