Large Language Model Text Generation Inference
Bring the notion of Model-as-a-Service to life
Sparsity-aware deep learning inference runtime for CPUs
Efficient few-shot learning with Sentence Transformers
Libraries for applying sparsification recipes to neural networks
Neural Network Compression Framework for enhanced OpenVINO
Openai style api for open large language models
A natural language interface for computers
A Unified Library for Parameter-Efficient Learning
An easy-to-use LLMs quantization package with user-friendly apis
KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)
A PyTorch-based knowledge distillation toolkit
A model library for exploring state-of-the-art deep learning
A natural language modeling framework based on PyTorch
Basic Utilities for PyTorch Natural Language Processing (NLP)
InferSent sentence embeddings