Retrieval and Retrieval-augmented LLMs
Qwen2.5-VL is the multimodal large language model series
High accuracy RAG for answering questions from scientific documents
Build your own Cowork, AI Scientist and other SoTA Agents
Leveraging BERT and c-TF-IDF to create easily interpretable topics
Bringing BERT into modernity via both architecture changes and scaling
Natural language workflows for AI agents
Document Image Parsing via Heterogeneous Anchor Prompting”
A very simple framework for state-of-the-art NLP
Generative AI reference workflows
From Paper to Presentation in One Click
Retrieval Augmented Generation (RAG) framework
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Did you say you like data?
Visual tool for building, testing, and deploying AI agent workflows
Ready-to-run cloud templates for RAG
SimpleMem: Efficient Lifelong Memory for LLM Agents
Scalable data pre processing and curation toolkit for LLMs
Web Scraping Framework
Python library to compile, build & package AWS Lambda functions
Running large language models on a single GPU
SQL-Driven RAG Engine
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
95% token savings. 155x faster queries. 16 languages
A best practices guide for day 2 operations