Audience

Developers and AI researchers seeking a solution offering a vision-language model that balances size and capability, ideal for building multimodal agents, document/image analysis tools, or GUI-based automation workflows

About GLM-4.1V

GLM-4.1V is a vision-language model, providing a powerful, compact multimodal model designed for reasoning and perception across images, text, and documents. The 9-billion-parameter variant (GLM-4.1V-9B-Thinking) is built on the GLM-4-9B foundation and enhanced through a specialized training paradigm using Reinforcement Learning with Curriculum Sampling (RLCS). It supports a 64k-token context window and accepts high-resolution inputs (up to 4K images, any aspect ratio), enabling it to handle complex tasks such as optical character recognition, image captioning, chart and document parsing, video and scene understanding, GUI-agent workflows (e.g., interpreting screenshots, recognizing UI elements), and general vision-language reasoning. In benchmark evaluations at the 10 B-parameter scale, GLM-4.1V-9B-Thinking achieved top performance on 23 of 28 tasks.

Pricing

Starting Price:
Free
Free Version:
Free Version available.

Integrations

API:
Yes, GLM-4.1V offers API access

Ratings/Reviews

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Company Information

Zhipu AI
Founded: 2023
China
chat.z.ai/

Videos and Screen Captures

GLM-4.1V Screenshot 1
Other Useful Business Software
Respond 100x faster, more accurately, and improve your documentation Icon
Respond 100x faster, more accurately, and improve your documentation

Designed for forward-thinking security, sales, and compliance teams

Slash response times for questionnaires, audits, and RFPs by up to 90%. OptiValue.ai automates the heavy lifting, freeing your team to focus on strategic priorities with intuitive tools for seamless review and validation.
Learn More

Product Details

Platforms Supported
Cloud
Windows
Mac
Linux
On-Premises
Training
Documentation
Support
Online

GLM-4.1V Frequently Asked Questions

Q: What kinds of users and organization types does GLM-4.1V work with?
Q: What languages does GLM-4.1V support in their product?
Q: What kind of support options does GLM-4.1V offer?
Q: What other applications or services does GLM-4.1V integrate with?
Q: Does GLM-4.1V have an API?
Q: What type of training does GLM-4.1V provide?
Q: How much does GLM-4.1V cost?

GLM-4.1V Product Features

GLM-4.1V Additional Categories