Vision Large Language Model

Deepseek VL-2: The Future of Scalable Vision-Language AI

Deepseek VL-2 is a sophisticated vision-language model designed to address complex multimodal tasks with remarkable efficiency and precision. Built on a new mixture of experts (MoE) architecture, this ...

VentureBeat

Z.ai debuts open source GLM-4.6V, a native tool-calling vision model for multimodal reasoning

Chinese AI startup Zhipu AI aka Z.ai has released its GLM-4.6V series, a new generation of open-source vision-language models (VLMs) optimized for multimodal reasoning, frontend automation, and ...

EurekAlert!

Beyond bigger models: How efficient multimodal AI is redefining the future of intelligence

Multimodal large language models have shown powerful abilities to understand and reason across text and images, but their ...

Semiconductor Engineering

Vision Language Models Come Rushing In

Just when you thought the pace of change of AI models couldn’t get any faster, it accelerates yet again. In the popular news media, the introduction of DeepSeek in January 2025 created a moment that ...

Forbes

How Vision Language Models Will Shape The Future Of Self-Driving Cars

As I highlighted in my last article, two decades after the DARPA Grand Challenge, the autonomous vehicle (AV) industry is still waiting for breakthroughs—particularly in addressing the “long tail ...

EurekAlert!

ETRI begins development of a 100B-scale large foundation model

ETRI, South Korea’s leading government-funded research institute, is establishing itself as a key research entity for ...

Tech Xplore on MSN

New AI model accurately grades messy handwritten math answers and explains student errors

A research team affiliated with UNIST has unveiled a novel AI system capable of grading and providing detailed feedback on ...

InfoWorld

What is Llama? Meta AI’s family of large language models explained

Llama has evolved beyond a simple language model into a multi-modal AI framework with safety features, code generation, and multi-lingual support. Llama, a family of sort-of open-source large language ...

DBusiness

New AI Tool Opens 3-D Modeling to Blind and Low-vision Programmers

A multi-university research team, including the University of Michigan in Ann Arbor, has developed A11yShape, a new tool designed to help blind and low-vision programmers independently create, inspect ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results