California
mist
-0.3 ° C
0.4 °
-0.7 °
89 %
3.1kmh
100 %
Wed
1 °
Thu
5 °
Fri
3 °
Sat
2 °
Sun
8 °
Wednesday, February 12, 2025
HomeChinaAlibaba Cloud launches...

Alibaba Cloud launches open source Large Vision Language Model Qwen-VL · TechNode


On August 25, Alibaba Cloud launched an open-source Large Vision Language Model (LVLM) named Qwen-VL. The LVLM is based on Alibaba Cloud’s 7 billion parameter foundational language model Qwen-7B. In addition to capabilities such as image-text recognition, description, and question answering, Qwen-VL introduces new features including visual location recognition and image-text comprehension, the company said in a statement. These functions enable the model to identify locations in pictures and to provide users with guidance based on the information extracted from images, the firm added. The model can be applied in various scenarios including image and document-based question answering, image caption generation, and fine-grained visual recognition. Currently, both Qwen-VL and its visual AI assistant Qwen-VL-Chat are available for free and commercial use on Alibaba’s “Model as a Service” platform ModelScope. [Alibaba Cloud statement, in Chinese]

Continue reading

A Swoon-Worthy Romantic Novel

In this witty rom-com read, bestselling author Ashley Poston gives a literal interpretation of the phrase “right place, wrong time.” A star-crossed couple whose intense attraction cannot sur- mount one significant obstacle: they are separated by a...