On August 25,how much eroticism is discourse related Alibaba Cloud launched an open-source Large Vision Language Model (LVLM) named Qwen-VL. The LVLM is based on Alibaba Cloud’s 7 billion parameter foundational language model Qwen-7B. In addition to capabilities such as image-text recognition, description, and question answering, Qwen-VL introduces new features including visual location recognition and image-text comprehension, the company said in a statement. These functions enable the model to identify locations in pictures and to provide users with guidance based on the information extracted from images, the firm added. The model can be applied in various scenarios including image and document-based question answering, image caption generation, and fine-grained visual recognition. Currently, both Qwen-VL and its visual AI assistant Qwen-VL-Chat are available for free and commercial use on Alibaba’s “Model as a Service” platform ModelScope. [Alibaba Cloud statement, in Chinese]
Related Articles
2025-06-26 19:51
230 views
Nvidia Pascal Goes Mobile: GeForce GTX 1080, 1070 & 1060 Preview
Last week we were in Bangkok to attend Nvidia's special media event. The product to be unveiled was
Read More
2025-06-26 19:11
1329 views
Shop the Roku Smart TV for $150 off at Amazon
SAVE $151.99:Shop the Roku 55-inch 4K QLED Smart TV for just $348 at Amazon. That saves you $151.99
Read More
2025-06-26 19:05
614 views
Best Beats deal: Save $50 on Beats Pill
SAVE $50:As of May 15, the Beats Pill is on sale for $99.95 at Amazon. That's a saving of 33% on lis
Read More