On August 25,shoe eroticism Alibaba Cloud launched an open-source Large Vision Language Model (LVLM) named Qwen-VL. The LVLM is based on Alibaba Cloud’s 7 billion parameter foundational language model Qwen-7B. In addition to capabilities such as image-text recognition, description, and question answering, Qwen-VL introduces new features including visual location recognition and image-text comprehension, the company said in a statement. These functions enable the model to identify locations in pictures and to provide users with guidance based on the information extracted from images, the firm added. The model can be applied in various scenarios including image and document-based question answering, image caption generation, and fine-grained visual recognition. Currently, both Qwen-VL and its visual AI assistant Qwen-VL-Chat are available for free and commercial use on Alibaba’s “Model as a Service” platform ModelScope. [Alibaba Cloud statement, in Chinese]
Related Articles
2025-06-26 22:25
336 views
Sony PULSE Elite PS5 headset open
SAVE $60:As of April 2, Sony PULSE Elite headset for PS5 is available in open-box fair condition at
Read More
2025-06-26 21:59
668 views
Thousands told to jump into the ocean as Australia's raging fires approached
The sun didn't rise on New Year's Eve. The summer morning in a small beach town on the east coast of
Read More
2025-06-26 21:42
2363 views
Filipino President Duterte says he 'doesn't discriminate' but condemns same
Same-sex marriage has been dealt a blow in the Philippines, with its president dismissing the possib
Read More