NG Solution Team
Technology

Can DeepSeek’s new AI vision transform the industry?

The Chinese AI start-up DeepSeek has introduced multimodal capabilities to its main chatbot, enabling it to process images and videos alongside text. This development aligns it with competitors already offering similar functions. The feature is currently available to select users for beta testing, following the release of DeepSeek’s new flagship model V4 and significant price reductions. The enhancement was announced by Chen Xiaokang, leader of the multimodal team, who highlighted the addition of an image recognition mode to the chat interface. This update is seen as essential for advancing beyond basic text interactions into more complex applications. Despite gaining international recognition in January 2025 for its model’s reasoning abilities and cost efficiency, DeepSeek had been criticized for lacking a multimodal offering.

Related posts

What’s new in the Galaxy S23 One UI 8 beta update?

James Smith

What Are the Latest Innovations and Acquisitions in Event Technology?

Jessica Williams

How is Rhuna revolutionizing stablecoin payments in the entertainment industry?

Jessica Williams

This website uses cookies to improve your experience. We assume you agree, but you can opt out if you wish. Accept More Info

Privacy & Cookies Policy