Can DeepSeek’s new AI vision transform the industry?

by Jessica Williams1 day ago01

The Chinese AI start-up DeepSeek has introduced multimodal capabilities to its main chatbot, enabling it to process images and videos alongside text. This development aligns it with competitors already offering similar functions. The feature is currently available to select users for beta testing, following the release of DeepSeek’s new flagship model V4 and significant price reductions. The enhancement was announced by Chen Xiaokang, leader of the multimodal team, who highlighted the addition of an image recognition mode to the chat interface. This update is seen as essential for advancing beyond basic text interactions into more complex applications. Despite gaining international recognition in January 2025 for its model’s reasoning abilities and cost efficiency, DeepSeek had been criticized for lacking a multimodal offering.

What are the technology megatrends redefining the future of banking?

Jessica Williams

This website uses cookies to improve your experience. We assume you agree, but you can opt out if you wish. Accept More Info

Privacy & Cookies Policy

What are the technology megatrends redefining the future of banking?

Is Cathie Wood Favoring Amazon Over AMD for AI Investment?

Jessica Williams

Related posts

What’s new in the Galaxy S23 One UI 8 beta update?

What Are the Latest Innovations and Acquisitions in Event Technology?

How is Rhuna revolutionizing stablecoin payments in the entertainment industry?