NG Solution Team
Technology

Can DeepSeek’s new AI vision transform the industry?

The Chinese AI start-up DeepSeek has introduced multimodal capabilities to its main chatbot, enabling it to process images and videos alongside text. This development aligns it with competitors already offering similar functions. The feature is currently available to select users for beta testing, following the release of DeepSeek’s new flagship model V4 and significant price reductions. The enhancement was announced by Chen Xiaokang, leader of the multimodal team, who highlighted the addition of an image recognition mode to the chat interface. This update is seen as essential for advancing beyond basic text interactions into more complex applications. Despite gaining international recognition in January 2025 for its model’s reasoning abilities and cost efficiency, DeepSeek had been criticized for lacking a multimodal offering.

Related posts

Has the second One UI 8.5 beta disrupted a key Samsung camera app on the Galaxy S25?

Emily Brown

Is Your Gmail Account Really at Risk?

David Jones

Is Google Facing a Major Threat from AI-Powered Cyberattacks?

Michael Johnson

This website uses cookies to improve your experience. We assume you agree, but you can opt out if you wish. Accept More Info

Privacy & Cookies Policy