Multimodal Search: From Mobile to the 5G+ Intelligent Era – Baidu’s Voice and Visual Search Technologies
This article reviews Baidu's multimodal search advancements, covering the evolution of voice and visual search, technical architectures, algorithmic improvements, and future prospects such as the DuXiaoxiao app that integrates speech, image, and text AI for immersive user experiences.