Tagged articles
4 articles
Page 1 of 1
DataFunTalk
DataFunTalk
Apr 1, 2020 · Artificial Intelligence

Knowledge Graph‑Based Multimodal Semantic Understanding at Baidu

This article outlines Baidu's large‑scale knowledge graph applications in AI, detailing the need for multimodal semantic understanding, challenges in text and video comprehension, and the technical solutions including entity annotation, conceptization, knowledge networks, and multimodal fusion for enhanced search, recommendation, and visual question answering.

Visual Question Answeringconceptualizationentity annotation
0 likes · 15 min read
Knowledge Graph‑Based Multimodal Semantic Understanding at Baidu
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 26, 2019 · Artificial Intelligence

How Decomposed Linguistic Representations Overcome Language Priors in VQA

This article reviews a AAAI 2020 paper that introduces a language‑attention based Visual Question Answering model which decomposes questions into type, object, and concept expressions to mitigate language bias, explains its modular architecture, and demonstrates superior performance on VQA‑CP v2 through extensive experiments and ablations.

Attention MechanismMultimodal LearningVQA-CP
0 likes · 14 min read
How Decomposed Linguistic Representations Overcome Language Priors in VQA