Tag

visual reasoning

0 views collected around this technical thread.

AntTech
AntTech
Oct 29, 2024 · Artificial Intelligence

Three Ant Group Papers Featured at EMNLP 2024: Dynamic Transformers, Plug‑and‑Play Visual Reasoner, and Efficient Fine‑Tuning of Large Language Models

This announcement introduces three Ant Group papers accepted at EMNLP 2024—Mixture‑of‑Modules for dynamic Transformer assembly, a plug‑and‑play visual reasoning framework built via data synthesis, and a layer‑wise importance‑aware efficient fine‑tuning method for large language models—highlighting their innovations and upcoming live presentations.

EMNLP 2024Large Language ModelsParameter-efficient Fine-tuning
0 likes · 6 min read
Three Ant Group Papers Featured at EMNLP 2024: Dynamic Transformers, Plug‑and‑Play Visual Reasoner, and Efficient Fine‑Tuning of Large Language Models
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jun 22, 2024 · Artificial Intelligence

Claude 3.5 Sonnet: Performance Review and Real‑World Tests

Claude 3.5 Sonnet, Anthropic’s latest large language model, is evaluated across a range of Chinese‑language tasks, visual reasoning, coding, and game creation, showing faster, cheaper, and often superior results compared to GPT‑4o, while also revealing occasional failures in simple games and math problems.

AI modelAnthropicClaude 3.5
0 likes · 8 min read
Claude 3.5 Sonnet: Performance Review and Real‑World Tests