Tag

visual AI

1 views collected around this technical thread.

DataFunTalk
DataFunTalk
Dec 5, 2024 · Artificial Intelligence

VAR: Scalable Image Generation via Next‑Scale Prediction Wins NeurIPS 2024 Best Paper

The VAR model, a Visual AutoRegressive framework that introduces a novel multi‑scale “next‑scale prediction” paradigm, dramatically improves image generation efficiency and quality, surpasses diffusion models, validates scaling laws in vision, and earned the Best Paper award at NeurIPS 2024.

NeurIPS2024autoregressive modelsimage generation
0 likes · 7 min read
VAR: Scalable Image Generation via Next‑Scale Prediction Wins NeurIPS 2024 Best Paper
360 Tech Engineering
360 Tech Engineering
Jun 25, 2023 · Artificial Intelligence

Visual Capability as a Fundamental Requirement for AGI and the SEEChat Multimodal Dialogue Model

The article reviews why visual ability is essential for artificial general intelligence, compares native multimodal and expert‑stitching integration approaches, details the architectures of models such as KOSMOS‑1, PALM‑E, Flamingo, BLIP‑2, LLAVA, miniGPT‑4, and introduces the SEEChat project that fuses CLIP vision encoders with chatGLM6B via a projection layer, presenting its training pipeline, experimental results, and future directions.

AGIModel FusionSEEChat
0 likes · 13 min read
Visual Capability as a Fundamental Requirement for AGI and the SEEChat Multimodal Dialogue Model
DataFunTalk
DataFunTalk
Dec 20, 2018 · Artificial Intelligence

How to Build World-Class Visual AI Technology

This presentation outlines the fundamentals of computer vision, discusses key factors such as algorithm research, large‑scale training platforms, intelligent data processing, and hardware optimization, and shares practical experiences from DeepGlint on building a world‑class visual AI system and its real‑world applications.

Hardware Optimizationcomputer visiondata pipeline
0 likes · 23 min read
How to Build World-Class Visual AI Technology