Tagged articles
11 articles
Page 1 of 1
Baidu Tech Salon
Baidu Tech Salon
Oct 10, 2025 · Artificial Intelligence

Navigating the 2025 AI Model Boom: Practical Evaluation Strategies

This article examines the rapid surge of large AI models in 2024‑2025, critiques the reliability of public leaderboards, and presents a business‑focused evaluation framework—including dataset construction, metric selection, automation, and LLM‑as‑judge techniques—to help developers choose the right model for real‑world applications.

AI PerformanceAI benchmarksLLM-as-judge
0 likes · 17 min read
Navigating the 2025 AI Model Boom: Practical Evaluation Strategies
Sohu Tech Products
Sohu Tech Products
Apr 16, 2025 · Artificial Intelligence

Comprehensive Guide to Building AI Datasets: From Source Collection to Data Augmentation and Validation

This guide walks readers through every stage of building high‑quality AI training datasets—from locating open‑source data and defining goals, through collection, annotation, cleaning, large‑scale processing, optional augmentation, and splitting, to validation—using a medical QA example for fine‑tuning DeepSeek‑R1.

AI fine-tuningPythondata augmentation
0 likes · 18 min read
Comprehensive Guide to Building AI Datasets: From Source Collection to Data Augmentation and Validation
AI Frontier Lectures
AI Frontier Lectures
Mar 25, 2025 · Artificial Intelligence

What Drives Alignment in Multimodal Large Language Models? A Comprehensive Review

This article provides an in‑depth review of alignment algorithms for multimodal large language models, covering application scenarios, dataset construction methods, evaluation benchmarks, current challenges, and future research directions, while summarizing contributions from leading academic institutions.

AI researchalignment algorithmsdataset construction
0 likes · 22 min read
What Drives Alignment in Multimodal Large Language Models? A Comprehensive Review
Architect
Architect
Mar 24, 2025 · Artificial Intelligence

How Multimodal Alignment Is Shaping the Future of Large Language Models

This article provides a systematic review of recent advances in multimodal alignment for large language models, covering key contributions, application scenarios, dataset construction, evaluation benchmarks, future challenges, and insights from LLM alignment research to guide both academia and industry.

AI SafetyMLLMdataset construction
0 likes · 26 min read
How Multimodal Alignment Is Shaping the Future of Large Language Models
AIWalker
AIWalker
Mar 13, 2025 · Artificial Intelligence

VideoPainter: Plug‑and‑Play Video Inpainting and Editing Achieves 8 SOTA Benchmarks

VideoPainter introduces a plug‑and‑play dual‑branch framework with a lightweight context encoder and ID‑resampling adapter, built on the massive VPData/VPBench dataset, and demonstrates state‑of‑the‑art performance across eight video restoration and editing metrics, while supporting flexible model integration and long‑video consistency.

Dual-Branch ArchitectureID ConsistencyPlug-and-Play
0 likes · 18 min read
VideoPainter: Plug‑and‑Play Video Inpainting and Editing Achieves 8 SOTA Benchmarks
Architect
Architect
Feb 22, 2025 · Artificial Intelligence

How Open‑Source Projects Reproduced DeepSeek‑R1 and Pushed LLM Limits

This article reviews the most notable open‑source reproductions of DeepSeek‑R1—including Open R1, OpenThoughts, LIMO and DeepScaleR—detailing their data pipelines, training steps, reinforcement‑learning strategies, dataset constructions, and benchmark results that demonstrate how small, high‑quality data can rival massive‑scale models.

AI researchDeepSeek-R1Model Scaling
0 likes · 26 min read
How Open‑Source Projects Reproduced DeepSeek‑R1 and Pushed LLM Limits
DaTaobao Tech
DaTaobao Tech
Jun 5, 2024 · Artificial Intelligence

Automated Quality Assessment for AIGC Image Generation: Recent Research Advances

The article reviews recent automated quality assessment advances for AIGC image generation, including an aesthetic scoring framework with the APDD dataset and AANSPS network, a human‑preference benchmark (HPD v2 and HPS v2) that outperforms IS/FID, and the Pick‑Score model trained on user‑driven Pick‑a‑Pic data, all enabling faster, unbiased evaluation, cost savings, and more effective model iteration, with ongoing work in home‑improvement AI.

AIGCAesthetic EvaluationHuman Preference
0 likes · 15 min read
Automated Quality Assessment for AIGC Image Generation: Recent Research Advances
Sohu Tech Products
Sohu Tech Products
Apr 24, 2024 · Artificial Intelligence

Domain-Specific Large Model Construction Guide

The guide explains why generic LLMs struggle with enterprise tasks and outlines two remedies—retrieval‑augmented generation and domain‑specific fine‑tuning—detailing dataset creation, training strategies (full‑parameter, LoRA, Q‑LoRA), validation methods, hardware benchmarks, and practical tips such as supervised fine‑tuning, 30% domain data, and a stepwise tuning pipeline.

AIdataset constructiondomain-specific LLM
0 likes · 16 min read
Domain-Specific Large Model Construction Guide
DataFunTalk
DataFunTalk
Apr 21, 2024 · Artificial Intelligence

Guidelines for Building Domain-Specific Large Models: Dataset Construction, Training Methods, Evaluation, and Hardware Benchmarking

This article presents a comprehensive guide on constructing domain-specific large language models, covering the differences from general models, how to build high‑quality domain datasets, selecting appropriate training methods, designing validation sets, evaluating model capabilities, and benchmarking domestic hardware performance.

AIModel Evaluationdataset construction
0 likes · 20 min read
Guidelines for Building Domain-Specific Large Models: Dataset Construction, Training Methods, Evaluation, and Hardware Benchmarking
Tencent Music Tech Team
Tencent Music Tech Team
Jun 1, 2021 · Artificial Intelligence

TDQA: A No-Reference Deep Learning Based Video Quality Assessment Algorithm for Live Streaming

TDQA is a no‑reference, deep‑learning video quality assessment algorithm designed for live‑streaming, built on a large subjectively annotated dataset and an end‑to‑end architecture with fine‑tuned backbones, achieving state‑of‑the‑art accuracy and sub‑second inference for real‑time quality monitoring and pipeline optimization.

Deep LearningModel TrainingNo-Reference
0 likes · 15 min read
TDQA: A No-Reference Deep Learning Based Video Quality Assessment Algorithm for Live Streaming