Tagged articles
19 articles
Page 1 of 1
Data Party THU
Data Party THU
Apr 21, 2026 · Industry Insights

What the 2026 AI Index Reveals About the Global AI Landscape

The 2026 AI Index report shows a dramatic shift toward industry‑driven AI breakthroughs, widening US‑China gaps, soaring carbon footprints of large models, narrowing performance gaps among top systems, booming AI investment, and growing societal concerns about responsible AI and its impact on jobs, education, and public perception.

AI IndexAI investmentAI policy
0 likes · 20 min read
What the 2026 AI Index Reveals About the Global AI Landscape
ZhongAn Tech Team
ZhongAn Tech Team
Apr 13, 2026 · Industry Insights

This Week’s AI Pulse: GPT‑4o’s Exit, Full‑Duplex Voice, Open‑World AI & More

The weekly roundup analyzes OpenAI’s GPT‑4o leadership change, ByteDance’s Seeduplex full‑duplex voice breakthrough, JD.com and Meituan’s internal AI restrictions, Anthropic’s Claude Mythos leak and Glasswing response, Sam Altman’s AI‑society contract proposal, Anthropic’s token‑usage controversy, Google’s strategic outlook, AI‑driven marketing platforms, a 48 GB GPU performance comparison of Gemma and GPT‑OSS models, SentiAvatar’s 3D digital‑human innovation, and the launch of the low‑cost AI open‑world Elseland.

3D AvatarAIAnthropic
0 likes · 33 min read
This Week’s AI Pulse: GPT‑4o’s Exit, Full‑Duplex Voice, Open‑World AI & More
SuanNi
SuanNi
Mar 8, 2026 · Artificial Intelligence

PinchBench Reveals Real‑World Performance of LLMs on OpenClaw Tasks

PinchBench, a rigorous benchmark that turns large language models into digital employees, measures success rate, execution speed, and per‑call cost across dozens of realistic office tasks, providing developers with concrete data to choose the most efficient model for their workloads.

AIBenchmarkLLM evaluation
0 likes · 10 min read
PinchBench Reveals Real‑World Performance of LLMs on OpenClaw Tasks
SuanNi
SuanNi
Feb 25, 2026 · Artificial Intelligence

How SkillsBench Reveals the Real Impact of Agent Skills on LLM Performance

The SkillsBench benchmark systematically evaluates how professionally crafted Skills boost large language model agents across 84 complex tasks, revealing significant performance gains, domain‑specific effects, and the trade‑offs of skill size and model scale.

Agent SkillsBenchmarkLLM
0 likes · 11 min read
How SkillsBench Reveals the Real Impact of Agent Skills on LLM Performance
Programmer's Advance
Programmer's Advance
Jan 12, 2026 · Artificial Intelligence

DeepSeek V4 Review: Open‑Source 1‑Trillion‑Parameter Model That Beats Claude & GPT for Developers

DeepSeek V4, the upcoming open‑source 1‑trillion‑parameter coding model, claims to surpass Claude and GPT with innovations like mHC, DSA and MoE, offering 1 M‑plus token context, 10× faster inference, and dramatically lower API costs—making it a game‑changer for most developers while reserving local deployment for only a few large enterprises.

AI coding modelAPI vs local deploymentDeepSeek-V4
0 likes · 19 min read
DeepSeek V4 Review: Open‑Source 1‑Trillion‑Parameter Model That Beats Claude & GPT for Developers
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Sep 28, 2025 · Artificial Intelligence

Core Metrics for Enterprise Large‑Model Engineering

The article outlines the five essential engineering domains—application, model, compute, knowledge, and data—in the era of large models, and details concrete scale, efficiency, service, value, quality, and security metrics that enterprises should track to drive intelligent outcomes.

AI Engineeringbusiness valuecompute metrics
0 likes · 7 min read
Core Metrics for Enterprise Large‑Model Engineering
AI Info Trend
AI Info Trend
Aug 11, 2025 · Industry Insights

What Q2 2025 Reveals About the AI Landscape: Key Trends and Model Rankings

The Q2 2025 State of AI Highlights Report analyzes benchmark data, model performance, and market dynamics, revealing five major industry trends, the rise of AI agents, rapid advances in language, vision, and speech models, and shifting hardware acceleration strategies that shape the future of artificial intelligence.

AIAI agentsBenchmark
0 likes · 11 min read
What Q2 2025 Reveals About the AI Landscape: Key Trends and Model Rankings
DataFunTalk
DataFunTalk
Jul 6, 2025 · Artificial Intelligence

Why DeepSeek’s Low‑Cost Tokenomics Are Losing Market Share to Anthropic and OpenAI

The article analyses DeepSeek’s unconventional low‑price, high‑latency strategy, its token‑pricing and KPI trade‑offs, and compares its performance, hardware choices, and market share with Anthropic, OpenAI, Google and other AI providers, while also discussing the rise of inference‑as‑a‑service and rumors about DeepSeek R2.

AI modelsDeepSeekTokenomics
0 likes · 14 min read
Why DeepSeek’s Low‑Cost Tokenomics Are Losing Market Share to Anthropic and OpenAI
Ops Development & AI Practice
Ops Development & AI Practice
Apr 4, 2025 · Industry Insights

Are Open‑Source LLMs Closing the Gap with Closed‑Source Giants?

A recent leaderboard analysis of top LLMs reveals that while closed‑source models like Gemini‑2.5‑Pro and ChatGPT‑4o still lead overall, open‑source models such as DeepSeek‑V3 and Llama are rapidly narrowing the performance gap, especially in specialized tasks like coding, driven by faster tech diffusion, public datasets, community collaboration, and reduced compute costs.

AI competitionindustry trendslarge language models
0 likes · 8 min read
Are Open‑Source LLMs Closing the Gap with Closed‑Source Giants?
AI Code to Success
AI Code to Success
Jan 26, 2025 · Industry Insights

How DeepSeek‑R1 Is Challenging OpenAI’s o1 and Shaping the AI Landscape

DeepSeek‑R1 achieved a 1357‑point Arena score, ranking third overall and tying OpenAI o1 for first in StyleCtrl, while its open‑source MIT‑licensed release—including distilled variants—and low‑cost API service aim to democratize advanced AI inference for developers worldwide.

AI competitionArena benchmarkDeepSeek
0 likes · 5 min read
How DeepSeek‑R1 Is Challenging OpenAI’s o1 and Shaping the AI Landscape
21CTO
21CTO
Mar 5, 2024 · Artificial Intelligence

Claude 3 Unveiled: Faster, More Accurate AI with File Upload Capability

Anthropic has launched Claude 3, offering three model variants—Opus, Sonnet, and the upcoming Haiku—each delivering faster response times, higher accuracy, advanced reasoning, and the ability to process uploaded files such as images, PDFs, and code, positioning it as a strong competitor to ChatGPT and Gemini.

Claude AIartificial intelligencefile upload
0 likes · 5 min read
Claude 3 Unveiled: Faster, More Accurate AI with File Upload Capability
Baidu Tech Salon
Baidu Tech Salon
Aug 8, 2023 · Artificial Intelligence

Tsinghua University Report Ranks Baidu Wenxin Yiyan First Among Chinese Large Language Models

A Tsinghua University evaluation of seven large language models found Baidu’s Wenxin Yiyan topping the domestic rankings with the highest overall score across 20 metrics—especially Chinese semantic understanding and safety—surpassing ChatGPT and tying GPT‑4, while also demonstrating rapid training, inference speed, and broad industry adoption.

AI EvaluationBaidu WenxinChinese NLP
0 likes · 4 min read
Tsinghua University Report Ranks Baidu Wenxin Yiyan First Among Chinese Large Language Models
php Courses
php Courses
Aug 2, 2023 · Artificial Intelligence

Stanford and UC Berkeley Study Finds Significant Decline in GPT-4 Capabilities Across Math, Coding, and Visual Reasoning

A joint Stanford and UC Berkeley study reveals that GPT‑4’s performance on mathematics, code generation, and visual‑reasoning tasks sharply declined between March and June 2023, with accuracy dropping from 97.6% to 2.4% on a prime‑checking benchmark and executable code rates falling from 52% to 10%.

AI EvaluationGPT-4machine learning
0 likes · 3 min read
Stanford and UC Berkeley Study Finds Significant Decline in GPT-4 Capabilities Across Math, Coding, and Visual Reasoning
Baobao Algorithm Notes
Baobao Algorithm Notes
Feb 14, 2022 · Artificial Intelligence

Mastering Feature Engineering: From AutoML Dictionaries to Business‑Driven Insights

This article presents a comprehensive, practical methodology for feature engineering that combines brute‑force AutoML‑style dictionary searches, business‑logic‑driven feature creation, and feature‑importance‑guided refinement, illustrating each approach with real Kaggle competition examples and concrete code snippets.

AutoMLKaggledata preprocessing
0 likes · 12 min read
Mastering Feature Engineering: From AutoML Dictionaries to Business‑Driven Insights
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 7, 2020 · Artificial Intelligence

How Active Learning Can Cut Labeling Costs and Boost Model Performance

This article explains active learning techniques that let models select valuable training samples, reducing annotation costs and improving performance, and describes business‑specific adaptations, experiments, and results that demonstrate its effectiveness in content‑safety applications.

active learningbatch samplingdata annotation
0 likes · 14 min read
How Active Learning Can Cut Labeling Costs and Boost Model Performance
JD Tech Talk
JD Tech Talk
Mar 27, 2020 · Artificial Intelligence

Understanding Federated Learning: Origins, Applications, and Privacy Protection Techniques

This article explains the rapid rise of federated learning, its technical foundations combining machine learning, distributed computing, and privacy protection, practical use cases, intuitive privacy examples, and empirical evidence that it can improve model performance without compromising data security.

Federated Learningartificial intelligencedata security
0 likes · 15 min read
Understanding Federated Learning: Origins, Applications, and Privacy Protection Techniques