Tagged articles
16 articles
Page 1 of 1
Machine Heart
Machine Heart
May 1, 2026 · Artificial Intelligence

API‑Only Probes Reveal GPT, Claude, Gemini Parameter Counts – Community Buzz

A new arXiv paper introduces Incompressible Knowledge Probes that estimate large language model sizes via black‑box API calls, fitting a log‑linear relation on 89 open‑source models and producing controversial parameter estimates for GPT‑5.5, Claude Opus, Gemini and others, sparking heated community debate.

AI scalingClaude OpusGPT-5.5
0 likes · 7 min read
API‑Only Probes Reveal GPT, Claude, Gemini Parameter Counts – Community Buzz
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Apr 14, 2026 · Artificial Intelligence

Two‑Year‑Old Chinese Forecast Gains Global Consensus as Meta, METR and Others Confirm the Same AI Scaling Law

A Chinese research team’s 2024 "density law"—which predicts that the parameters needed for a given LLM performance halve every 3.5 months—has been independently validated by Meta’s scaling ladder, METR’s time‑horizon report, and subsequent analyses, revealing a unified exponential growth curve that reshapes expectations for inference cost, edge AI feasibility, and optimal model‑development strategies.

AI scalingLLM density lawMETR
0 likes · 11 min read
Two‑Year‑Old Chinese Forecast Gains Global Consensus as Meta, METR and Others Confirm the Same AI Scaling Law
Machine Heart
Machine Heart
Apr 8, 2026 · Artificial Intelligence

Meta Unveils Muse Spark: The First Model from Its Superintelligence Lab

Meta has launched Muse Spark, its inaugural model from the newly formed Superintelligence Lab, showcasing multimodal capabilities, tool use, visual chain‑of‑thought, and multi‑agent orchestration, while detailing pretraining scaling gains, reinforcement‑learning improvements, and test‑time reasoning efficiencies.

AI scalingMetaMuse Spark
0 likes · 9 min read
Meta Unveils Muse Spark: The First Model from Its Superintelligence Lab
AI Info Trend
AI Info Trend
Apr 8, 2026 · Artificial Intelligence

Why Strong Data Foundations Are Crucial for Scaling Agentic AI

A McKinsey report reveals that while two‑thirds of enterprises have tried agentic AI, less than 10% achieve scalable value, and robust, modern data architectures—built on seven concrete principles and a four‑step implementation plan—are the decisive factor.

AI scalingAgentic AIData Architecture
0 likes · 7 min read
Why Strong Data Foundations Are Crucial for Scaling Agentic AI
ByteDance SE Lab
ByteDance SE Lab
Apr 7, 2026 · Artificial Intelligence

How Scale‑SWE Enables 100k Real‑World Coding Tasks for AI Agents

The Scale‑SWE project combines a massive 100k‑sample software engineering dataset with a high‑concurrency sandbox infrastructure and a multi‑agent workflow to dramatically improve code‑agent training, evaluation, and real‑world performance, surpassing existing models on SWE‑bench benchmarks.

AI scalingSWE datasetSoftware Engineering
0 likes · 11 min read
How Scale‑SWE Enables 100k Real‑World Coding Tasks for AI Agents
Fighter's World
Fighter's World
Oct 7, 2025 · Industry Insights

How Many Digital Workers Could Future AI Deploy?

The article analyzes Epoch AI's token‑based framework for estimating AI‑generated digital workers, critiques its static assumptions, and proposes a dynamic, multi‑factor model that incorporates compute supply, hardware constraints, inference efficiency, task reliability, and economic value to forecast a wide range of possible future digital‑worker counts.

AIAI InfrastructureAI scaling
0 likes · 27 min read
How Many Digital Workers Could Future AI Deploy?
DataFunSummit
DataFunSummit
Sep 20, 2025 · Artificial Intelligence

How We Scaled WeChat AI Services with Ray: Lessons from Million‑Node Deployments

This article examines how WeChat’s Astra platform leverages the Ray distributed framework to manage million‑node AI workloads, addressing challenges of scale, heterogeneous GPU resources, operational complexity, and cost, and outlines the architecture that unifies Ray services across multiple Kubernetes clusters.

AI scalingAstra PlatformGPU Management
0 likes · 5 min read
How We Scaled WeChat AI Services with Ray: Lessons from Million‑Node Deployments
DataFunSummit
DataFunSummit
Sep 11, 2025 · Artificial Intelligence

How Ray Powers Massive AI Computing on WeChat: Lessons from Tencent

This article examines how Tencent leverages the Ray distributed framework within the Astra platform to handle WeChat's massive AI workloads, addressing challenges of scale, heterogeneous GPU resources, operational complexity, and cost while outlining the architecture and practical benefits.

AI scalingAstra PlatformRay
0 likes · 5 min read
How Ray Powers Massive AI Computing on WeChat: Lessons from Tencent
DataFunSummit
DataFunSummit
Aug 28, 2025 · Artificial Intelligence

How We Scaled AI Compute to Millions of Nodes with Ray on WeChat

This article explains how Tencent's WeChat team built the Astra platform on Ray to manage millions of AI compute nodes, addressing challenges of massive scale, heterogeneous GPU resources, low‑priority node instability, deployment complexity, and cost, while detailing architecture, scheduling strategies, and practical usage examples.

AI scalingCluster ManagementRay
0 likes · 21 min read
How We Scaled AI Compute to Millions of Nodes with Ray on WeChat
DataFunTalk
DataFunTalk
Jul 16, 2025 · Artificial Intelligence

MiniMax-M1 Revealed: Hybrid Attention, RL Training, and 1M Token Context

MiniMax’s latest M1 model, unveiled after a $300 million funding round, showcases a 4.56‑trillion‑parameter hybrid‑expert architecture with lightning attention, supporting up to one million tokens, and leverages reinforcement‑learning techniques to enhance long‑context handling, inference efficiency, and system‑2 reasoning capabilities.

AI scalingModel architecturehybrid attention
0 likes · 16 min read
MiniMax-M1 Revealed: Hybrid Attention, RL Training, and 1M Token Context
AI Cyberspace
AI Cyberspace
May 20, 2025 · Artificial Intelligence

Why SuperNode and SuperPOD Are Critical for Scaling AI Models

This article explains the scaling laws behind large language models, the explosive growth of model sizes and compute demands, and why modern AI infrastructure must adopt SuperNode and SuperPOD architectures that combine high‑bandwidth Scale‑Up networks with flexible Scale‑Out networking to overcome bandwidth, latency, and power challenges.

AI scalingDistributed TrainingSuperPoD
0 likes · 42 min read
Why SuperNode and SuperPOD Are Critical for Scaling AI Models
Fighter's World
Fighter's World
Dec 7, 2024 · Artificial Intelligence

Does Scaling Law Still Hold? Analyzing OpenAI’s 12‑Day Mini Releases and the Future of GPT‑5

The article examines OpenAI’s 12‑day mini‑series, the emergence of o1 and Reinforcement Fine‑Tuning, and uses Epoch AI’s 2024 report to evaluate four critical constraints—power, chip capacity, data scarcity, and latency—that determine whether AI scaling laws can sustain the compute needed for a GPT‑5‑scale model by 2030.

AI scalingLatencychip manufacturing
0 likes · 11 min read
Does Scaling Law Still Hold? Analyzing OpenAI’s 12‑Day Mini Releases and the Future of GPT‑5
Baobao Algorithm Notes
Baobao Algorithm Notes
Aug 27, 2024 · Industry Insights

What Real‑World LLM Researchers Face: Scaling Limits, Data Bottlenecks, and Deployment Challenges

The author shares a candid account of recent large‑model experiments, highlighting why most labs struggle to exceed 100 B parameters, how data and hardware constraints shape model iteration, and the practical engineering, safety, and multimodal challenges that dictate real‑world LLM deployment.

AI industryAI scalingHardware acceleration
0 likes · 6 min read
What Real‑World LLM Researchers Face: Scaling Limits, Data Bottlenecks, and Deployment Challenges