Tagged articles
171 articles
Page 1 of 2
Machine Heart
Machine Heart
May 19, 2026 · Artificial Intelligence

HyperEyes: Parallel Multimodal Search Agents Move from Deep to Wide for Efficiency

HyperEyes introduces a unified‑location‑as‑search (UGS) action space, parallel data synthesis, and a dual‑granularity efficiency‑aware RL framework that enable multimodal agents to perform simultaneous multi‑target retrieval, dramatically reducing interaction rounds while improving accuracy and cost‑efficiency across benchmark evaluations.

AgentBenchmarkefficiency
0 likes · 9 min read
HyperEyes: Parallel Multimodal Search Agents Move from Deep to Wide for Efficiency
PaperAgent
PaperAgent
May 9, 2026 · Artificial Intelligence

How ActDistill Slashes Deployment Costs of VLA Large Models

ActDistill, proposed by Tongji University and collaborators, reduces the inference latency, compute consumption, and action-loop speed of Vision‑Language‑Action (VLA) models by selectively distilling action‑relevant knowledge, achieving up to 1.67× speedup while preserving control quality on real robot hardware.

ActDistillRoboticsVLA
0 likes · 13 min read
How ActDistill Slashes Deployment Costs of VLA Large Models
Machine Heart
Machine Heart
May 9, 2026 · Artificial Intelligence

BARD-VL Achieves New SOTA for Multimodal Diffusion Models via Autoregressive‑Diffusion Bridge

The BARD-VL framework bridges pretrained autoregressive vision‑language models to diffusion‑based VLMs, preserving or surpassing original performance while boosting decoding throughput up to three times, through progressive block merging, stage‑wise diffusion distillation, and engineering optimizations validated on multiple benchmarks.

BARD-VLBenchmarkdiffusion
0 likes · 9 min read
BARD-VL Achieves New SOTA for Multimodal Diffusion Models via Autoregressive‑Diffusion Bridge
ZhiKe AI
ZhiKe AI
Apr 28, 2026 · Artificial Intelligence

Demystifying DeepSeek‑V4 Benchmarks with Real‑World Data

This article breaks down DeepSeek‑V4's six core capability categories—knowledge, reasoning, programming, math, long‑context, and agent—showing how each benchmark works, presenting concrete scores that place V4 first or second against leading models, and explaining the hidden efficiency gains that make V4 up to 13.7× cheaper to run.

AI EvaluationBenchmarkDeepSeek-V4
0 likes · 14 min read
Demystifying DeepSeek‑V4 Benchmarks with Real‑World Data
SuanNi
SuanNi
Apr 12, 2026 · Artificial Intelligence

How MemPO Gives AI Agents Long‑Term Memory and Cuts Costs by 70%

The paper introduces MemPO, a self‑memory strategy optimization algorithm that lets large language model agents actively manage their memory, dramatically improving accuracy on complex multi‑step tasks while reducing token consumption by up to 73%, and validates the approach with extensive experiments and analysis.

AILong-term MemoryMemory Optimization
0 likes · 11 min read
How MemPO Gives AI Agents Long‑Term Memory and Cuts Costs by 70%
AI Engineering
AI Engineering
Apr 9, 2026 · Artificial Intelligence

Meta Unveils Muse Spark: Does Alexandr Wang’s First MSL Model Deliver?

Meta’s new Muse Spark model, the first output of Meta Superintelligence Labs, claims multimodal reasoning, ten‑fold compute efficiency over comparable models, strong safety rejection rates, and competitive benchmark scores, while being rolled out across Meta’s core apps.

BenchmarkContemplating modeMeta
0 likes · 6 min read
Meta Unveils Muse Spark: Does Alexandr Wang’s First MSL Model Deliver?
PMTalk Product Manager Community
PMTalk Product Manager Community
Mar 30, 2026 · Product Management

What Product Managers Lose When AI Takes Over Their Thinking

The article examines how reliance on generative AI tools boosts product managers' efficiency but erodes essential skills such as independent user insight, structured thinking, judgment, and differentiation, citing research from MIT and Microsoft‑CMU, and offers practical habits to preserve critical thinking while still leveraging AI.

AIcognitive biasefficiency
0 likes · 16 min read
What Product Managers Lose When AI Takes Over Their Thinking
AIWalker
AIWalker
Mar 18, 2026 · Artificial Intelligence

7× Faster Inference: Tsinghua’s Huang‑Gao Team Redesigns Vision‑Transformer Attention via Fourier Transforms

The AAAI 2026 paper by Tsinghua’s Huang‑Gao team shows that modeling Vision‑Transformer attention as a Block‑Circulant matrix and computing it with FFT reduces the quadratic complexity to O(N log N), delivering up to seven‑fold real‑world speedups without sacrificing accuracy.

AAAI 2026Circulant MatricesComputer Vision
0 likes · 15 min read
7× Faster Inference: Tsinghua’s Huang‑Gao Team Redesigns Vision‑Transformer Attention via Fourier Transforms
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Mar 10, 2026 · Artificial Intelligence

How InfLLM‑V2 Achieves Seamless Short‑to‑Long Context Upgrade with Minimal Structural Changes

InfLLM‑V2 introduces a dense‑sparse switchable attention framework that preserves the original dense‑attention parameters while enabling efficient long‑context training, matching full‑attention performance on benchmarks such as RULER, LongBench, and chain‑reasoning tasks, and delivering up to 2.3× end‑to‑end inference speedup without degrading short‑sequence abilities.

InfLLM-V2Transformerdense-sparse attention
0 likes · 16 min read
How InfLLM‑V2 Achieves Seamless Short‑to‑Long Context Upgrade with Minimal Structural Changes
PaperAgent
PaperAgent
Mar 10, 2026 · Artificial Intelligence

How MemSifter Delivers High‑Precision, Low‑Cost Long‑Term Memory for LLMs

MemSifter introduces a lightweight agent that outsources memory retrieval for large language models, using a Think‑and‑Rank pipeline and a task‑result‑oriented reinforcement‑learning training paradigm to achieve superior retrieval accuracy and efficiency across eight benchmark tasks while keeping inference overhead minimal.

AgentBenchmarkLLM
0 likes · 13 min read
How MemSifter Delivers High‑Precision, Low‑Cost Long‑Term Memory for LLMs
AI Explorer
AI Explorer
Mar 7, 2026 · Artificial Intelligence

SenseTime’s Multimodal Model Skips the Encoder, Boosting Performance and Shifting AI Design Paradigms

SenseTime eliminates the intermediate encoder in multimodal AI models, allowing direct cross‑modal learning, which yields markedly higher performance at 2‑trillion‑parameter scale while reducing training cost, and may trigger a broader industry move toward simpler, more efficient architectures.

AI Paradigm ShiftModel architectureMultimodal AI
0 likes · 6 min read
SenseTime’s Multimodal Model Skips the Encoder, Boosting Performance and Shifting AI Design Paradigms
Digital Planet
Digital Planet
Feb 24, 2026 · Industry Insights

Why Are Second‑Tier Distributors Disappearing in China's FMCG Channels?

The article analyzes how fast‑moving consumer goods brands are abandoning second‑tier distributors as digital tools, B2B platforms, and efficiency‑driven channel redesign reshape the market, outlining the historical role of these intermediaries, the pressures that render them obsolete, and strategic steps for brands to adapt.

B2BDigital TransformationFMCG
0 likes · 14 min read
Why Are Second‑Tier Distributors Disappearing in China's FMCG Channels?
PaperAgent
PaperAgent
Jan 30, 2026 · Artificial Intelligence

How LLM‑in‑Sandbox Turns Large Models into General‑Purpose Agents Without Extra Training

The LLM‑in‑Sandbox framework places large language models inside a virtual machine that provides external tool access, persistent storage, and code execution, yielding up to a 24.2% performance boost across six benchmark tasks without additional training, and it scales from zero‑shot to reinforcement‑learning‑enhanced agents while remaining cost‑effective.

Agentic AILLMefficiency
0 likes · 6 min read
How LLM‑in‑Sandbox Turns Large Models into General‑Purpose Agents Without Extra Training
Zhihu Tech Column
Zhihu Tech Column
Jan 20, 2026 · Artificial Intelligence

How AI‑Powered Agentic Workflows Cut Costs and Boosted R&D Efficiency by Over 30% – A Real‑World Case Study

This article details a multi‑year, data‑driven transformation in which a product‑research team leveraged large‑model AI and agentic workflows to automate repetitive coding, streamline hot‑topic discussion creation, and replace a seven‑person outsourcing crew, achieving up to 38.6% project‑time reduction, a 22.5‑25 PD weekly capacity gain, and a dramatic drop in marginal costs.

AICost reductionGoogle ADK
0 likes · 29 min read
How AI‑Powered Agentic Workflows Cut Costs and Boosted R&D Efficiency by Over 30% – A Real‑World Case Study
Design Hub
Design Hub
Dec 12, 2025 · Artificial Intelligence

GPT-5.2 Unveiled: A Cutting-Edge AI Super-Assistant Built for Real-World Work

OpenAI's newly released GPT-5.2 claims to outperform human experts on about 70% of real tasks, achieve a perfect score on the AIME 2025 competition, and deliver dramatic efficiency gains—up to 390× cost reduction—while showcasing impressive examples such as one‑shot ocean shader generation, a full 3D engine built in a single file, and visual‑perception scores rivaling top models.

AI benchmarksAgent AIDesign Automation
0 likes · 8 min read
GPT-5.2 Unveiled: A Cutting-Edge AI Super-Assistant Built for Real-World Work
PMTalk Product Manager Community
PMTalk Product Manager Community
Nov 29, 2025 · Industry Insights

When Proud KPIs Turn Into Layoff Notices: An AI Product Manager’s Path to Redemption

The article reflects on how AI‑driven efficiency in customer‑service centers can initially boost performance and morale, but as growth stalls the same metrics become a justification for staff cuts, prompting product managers to rethink their role from replacer to enabler while preserving human dignity.

AIEthicscustomer-service
0 likes · 8 min read
When Proud KPIs Turn Into Layoff Notices: An AI Product Manager’s Path to Redemption
Tencent Advertising Technology
Tencent Advertising Technology
Nov 28, 2025 · Artificial Intelligence

How Retrv-R1 Redefines Universal Multimodal Retrieval with Reasoning‑Driven MLLM

Retrv‑R1, a reasoning‑driven multimodal large language model framework, tackles the precision‑efficiency dilemma of universal multimodal retrieval by introducing a two‑stage coarse‑to‑fine pipeline, an information‑compression module, a detail‑inspection mechanism, and a three‑stage training strategy, achieving SOTA performance across accuracy, efficiency, and generalization benchmarks.

GeneralizationMLLMMultimodal Retrieval
0 likes · 21 min read
How Retrv-R1 Redefines Universal Multimodal Retrieval with Reasoning‑Driven MLLM
Kuaishou Tech
Kuaishou Tech
Nov 5, 2025 · Artificial Intelligence

How HiPO Gives LLMs a Smart Thinking Switch to Cut Costs and Boost Accuracy

This article explains the overthinking problem of large language models, introduces the HiPO framework with hybrid data cold‑start and reinforcement‑learning reward mechanisms that let models decide when to think deeply or answer directly, and shows experimental results demonstrating significant efficiency gains and accuracy improvements across multiple benchmarks.

Hybrid Policy OptimizationLLMadaptive inference
0 likes · 13 min read
How HiPO Gives LLMs a Smart Thinking Switch to Cut Costs and Boost Accuracy
Alibaba Cloud Developer
Alibaba Cloud Developer
Oct 27, 2025 · Artificial Intelligence

How to Build a Quantifiable AI Coding Efficiency Metric System

This article explains how, amid the rapid rise of AI‑assisted programming, a scientific and actionable R&D efficiency metric framework was designed, detailing core indicators such as AI code adoption rate, data collection methods, platform architecture, and practical insights from a large‑scale implementation.

AIMCPcoding
0 likes · 18 min read
How to Build a Quantifiable AI Coding Efficiency Metric System
Data Party THU
Data Party THU
Oct 25, 2025 · Artificial Intelligence

How InfLLM‑V2 Delivers Fast, Low‑Cost Sparse Attention for Long‑Context LLMs

InfLLM‑V2 introduces a zero‑parameter, train‑efficient sparse‑attention framework that dramatically speeds up long‑sequence processing while requiring only 5 B tokens for training, and the open‑source MiniCPM4.1 model demonstrates comparable performance to dense attention on both long‑text understanding and deep‑thinking benchmarks.

InfLLM-V2MiniCPM4.1efficiency
0 likes · 10 min read
How InfLLM‑V2 Delivers Fast, Low‑Cost Sparse Attention for Long‑Context LLMs
DaTaobao Tech
DaTaobao Tech
Oct 22, 2025 · Artificial Intelligence

How AI Coding Transforms Complex Client Development: Methods, Challenges, and Efficiency Gains

This article reveals the core methodology of applying AI coding to complex client-side development, discusses practical challenges, prompt design, task decomposition, efficiency improvements, and provides actionable guidelines and architectural rules for integrating AI into UI and service layers.

AI CodingPrompt EngineeringSoftware Architecture
0 likes · 15 min read
How AI Coding Transforms Complex Client Development: Methods, Challenges, and Efficiency Gains
Data Party THU
Data Party THU
Oct 18, 2025 · Artificial Intelligence

Can Classic Graph Autoencoders Rival SOTA? Surprising Optimizations Reveal Their Power

Researchers from Peking University demonstrate that, by applying modern optimization techniques to the decades‑old Graph Autoencoder (GAE), the model can achieve state‑of‑the‑art link‑prediction performance on benchmarks like ogbl‑ppa, while delivering orders‑of‑magnitude speed improvements, challenging the trend toward ever‑more complex GNNs.

Model Optimizationefficiencygraph autoencoder
0 likes · 10 min read
Can Classic Graph Autoencoders Rival SOTA? Surprising Optimizations Reveal Their Power
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Oct 13, 2025 · Artificial Intelligence

How Large‑and‑Small Language Model Collaboration Is Shaping the Future

The article argues that combining large, high‑capacity models with lightweight, fine‑tuned small models can cut costs, lower latency, enable specialized vertical tasks, and shift development from chasing ever‑bigger models toward optimal system architectures, outlining key techniques such as state‑space models, knowledge distillation, and staged fine‑tuning.

AI ArchitectureFine-tuningefficiency
0 likes · 3 min read
How Large‑and‑Small Language Model Collaboration Is Shaping the Future
Data Party THU
Data Party THU
Oct 13, 2025 · Artificial Intelligence

How BranchGRPO Accelerates and Stabilizes Diffusion Model Alignment

BranchGRPO introduces a tree‑structured branching, reward‑fusion, and lightweight pruning framework that dramatically speeds up diffusion and flow model training while delivering denser, more stable reward signals, achieving up to five‑fold faster convergence and higher alignment scores on image and video generation benchmarks.

BranchGRPORLHFdiffusion models
0 likes · 10 min read
How BranchGRPO Accelerates and Stabilizes Diffusion Model Alignment
Data Party THU
Data Party THU
Oct 6, 2025 · Artificial Intelligence

How OneCAT Redefines Multimodal AI with a Decoder‑Only Architecture

OneCAT introduces a unified decoder‑only transformer that eliminates separate visual encoders, employs a modality‑specific MoE, integrates multi‑scale visual generation, and achieves state‑of‑the‑art performance and efficiency across multimodal understanding, text‑to‑image synthesis, and image editing tasks.

AI modelOneCATdecoder-only
0 likes · 14 min read
How OneCAT Redefines Multimodal AI with a Decoder‑Only Architecture
大转转FE
大转转FE
Sep 19, 2025 · Frontend Development

Boosting Frontend Development Efficiency with AI: A Real‑World Cursor Case Study

This article details how integrating the AI coding assistant Cursor into a membership‑system frontend project increased development efficiency by 21%, reduced a 188‑hour task to 149 hours, and outlines practical methods for AI‑generated routing, UI‑to‑DOM conversion, mock data creation, code refactoring, and the limits of AI assistance in complex interactions.

AI-assisted developmentCode GenerationComponent Architecture
0 likes · 20 min read
Boosting Frontend Development Efficiency with AI: A Real‑World Cursor Case Study
Model Perspective
Model Perspective
Sep 4, 2025 · Fundamentals

Why Economics Matters: Understanding Scarcity, Choice, and Incentives

Economics studies how societies allocate scarce resources to satisfy unlimited wants, exploring concepts such as scarcity, choice, efficiency versus fairness, institutional incentives, rationality versus behavior, and the interplay between micro and macro perspectives, while highlighting its practical relevance in policy, business, and personal decisions.

Fairnessbehavioral economicseconomics
0 likes · 9 min read
Why Economics Matters: Understanding Scarcity, Choice, and Incentives
AIWalker
AIWalker
Sep 2, 2025 · Artificial Intelligence

BEVANet’s Triple Boost for Real-Time Segmentation: Field, Edge, Speed

BEVANet tackles the efficiency‑accuracy trade‑off in real‑time semantic segmentation by integrating large‑kernel attention, an efficient visual attention (EVA) module, a bilateral architecture, and boundary‑guided adaptive fusion, delivering up to 81 % mIoU on Cityscapes at 33 FPS and surpassing prior state‑of‑the‑art models on both accuracy and speed.

Computer VisionReal-Timeefficiency
0 likes · 19 min read
BEVANet’s Triple Boost for Real-Time Segmentation: Field, Edge, Speed
Java Web Project
Java Web Project
Aug 22, 2025 · Industry Insights

What Happens When Over‑Engineering Organizational Structure Turns a Temple into Chaos?

An allegorical case study shows how a temple’s endless creation of specialized departments—water‑carrying, incense‑fund, analysis, and more—fails to solve core resource shortages, exposing flawed processes, unclear responsibilities, and ineffective coordination that ultimately lead to systemic collapse.

Case StudyLeadershipManagement
0 likes · 5 min read
What Happens When Over‑Engineering Organizational Structure Turns a Temple into Chaos?
大转转FE
大转转FE
Aug 11, 2025 · Frontend Development

Frontend Weekly: AI‑Driven Efficiency, Tree Shaking Deep Dive, and Code Refactoring

This newsletter curates five technical articles covering AI‑enhanced frontend productivity, a comparative deep dive into tree shaking across major bundlers, innovative AI code generation for e‑commerce frontends, a rapid AI‑assisted component refactor, and efficiency gains in ad‑monitoring development.

AICode GenerationTree Shaking
0 likes · 4 min read
Frontend Weekly: AI‑Driven Efficiency, Tree Shaking Deep Dive, and Code Refactoring
DaTaobao Tech
DaTaobao Tech
Jul 16, 2025 · Artificial Intelligence

From GPT‑4 to Agentic AI: How LLM Architecture Evolved (2023‑2025)

Since GPT‑4’s 2023 debut, large language models have shifted from sheer scale to efficiency‑driven designs, advanced reasoning with chain‑of‑thought, and agentic tool use, as illustrated by MoE, MLA, and new attention mechanisms, reshaping benchmarks, commercial strategies, and the future of AI.

Agentic AILLMModel Scaling
0 likes · 24 min read
From GPT‑4 to Agentic AI: How LLM Architecture Evolved (2023‑2025)
High Availability Architecture
High Availability Architecture
Jul 9, 2025 · Artificial Intelligence

How LLMs Evolved from GPT‑4 to Agentic AI: Trends, Techniques, and Future Directions

This article analyzes the rapid evolution of large language models from the GPT‑4 era through efficiency‑focused sparsity and attention innovations, to inference‑time reasoning and tool‑using agents, highlighting key architectures, benchmark breakthroughs, competitive strategies, and emerging research directions toward embodied AI.

Agentic AILLMTransformer
0 likes · 24 min read
How LLMs Evolved from GPT‑4 to Agentic AI: Trends, Techniques, and Future Directions
Qiming AI - Digital Management Talk
Qiming AI - Digital Management Talk
Jun 23, 2025 · Operations

9 Essential Supply Chain Metrics to Transform Data‑Driven Decisions

This article outlines nine crucial supply‑chain metrics across procurement, production, logistics and overall efficiency, explains their formulas and real‑world examples, and shows how each indicator can be used to identify problems, benchmark performance, and drive data‑driven decision‑making for cost reduction and customer satisfaction.

Data-drivenLogisticsefficiency
0 likes · 12 min read
9 Essential Supply Chain Metrics to Transform Data‑Driven Decisions
Kuaishou Large Model
Kuaishou Large Model
Jun 20, 2025 · Artificial Intelligence

How OneRec Revolutionizes Short-Video Recommendations with End-to-End Generative AI

OneRec, an end-to-end generative recommendation system from Kuaishou, uses an encoder-decoder architecture, reward-based preference alignment, and reinforcement learning to dramatically improve video recommendation efficiency, boosting user engagement and reducing operational costs while achieving scaling-law performance comparable to large language models.

Kuaishouefficiencygenerative AI
0 likes · 18 min read
How OneRec Revolutionizes Short-Video Recommendations with End-to-End Generative AI
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jun 19, 2025 · Artificial Intelligence

Can Adaptive Chain‑of‑Thought Learning Halve LLM Thinking Time?

The article introduces the Think When You Need (TWYN) method, a reinforcement‑learning approach that dynamically adapts chain‑of‑thought length, dramatically cuts redundant token generation in large language models, and maintains or improves accuracy across diverse reasoning benchmarks.

adaptive inferencechain-of-thoughtefficiency
0 likes · 9 min read
Can Adaptive Chain‑of‑Thought Learning Halve LLM Thinking Time?
Volcano Engine Developer Services
Volcano Engine Developer Services
May 8, 2025 · Operations

How ByteBrain-LogParser Achieves 1‑2 Orders Faster Log Parsing in Cloud Services

ByteBrain-LogParser is a cloud‑native log‑parsing framework that transforms unstructured logs into dynamic templates with real‑time precision control, delivering parsing speeds up to two orders of magnitude faster than state‑of‑the‑art methods while maintaining near‑SOTA accuracy and low storage overhead.

Cloud ServicesHierarchical ClusteringReal-time analytics
0 likes · 27 min read
How ByteBrain-LogParser Achieves 1‑2 Orders Faster Log Parsing in Cloud Services
58UXD
58UXD
May 8, 2025 · Product Management

How to Transform Consumer Finance UX: From Confusing Jargon to Trustworthy Borrowing

This article examines the pain points of consumer‑finance product users—confusing terminology, security concerns, and cumbersome processes—and proposes a three‑dimensional redesign (cognitive, emotional, efficiency) to make borrowing clearer, safer, and more efficient.

DesignTrustconsumer finance
0 likes · 6 min read
How to Transform Consumer Finance UX: From Confusing Jargon to Trustworthy Borrowing
Tencent Cloud Developer
Tencent Cloud Developer
Apr 8, 2025 · R&D Management

Building an Engineer Culture: Leading Reliable Programmers Amid Uncertainty

The article argues that cultivating a strong engineering culture—through shared values, clear processes, automation, altruistic collaboration, and continuous self‑improvement—empowers programmers to remain reliable and productive despite uncertainty, boosting the odds of sustained success while acknowledging that culture is only one of several critical factors.

Code reviewEngineering Cultureefficiency
0 likes · 21 min read
Building an Engineer Culture: Leading Reliable Programmers Amid Uncertainty
AI Frontier Lectures
AI Frontier Lectures
Mar 21, 2025 · Artificial Intelligence

Can Chain‑of‑Thought Templates Unlock Higher Reasoning Limits in LLMs?

The article examines how chain‑of‑thought (CoT) templates are evolving from short‑term heuristics to long‑range planning in large language models, highlighting recent advances such as OpenAI o1, DeepSeek R1, and Kimi 1.5, and explores template designs that boost reasoning performance, efficiency, and multimodal capabilities.

AI reasoningLong CoTPrompt Engineering
0 likes · 7 min read
Can Chain‑of‑Thought Templates Unlock Higher Reasoning Limits in LLMs?
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Mar 13, 2025 · Artificial Intelligence

UniCBE: A Unified Multi‑Objective Optimization Framework for Contrastive Based Evaluation

UniCBE introduces a unified multi‑objective optimization framework for contrastive‑based evaluation that mitigates sampling bias, unbalanced uncertainty reduction, and inefficient resource allocation by combining three decoupled probability matrices through a greedy and Hadamard‑product strategy, achieving Pearson correlations above 0.995 with only 83 % of the annotation budget and cutting evaluation costs by more than 50 % across diverse LLM evaluators.

Contrastive EvaluationSampling Biasefficiency
0 likes · 10 min read
UniCBE: A Unified Multi‑Objective Optimization Framework for Contrastive Based Evaluation
AIWalker
AIWalker
Feb 13, 2025 · Artificial Intelligence

How FlashVideo Turns Low‑Res Clips into 4K Video with Minimal Compute

FlashVideo introduces a two‑stage framework that first generates low‑resolution videos with strong prompt fidelity and then uses flow‑matching ODE trajectories to upscale to 4K quality in just four function evaluations, achieving top VBench‑Long scores while cutting generation time by up to five‑fold.

AIFlashVideoVideo Generation
0 likes · 26 min read
How FlashVideo Turns Low‑Res Clips into 4K Video with Minimal Compute
DaTaobao Tech
DaTaobao Tech
Jan 10, 2025 · Artificial Intelligence

AI-Driven Efficiency Improvements in a Chatroom Project

The article shows how AI accelerated a chatroom project's development—cutting cycle time, lowering developer skill demands, and delivering over 8% efficiency gains—by auto‑generating boilerplate code, debugging suggestions, and routine components such as API, Redis, MySQL, and WebSocket snippets, while outlining future improvements and a potential additional 10% R&D boost through integrated AI tools.

AIPythonWebSocket
0 likes · 9 min read
AI-Driven Efficiency Improvements in a Chatroom Project
DevOps
DevOps
Nov 24, 2024 · R&D Management

The Engineer Soul of BYD: Innovation, Competition, and Efficiency

The article examines BYD's engineering culture, highlighting how its leaders emphasize innovation, competition, and efficiency, and how this mindset drives technological breakthroughs, rigorous R&D processes, and a distinctive management style that empowers engineers to shape the company's long‑term success.

BYDEngineering CultureInnovation
0 likes · 19 min read
The Engineer Soul of BYD: Innovation, Competition, and Efficiency
NewBeeNLP
NewBeeNLP
Nov 11, 2024 · Artificial Intelligence

What Do Recent Multimodal LLM Papers Reveal About Vision‑Language Models?

This article surveys ten recent multimodal large language model papers, covering vision representation laws, a stricter instruction benchmark, safety impacts of visual adaptation, the Mini‑Gemini architecture, automatic pruning, vision capability boosting, long‑context transfer, efficient token sparsification, math reasoning, and hallucination mitigation.

BenchmarkTraining StrategiesVision-Language Models
0 likes · 18 min read
What Do Recent Multimodal LLM Papers Reveal About Vision‑Language Models?
Data Thinking Notes
Data Thinking Notes
Sep 24, 2024 · Artificial Intelligence

Leveraging Large Models to Transform Data Governance: Quality, Cost, Efficiency

This article explains how large language models enhance data governance by improving data quality, reducing implementation costs, and increasing operational efficiency through knowledge bases and interactive prompt libraries, and it also outlines practical empowerment pathways for organizations seeking to leverage AI-driven analytics.

AICost reductionData Governance
0 likes · 3 min read
Leveraging Large Models to Transform Data Governance: Quality, Cost, Efficiency
Continuous Delivery 2.0
Continuous Delivery 2.0
Jul 26, 2024 · R&D Management

Challenges and Essential Requirements for Improving Software Development Efficiency

Software development efficiency faces multiple challenges such as slow business growth versus high technical labor costs, high customer expectations versus limited delivery capacity, excessive product manager demands, and legacy system debt, and improving it requires enhancing team skills, communication, process quality, and management practices.

efficiencylegacy systemsprocess improvement
0 likes · 5 min read
Challenges and Essential Requirements for Improving Software Development Efficiency
Tech Architecture Stories
Tech Architecture Stories
Jun 10, 2024 · Operations

How to Slash Cloud Costs: Real Lessons from Tencent’s Video Platforms

This article examines Tencent's cost‑optimization journey for its short‑video and streaming services, breaking down human, server, and bandwidth expenses, explaining precise accounting methods, negotiation tactics, usage‑vs‑efficiency strategies, and global resource‑scheduling techniques to achieve sustainable cost reduction.

Cloud ResourcesCost OptimizationResource Management
0 likes · 12 min read
How to Slash Cloud Costs: Real Lessons from Tencent’s Video Platforms
DataFunSummit
DataFunSummit
May 9, 2024 · Fundamentals

Technical Evolution and Challenges of Online A/B Testing

This article reviews the two‑decade evolution of online A/B testing, outlines the business and technical challenges enterprises face, and details three core technical challenges—experiment accuracy, analysis & interpretation, and efficiency—along with practical solutions for each.

A/B testingAnalysisdata-driven decision
0 likes · 6 min read
Technical Evolution and Challenges of Online A/B Testing
Architecture and Beyond
Architecture and Beyond
Apr 27, 2024 · Product Management

Improving Efficiency in Domestic SaaS Companies: Product, R&D, Customer Acquisition, and After‑Sales Challenges

The article analyses the low efficiency problems faced by Chinese SaaS firms across product, development, customer acquisition and after‑sales, identifies root causes such as homogenous offerings, poor demand management, weak R&D processes, and proposes strategic, cultural, architectural, procedural and metric‑driven solutions to boost overall performance.

Customer AcquisitionR&DSaaS
0 likes · 22 min read
Improving Efficiency in Domestic SaaS Companies: Product, R&D, Customer Acquisition, and After‑Sales Challenges
Kuaishou Tech
Kuaishou Tech
Apr 17, 2024 · Artificial Intelligence

Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning

The paper presented at AAAI introduces the EERCF method, a coarse‑to‑fine visual representation and two‑stage recall‑then‑rerank strategy that dramatically reduces cross‑modal matching FLOPs while preserving state‑of‑the‑art retrieval performance on multiple video benchmarks.

AIMultimodal Learningcoarse-to-fine representation
0 likes · 8 min read
Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning
Model Perspective
Model Perspective
Mar 13, 2024 · Operations

Evaluating City Efficiency with DEA’s CCR and BCC Models

This article introduces Data Envelopment Analysis (DEA) as a non‑parametric method for assessing relative efficiency of decision‑making units, explains the CCR and BCC models, and demonstrates their application in evaluating and comparing the efficiency of various U.S. cities using real‑world data.

BCCCCRDEA
0 likes · 9 min read
Evaluating City Efficiency with DEA’s CCR and BCC Models
AntTech
AntTech
Mar 11, 2024 · Artificial Intelligence

Can Small Language Models be Good Reasoners in Recommender Systems?

This article presents SLIM, a knowledge‑distillation framework that transfers the reasoning abilities of large language models to compact models for sequential recommendation, enhancing item representation, user profiling, and bias mitigation while achieving comparable performance with far lower computational resources.

AILLMefficiency
0 likes · 12 min read
Can Small Language Models be Good Reasoners in Recommender Systems?
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 8, 2024 · R&D Management

From Test Engineer to R&D Leader: Growth, Efficiency & Stability Lessons

The author reflects on five years at Alibaba as a test developer, sharing personal growth stages, the challenges of rapid change and business pressure, practical approaches to R&D efficiency, stability metrics, and team management, offering actionable insights for engineers seeking continuous improvement and leadership.

R&D managementTest Developmentefficiency
0 likes · 33 min read
From Test Engineer to R&D Leader: Growth, Efficiency & Stability Lessons
Ctrip Technology
Ctrip Technology
Feb 22, 2024 · Backend Development

Design and Implementation of a Serverless Data Filling Engine for UnifiedPB in Ctrip Hotel Recommendation System

This article describes how Ctrip's hotel recommendation team built a serverless, configuration‑driven data‑filling engine based on UnifiedPB protobuf schemas to improve development efficiency, reduce cost, ensure data quality, and achieve unified three‑region data delivery across more than twenty recommendation scenarios.

BackendServerlessdata engineering
0 likes · 12 min read
Design and Implementation of a Serverless Data Filling Engine for UnifiedPB in Ctrip Hotel Recommendation System
DevOps
DevOps
Feb 20, 2024 · R&D Management

Leadership in Project Management: Principles, Elements, Value, and Practices

This article explores the concept of leadership within project management, outlining its definition, the L‑E‑A‑D elements (Listen, Efficiency, Action, Development), the value it brings to team culture, collaboration, innovation and risk management, and practical ways to improve leadership skills.

InnovationLeadershipProject Management
0 likes · 12 min read
Leadership in Project Management: Principles, Elements, Value, and Practices
Qunhe Technology User Experience Design
Qunhe Technology User Experience Design
Dec 4, 2023 · Artificial Intelligence

How AI is Transforming Creative Design: Real‑World Cases & Workflow Insights

This article examines how AI tools are reshaping creative design teams by boosting efficiency, outlines feasible application areas through quadrant analysis, and presents four detailed case studies that compare traditional and AI‑augmented workflows, highlighting productivity gains and practical considerations.

AIAIGCCase Study
0 likes · 7 min read
How AI is Transforming Creative Design: Real‑World Cases & Workflow Insights
DataFunTalk
DataFunTalk
Nov 21, 2023 · Artificial Intelligence

Improving Efficiency of Large-Scale Distributed Training for Large Language Models

Recent advances in large language models have dramatically increased model size and training data, leading to soaring computational costs; this article examines the scaling trends, hardware utilization challenges, distributed training techniques, and ethical considerations, highlighting methods to improve efficiency, reduce costs, and mitigate environmental impact.

AI ethicsDistributed Trainingcompute optimization
0 likes · 29 min read
Improving Efficiency of Large-Scale Distributed Training for Large Language Models
Bilibili Tech
Bilibili Tech
Oct 27, 2023 · Game Development

Optimizing Global Game Publishing Efficiency with ONE SDK/API: Architecture, Metrics, and Practices

Bilibili’s ONE SDK/API unifies 22 fragmented game SDK variants across brands, regions, and devices into a single, adapter‑based package with centralized parameter management and automated multi‑channel builds, slashing global integration time from 60 to 15 days, cutting parameter awareness by 90 % and duplicate development effort by at least 75 %.

SDKSoftware Architectureefficiency
0 likes · 30 min read
Optimizing Global Game Publishing Efficiency with ONE SDK/API: Architecture, Metrics, and Practices
Kuaishou Tech
Kuaishou Tech
Oct 26, 2023 · Artificial Intelligence

SHARK: Efficient Embedding Compression for Large-Scale Recommendation Models

The paper introduces SHARK, a two‑component framework that uses a fast Taylor‑expanded permutation method to prune embedding tables and a frequency‑aware quantization scheme to apply mixed‑precision to embeddings, achieving up to 70% memory reduction and 30% QPS improvement in industrial short‑video and e‑commerce recommendation systems.

Model Pruningefficiencyembedding compression
0 likes · 8 min read
SHARK: Efficient Embedding Compression for Large-Scale Recommendation Models
Ximalaya Technology Team
Ximalaya Technology Team
Aug 15, 2023 · Operations

How Scenario‑Based Automated Testing Boosted Efficiency for a Fast‑Growing Audio Platform

This article details how a rapidly expanding audio service built a lightweight, modular testing framework called FAST, defined core scenario‑based testing concepts, and implemented an automated pipeline that reduced manual effort, cut costs, and improved release quality across multiple teams.

Automated TestingScenario TestingSoftware Operations
0 likes · 7 min read
How Scenario‑Based Automated Testing Boosted Efficiency for a Fast‑Growing Audio Platform
Baobao Algorithm Notes
Baobao Algorithm Notes
Jul 23, 2023 · Artificial Intelligence

Why Cold Starts, Reward Hacking, and Evaluation Matter in LLM Training

The article analyzes key challenges in large‑language‑model pipelines—including the necessity of cold‑start pretraining, the pitfalls of reward‑model hacking, efficiency‑effectiveness trade‑offs, evaluation difficulties, and downstream fine‑tuning limits—offering practical insights for more reliable LLM development.

Fine-tuningLLMRLHF
0 likes · 9 min read
Why Cold Starts, Reward Hacking, and Evaluation Matter in LLM Training
DeWu Technology
DeWu Technology
Jun 28, 2023 · R&D Management

Interpreting R&D Data Metrics: From Collection to Actionable Insights

Effective R&D efficiency improvement requires moving from manual, scattered, and unstandardized data collection to mature, integrated systems, defining metrics aligned with business goals, and applying a three‑layer framework—quantifiable, explainable, and intervenable—through cleaning, baseline setting, statistical analysis, root‑cause identification, and ROI‑focused action planning to turn numbers into actionable insights.

R&D metricsdata analysisdecision making
0 likes · 12 min read
Interpreting R&D Data Metrics: From Collection to Actionable Insights
Test Development Learning Exchange
Test Development Learning Exchange
May 12, 2023 · Fundamentals

Ten Benefits of Software Testers Writing Code

Writing code empowers software testers by automating test execution, reducing costs, expanding coverage, improving code quality, enhancing skill sets, increasing test case accuracy, shortening test cycles, boosting result reliability, streamlining test management, and accelerating data comparison, ultimately making testing more efficient and reliable.

Software Testingcode qualityefficiency
0 likes · 3 min read
Ten Benefits of Software Testers Writing Code
DataFunTalk
DataFunTalk
Mar 14, 2023 · Artificial Intelligence

Review of Deep Learning Model Evolution and Future Trends

The article reviews the past six years of deep‑learning model development, highlighting patterns such as increasing scale, growing universality, limited interpretability, and challenges in efficiency, while forecasting future directions like more efficient architectures, enhanced perception, multimodal capabilities, integration with life sciences, and the emergence of general‑purpose intelligent agents, and concludes with a promotion for a deep‑learning practice ebook.

AI trendsFuture AIInterpretability
0 likes · 6 min read
Review of Deep Learning Model Evolution and Future Trends
DevOps
DevOps
Mar 10, 2023 · R&D Management

Improving Software Development Efficiency: Organizational, Architectural, and Management Perspectives

The article examines software development efficiency by defining R&D effectiveness, discussing organizational design, architectural alignment, project management practices, and key performance metrics, and concludes with actionable implementation steps for building high‑performing engineering teams.

architectureefficiencysoftware development
0 likes · 10 min read
Improving Software Development Efficiency: Organizational, Architectural, and Management Perspectives
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Nov 4, 2022 · Artificial Intelligence

How AI Platforms Turn Dreams into Reality: Scaling, Efficiency, and Usability

In this talk from the 2022 Yunqi Conference, Jia Yangqing explains how Alibaba's AI platform addresses efficiency, scale, and usability challenges by moving the Damo Academy to the cloud, open‑sourcing ModelScope, and delivering large‑model training, deployment, and inference services at massive scale.

AI EngineeringModel Scalingefficiency
0 likes · 10 min read
How AI Platforms Turn Dreams into Reality: Scaling, Efficiency, and Usability
Model Perspective
Model Perspective
Oct 1, 2022 · Operations

How Variable Returns to Scale DEA Reveals Input/Output Slack in a Two‑Stage Model

This article explains how variable‑returns‑to‑scale input‑oriented and output‑oriented DEA models use input and output slacks, introduces a two‑stage linear programming approach to identify non‑zero slacks, and defines full and weak efficiency through formal DEA definitions and illustrative decision‑unit examples.

DEALinear Programmingefficiency
0 likes · 6 min read
How Variable Returns to Scale DEA Reveals Input/Output Slack in a Two‑Stage Model
58UXD
58UXD
Sep 30, 2022 · Product Management

Boosting Office Efficiency: Design Strategies from 58’s Cloud Collaboration 2.0

Amid the ongoing pandemic, the article examines how 58’s Cloud Efficiency platform redesigns project collaboration—covering full lifecycle roles, a unified messaging system, concept simplification, high‑efficiency pages, and multi‑dimensional displays—to formulate a clear efficiency equation that drives measurable office productivity gains.

CollaborationUX designefficiency
0 likes · 11 min read
Boosting Office Efficiency: Design Strategies from 58’s Cloud Collaboration 2.0
Dada Group Technology
Dada Group Technology
Sep 5, 2022 · Operations

Design and Implementation of JD.com Data Construction Platform for Testing Efficiency

This article describes the motivation, design, architecture, key features, and outcomes of JD.com's data construction platform, which automates test data creation using a Springboot‑Mybatis‑Vue stack, significantly reducing manual effort and improving testing efficiency across multiple business lines.

OperationsTesting Automationdata construction
0 likes · 9 min read
Design and Implementation of JD.com Data Construction Platform for Testing Efficiency
Java Captain
Java Captain
Aug 24, 2022 · R&D Management

How Apifox Boosted Our Team’s Efficiency: A Real‑World Case Study

In this firsthand account, a technical leader describes how adopting the Apifox API testing and mock tool transformed their small company's development workflow, cutting interface testing, integration, and client support times dramatically while improving code quality and overall team efficiency.

API testingApifoxAutomation
0 likes · 14 min read
How Apifox Boosted Our Team’s Efficiency: A Real‑World Case Study
转转QA
转转QA
Aug 12, 2022 · Backend Development

Improving Test Efficiency through Data Construction: Practices and Insights

This article explains how systematic data construction, using a low‑code front‑end and Java back‑end platform, streamlines complex test scenarios, reduces manual effort, and enhances both testing efficiency and code quality across multiple business systems.

Backend DevelopmentJavaQA
0 likes · 9 min read
Improving Test Efficiency through Data Construction: Practices and Insights
ITPUB
ITPUB
Aug 11, 2022 · Fundamentals

How to Train Yourself into a Programming Expert: Practical Steps and Mindset

This article breaks down what it means to be a programming expert, classifies three types of experts, defines the practical criteria of high efficiency, high quality, and stable output, and offers concrete advice on linking deep knowledge with real‑world problem solving to accelerate growth.

Career GrowthSkill DevelopmentSoftware Engineering
0 likes · 9 min read
How to Train Yourself into a Programming Expert: Practical Steps and Mindset
iQIYI Technical Product Team
iQIYI Technical Product Team
Jul 22, 2022 · Cloud Computing

How iQIYI’s Adoption of IMF Doubles Video Delivery Efficiency

By upgrading to the third‑phase Interoperable Master Format, iQIYI has cut delivery time in half, enabling one film’s upload to cover two, while reducing subtitle edit time to 5% of the original, slashing storage by over a third, and supporting 4K‑HDR and immersive audio for faster, more flexible global distribution.

IMFMedia FormatVideo Delivery
0 likes · 6 min read
How iQIYI’s Adoption of IMF Doubles Video Delivery Efficiency
DaTaobao Tech
DaTaobao Tech
Jul 20, 2022 · Backend Development

Scalable Backend Page Production with Standardization and No‑Code Solutions

By abstracting 86 % of form‑list‑detail scenarios into standardized scene assets and linking them with unified data models, the platform enables no‑code generation of backend pages, cutting development effort fivefold versus source code, boosting overall productivity by 68.6 % while preserving product quality and consistency.

No-codedata modelingefficiency
0 likes · 14 min read
Scalable Backend Page Production with Standardization and No‑Code Solutions
58 Tech
58 Tech
May 10, 2022 · R&D Management

Recap of 58 Technology Salon: R&D Efficiency Measurement and Mobile Quality Assurance

The article summarizes the 28th 58 Technology Salon held on April 27, 2022, featuring presentations on R&D efficiency measurement and mobile quality assurance, including detailed Q&A sessions that discuss metrics, data handling, performance testing tools, and practical improvements for development and testing teams.

Performance TestingR&D metricsSoftware Testing
0 likes · 6 min read
Recap of 58 Technology Salon: R&D Efficiency Measurement and Mobile Quality Assurance
Dada Group Technology
Dada Group Technology
May 6, 2022 · Frontend Development

Improving Usability and Efficiency of the Cashier System: Front‑End Architecture Redesign and Component Refactoring

This article analyzes the usability and efficiency issues of the JD.com to‑home cash register H5 and RN versions, proposes a front‑end architectural redesign with component reuse, layering, and design principles, and demonstrates the resulting performance gains, reduced code entropy, and future improvement plans.

Component DesignUsabilityefficiency
0 likes · 15 min read
Improving Usability and Efficiency of the Cashier System: Front‑End Architecture Redesign and Component Refactoring
DevOps
DevOps
Mar 28, 2022 · R&D Management

Why Efficiency Is the Enemy of Innovation: Insights from ByteDance, Tencent, and Huawei

The article argues that relentless pursuit of efficiency—seeking the shortest path—can stifle innovation, illustrating how purposeful waste, redundant resources, and flexible timing at companies like ByteDance, Tencent, and Huawei foster breakthrough ideas while balancing the need for eventual optimization.

R&D managementefficiencyresource allocation
0 likes · 10 min read
Why Efficiency Is the Enemy of Innovation: Insights from ByteDance, Tencent, and Huawei
Baidu Geek Talk
Baidu Geek Talk
Mar 24, 2022 · Frontend Development

Practical Techniques to Boost Web Development Efficiency

The article presents actionable methods to speed web development, including adopting a front‑back end separation architecture, leveraging scaffolding tools for data, documentation and mock interfaces, applying disciplined front‑end coding practices, and using computer‑vision‑based techniques to accelerate automated testing.

architecturecoding practicesefficiency
0 likes · 8 min read
Practical Techniques to Boost Web Development Efficiency
HomeTech
HomeTech
Mar 22, 2022 · Cloud Computing

AutoPPT System Architecture and Data-Driven PPT Generation

This article explores the AutoPPT system that automates PowerPoint presentation creation by integrating real-time data analysis, reducing manual effort, and enhancing efficiency through a structured workflow from data collection to final output.

AutoPPTJavaPPT generation
0 likes · 10 min read
AutoPPT System Architecture and Data-Driven PPT Generation
Taobao Frontend Technology
Taobao Frontend Technology
Mar 17, 2022 · Backend Development

How No‑Code Platforms Can Revolutionize Mid‑Office Page Production and Boost Development Efficiency

This article explains how a scene‑driven, no‑code platform standardizes UI, data, and API models to enable scalable, high‑efficiency production of middle‑office pages, reduces manual coding, improves collaboration across roles, and provides a unified efficiency‑measurement framework for source, low‑code, and no‑code development modes.

No-codedata modelingefficiency
0 likes · 16 min read
How No‑Code Platforms Can Revolutionize Mid‑Office Page Production and Boost Development Efficiency