Tagged articles

Large Language Model

737 articles · Page 1 of 8

Jul 7, 2026 · Artificial Intelligence

What Does Claude Think When It Remains Silent? Inside Anthropic’s Newly Discovered J Space

Anthropic’s recent study reveals a hidden "J space" inside Claude that silently holds concepts the model considers but does not output, and through a Jacobian‑lens technique the researchers can read, edit, and control this workspace, showing its role in multi‑step reasoning, task flexibility, and AI safety monitoring.

AI interpretabilityAI safetyAnthropic

0 likes · 29 min read

What Does Claude Think When It Remains Silent? Inside Anthropic’s Newly Discovered J Space

DataFunTalk

Jul 5, 2026 · Artificial Intelligence

Exploring Multimodal GraphRAG: How Document Intelligence, Knowledge Graphs, and Large Models Combine

This article presents a comprehensive technical analysis of multimodal GraphRAG, covering document‑intelligent parsing pipelines, multimodal graph index construction, knowledge‑graph‑enhanced chunk linking, various multimodal RAG approaches, their trade‑offs, benchmark results, and future research directions.

GraphRAGLarge Language ModelRAG

0 likes · 24 min read

Exploring Multimodal GraphRAG: How Document Intelligence, Knowledge Graphs, and Large Models Combine

DataFunSummit

Jul 3, 2026 · Artificial Intelligence

Designing Next‑Gen Recommendation and Search Systems with Agentic Architectures

This article reviews cutting‑edge AI search and recommendation techniques—including Alibaba Cloud’s Agentic RAG, Huawei’s LLM‑enhanced recommendation pipeline, and Baidu’s generative ranking model GRAB—detailing their architectural evolution, multimodal retrieval strategies, performance benchmarks, and practical deployment insights.

AI SearchAgentic RAGGPU Acceleration

0 likes · 6 min read

Designing Next‑Gen Recommendation and Search Systems with Agentic Architectures

DataFunTalk

Jul 3, 2026 · Artificial Intelligence

How Knora Uses Ontology + Large Models to Overcome Hallucinations and Execution Gaps in Enterprise AI

The article explains how enterprise AI is shifting from conversational assistance to autonomous execution, outlines six key challenges such as hallucinations and cold‑start, and details Knora's ontology‑enhanced platform—including its multi‑layer architecture, autonomous agents, real‑world LED production line case study, and roadmap—to deliver reliable, controllable AI solutions.

Autonomous AgentsEnterprise AIKnora

0 likes · 16 min read

How Knora Uses Ontology + Large Models to Overcome Hallucinations and Execution Gaps in Enterprise AI

Alibaba Cloud Big Data AI Platform

Jul 2, 2026 · Artificial Intelligence

AI Search + ES Agent Builder: Best Practices for Deploying Enterprise AI Assistants

This guide explains why enterprise data is hard for large language models, introduces ES Agent Builder as a solution, outlines three high‑value use cases, details the three‑layer architecture and four core components, and provides practical best‑practice recommendations with concrete examples and visualizations.

AI SearchAgent BuilderData Integration

0 likes · 15 min read

AI Search + ES Agent Builder: Best Practices for Deploying Enterprise AI Assistants

ITPUB

Jul 1, 2026 · Artificial Intelligence

How Tianyi Cloud Shifts from Manual Ops to Model‑Driven AI+ Cloud Integration

The article analyzes Tianyi Cloud's transition from labor‑intensive cloud service handling to an AI‑driven intelligent assistant, detailing the business challenges, the architecture that couples AI, data, and business middle platforms, and the measurable efficiency gains achieved through model‑based automation.

AIBusiness AutomationCloud Computing

0 likes · 12 min read

How Tianyi Cloud Shifts from Manual Ops to Model‑Driven AI+ Cloud Integration

DataFunSummit

Jun 29, 2026 · Big Data

Generate Ad Creative with One SQL Using Hologres for Intelligent Creation and Closed‑Loop Analysis

The article explains how Hologres AI Function and Skills transform traditional, slow, and fragmented ad‑creative production into a fully automated, SQL‑driven workflow that handles multimodal data ingestion, AI‑based labeling, video generation, and real‑time performance analysis in a single closed‑loop system.

AI FunctionAd CreativeData Warehouse

0 likes · 12 min read

Generate Ad Creative with One SQL Using Hologres for Intelligent Creation and Closed‑Loop Analysis

Machine Learning Algorithms & Natural Language Processing

Jun 27, 2026 · Artificial Intelligence

GPT-5.6 Emergency Halt: OpenAI’s Flagship Model Forced into One‑by‑One Review

OpenAI has abruptly paused the rollout of GPT‑5.6, limiting access to a small partner preview and requiring individual approval for each user, while developers uncover internal routes, performance claims, and compare the delay to Anthropic’s Fable 5 and Google’s Gemini 3.5, highlighting security‑driven release constraints across the AI industry.

AI safetyGPT-5.6Large Language Model

0 likes · 8 min read

GPT-5.6 Emergency Halt: OpenAI’s Flagship Model Forced into One‑by‑One Review

DataFunTalk

Jun 27, 2026 · Artificial Intelligence

OpenAI Unveils GPT‑5.6 ‘Solar System’ Models: Sol, Terra, Luna Outperform Mythos

OpenAI released GPT‑5.6 with three tiered models—Sol, Terra and Luna—named after celestial bodies, offering lower pricing, record‑breaking benchmark scores in programming, security, biology and health, new max and ultra inference modes, limited partner access, and a deployment plan on Cerebras that could make it the fastest flagship LLM.

AI benchmarksGPT-5.6Large Language Model

0 likes · 8 min read

OpenAI Unveils GPT‑5.6 ‘Solar System’ Models: Sol, Terra, Luna Outperform Mythos

Machine Heart

Jun 27, 2026 · Artificial Intelligence

GPT-5.6 Launch: Sol, Terra, Luna Beat Mythos Yet Stay Behind Paywall

OpenAI’s surprise preview of GPT‑5.6 introduces three tiered models—Sol, Terra and Luna—with Sol offering max and ultra modes that deliver top‑tier performance in programming, biology and cybersecurity benchmarks, lower pricing, a new prompt‑cache system, and a restricted rollout amid U.S. regulatory scrutiny.

AI safetyBenchmarkCerebras

0 likes · 7 min read

GPT-5.6 Launch: Sol, Terra, Luna Beat Mythos Yet Stay Behind Paywall

ITPUB

Jun 26, 2026 · Artificial Intelligence

Doubao Pro: AI Productivity for Only ¥68 – Unmatched Value and Performance

Doubao launches its Professional edition featuring the flagship 2.1 Pro model, a new office‑task mode, and tiered pricing starting at ¥68 per month, while benchmark tests show its coding and agent abilities rivaling GPT‑5.5 and surpassing competing subscription plans.

AI productivityBenchmarkChatGPT comparison

0 likes · 11 min read

Doubao Pro: AI Productivity for Only ¥68 – Unmatched Value and Performance

IT Services Circle

Jun 26, 2026 · Artificial Intelligence

Where to Find Reliable Free Large‑Model APIs for Everyday Developers?

The author built a zero‑cost internal coding assistant using iFlytek's free Qwen3.6‑35B‑A3B and Qwen3.5‑35B‑A3B models, explains why these models were chosen over alternatives, provides a nine‑step guide to claim the free MaaS token quota, shares ready‑to‑run Python code, and reports real‑world performance across code generation, long‑document parsing, and multi‑turn conversations, while also outlining suitable user groups and an optional enterprise Token Plan.

APILarge Language ModelMaaS

0 likes · 12 min read

Where to Find Reliable Free Large‑Model APIs for Everyday Developers?

Machine Heart

Jun 26, 2026 · Artificial Intelligence

Why iFlytek Spark X2 Scored 708 on the Gaokao: An In‑Depth Model Analysis

A comprehensive evaluation of domestic large language models on China's Gaokao shows iFlytek Spark X2 tying for top physics scores and leading in history, with its advantage stemming from balanced language understanding, rigorous step‑by‑step reasoning, and a decade‑long education data pipeline.

AI evaluationGaokaoLarge Language Model

0 likes · 11 min read

Why iFlytek Spark X2 Scored 708 on the Gaokao: An In‑Depth Model Analysis

PaperAgent

Jun 26, 2026 · Artificial Intelligence

13 Must-Read Agent Papers from Meituan for ICML'26

This article presents a curated list of thirteen recent research papers on generalist agents—covering visual memory, environment synthesis, value modeling, self‑verification, robustness benchmarks, high‑resolution video generation, long‑horizon world models, and alignment fine‑tuning—along with brief abstracts and links to the PDFs for the upcoming Meituan ICML'26 sharing sessions.

AIAgentBenchmark

0 likes · 16 min read

13 Must-Read Agent Papers from Meituan for ICML'26

Black & White Path

Jun 25, 2026 · Artificial Intelligence

Can DeepSeek‑V4‑Fable’s AI Make Red Teams Redundant?

DeepSeek‑V4‑Fable, an autonomous AI agent built on a Chinese large‑model foundation and refined with SFT and GRPO, achieves a 58.7% overall solve rate on 300 held‑out CTF challenges, prompting a debate on its impact on red‑team workflows and security governance.

AICTFDeepSeek-V4-Fable

0 likes · 9 min read

Can DeepSeek‑V4‑Fable’s AI Make Red Teams Redundant?

Architecture Breakthrough

Jun 25, 2026 · R&D Management

How to Design System Architecture Diagrams with DDD in the AI Era

The article explains how architects can bridge high‑level platform diagrams and concrete implementation by using DDD‑based module functional diagrams that serve as prompts for AI code generation, avoiding low‑level detail while ensuring domain understanding guides development.

AI Prompt EngineeringArchitecture DiagramDDD

0 likes · 4 min read

How to Design System Architecture Diagrams with DDD in the AI Era

Machine Heart

Jun 23, 2026 · Artificial Intelligence

Unlimited OCR Achieves SOTA Long-Document Parsing in a Single Forward Pass

Unlimited OCR, Baidu's open‑source model built on DeepSeek OCR, uses a novel Reference Sliding Window Attention to compress visual tokens and keep KV cache size constant, enabling end‑to‑end parsing of whole books with 93.23% OmniDocBench v1.5 score and stable latency across dozens of pages.

DeepSeekLarge Language ModelLong Document

0 likes · 14 min read

Unlimited OCR Achieves SOTA Long-Document Parsing in a Single Forward Pass

21CTO

Jun 22, 2026 · Industry Insights

How 25‑Year‑Old Founder Michael Truell Built Cursor into an AI Coding Powerhouse

Cursor, the AI‑powered code editor born at MIT, grew to millions of users and billions in valuation within three years, driven by founder Michael Truell’s early programming talent, strategic pivots—including self‑developed Composer model and a deep partnership with SpaceX’s xAI—to escape reliance on third‑party models.

AI codingLarge Language Modelcompute partnership

0 likes · 11 min read

How 25‑Year‑Old Founder Michael Truell Built Cursor into an AI Coding Powerhouse

Machine Learning Algorithms & Natural Language Processing

Jun 20, 2026 · Artificial Intelligence

Musk Says GLM Could Reach Fable Level by Q1 2027—ZhiPu’s Tang Argues It’s Much Sooner

Elon Musk predicted that China’s GLM model would catch up to Anthropic’s Fable by the first quarter of 2027, but ZhiPu’s chief scientist Tang Jie argues the gap is closing much faster, as GLM‑5.2 receives free global compute, tops benchmark leaderboards, and demonstrates open‑source performance rivaling top closed‑source models.

Anthropic FableBenchmarkGLM-5.2

0 likes · 8 min read

Musk Says GLM Could Reach Fable Level by Q1 2027—ZhiPu’s Tang Argues It’s Much Sooner

Old Zhang's AI Learning

Jun 20, 2026 · Artificial Intelligence

Exploring Qwen3.7-Max: Full End-to-End Workflow for Knowledge Mining, Article & Video Generation

The author walks through a complete Qwen3.7-Max workflow—knowledge collection, cross‑document understanding, article creation, and video production—highlighting tips, challenges, and the model's ability to handle 1 M‑token contexts and long‑chain tasks.

AI workflowArticle GenerationKnowledge Extraction

0 likes · 10 min read

Exploring Qwen3.7-Max: Full End-to-End Workflow for Knowledge Mining, Article & Video Generation

ZhiKe AI

Jun 20, 2026 · Artificial Intelligence

How Large Language Models Generate Blur‑Free SVGs by Writing Code

The article explains that because SVG graphics are defined by XML code, large language models can turn natural‑language descriptions into SVG markup, producing vector images that scale without pixelation; it details the four‑step generation process, compares SVG to raster formats, and highlights its suitability for diagrams and charts.

AI graphicsLarge Language ModelSVG

0 likes · 4 min read

How Large Language Models Generate Blur‑Free SVGs by Writing Code

Machine Heart

Jun 19, 2026 · Artificial Intelligence

Hugging Face Funds 6‑Hour Free Compute for GLM‑5.2 as Musk Praises the Model

Hugging Face has pledged six hours of global free compute for the Chinese open‑source LLM GLM‑5.2, a model praised by Elon Musk and benchmarked within 1‑4 % of top closed‑source systems, while its novel IndexShare architecture cuts token‑wise computation by nearly threefold and its MIT‑licensed release fuels China’s rapid ascent in the global AI model landscape.

AI competitionBenchmarkChina AI

0 likes · 8 min read

Hugging Face Funds 6‑Hour Free Compute for GLM‑5.2 as Musk Praises the Model

Ctrip Technology

Jun 18, 2026 · Artificial Intelligence

How Trip.com Cut Multilingual UI QA Costs by 90% with GUI Agent and Multi‑Agent AI

Trip.com built the "慧鉴天工" system that combines a GUI Agent, multi‑agent LQA algorithms, OODA‑loop architecture, and a knowledge‑graph‑enhanced pipeline to automate page collection, multilingual text extraction, and quality inspection across 31 languages, achieving over 90% cost reduction and 70%+ detection accuracy.

GUI AgentLarge Language ModelMulti-Agent

0 likes · 21 min read

How Trip.com Cut Multilingual UI QA Costs by 90% with GUI Agent and Multi‑Agent AI

SuanNi

Jun 17, 2026 · Artificial Intelligence

GLM-5.2 Tops Code Arena Benchmarks and Goes Open Source

GLM-5.2, the newly released open‑source LLM from Zhipu, achieves the #1 ranking on Code Arena’s global blind‑test, supports a 1 million‑token context, introduces architectural innovations like IndexShare and MTP, and delivers competitive benchmark results against leading closed‑source models.

1M token contextBenchmarkGLM-5.2

0 likes · 8 min read

GLM-5.2 Tops Code Arena Benchmarks and Goes Open Source

DataFunSummit

Jun 15, 2026 · Industry Insights

How Data Ontology Powers Digital and Intelligent Penetration Management in Private Funds

Facing a massive scale of assets and strict regulatory demands, a private‑equity platform leveraged ontology‑driven knowledge graphs and large‑model agents to automate high‑frequency reporting, achieve traceable AI decisions, and build a scalable, explainable intelligence layer for fund‑level transparency.

AI automationData GovernanceLarge Language Model

0 likes · 10 min read

How Data Ontology Powers Digital and Intelligent Penetration Management in Private Funds

Top Architect

Jun 15, 2026 · Artificial Intelligence

How One Line of Code Revived Claude Fable 5

A developer used a single prompt‑injection command to load a leaked 120 KB system prompt into Opus 4.8, instantly resurrecting Claude Fable 5 and exposing stark differences in output, while the article also uncovers Amazon’s role in the model’s abrupt shutdown and the broader AI‑security implications.

AI securityAmazonAnthropic

0 likes · 12 min read

How One Line of Code Revived Claude Fable 5

Data Party THU

Jun 14, 2026 · Artificial Intelligence

Stop Misunderstanding AI Agents: A Clear Guide to All Core Concepts

The article defines AI Agent as a system centered on a large model that can invoke tools, receive feedback, and continuously accomplish tasks, and systematically distinguishes related terms such as Model, Scaffolding, Harness, Context Engineering, Policy, Tool, Skill, Sub‑agent, Environment, Rollout, Reward, and Trainer, using concrete examples to clarify each.

AI AgentContext EngineeringHarness

0 likes · 10 min read

Stop Misunderstanding AI Agents: A Clear Guide to All Core Concepts

Design Hub

Jun 13, 2026 · Artificial Intelligence

Claude Fable 5: The AI Model So Powerful It Was Pulled Offline

Claude Fable 5, recently taken offline by a US government request, showcases a leap in AI capability by turning high‑level visual prompts into full‑featured prototypes such as shaders, fluid simulations, games, and UI diagnostics, while also exposing trade‑offs in cost, safety guards, and long‑term usability.

AI AgentsAI safetyAnthropic

0 likes · 15 min read

Claude Fable 5: The AI Model So Powerful It Was Pulled Offline

Machine Heart

Jun 10, 2026 · Artificial Intelligence

Can AI Bridge the College Application Gap? Alibaba’s Free Volunteer‑Filling Agent Tested by 400K AI Candidates

Alibaba’s free Qianwen high‑school volunteer‑filling Agent combines a knowledge base of 3,000 schools, proactive calendar planning, persistent memory and reinforcement‑learning‑trained LLM to guide 12.9 million candidates, and its performance was stress‑tested with 400,000 simulated AI applicants.

AI AgentCollege AdmissionsEducation Technology

0 likes · 10 min read

Can AI Bridge the College Application Gap? Alibaba’s Free Volunteer‑Filling Agent Tested by 400K AI Candidates

DataFunSummit

Jun 10, 2026 · Artificial Intelligence

Why Memory Is the Biggest Challenge for AI Agents and How MemOS Boosts Cloud Calls by Over 200%

The article analyzes how memory limitations hinder AI agents, compares model‑driven and application‑driven approaches, details the five‑layer MemOS architecture, reports cloud service usage growth of 100‑200% with token savings of up to 72%, and shows how MemOS enhances OpenClaw and enterprise deployments.

AI AgentCloud ServiceLarge Language Model

0 likes · 19 min read

Why Memory Is the Biggest Challenge for AI Agents and How MemOS Boosts Cloud Calls by Over 200%

DataFunTalk

Jun 10, 2026 · Artificial Intelligence

Claude Mythos 5 Unleashed: 50 Million Lines of Code Processed in One Day

Anthropic released Claude Fable 5 and Mythos 5, dual‑version LLMs that achieve record‑breaking benchmarks in software engineering, visual reasoning, long‑context tasks and finance, while introducing a safety‑first routing system, token‑efficiency pricing and a limited free‑trial window, reshaping how developers and enterprises interact with powerful AI agents.

AI benchmarksClaudeFable 5

0 likes · 18 min read

Claude Mythos 5 Unleashed: 50 Million Lines of Code Processed in One Day

AI Explorer

Jun 10, 2026 · Artificial Intelligence

Anthropic Unveils Claude Fable 5 and Mythos 5: Layered Release of Powerful, Risky AI

Anthropic released Claude Fable 5 for all users and Claude Mythos 5 for trusted partners, both built on the same base model but with different safety guardrails, showcasing record‑setting benchmarks in code migration, vision, long‑context memory, and highlighting dual‑use risks and a new 30‑day data retention policy.

AI safetyAnthropicBenchmark

0 likes · 10 min read

Anthropic Unveils Claude Fable 5 and Mythos 5: Layered Release of Powerful, Risky AI

AI Engineering

Jun 9, 2026 · Artificial Intelligence

Anthropic Unveils Claude Fable 5: Benchmark Wins and Games You Can Play Now

Anthropic’s Claude Fable 5 and Mythos 5 launch with benchmark‑leading performance across software engineering, knowledge work, vision and long‑context tasks, safety‑graded access, and live demos that generate full video games from a single prompt, while pricing and phased rollout are detailed.

AI benchmarksAI safetyClaude

0 likes · 11 min read

Anthropic Unveils Claude Fable 5: Benchmark Wins and Games You Can Play Now

SuanNi

Jun 9, 2026 · Artificial Intelligence

How Xiaomi’s MiMo‑V2.5‑Pro UltraSpeed Achieves 1 T‑Parameter, 1000 Tokens/s Generation

Xiaomi’s MiMo‑V2.5‑Pro UltraSpeed delivers a 1‑trillion‑parameter model that generates over 1000 tokens per second on a standard 8‑GPU server by combining FP4 quantization, MoE architecture, DFlash decoding and TileRT’s custom execution engine, challenging the need for dedicated ASICs.

DFlashFP4 quantizationLarge Language Model

0 likes · 10 min read

How Xiaomi’s MiMo‑V2.5‑Pro UltraSpeed Achieves 1 T‑Parameter, 1000 Tokens/s Generation

Machine Learning Algorithms & Natural Language Processing

Jun 8, 2026 · Artificial Intelligence

MindLab Unveils 749B Agent-Optimized Macaron‑V1‑Preview Model

MindLab released the 749B‑parameter Macaron‑V1‑Preview, a model engineered for deep Agent‑Harness post‑training that was trained on fewer than 300 GPUs at less than 1% of the compute cost of peer models and achieves SOTA results on multiple Agent‑centric benchmarks such as LivingBench, VitaBench and PinchBench.

Agent HarnessBenchmarkEfficient Training

0 likes · 16 min read

MindLab Unveils 749B Agent-Optimized Macaron‑V1‑Preview Model

DataFunSummit

Jun 8, 2026 · Artificial Intelligence

Agent Architecture in Action: Building Next‑Gen Recommendation and Search Systems

The article reviews cutting‑edge technical practices for next‑generation recommendation and search, covering Alibaba Cloud AI Search's Agentic RAG multi‑agent design, Huawei Noah's LLM‑enhanced recommendation evolution, Baidu's generative ranking (GRAB) for ads, and Elasticsearch‑based vector RAG implementations, with concrete architecture details and performance results.

AI SearchAgentic RAGElasticsearch

0 likes · 6 min read

Agent Architecture in Action: Building Next‑Gen Recommendation and Search Systems

DataFunTalk

Jun 7, 2026 · Artificial Intelligence

ChatGPT’s Dreaming V3 Memory Upgrade: Free for a Billion Users

OpenAI unveiled Dreaming V3, a new memory architecture that lets ChatGPT silently replay and consolidate daily conversations, achieving 82.8% context recall, 71.3% preference compliance, five‑fold compute savings, and free access for billions while offering a transparent memory‑summary interface.

AI memoryChatGPTDreaming V3

0 likes · 9 min read

ChatGPT’s Dreaming V3 Memory Upgrade: Free for a Billion Users

Coder Trainee

Jun 6, 2026 · Artificial Intelligence

What Is an AI Agent? From Large Language Models to Autonomous Agents

This article explains why large language models are powerful yet limited, defines AI agents as autonomous systems that combine a model, memory, tools, and actions, details the ReAct reasoning‑and‑acting loop, provides a 30‑line Python LangChain example and a Java Spring AI implementation, and outlines five practical use‑case scenarios and the roadmap for the series.

AI AgentJavaLangChain

0 likes · 10 min read

What Is an AI Agent? From Large Language Models to Autonomous Agents

DataFunSummit

Jun 6, 2026 · Artificial Intelligence

From Traffic Links to Task Management: 1688’s Agentic AI Evolution

The article details how 1688 transformed its platform from a traditional intent‑matching traffic hub into an Agentic AI system that understands business tasks, outlining a three‑step implementation of knowledge, trajectory and environment redesign, dual‑track evolution, novel evaluation methods, and the emerging role of product managers as evaluation engineers.

Large Language ModelRetrieval Augmented GenerationSkill Hub

0 likes · 13 min read

From Traffic Links to Task Management: 1688’s Agentic AI Evolution

Code Mala Tang

Jun 6, 2026 · Artificial Intelligence

MiniMax M3 Sets New Benchmarks: 1M Context, 59% SWE‑Bench, 9‑15× Faster Multimodal Model

MiniMax unveiled its open‑source M3 model, delivering 1 million‑token context, 59 % SWE‑Bench Pro accuracy that outperforms GPT‑5.5 and Gemini 3.1 Pro, native multimodal desktop interaction, and a 9‑15× speed boost via MiniMax Sparse Attention, with pricing as low as $20 per month.

BenchmarkLarge Language ModelMSA

0 likes · 11 min read

MiniMax M3 Sets New Benchmarks: 1M Context, 59% SWE‑Bench, 9‑15× Faster Multimodal Model

DataFunSummit

Jun 5, 2026 · Artificial Intelligence

Why AI Agents Struggle with Memory and How MemOS Boosts Cloud Calls Over 200%

The article analyzes the critical role of memory for AI agents, compares model‑driven and application‑driven approaches, details the five‑layer MemOS architecture and its three‑layer memory coordination, and shows how MemOS‑powered cloud services achieved a 100‑200% month‑over‑month usage increase while cutting token consumption by up to 72%.

AI AgentCloud ServicesLarge Language Model

0 likes · 18 min read

Why AI Agents Struggle with Memory and How MemOS Boosts Cloud Calls Over 200%

PaperAgent

Jun 4, 2026 · Artificial Intelligence

SkillOpt: Enabling Self‑Evolving Agent Skills via Text‑Space Optimization

SkillOpt reframes LLM agent skills as trainable external state, applying a deep‑learning‑style optimizer to systematically improve skill documents, and demonstrates across six benchmarks, seven models, and three execution modes that this approach yields consistent, large gains and robust transferability.

Agent SkillsLarge Language ModelSelf-Evolving Agents

0 likes · 12 min read

SkillOpt: Enabling Self‑Evolving Agent Skills via Text‑Space Optimization

JD Tech Talk

Jun 3, 2026 · Artificial Intelligence

JoySafety: Open-Source Large Model Security Framework Joins Open Atom Foundation

In May 2026 the Open Atom Open Source Foundation announced JoySafety, an Apache‑2.0‑licensed, four‑layer large‑model security framework that delivers sub‑50 ms detection, over 95% attack interception, and supports 1B‑20B parameter models across cloud, edge, and device deployments.

AI safetyApache 2.0Generative AI

0 likes · 4 min read

JoySafety: Open-Source Large Model Security Framework Joins Open Atom Foundation

Java Architect Handbook

Jun 3, 2026 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and Why It Matters for LLM Interviews

The article explains Retrieval‑Augmented Generation (RAG), why large language models suffer from hallucination, knowledge cutoff, domain gaps and traceability issues, and how RAG’s offline‑online pipeline, comparison with fine‑tuning and long‑context approaches, and emerging trends like Agentic and Graph‑RAG can be discussed in technical interviews.

AI interviewLarge Language ModelRAG

0 likes · 12 min read

What Is Retrieval‑Augmented Generation (RAG) and Why It Matters for LLM Interviews

Baobao Algorithm Notes

Jun 2, 2026 · Artificial Intelligence

MiniMax M3: How a 1M‑Token, Multimodal Agent Reproduces ICLR Research and Automates Kaggle Competitions

The MiniMax M3 model combines a 1‑million‑token context window, native multimodal training and a new MiniMax Sparse Attention architecture that cuts token compute to one‑twentieth of its predecessor, achieving up to 15× faster decoding, while its interactive user‑simulator training enables fully autonomous agents that can reproduce ICLR‑2025 research and tackle Auto‑Kaggle competitions at a fraction of the cost of Western models.

Auto KaggleLarge Language ModelM3

0 likes · 9 min read

MiniMax M3: How a 1M‑Token, Multimodal Agent Reproduces ICLR Research and Automates Kaggle Competitions

Old Zhang's AI Learning

Jun 1, 2026 · Artificial Intelligence

NVIDIA Unveils Nemotron 3 Ultra: The Largest US Open‑Source LLM Boosting Agent Capabilities

NVIDIA released Nemotron 3 Ultra, a 550 B‑parameter open‑source LLM with 55 B active MoE parameters, hybrid Mamba‑Transformer architecture, 1 M token context, and three core innovations that deliver superior MMLU, code, math scores and up to 5× throughput versus rivals, though weights are not yet public.

BenchmarkLarge Language ModelMamba

0 likes · 8 min read

NVIDIA Unveils Nemotron 3 Ultra: The Largest US Open‑Source LLM Boosting Agent Capabilities

HyperAI Super Neural

Jun 1, 2026 · Artificial Intelligence

AI‑Computational Chemistry Workflow Cuts Diabetic Wound‑Healing Drug R&D Time by 70%

Singapore’s National University of Singapore presents an AI‑computational chemistry (AI‑CC) pipeline that integrates large‑language‑model literature mining with multi‑stage molecular simulations, enabling a closed‑loop analysis of drug‑protein nanoscale interactions, accelerating diabetic wound‑healing drug repurposing and shortening the development cycle by more than 70%.

AI‑CCLarge Language Modelcomputational chemistry

0 likes · 12 min read

AI‑Computational Chemistry Workflow Cuts Diabetic Wound‑Healing Drug R&D Time by 70%

SuanNi

Jun 1, 2026 · Artificial Intelligence

MiniMax M3 Beats GPT‑5.5 in Programming and Goes Open‑Source

MiniMax M3, a domestically developed LLM, combines a new sparse‑attention MSA architecture, native multimodal support, and million‑token context to match or surpass top closed‑source models in programming and agent benchmarks, while achieving a 9.4× speedup on FP8 GEMM and preparing for open‑source release.

AIFP8 GEMMLarge Language Model

0 likes · 12 min read

MiniMax M3 Beats GPT‑5.5 in Programming and Goes Open‑Source

Machine Heart

May 28, 2026 · Artificial Intelligence

Claude Opus 4.8 Arrives with Higher Honesty and Record‑Breaking Valuation

Anthropic unveiled Claude Opus 4.8, a flagship LLM that improves benchmark scores, introduces honesty training and dynamic workflows, offers unchanged pricing with a cheaper fast mode, and announced a $65 billion financing round that lifted its valuation to $965 billion.

AI alignmentAnthropicClaude Opus 4.8

0 likes · 9 min read

Claude Opus 4.8 Arrives with Higher Honesty and Record‑Breaking Valuation

AI Engineering

May 28, 2026 · Artificial Intelligence

Anthropic Unveils Claude Opus 4.8: Same Price, Agent Power Beats GPT‑5.5

Anthropic released Claude Opus 4.8 with unchanged pricing, new inference‑strength controls, Dynamic Workflows for massive tasks, a fast mode 2.5× quicker and three‑times cheaper, and benchmark results showing its agent capabilities surpass GPT‑5.5 while improving honesty and alignment.

AI AgentsAnthropicClaude Opus 4.8

0 likes · 12 min read

Anthropic Unveils Claude Opus 4.8: Same Price, Agent Power Beats GPT‑5.5

IoT Full-Stack Technology

May 28, 2026 · Artificial Intelligence

What Exactly Is an AI Agent? A Simple Guide vs. Regular Chatbots

An AI Agent combines a large language model with a clear goal, callable tools, and a multi‑step reasoning loop, enabling perception, planning, and action that go beyond simple chat by decomposing tasks, using external APIs, iterating on errors, and managing memory, while acknowledging its limitations.

AI AgentLarge Language ModelPerception-Planning-Action

0 likes · 7 min read

What Exactly Is an AI Agent? A Simple Guide vs. Regular Chatbots

Machine Learning Algorithms & Natural Language Processing

May 28, 2026 · Artificial Intelligence

Open‑Source 35B Intern‑S2‑Preview Rivals Trillion‑Parameter Models on Scientific Benchmarks

The open‑source 35‑billion‑parameter Intern‑S2‑Preview model achieves scientific‑task performance comparable to trillion‑parameter models, thanks to full‑link “general‑specialized” training, reinforced‑learning scaling, and hardware‑aware optimizations, and it outperforms leading closed‑source models on benchmarks such as MolecularIQ and crystal‑structure generation.

BenchmarkInternLMLarge Language Model

0 likes · 11 min read

Open‑Source 35B Intern‑S2‑Preview Rivals Trillion‑Parameter Models on Scientific Benchmarks

vivo Internet Technology

May 27, 2026 · Artificial Intelligence

Deploying an AI‑Powered Shopping Guide on the Vivo Official Site

This article details the end‑to‑end implementation of an AI shopping guide on the Vivo official website, covering problem definition, multi‑layer architecture, technology selection, data synthesis, FastText intent‑recognition model training, prompt engineering, RAG‑augmented retrieval, structured output, safety testing, and the resulting business impact.

AIChatbotKnowledge Base

0 likes · 27 min read

Deploying an AI‑Powered Shopping Guide on the Vivo Official Site

SuanNi

May 26, 2026 · Artificial Intelligence

Why Tokens Are Burning Out and a Free Claude Opus 4.6‑Level Model Is Coming

The SkyClaw‑v1.0 model from Skywork AI offers a free, soon‑to‑be open‑source large‑language model for agent applications that matches Claude Opus 4.6 in performance while cutting token costs dramatically, and the article details its benchmarks, training pipeline, and deployment recommendations.

AgentBenchmarkLarge Language Model

0 likes · 7 min read

Why Tokens Are Burning Out and a Free Claude Opus 4.6‑Level Model Is Coming

HyperAI Super Neural

May 26, 2026 · Artificial Intelligence

Robin Integrates 550 Papers in 30 min, Closing the AI‑Driven Research Loop and Discovering dAMD Therapies

The Robin multi‑agent system combines literature mining, hypothesis generation, and experimental data analysis into a continuous AI‑driven workflow, integrating 550 papers in 30 minutes, benchmarking on the BixBench suite, and uncovering a ROCK‑inhibitor and the glaucoma drug Lipasudil as promising treatments for dry‑age‑related macular degeneration.

Large Language Modelautomated hypothesis generationbiomedical AI

0 likes · 14 min read

Robin Integrates 550 Papers in 30 min, Closing the AI‑Driven Research Loop and Discovering dAMD Therapies

Machine Heart

May 26, 2026 · Artificial Intelligence

Grok Survives xAI Shutdown with 1.5‑T V9‑Medium Model – Musk Announces

After xAI’s dissolution, Elon Musk revealed that the new Grok V9‑Medium model, a 1.5‑trillion‑parameter foundation model optimized for Blackwell GPUs and enriched with Cursor data, has completed training, will undergo fine‑tuning and reinforcement learning, and is slated for public release within weeks, while the older 0.5‑T model will be open‑sourced later this year.

AI AgentBlackwell GPUCursor data

0 likes · 6 min read

Grok Survives xAI Shutdown with 1.5‑T V9‑Medium Model – Musk Announces

Machine Heart

May 26, 2026 · Artificial Intelligence

AI‑Written Training Framework Powers 1B‑Parameter MiniCPM5 for Edge AI

The article analyzes MiniCPM5‑1B, a 1‑billion‑parameter edge‑friendly language model whose training framework, ForgeTrain, was generated entirely by AI, achieving Megatron‑level quality with 10% faster speed and enabling low‑cost, low‑latency deployment on devices ranging from laptops to smartphones.

AI training frameworkData GovernanceForgeTrain

0 likes · 16 min read

AI‑Written Training Framework Powers 1B‑Parameter MiniCPM5 for Edge AI

ZhiKe AI

May 26, 2026 · Artificial Intelligence

ChatGPT Only Answers, Agents Get Things Done: Understanding AI Digital Employees

The article explains that AI Agents combine LLMs, memory, planning, and tool access to act autonomously on tasks—unlike ChatGPT’s passive answering—while highlighting industry momentum in 2025 and the four core capabilities that make them true digital employees.

AI AgentAI toolsLarge Language Model

0 likes · 8 min read

ChatGPT Only Answers, Agents Get Things Done: Understanding AI Digital Employees

dbaplus Community

May 25, 2026 · Artificial Intelligence

Claude AI Elevates DeWu’s Financial Data Warehouse to Full-Chain Efficiency

The article analyzes how Claude large‑model AI is applied to DeWu’s financial data warehouse, detailing the domain’s unique challenges, the model’s three core capabilities, practical use‑cases such as OneData standardised modelling, AI‑assisted SQL coding and automated data testing, and the resulting efficiency, quality and reusability gains.

AI codingClaude AIData Testing

0 likes · 21 min read

Claude AI Elevates DeWu’s Financial Data Warehouse to Full-Chain Efficiency

Machine Heart

May 25, 2026 · Artificial Intelligence

How DeepMind’s AI Solved Nine Erdős Problems for Only a Few Hundred Dollars Each

DeepMind’s AlphaProof Nexus framework enabled an AI agent to automatically prove and verify nine long‑standing Erdős conjectures at a cost of only a few hundred dollars per problem, using a simple “think‑try” loop and a more advanced multi‑agent evolution architecture, and demonstrating a shift toward leveraging raw large‑model reasoning for formal mathematics.

AI researchAlphaProof NexusDeepMind

0 likes · 11 min read

How DeepMind’s AI Solved Nine Erdős Problems for Only a Few Hundred Dollars Each

DaTaobao Tech

May 25, 2026 · Artificial Intelligence

Scaling to Ten‑Thousand QPS: Lessons from Building a Real‑Time Product‑Domain Agent

The article details how the product team tackled AI‑driven challenges by designing a two‑layer, event‑driven Function‑Centric Agent architecture that unifies workflow orchestration and capability supply, enabling real‑time inference for billions of items, cutting development cycles to one person‑week, and boosting search conversion rates.

AI AgentAIFunctionFunction Calling

0 likes · 29 min read

Scaling to Ten‑Thousand QPS: Lessons from Building a Real‑Time Product‑Domain Agent

Golang Shines

May 25, 2026 · Industry Insights

When a 60‑Year‑Old Wins 100 Million AI Tokens in a Lobster Contest: What “Token” Really Means

A 60‑year‑old fisherman in Anhui won a prize of 100 million AI tokens, mistook it for a scam, and sparked a discussion about the official Chinese term “词元”, its nationwide adoption, massive token usage statistics, and what this reveals about AI penetration in everyday life.

AI adoptionAI tokenChina AI policy

0 likes · 5 min read

When a 60‑Year‑Old Wins 100 Million AI Tokens in a Lobster Contest: What “Token” Really Means

Big Data Tech Team

May 25, 2026 · Artificial Intelligence

Mastering Data Agent: A Complete End‑to‑End Guide from Basics to Pro

This article breaks down the concept of a Data Agent that automates the entire traditional data‑analysis pipeline, explains its three‑layer architecture, the ReAct reasoning loop, multi‑agent collaboration, six practical use cases, and offers deployment recommendations for teams looking to adopt AI‑driven data workflows.

AIBIData Agent

0 likes · 18 min read

Mastering Data Agent: A Complete End‑to‑End Guide from Basics to Pro

Machine Heart

May 24, 2026 · Artificial Intelligence

Proactive Failure Recovery: How AgentChord Embeds Recovery Actions into Robot Task Graphs

AgentChord, a system presented at RSS 2026, anticipates potential robot manipulation failures by embedding recovery actions directly into a structured task graph, enabling immediate low‑latency switches to pre‑compiled recovery branches and achieving up to 99.2% success in simulated tasks and 77.5% on real robots.

Failure RecoveryLarge Language ModelRobotics

0 likes · 13 min read

Proactive Failure Recovery: How AgentChord Embeds Recovery Actions into Robot Task Graphs

Machine Heart

May 23, 2026 · Industry Insights

DeepSeek Secures $10B Funding and Slashes API Prices by 75%

DeepSeek announced a permanent 75% API price cut, positioning its rates below GPT‑5.5 and Claude Opus 4.7, while simultaneously raising up to $10 billion in financing and launching a new Harness team to productize its V4 Pro model for developers.

AGIAI financingAI pricing

0 likes · 6 min read

DeepSeek Secures $10B Funding and Slashes API Prices by 75%

SuanNi

May 22, 2026 · Artificial Intelligence

Why Qwen3.7-Max Is Sending Overseas Developers Into a Frenzy

Qwen3.7-Max demonstrates product‑level long‑task autonomy with 35 hours of uninterrupted operation, 1,158 tool calls, and kernel‑level optimizations, while outperforming Gemini 3.5‑Flash, Claude Opus, and GPT‑5.5 across a wide range of benchmarks, cost‑effectiveness, and real‑world agent scenarios.

AIAgentBenchmark

0 likes · 11 min read

Why Qwen3.7-Max Is Sending Overseas Developers Into a Frenzy

SuanNi

May 22, 2026 · Artificial Intelligence

How GLM‑5.1‑highspeed Achieves 7× Faster Inference to Become the World’s Fastest Flagship Model

On May 22, Zhipu launched the GLM‑5.1‑highspeed API, delivering 400 tokens per second—about 7× faster than the original model and twice as fast as Gemini 3.5 Flash—through a three‑layer optimization that rewrites the MoE inference path, introduces dynamic scheduling, and leverages TileRT’s AOT engine to cut latency while preserving full flagship capabilities.

GLM-5.1Inference OptimizationLarge Language Model

0 likes · 10 min read

How GLM‑5.1‑highspeed Achieves 7× Faster Inference to Become the World’s Fastest Flagship Model

Machine Learning Algorithms & Natural Language Processing

May 22, 2026 · Artificial Intelligence

20‑Year‑Old Transformer Co‑author Open‑Sources a 218‑Billion‑Parameter Model

Cohere’s Command A+ model, built by Transformer co‑author Aidan Gomez and backed by Nick Frosst, packs 218 billion parameters but activates only 25 billion at inference, uses a lossless 4‑bit quantization scheme, offers native citation support, runs on a single B200 or two H100 GPUs, and is released under an Apache 2.0 license, marking a major shift toward truly open‑source, enterprise‑ready large language models.

AIApache 2.0Cohere

0 likes · 12 min read

20‑Year‑Old Transformer Co‑author Open‑Sources a 218‑Billion‑Parameter Model

Machine Heart

May 21, 2026 · Artificial Intelligence

AI Cracks 80-Year-Old Erdős Unit Distance Problem

OpenAI’s general‑purpose large language model independently disproved the Erdős unit‑distance conjecture, introducing a novel algebraic‑number‑theory construction that outperforms the long‑standing square‑grid approach and reshapes how AI can contribute to deep mathematical research.

AIErdős unit distance problemLarge Language Model

0 likes · 9 min read

AI Cracks 80-Year-Old Erdős Unit Distance Problem

Mike Chen's Internet Architecture

May 21, 2026 · Artificial Intelligence

Demystifying AI Large Models: Architecture, Principles, and Workflow

The article explains that large language models are massive probability engines built on the Transformer architecture with self‑attention, trained through costly pre‑training on trillions of tokens, then refined by instruction fine‑tuning and RLHF, ultimately predicting the next token to generate text.

Large Language ModelRLHFSelf-Attention

0 likes · 5 min read

Demystifying AI Large Models: Architecture, Principles, and Workflow

AI Large-Model Wave and Transformation Guide

May 21, 2026 · Artificial Intelligence

A Complete Guide to Data Agent: From Basics to Advanced Workflow

The article explains what a Data Agent is, its three‑layer architecture, the ReAct reasoning framework, step‑by‑step workflow for natural‑language queries, multi‑agent collaboration, practical use cases, and recommendations for adopting Data Agent in data‑driven teams.

AIData AgentLarge Language Model

0 likes · 19 min read

A Complete Guide to Data Agent: From Basics to Advanced Workflow

IT Services Circle

May 20, 2026 · Artificial Intelligence

Google I/O 2026 Unveils Gemini Omni and Gemini 3.5 Flash – A Leap in Multimodal AI

At Google I/O 2026 the company introduced Gemini Omni, a truly multimodal model that can ingest any combination of text, image, audio or video and generate high‑quality content, and Gemini 3.5 Flash, which outperforms Gemini 3.1 Pro across major benchmarks while delivering four‑times faster token throughput, alongside the new Antigravity 2.0 agent platform and the Gemini Spark personal AI assistant.

AI generationAgent PlatformBenchmark

0 likes · 13 min read

Google I/O 2026 Unveils Gemini Omni and Gemini 3.5 Flash – A Leap in Multimodal AI

Old Zhang's AI Learning

May 20, 2026 · Artificial Intelligence

Qwen 3.7‑Max vs Claude 4.7: 7 In‑Depth Tests Reveal a Smooth, Powerful Model

The author evaluates Alibaba’s newly released Qwen 3.7‑Max across seven rigorous tasks—including reading comprehension, HTML fireworks generation, 3D particle visualizations, PDF‑to‑PPT conversion, Excel data analysis, GitHub trending scraping, and complex video generation—showing it often surpasses GPT‑5.5‑level models and rivals Claude 4.7, especially in long‑duration agent tasks.

AI benchmarkAgentClaude 4.7

0 likes · 9 min read

Qwen 3.7‑Max vs Claude 4.7: 7 In‑Depth Tests Reveal a Smooth, Powerful Model

Machine Heart

May 20, 2026 · Artificial Intelligence

Qwen3.7-Max Sets New Agent Benchmarks – China’s New Model King

Alibaba’s Qwen3.7‑Max model tops multiple Arena leaderboards, achieves SOTA scores in programming, reasoning, and multilingual benchmarks, runs a 35‑hour autonomous coding task on a custom AI chip with 10× speedup, and demonstrates end‑to‑end desktop app creation and web‑search agents, illustrating a rapid monthly model‑iteration strategy.

AI chipAgentAlibaba

0 likes · 13 min read

Qwen3.7-Max Sets New Agent Benchmarks – China’s New Model King

Machine Learning Algorithms & Natural Language Processing

May 20, 2026 · Artificial Intelligence

Composer 2.5 Narrows the Gap to Claude Opus 4.7 with Ten‑Fold Cost Savings

Composer 2.5, the latest AI‑coding model from Cursor, claims near‑par performance with Claude 4.7 Opus and GPT‑5.5 while delivering up to ten‑times higher efficiency and a pricing model of $0.5 per M input tokens and $2.5 per M output tokens, backed by novel reinforcement‑learning tricks, massive synthetic data, and a custom Muon optimizer with dual‑grid HSDP architecture.

AI programmingComposer 2.5HSDP

0 likes · 13 min read

Composer 2.5 Narrows the Gap to Claude Opus 4.7 with Ten‑Fold Cost Savings

SuanNi

May 19, 2026 · Artificial Intelligence

Qwen 3.7 Debuts: Ranks 13th Globally and Tops China’s Model Leaderboard

Qwen 3.7‑Max‑Preview secures the 13th spot worldwide and the top position among Chinese models, while Qwen 3.7‑Plus‑Preview ranks 16th in vision, highlighting an accelerated release cadence, deeper technical depth across sub‑tasks, and a shift in China’s large‑model competition toward ecosystem control.

AI competitionChina AILarge Language Model

0 likes · 9 min read

Qwen 3.7 Debuts: Ranks 13th Globally and Tops China’s Model Leaderboard

DataFunTalk

May 19, 2026 · Artificial Intelligence

Qwen 3.7 Max Preview Lands: Rapid Dual‑Model Iteration Keeps China’s Lead in Text and Vision

The Qwen 3.7‑Max and Qwen 3.7‑Plus preview models debut with top‑15 global rankings in Arena, the only Chinese models in text and vision leaderboards, while a timeline analysis shows the Qwen series accelerating from 4‑6‑month releases to a 2‑3‑month cadence and introducing dense and MoE variants up to 235 B parameters.

AI benchmarkChinese AILarge Language Model

0 likes · 6 min read

Qwen 3.7 Max Preview Lands: Rapid Dual‑Model Iteration Keeps China’s Lead in Text and Vision

Machine Heart

May 17, 2026 · Artificial Intelligence

Why Do Large Language Models Speak and Reason Like Humans? An In‑Depth Look at Their Mechanisms

This article examines how large language models acquire human‑like language and reasoning abilities by learning statistical patterns, employing next‑token prediction, feature superposition, sparse autoencoders, and function‑token memory mechanisms, and compares their internal processes with human cognition, highlighting both breakthroughs and remaining limitations.

Artificial IntelligenceFeature SuperpositionLLM Interpretability

0 likes · 24 min read

Why Do Large Language Models Speak and Reason Like Humans? An In‑Depth Look at Their Mechanisms

DataFunTalk

May 16, 2026 · Artificial Intelligence

How Knora Combines Ontology and Large Models to Overcome AI Hallucinations and Execution Gaps in Enterprises

The article explains how YueDian Technology's Knora 4.0 platform fuses domain ontologies with large‑model AI to create a unified, trustworthy, and autonomous enterprise AI system that addresses hallucination, data integration, and execution challenges across complex business scenarios.

AI platformAutonomous AgentsEnterprise AI

0 likes · 14 min read

How Knora Combines Ontology and Large Models to Overcome AI Hallucinations and Execution Gaps in Enterprises

Machine Heart

May 15, 2026 · Artificial Intelligence

How X2SAM Empowers Multimodal Models to Segment Images and Videos at Pixel Level

X2SAM is a unified multimodal large model that combines image and video segmentation with language and visual prompts, introduces a Mask Memory for temporal consistency, defines a new V‑VGD task, and achieves state‑of‑the‑art results while cutting training cost by over 30%.

Large Language ModelV-VGDX2SAM

0 likes · 9 min read

How X2SAM Empowers Multimodal Models to Segment Images and Videos at Pixel Level

Xiaomi Tech

May 14, 2026 · Artificial Intelligence

500 M Videos Yield the Largest Open‑Source GUI Dataset; 3B Model Cuts Inference Tokens 71% and Beats Larger Models (Xiaomi AI at ICML 2026)

Xiaomi’s AI team extracted 5 billion video frames to create the world’s largest open‑source GUI dataset, demonstrated that a 3 B‑parameter model can reduce inference tokens by 71% while surpassing larger models, and presented a suite of ICML 2026 papers covering data scaling, benchmarking, reasoning, multimodal perception, and training stability for GUI agents and other AI tasks.

GUI AgentLarge Language Modelbenchmarking

0 likes · 21 min read

500 M Videos Yield the Largest Open‑Source GUI Dataset; 3B Model Cuts Inference Tokens 71% and Beats Larger Models (Xiaomi AI at ICML 2026)

Black & White Path

May 13, 2026 · Information Security

AI‑Powered 0‑Day Discovery: How Attackers Autonomously Bypassed 2FA

In May 2026, Google Threat Intelligence disclosed that a cybercrime group used a large‑language model to autonomously identify a semantic‑logic flaw in a popular open‑source Python‑based web management tool, generate a Python exploit that bypasses its two‑factor authentication, and launch mass automated attacks, prompting new blue‑team detection and defense strategies.

0-day2FA bypassAI security

0 likes · 12 min read

AI‑Powered 0‑Day Discovery: How Attackers Autonomously Bypassed 2FA

SuanNi

May 12, 2026 · Artificial Intelligence

AntAngelMed: 6.1B‑Activated MoE Model Tops Three Medical Benchmarks

AntAngelMed, a 100‑billion‑parameter medical LLM using a 6.1 billion‑parameter MoE architecture, achieves performance comparable to a 40 billion‑parameter dense model, exceeds 200 tokens/s inference speed, and ranks first on HealthBench, MedAIBench and MedBench, with a three‑stage training pipeline and extensive efficiency optimizations.

HealthBenchLarge Language ModelMedAIBench

0 likes · 6 min read

AntAngelMed: 6.1B‑Activated MoE Model Tops Three Medical Benchmarks

Airbnb Technology Team

May 12, 2026 · Frontend Development

How Airbnb Migrated 3.5K Enzyme Tests to React Testing Library in Six Weeks Using LLM‑Powered Automation

Airbnb transformed nearly 3,500 Enzyme test files to React Testing Library in just six weeks by building a large‑language‑model‑driven pipeline that validates, rewrites, retries, and enriches prompts with extensive context, achieving a 97% migration success rate while dramatically cutting manual effort and cost.

AirbnbEnzymeLarge Language Model

0 likes · 11 min read

How Airbnb Migrated 3.5K Enzyme Tests to React Testing Library in Six Weeks Using LLM‑Powered Automation

Old Zhang's AI Learning

May 11, 2026 · Artificial Intelligence

Open‑Source Qwen3.6‑35B‑A3B Runs at 162 tok/s on a Single RTX 5090

The article introduces the open‑source Qwen3.6‑35B‑A3B model, explains its MoE architecture, three‑stage LoRA fine‑tuning, shows benchmark results where it achieves 161.9 tok/s on an RTX 5090—2.6× faster than a dense 27B counterpart—and discusses deployment tips, quantized GGUF release, and known compatibility pitfalls.

GGUF quantizationLarge Language ModelLoRA fine-tuning

0 likes · 7 min read

Open‑Source Qwen3.6‑35B‑A3B Runs at 162 tok/s on a Single RTX 5090

SuanNi

May 10, 2026 · Artificial Intelligence

How HTML Beats Markdown for Better AI Communication and Collaboration

The article argues that while Markdown has served as a convenient intermediate language for large language models, generating HTML output unlocks richer visual presentation, interactive controls, and easier sharing, albeit at the cost of higher token usage and more complex version control.

AI interactionHTMLLarge Language Model

0 likes · 9 min read

How HTML Beats Markdown for Better AI Communication and Collaboration

Data Party THU

May 10, 2026 · Artificial Intelligence

SpikingBrain 2.0 Breaks Long‑Sequence and Low‑Power Bottlenecks in Brain‑Inspired LLMs

The Chinese Academy of Sciences unveils SpikingBrain 2.0‑5B, a brain‑inspired large model that uses dual‑space sparse attention and dual activation (FP8 and INT8‑Spiking) to cut training cost by over tenfold, achieve up to 15× speedup on long sequences, and match Qwen‑3 performance while drastically reducing power consumption.

Large Language ModelSparse attentionSpikingBrain2.0

0 likes · 10 min read

SpikingBrain 2.0 Breaks Long‑Sequence and Low‑Power Bottlenecks in Brain‑Inspired LLMs

DataFunSummit

May 8, 2026 · Artificial Intelligence

Agent Architecture in Action: Building Next‑Gen Recommendation and Search Systems

This article reviews cutting‑edge AI search and recommendation technologies, covering Alibaba Cloud's Agentic RAG architecture, Huawei Noah's LLM‑enhanced recommendation pipeline, and Baidu's generative ranking model GRAB, while detailing their design challenges, multi‑modal retrieval strategies, performance gains, and real‑world deployment results.

AI SearchAgentic RAGGenerative Ranking

0 likes · 6 min read

java1234

May 7, 2026 · Artificial Intelligence

Why the Claude Code ‘CLAUDE.md’ Ruleset Earned Over 91K Stars

The article analyzes the forrestchang/andrej-karpathy-skills GitHub repository, whose CLAUDE.md file provides project‑level behavior rules for Claude Code, explains the four core principles, why it attracted more than 91 000 stars, how to integrate it, its trade‑offs, and suitable teams.

AI coding guidelinesCLAUDE.mdClaude Code

0 likes · 7 min read

Why the Claude Code ‘CLAUDE.md’ Ruleset Earned Over 91K Stars

Su San Talks Tech

May 7, 2026 · Artificial Intelligence

DeepSeek’s New Claude‑Code‑Style Terminal Agent: An Open‑Source Rust Project

An open‑source Rust‑based terminal agent for DeepSeek V4, dubbed DeepSeek‑TUI, offers Claude‑Code‑like capabilities such as file manipulation, shell execution, git management, parallel sub‑task scheduling, side‑git rollback, and LSP diagnostics, and has quickly attracted thousands of stars and active community contributions.

AI codingDeepSeekLSP

0 likes · 5 min read

DeepSeek’s New Claude‑Code‑Style Terminal Agent: An Open‑Source Rust Project

DataFunSummit

May 6, 2026 · Artificial Intelligence

Inside 1688’s Inference‑Based Recommendation System: Architecture, Challenges, and Future Directions

This article details how Alibaba 1688 tackles the “information cocoon” problem by deploying large‑model inference‑based recommendation, describing its three‑layer architecture, multi‑stage user demand analysis, long‑cycle behavior compression, prompt engineering, trend mining, near‑line serving, and future enhancements.

Large Language Modelbehavior compressione-commerce

0 likes · 23 min read

Inside 1688’s Inference‑Based Recommendation System: Architecture, Challenges, and Future Directions

DataFunSummit

May 5, 2026 · Artificial Intelligence

How Huawei Noah’s KAR Project Leverages LLMs to Advance Recommendation Systems

The article reviews the evolution of recommendation systems from deep learning to large language models, analyzes core challenges such as noisy implicit feedback and limited semantic understanding, and details Huawei Noah’s KAR solution that uses factorized prompting, multi‑expert adapters, and AI‑Agent architectures to achieve a 1.5% AUC lift and validated online A/B test results.

AI AgentAUCHuawei

0 likes · 5 min read

How Huawei Noah’s KAR Project Leverages LLMs to Advance Recommendation Systems

Architects' Tech Alliance

May 4, 2026 · Artificial Intelligence

How DeepSeek‑TUI Scored 2.3k GitHub Stars and Won Over Chinese “Whale Brothers”

DeepSeek‑TUI, a Rust‑based terminal coding agent built on DeepSeek‑V4’s 1‑million‑token context, exploded on GitHub with 2.3k stars by offering lightweight installation, multi‑model RLM acceleration, Chinese localization, and cost‑effective flash inference, while its creator’s unconventional background and timely market trends fueled its viral success.

AI codingDeepSeekLarge Language Model

0 likes · 6 min read

How DeepSeek‑TUI Scored 2.3k GitHub Stars and Won Over Chinese “Whale Brothers”

Old Zhang's AI Learning

May 3, 2026 · Artificial Intelligence

Alibaba’s Qwen‑Scope: A Brain‑Computer Interface for Qwen‑3.5‑27B

Qwen‑Scope adds a sparse autoencoder (SAE) to the Qwen‑3.5‑27B model, exposing a top‑K 50‑feature, residual‑stream hook across all 64 layers for interpretability, controllable generation, data analysis, and training diagnostics, while detailing installation, usage, and practical trade‑offs.

Large Language ModelQwenSAE

0 likes · 11 min read

Alibaba’s Qwen‑Scope: A Brain‑Computer Interface for Qwen‑3.5‑27B

DataFunTalk

May 2, 2026 · Industry Insights

Why Palantir’s Ontology Fuels Its Valuation: The Skeleton and Memory Behind AI

In a 90‑minute round‑table, experts from banking risk control and cloud observability explain how Palantir’s ontology bridges three data gaps, turns raw logs into a graph of entities and relationships, and works with large models as a skeleton and memory to make AI trustworthy and scalable.

AI trustworthinessDigital TwinLarge Language Model

0 likes · 16 min read

Why Palantir’s Ontology Fuels Its Valuation: The Skeleton and Memory Behind AI

IT Services Circle

May 1, 2026 · Artificial Intelligence

GPT’s Father Sends AI Back to 1930: An AI That Writes Python Without Seeing Code

Alec Radford’s team released Talkie, a 13‑billion‑parameter LLM trained exclusively on pre‑1931 texts (2600 billion tokens), which surprisingly can generate correct Python programs via few‑shot learning, demonstrating genuine reasoning rather than mere memorisation, and the article details its experiments, data‑quality challenges, comparative performance, and ambitious scaling roadmap.

Large Language ModelModel ScalingOCR data quality

0 likes · 8 min read

GPT’s Father Sends AI Back to 1930: An AI That Writes Python Without Seeing Code

Su San Talks Tech

May 1, 2026 · Artificial Intelligence

Xiaomi Unveils 1.02‑Trillion‑Parameter MiMo 2.5 Model – Token Grant Guide and Real‑World Benchmarks

Xiaomi has launched the MiMo 2.5 series, featuring a 1.02‑trillion‑parameter MoE model with 1 M‑token context, offers a token‑grant program for developers, and delivers benchmark scores that rival leading models such as DeepSeek‑V4‑Pro, Kimi K2, GPT‑5 and Gemini 3.0.

AIBenchmarkLarge Language Model

0 likes · 9 min read

Xiaomi Unveils 1.02‑Trillion‑Parameter MiMo 2.5 Model – Token Grant Guide and Real‑World Benchmarks

Architects' Tech Alliance

May 1, 2026 · Artificial Intelligence

How DeepSeek V4 Triggers a Global AI Price War with OpenAI

DeepSeek V4’s open‑source 1 M‑token MoE model delivers benchmark scores of MMLU 88.7, C‑Eval 92.1 and HumanEval 69.5, while its 4‑bit AWQ quantization, PagedAttention memory management and FlashAttention acceleration cut inference costs and latency, prompting rivals such as Anthropic, OpenAI, Baidu and Huawei to slash prices and boost efficiency in a fierce market battle.

AI efficiencyDeepSeek-V4Large Language Model

0 likes · 9 min read

How DeepSeek V4 Triggers a Global AI Price War with OpenAI

Old Meng AI Explorer

Apr 30, 2026 · Artificial Intelligence

How to Use Kimi K2.6 for Free: The Open‑Source Chinese LLM That Beats Top Models

The article provides a deep technical overview of Kimi K2.6—including its MoE architecture, benchmark superiority over GPT‑5.4 and Claude Opus, six free‑access channels, practical usage tips, and real‑world scenarios—so developers can evaluate and adopt the model without cost.

Agent SwarmBenchmarkFree API

0 likes · 13 min read

How to Use Kimi K2.6 for Free: The Open‑Source Chinese LLM That Beats Top Models

Machine Heart

Apr 30, 2026 · Artificial Intelligence

Beyond DeepSeek V4: A Trillion‑Parameter LLM Trained End‑to‑End on Domestic Chips

The article analyzes how both DeepSeek V4 and Meituan's LongCat‑2.0‑P preview, each with trillion‑scale parameters and 1 M‑token context, were trained and inferred entirely on Chinese‑made accelerators, detailing memory optimizations, deterministic operators, MoE redesigns, and massive multi‑card clusters that prove domestic compute can meet top‑tier AI workloads.

Deterministic OpsDomestic AI ChipLarge Language Model

0 likes · 13 min read

Beyond DeepSeek V4: A Trillion‑Parameter LLM Trained End‑to‑End on Domestic Chips