DataFunTalk
Author

DataFunTalk

Dedicated to sharing and discussing big data and AI technology applications, aiming to empower a million data scientists. Regularly hosts live tech talks and curates articles on big data, recommendation/search algorithms, advertising algorithms, NLP, intelligent risk control, autonomous driving, and machine learning/deep learning.

2.5k
Articles
2
Likes
9.2k
Views
1
Comments
Recent Articles

Latest from DataFunTalk

100 recent articles max
DataFunTalk
DataFunTalk
Jun 12, 2026 · Artificial Intelligence

How Ontology + Large Models Enable Knora to Tackle Hallucinations and Execution Gaps in Enterprise AI

The article explains how Knora 4.0 combines ontology with large‑model AI to move enterprise applications from isolated chat bots to autonomous, end‑to‑end systems, addressing six major challenges such as hallucinations, unstable outputs, weak planning, poor responsiveness, data integration difficulty, and long cold‑start cycles, and demonstrates the approach with real LED‑line use cases, architectural details, and a roadmap for future autonomous agents.

AI PlatformAutonomous AgentsEnterprise AI
0 likes · 17 min read
How Ontology + Large Models Enable Knora to Tackle Hallucinations and Execution Gaps in Enterprise AI
DataFunTalk
DataFunTalk
Jun 12, 2026 · Industry Insights

Why Are True Benchmark Cases for Data Agents Still Rare After Years of Hype?

The article analyzes the surge of interest in Agentic Analytics and Data Agents, explains how market focus has shifted from speed to accuracy and real‑world value, and outlines the concrete criteria that a genuine enterprise‑grade data‑analysis agent benchmark must satisfy.

AccuracyAgentic AnalyticsBenchmark Cases
0 likes · 9 min read
Why Are True Benchmark Cases for Data Agents Still Rare After Years of Hype?
DataFunTalk
DataFunTalk
Jun 11, 2026 · Artificial Intelligence

How Qichacha Leverages Large Language Models for Field‑Level Data Lineage

This article details Qichacha's use of large language models to extract field‑level data lineage from heterogeneous, non‑standard code and ETL assets, describing the motivation, architectural blueprint, practical challenges such as cost, accuracy and hallucination, and the resulting improvements in impact analysis, metric tracing, and sensitive‑data governance.

Big DataData GovernanceFlink
0 likes · 11 min read
How Qichacha Leverages Large Language Models for Field‑Level Data Lineage
DataFunTalk
DataFunTalk
Jun 10, 2026 · Artificial Intelligence

Claude Mythos 5 Unleashed: 50 Million Lines of Code Processed in One Day

Anthropic released Claude Fable 5 and Mythos 5, dual‑version LLMs that achieve record‑breaking benchmarks in software engineering, visual reasoning, long‑context tasks and finance, while introducing a safety‑first routing system, token‑efficiency pricing and a limited free‑trial window, reshaping how developers and enterprises interact with powerful AI agents.

AI benchmarksClaudeFable 5
0 likes · 18 min read
Claude Mythos 5 Unleashed: 50 Million Lines of Code Processed in One Day
DataFunTalk
DataFunTalk
Jun 10, 2026 · Artificial Intelligence

Building an Enterprise‑Grade RAG 2.0 System: Architecture, Challenges, and Practices

This article analyses the enterprise‑level RAG 2.0 solution, covering its background problems, layered architecture, offline and online pipelines, document parsing, multi‑turn query rewriting, hybrid vector‑plus‑BM25 retrieval, ranking models such as RRF, ColBERT and cross‑encoder, knowledge filtering, two‑stage generation with FoRAG, and practical evaluation metrics.

Document ParsingEnterprise AIHybrid retrieval
0 likes · 22 min read
Building an Enterprise‑Grade RAG 2.0 System: Architecture, Challenges, and Practices
DataFunTalk
DataFunTalk
Jun 9, 2026 · Artificial Intelligence

How Ontology‑Driven Agents Enable Controllable Execution in Harness Engineering

The article analyzes why current AI agents often act beyond business rules, proposes an ontology‑driven semantic foundation called Harness Engineering, and details three technical pillars—architectural constraints, context engineering, and feedback loops—illustrated with the Knora implementation and real‑world use cases.

AI AgentsEnterprise AIKnora
0 likes · 20 min read
How Ontology‑Driven Agents Enable Controllable Execution in Harness Engineering
DataFunTalk
DataFunTalk
Jun 9, 2026 · Artificial Intelligence

Anthropic’s Internal Claude Code Skills: 9 Types, Key Practices, and Writing Tips

Anthropic reveals how its teams use Claude Code Skills, classifying them into nine functional categories, emphasizing verification and focus, and sharing concrete guidelines for structuring SKILL.md, progressive disclosure, memory, scripts, hooks, distribution, composition, and usage measurement.

AI automationClaude CodeKnowledge Management
0 likes · 15 min read
Anthropic’s Internal Claude Code Skills: 9 Types, Key Practices, and Writing Tips
DataFunTalk
DataFunTalk
Jun 8, 2026 · Artificial Intelligence

How Harness Transforms AI Coding for Data Warehousing into an End-to-End Pipeline

This article details how a data‑warehouse team built a seven‑layer Harness framework to overcome AI‑coding challenges—semantic drift, strict constraints, and cross‑session context—enabling reliable, end‑to‑end production‑grade wide‑table delivery with up to 25× speedup and near‑zero side‑effects.

AIData WarehousingFramework
0 likes · 32 min read
How Harness Transforms AI Coding for Data Warehousing into an End-to-End Pipeline
DataFunTalk
DataFunTalk
Jun 7, 2026 · Artificial Intelligence

ChatGPT’s Dreaming V3 Memory Upgrade: Free for a Billion Users

OpenAI unveiled Dreaming V3, a new memory architecture that lets ChatGPT silently replay and consolidate daily conversations, achieving 82.8% context recall, 71.3% preference compliance, five‑fold compute savings, and free access for billions while offering a transparent memory‑summary interface.

AI memoryChatGPTDreaming V3
0 likes · 9 min read
ChatGPT’s Dreaming V3 Memory Upgrade: Free for a Billion Users
DataFunTalk
DataFunTalk
Jun 7, 2026 · Artificial Intelligence

Exploring Multimodal GraphRAG: Combining Document Intelligence, Knowledge Graphs, and Large Models

This article presents a comprehensive technical analysis of multimodal GraphRAG, covering document‑intelligence parsing pipelines, multimodal graph indexing, retrieval‑generation workflows, knowledge‑graph enhancements for chunk relations, and a detailed comparison of RAG, GraphRAG, and KG‑QA approaches.

GraphRAGKnowledge GraphRAG
0 likes · 26 min read
Exploring Multimodal GraphRAG: Combining Document Intelligence, Knowledge Graphs, and Large Models