Tagged articles
15 articles
Page 1 of 1
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Apr 17, 2026 · Artificial Intelligence

Can Table Modeling Scale? Rethinking Tree Models in the Age of Massive Compute

The article examines how the dramatic increase in GPU compute power—illustrated by a single H100 GPU equaling about 200 Hadoop instances—challenges the dominance of tree‑based models for structured data, presents scaling‑law experiments with KMLP and FOUND, and argues that pre‑training can redefine the balance between compute, data, and algorithms.

FOUNDGPUKMLP
0 likes · 10 min read
Can Table Modeling Scale? Rethinking Tree Models in the Age of Massive Compute
Machine Heart
Machine Heart
Apr 17, 2026 · Artificial Intelligence

Can Table Modeling Scale? Rethinking the Tree Model Era Amid Compute Shifts

The article examines how a single NVIDIA H100 GPU delivers roughly 200‑fold more FP16 compute than a 96‑core CPU Hadoop node, explores the "Bitter Lesson" of scaling‑driven AI breakthroughs, and presents large‑scale pretraining experiments that show table and sequence models now exhibit clear scaling laws, challenging the dominance of traditional tree‑based approaches.

FOUNDKMLPStructured Data
0 likes · 10 min read
Can Table Modeling Scale? Rethinking the Tree Model Era Amid Compute Shifts
Java One
Java One
Apr 8, 2026 · Artificial Intelligence

Master Claude API: From Model Selection to Streaming Responses

This guide walks you through Claude Code model choices, secure API key handling, Python SDK setup, request construction, multi‑turn conversation management, system prompts, temperature tuning, response streaming, and extracting clean structured data such as JSON, all with practical code examples and diagrams.

Claude APIMulti-turn ConversationPrompt engineering
0 likes · 31 min read
Master Claude API: From Model Selection to Streaming Responses
AI Tech Publishing
AI Tech Publishing
Feb 8, 2026 · Artificial Intelligence

Why Bigger Context Windows Fail and How Structured Graphs Deliver Precise Fact Retrieval

The article argues that large language models struggle with exact factual answers and that extending context windows often degrades performance, while knowledge graphs provide structured, traceable retrieval; it proposes a unified graph monograph and small, focused context slices to empower LLMs with accurate information.

Context RetrievalLLMLong Context Window
0 likes · 10 min read
Why Bigger Context Windows Fail and How Structured Graphs Deliver Precise Fact Retrieval
Baidu Maps Tech Team
Baidu Maps Tech Team
Nov 19, 2025 · Artificial Intelligence

Boosting Socio‑Economic Q&A: The ARAG Framework Merges Structured Data Analysis with RAG

ARAG introduces a novel Retrieval‑Augmented Generation framework that tightly integrates LLM‑driven structured data analysis with unstructured information retrieval, addressing the “structured + unstructured” reasoning gap in socio‑economic queries, and demonstrates superior accuracy, robustness, and hallucination resistance through extensive evaluations.

LLMRAGSocio-economic AI
0 likes · 12 min read
Boosting Socio‑Economic Q&A: The ARAG Framework Merges Structured Data Analysis with RAG
AI Frontier Lectures
AI Frontier Lectures
Nov 13, 2025 · Artificial Intelligence

Can a 2M‑Parameter Model Outperform XGBoost? Inside LimiX‑2M’s Tabular AI Breakthrough

The article examines LimiX‑2M, a lightweight 2‑million‑parameter transformer‑based model for structured tabular data that, through a novel Radial Basis Function embedding layer, achieves classification and regression performance surpassing traditional gradient‑boosting methods like XGBoost and even larger AI models, while remaining easy to fine‑tune and deploy.

LimiXRBF embeddingStructured Data
0 likes · 10 min read
Can a 2M‑Parameter Model Outperform XGBoost? Inside LimiX‑2M’s Tabular AI Breakthrough
AI Algorithm Path
AI Algorithm Path
Mar 24, 2025 · Artificial Intelligence

How to Use Pydantic for Structured LLM Output

The article explains why LLM responses can be inconsistent, introduces Pydantic as a way to define custom output schemas, and walks through concrete examples—both with OpenAI and Ollama models—showing how to build a LangChain pipeline that parses responses into structured data.

LLMLangChainOllama
0 likes · 7 min read
How to Use Pydantic for Structured LLM Output
DataFunTalk
DataFunTalk
Mar 7, 2024 · Artificial Intelligence

Integrating Large Language Models with Knowledge Graphs: Current Status and Future Directions

Large language models enhance human‑machine interaction and natural language understanding, but knowledge graphs remain essential for structured, low‑cost decision making, factual retrieval, and domains like finance; combining both can improve conversational systems, while ongoing challenges in knowledge graph construction persist, as highlighted for the upcoming DataFunSummit2024.

Conversational AIStructured Dataknowledge graph
0 likes · 5 min read
Integrating Large Language Models with Knowledge Graphs: Current Status and Future Directions
Airbnb Technology Team
Airbnb Technology Team
Jan 31, 2024 · Artificial Intelligence

Airbnb’s Listing Attribute Extraction Platform (LAEP): End-to-End Structured Information Extraction Using Machine Learning and NLP

Airbnb’s Listing Attribute Extraction Platform (LAEP) uses a custom NER model, word‑embedding mapping, and a BERT‑based scorer to automatically pull, normalize, and validate structured attributes from hosts’ unstructured text, boosting coverage for downstream tools and enhancing guest‑host matching at scale.

AirbnbBERTNER
0 likes · 11 min read
Airbnb’s Listing Attribute Extraction Platform (LAEP): End-to-End Structured Information Extraction Using Machine Learning and NLP
DataFunTalk
DataFunTalk
Jun 17, 2023 · Artificial Intelligence

Research on Text Generation for Structured Data

This article reviews the rapidly evolving field of structured‑data text generation, covering AI development stages, core concepts, model architectures from pipeline to pretrained transformers, key challenges such as content selection, numeric representation, reasoning and style control, and outlines recent research directions and Q&A insights.

AIStructured DataText Generation
0 likes · 21 min read
Research on Text Generation for Structured Data
Laiye Technology Team
Laiye Technology Team
Dec 31, 2021 · Artificial Intelligence

Overview of Table Recognition Techniques and Practical Implementation

This article reviews the challenges of extracting structured table data from images, compares two‑stage and end‑to‑end OCR approaches, evaluates four state‑of‑the‑art table‑recognition models (SPLERGE, CascadeTabNet, TableMASTER, UnetTable), and presents a practical deployment workflow with performance metrics.

AIComputer VisionDeep Learning
0 likes · 14 min read
Overview of Table Recognition Techniques and Practical Implementation
Xianyu Technology
Xianyu Technology
May 14, 2019 · Frontend Development

Structured Layout Information and Guided Line Method for UI Component Detection

The paper presents a structured layout‑information framework combined with a guided‑line “leader‑follower” algorithm that represents UI controls as Connection objects and matches them via attribute and vector similarity, enabling fast identification of recurring business components and duplicate GridView items without extensive retraining, thereby enhancing code reuse in UI2CODE projects.

Layout AnalysisStructured Datacomponent detection
0 likes · 9 min read
Structured Layout Information and Guided Line Method for UI Component Detection