Big Data Technology Tribe
Author

Big Data Technology Tribe

Focused on computer science and cutting‑edge tech, we distill complex knowledge into clear, actionable insights. We track tech evolution, share industry trends and deep analysis, helping you keep learning, boost your technical edge, and ride the digital wave forward.

41
Articles
0
Likes
102
Views
0
Comments
Recent Articles

Latest from Big Data Technology Tribe

41 recent articles
Big Data Technology Tribe
Big Data Technology Tribe
Feb 25, 2026 · Databases

How Lance Implements MVCC Transactions with Optimistic Concurrency and Automatic Conflict Resolution

Lance uses Multi-Version Concurrency Control to provide ACID guarantees, creating immutable table versions on each commit and employing atomic storage primitives, rebase logic, and retry mechanisms to handle concurrent writes, conflict detection, and resolution across multiple transaction types.

Concurrency ControlDatabase InternalsLance
0 likes · 16 min read
How Lance Implements MVCC Transactions with Optimistic Concurrency and Automatic Conflict Resolution
Big Data Technology Tribe
Big Data Technology Tribe
Jan 20, 2026 · Big Data

Extending Spark SQL with LanceSparkSessionExtensions: A Complete Guide

This article explains how to inject the LanceSpark plugin into Spark, covering the core LanceSparkSessionExtensions class, various ways to register extensions, the custom parser and planner strategy implementations, and the underlying Spark mechanisms such as injectParser, injectPlannerStrategy, and PredicateHelper.

DataSourceV2LanceSparkPlannerStrategy
0 likes · 14 min read
Extending Spark SQL with LanceSparkSessionExtensions: A Complete Guide
Big Data Technology Tribe
Big Data Technology Tribe
Nov 23, 2025 · Artificial Intelligence

How Ray Data Accelerates AI Workloads with Streaming Execution

Ray Data is a scalable library built on Ray that offers high‑performance, streaming‑execution APIs for AI workloads, enabling efficient batch inference, data preprocessing, and training data ingestion across CPU and GPU resources, while supporting diverse data formats and seamless integration with popular frameworks.

AI data processingPythonRay Data
0 likes · 11 min read
How Ray Data Accelerates AI Workloads with Streaming Execution