StarRing Big Data Open Lab
Author

StarRing Big Data Open Lab

Focused on big data technology research, exploring the Big Data era | [email protected]

105
Articles
0
Likes
64
Views
0
Comments
Recent Articles

Latest from StarRing Big Data Open Lab

100 recent articles max
StarRing Big Data Open Lab
StarRing Big Data Open Lab
Nov 11, 2016 · Big Data

Why SQL Still Rules Big Data—and How NoSQL & NewSQL Fit In

The article explores the evolution of data processing from Hadoop and Spark to modern SQL, NoSQL, and NewSQL solutions, comparing their architectures, performance trade‑offs, and use‑cases, while illustrating concepts with examples like MapReduce, Hive, Impala, and streaming platforms such as Storm.

HadoopNewSQLNoSQL
0 likes · 14 min read
Why SQL Still Rules Big Data—and How NoSQL & NewSQL Fit In
StarRing Big Data Open Lab
StarRing Big Data Open Lab
Nov 4, 2016 · Artificial Intelligence

How Item Features Power Music Recommendations: A Hands‑On Guide

This article explains how recommendation systems can use item‑level features instead of user ratings, illustrating the approach with Pandora's music‑gene project, detailing feature selection, scoring, distance calculations, standardization, and classification techniques across music, athlete, Iris, and automobile datasets.

Recommendation SystemsStandardizationclassification
0 likes · 20 min read
How Item Features Power Music Recommendations: A Hands‑On Guide
StarRing Big Data Open Lab
StarRing Big Data Open Lab
Oct 27, 2016 · Artificial Intelligence

Why Explicit vs Implicit Feedback Matters in Recommender Systems

This article explains the difference between explicit and implicit user feedback, discusses their advantages and pitfalls, and shows how collaborative‑filtering techniques such as user‑based, item‑based, adjusted cosine similarity, and Slope One can be applied to build accurate recommendation engines.

Recommender SystemsSlope Oneadjusted cosine similarity
0 likes · 19 min read
Why Explicit vs Implicit Feedback Matters in Recommender Systems
StarRing Big Data Open Lab
StarRing Big Data Open Lab
Oct 24, 2016 · Big Data

Why Kappa Beats Lambda: A Deep Dive into Modern Big Data Architectures

This article compares Lambda and Kappa architectures, explains their three‑layer models, highlights the drawbacks of maintaining separate batch and speed layers in Lambda, introduces Kappa’s unified approach with StreamSQL, provides a smart‑traffic case study, and offers guidance on choosing the right architecture based on data volume, development complexity, and operational costs.

Kappa architectureLambda architectureStreamSQL
0 likes · 15 min read
Why Kappa Beats Lambda: A Deep Dive into Modern Big Data Architectures
StarRing Big Data Open Lab
StarRing Big Data Open Lab
Oct 20, 2016 · Artificial Intelligence

How Collaborative Filtering Powers Recommendations: From Manhattan to Cosine Similarity

This article walks through the fundamentals of recommendation systems, explaining collaborative filtering and various similarity measures—including Manhattan, Euclidean, Minkowski, Pearson correlation, and cosine similarity—while discussing their suitability for dense, sparse, or biased rating data and introducing K‑Nearest Neighbors for practical implementation.

Recommendation Systemscollaborative filteringdata mining
0 likes · 15 min read
How Collaborative Filtering Powers Recommendations: From Manhattan to Cosine Similarity
StarRing Big Data Open Lab
StarRing Big Data Open Lab
Oct 10, 2016 · Big Data

Mastering Lambda Architecture: Real‑Time & Batch Processing for Smart Traffic

This article explains the principles of Lambda Architecture, its three‑layer design for combining batch and real‑time analytics, and demonstrates a detailed smart‑traffic case study with component selection, capacity planning, and implementation guidance for building scalable big‑data systems.

Batch ProcessingLambda architectureSmart Traffic
0 likes · 15 min read
Mastering Lambda Architecture: Real‑Time & Batch Processing for Smart Traffic
StarRing Big Data Open Lab
StarRing Big Data Open Lab
Oct 8, 2016 · Big Data

Evolving Data Warehouses with Hadoop & Spark: Core Technologies

Data warehouses centralize and transform enterprise data for multidimensional analysis, and modern demands have spawned four types—traditional, real‑time, associative discovery, and data marts—each with distinct technical requirements, while Hadoop‑based solutions like Transwarp Data Hub address challenges of scale, variety, latency, and security.

Distributed ComputingHadoopReal-time analytics
0 likes · 21 min read
Evolving Data Warehouses with Hadoop & Spark: Core Technologies