Tag

data clustering

0 views collected around this technical thread.

DataFunTalk
DataFunTalk
Jan 2, 2023 · Artificial Intelligence

Tail Traffic Modeling and Data‑Driven Risk Strategies at 360 Shuke

This article presents 360 Shuke's practical approach to modeling low‑volume (tail) credit traffic using accumulated data, covering the characteristics of tail traffic, sample expansion under low approval rates, timeliness‑based data clustering, and ranking optimization for high‑quality head customers.

data clusteringmodel optimizationrisk modeling
0 likes · 19 min read
Tail Traffic Modeling and Data‑Driven Risk Strategies at 360 Shuke
Big Data Technology Architecture
Big Data Technology Architecture
Mar 4, 2021 · Big Data

Improving Interactive Analysis on Massive Datasets with Data Clustering and Data Skipping Using Spark and Iceberg

This article explores how data clustering techniques such as linear order, Z‑order, and Hilbert‑curve ordering can be applied in Apache Spark and Apache Iceberg to achieve efficient data skipping on terabyte‑scale tables, dramatically reducing file scans and enabling sub‑second interactive analytics for multi‑dimensional queries.

Data SkippingSparkZ-Order
0 likes · 20 min read
Improving Interactive Analysis on Massive Datasets with Data Clustering and Data Skipping Using Spark and Iceberg