Tagged articles
151 articles
Page 1 of 2
Model Perspective
Model Perspective
Apr 9, 2026 · Industry Insights

Why the Silly Big Personality Test Feels So Accurate: A 15‑Dimensional Clustering Analysis

The article dissects the Silly Big Personality Test (SBTI), explaining its 15‑dimensional Euclidean space, threshold‑based hierarchical clustering, statistical tricks like the Barnum effect, and its viral spread as a social meme, while highlighting the scientific limits of such entertainment‑focused assessments.

Barnum effectPsychologyclustering
0 likes · 7 min read
Why the Silly Big Personality Test Feels So Accurate: A 15‑Dimensional Clustering Analysis
Baidu Maps Tech Team
Baidu Maps Tech Team
Nov 10, 2025 · Fundamentals

How GENREGION Revolutionizes City Region Segmentation with Vector Road Networks

GENREGION is an open‑source, vector‑based city region segmentation system that leverages road‑network graphs, novel MBR‑based clustering, and recursive left‑most vector search to produce semantically accurate, scalable polygons, outperforming traditional grid and vector methods in both semantic similarity and computational efficiency.

city segmentationclusteringspatial analysis
0 likes · 11 min read
How GENREGION Revolutionizes City Region Segmentation with Vector Road Networks
Architecture Digest
Architecture Digest
Sep 22, 2025 · Databases

From Single Node to Distributed Cluster: Mastering Redis Evolution

This article walks through Redis's journey from a simple single‑instance cache to a robust, highly available, and horizontally scalable distributed system, covering persistence mechanisms, master‑slave replication, Sentinel automatic failover, and sharding clusters for real‑world high‑traffic applications.

Persistenceclusteringdatabase
0 likes · 12 min read
From Single Node to Distributed Cluster: Mastering Redis Evolution
HyperAI Super Neural
HyperAI Super Neural
Sep 15, 2025 · Artificial Intelligence

scSiameseClu Sets New SOTA on Unsupervised Single‑Cell Clustering Across 7 Datasets

The paper introduces scSiameseClu, a Siamese clustering framework that combines dual augmentation, siamese fusion, and optimal‑transport clustering to overcome representation collapse in scRNA‑seq data, and demonstrates state‑of‑the‑art performance on seven diverse single‑cell datasets and downstream annotation tasks.

Graph Neural NetworkSiamese Networkclustering
0 likes · 11 min read
scSiameseClu Sets New SOTA on Unsupervised Single‑Cell Clustering Across 7 Datasets
Python Programming Learning Circle
Python Programming Learning Circle
Sep 11, 2025 · Artificial Intelligence

Essential Machine Learning Algorithms: From Linear Regression to DBSCAN

This article provides a comprehensive overview of key machine‑learning algorithms—including supervised methods like linear regression, SVM, Naive Bayes, logistic regression, k‑NN, decision trees, random forests, GBDT, and unsupervised techniques such as k‑means, hierarchical clustering, DBSCAN, and PCA—explaining their principles, strengths, and typical use cases.

AlgorithmsNaive BayesUnsupervised Learning
0 likes · 20 min read
Essential Machine Learning Algorithms: From Linear Regression to DBSCAN
Su San Talks Tech
Su San Talks Tech
Sep 11, 2025 · Databases

Why Redis Handles Millions of Requests: Deep Dive into Its High‑Performance Architecture

This article explains how Redis achieves ultra‑high concurrency by combining a single‑threaded event‑driven core, highly optimized in‑memory data structures, efficient I/O multiplexing, smart memory management, asynchronous persistence, and scalable clustering, allowing it to comfortably serve hundreds of thousands of simultaneous requests.

Event-drivenMemory OptimizationPersistence
0 likes · 16 min read
Why Redis Handles Millions of Requests: Deep Dive into Its High‑Performance Architecture
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 5, 2025 · Operations

How Alibaba Scales Anomaly Detection Across Millions of Metrics

This article explains how Alibaba tackles anomaly detection for tens of millions of metrics in a 100‑thousand‑machine cluster by comparing vertical time‑series methods with horizontal clustering, choosing DBSCAN for large‑scale monitoring, and detailing the ETL, computation, and visualization pipeline.

DBSCANTime Seriesanomaly detection
0 likes · 6 min read
How Alibaba Scales Anomaly Detection Across Millions of Metrics
Qborfy AI
Qborfy AI
Jun 26, 2025 · Artificial Intelligence

Unlock Hidden Patterns: A Hands‑On Guide to Unsupervised Learning Techniques

This article explains unsupervised learning by defining its core concepts, comparing clustering, dimensionality reduction, and association techniques, and illustrating each with concrete examples—from restaurant dish grouping and housing decision simplification to convenience‑store product analysis—while offering hands‑on experiments and real‑world case studies such as Amazon, NASA, and 7‑Eleven.

AICase StudiesUnsupervised Learning
0 likes · 5 min read
Unlock Hidden Patterns: A Hands‑On Guide to Unsupervised Learning Techniques
Raymond Ops
Raymond Ops
Apr 4, 2025 · Databases

Mastering Redis: Persistence, Cache Pitfalls, Data Types, Clustering & Locks

This article explains Redis persistence mechanisms (RDB and AOF), common cache problems and their solutions, the various Redis data structures and their use cases, cluster deployment options, master‑slave replication, transaction commands, and how to implement distributed locks with SETNX and expiration.

CacheData TypesPersistence
0 likes · 13 min read
Mastering Redis: Persistence, Cache Pitfalls, Data Types, Clustering & Locks
AI Code to Success
AI Code to Success
Mar 12, 2025 · Artificial Intelligence

Mastering K‑Means: Theory, Implementation, and Real‑World Applications

This comprehensive guide explores the K‑Means clustering algorithm, covering its mathematical foundation, step‑by‑step procedure, centroid initialization strategies, practical implementation with Python’s Scikit‑learn on the Iris dataset, evaluation metrics, optimization techniques, and diverse applications ranging from image segmentation to bioinformatics.

K-MeansPythonalgorithm
0 likes · 31 min read
Mastering K‑Means: Theory, Implementation, and Real‑World Applications
Architect's Must-Have
Architect's Must-Have
Feb 26, 2025 · Databases

Master MySQL Indexes: From Basics to B+Tree Optimization

This article explains what MySQL indexes are, how they work, their advantages and drawbacks, the different types of indexes—including primary, ordinary, composite, and full‑text—and dives deep into B‑Tree and B+Tree structures, clustering, page organization, and best practices for efficient query performance.

B+TreeDatabase Optimizationclustering
0 likes · 11 min read
Master MySQL Indexes: From Basics to B+Tree Optimization
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Feb 23, 2025 · Frontend Development

AutoHue.js: Automatic Image Color Extraction for Web Banners

This article introduces AutoHue.js, a lightweight JavaScript library that uses Canvas, Lab color space conversion, and clustering algorithms to automatically extract dominant, secondary, and edge colors from images for seamless background integration in web banners, complete with usage examples and installation instructions.

AutoHueCanvasImage Processing
0 likes · 16 min read
AutoHue.js: Automatic Image Color Extraction for Web Banners
Code Mala Tang
Code Mala Tang
Nov 20, 2024 · Backend Development

Can Node.js Power Millions of Users? Scaling Strategies Revealed

This article explores whether Node.js can handle millions of concurrent users, explains the core non‑blocking architecture, outlines challenges such as the single‑thread model and memory leaks, and provides practical scaling tactics like clustering, load balancing, caching, and database optimization.

BackendNode.jsScalability
0 likes · 10 min read
Can Node.js Power Millions of Users? Scaling Strategies Revealed
Sohu Tech Products
Sohu Tech Products
Sep 11, 2024 · Big Data

Tencent Real-time Lakehouse Intelligent Optimization Practice

Tencent’s real‑time lakehouse combines Spark, Flink, StarRocks and Presto compute layers with Iceberg‑based management and HDFS/COS storage, and its Intelligent Optimize Service—comprising Compaction, Expiration, Cleaning, Clustering, Index and Auto‑Engine modules—automatically reduces merge time, improves query performance, enables secondary indexing, and dynamically routes hot partitions, while future plans target cold/hot separation, materialized view acceleration, and AI‑driven optimizations.

Big DataLakehousePyIceberg
0 likes · 12 min read
Tencent Real-time Lakehouse Intelligent Optimization Practice
DataFunSummit
DataFunSummit
Aug 31, 2024 · Big Data

Apache Hudi Clustering: Workflow and Layout Optimization Strategies (Part 6)

This article explains Apache Hudi's clustering service, detailing its workflow, three execution modes, and layout optimization strategies—including linear, Z‑order, and Hilbert space‑filling curves—to improve storage locality and query performance in large‑scale data lake environments.

Apache HudiBig DataSpace-filling Curves
0 likes · 8 min read
Apache Hudi Clustering: Workflow and Layout Optimization Strategies (Part 6)
Baidu Geek Talk
Baidu Geek Talk
Aug 7, 2024 · Artificial Intelligence

Detecting Time‑Series Anomalies in Embedding Space: A Practical AI Approach

This article presents an embedding‑based method for time‑series anomaly detection in security and anti‑cheat scenarios, explains how to vectorise logs, sample and compute distribution features, details implementation code, and validates the approach with four synthetic experiments showing precision‑recall improvements at day and hour granularity.

EmbeddingSecurityTime Series
0 likes · 12 min read
Detecting Time‑Series Anomalies in Embedding Space: A Practical AI Approach
21CTO
21CTO
May 25, 2024 · Databases

How Pinterest Scaled Its Architecture: From Clusters to Sharding

This article examines Pinterest's evolution through four development phases, the core technologies it adopted, and how it transitioned from complex clustering to a simpler sharding architecture to achieve scalable, high‑availability data storage.

Pinterestarchitectureclustering
0 likes · 11 min read
How Pinterest Scaled Its Architecture: From Clusters to Sharding
Python Programming Learning Circle
Python Programming Learning Circle
May 5, 2024 · Artificial Intelligence

Python Implementation of DBSCAN and KMeans for Point Cloud Clustering and Tracking with Hungarian Matching

This article presents a Python project that reads point‑cloud data from CSV files, applies DBSCAN and KMeans clustering, extracts cluster features, and uses the Hungarian algorithm to match clusters across frames for tracking, complete with full source code and result visualization.

DBSCANHungarian algorithmKMeans
0 likes · 13 min read
Python Implementation of DBSCAN and KMeans for Point Cloud Clustering and Tracking with Hungarian Matching
Wukong Talks Architecture
Wukong Talks Architecture
Apr 24, 2024 · Backend Development

Core Concepts and Common Patterns of RabbitMQ

This article explains why message queues are needed, outlines RabbitMQ's architecture, describes its basic concepts and working modes such as simple, work, fanout, direct and topic, and discusses reliability features like transactions, confirms, dead‑letter queues, TTL, clustering, ordering, and handling message backlogs.

Message QueueRabbitMQTTL
0 likes · 24 min read
Core Concepts and Common Patterns of RabbitMQ
Baobao Algorithm Notes
Baobao Algorithm Notes
Mar 21, 2024 · Artificial Intelligence

Can the CaR Method Achieve Better LLM Performance with Only 1.4% of Training Data?

This article explains how the CaR (Clustering and Ranking) approach evaluates data quality with a scoring model and selects diverse samples via PCA‑reduced sentence embeddings and K‑Means clustering, achieving comparable or superior large‑model performance while using just 1.96% of the original dataset.

CaR methodData QualityLLM training
0 likes · 8 min read
Can the CaR Method Achieve Better LLM Performance with Only 1.4% of Training Data?
Model Perspective
Model Perspective
Feb 1, 2024 · Fundamentals

Essential Guide to Statistical and Probabilistic Model Articles

This curated list gathers recent articles on statistical and probabilistic models, covering clustering analysis, various linear regression techniques, and causal analysis, providing convenient links for students and researchers to explore each topic in depth.

Causal Analysisclusteringlinear regression
0 likes · 3 min read
Essential Guide to Statistical and Probabilistic Model Articles
Baobao Algorithm Notes
Baobao Algorithm Notes
Dec 11, 2023 · Artificial Intelligence

Boost Large‑Model Fine‑Tuning with Low‑Cost Data Selection and Construction

The article explains practical techniques for choosing and constructing fine‑tuning data for large language models, covering data diversity through similarity‑based clustering, semi‑supervised filtering with binary classifiers, and uncertainty‑driven sampling using perplexity or reward models to build an efficient, low‑cost pipeline.

Large ModelReward modelactive learning
0 likes · 9 min read
Boost Large‑Model Fine‑Tuning with Low‑Cost Data Selection and Construction
DataFunTalk
DataFunTalk
Nov 16, 2023 · Product Management

User Operations: Methods for User Analysis, Segmentation, and Aha‑Moment Identification

This article provides a comprehensive guide to user operations, covering the definition of user operation, common user analysis techniques, attribute and behavior analysis, segmentation methods using business logic and clustering algorithms, and the concept of the Aha‑moment or magic number for optimizing retention and value.

Aha MomentSegmentationUser Retention
0 likes · 12 min read
User Operations: Methods for User Analysis, Segmentation, and Aha‑Moment Identification
Programmer DD
Programmer DD
Nov 5, 2023 · Artificial Intelligence

How Fake GitHub Stars Are Bought, Detected, and Why Investors Care

GitHub star‑buying has become a black‑market industry, with prices ranging from under a yuan to six yuan per star, prompting investors to use star counts as a metric, while researchers develop simple and advanced clustering algorithms to detect fake stars and expose the practice.

GitHubclusteringfake detection
0 likes · 9 min read
How Fake GitHub Stars Are Bought, Detected, and Why Investors Care
Test Development Learning Exchange
Test Development Learning Exchange
Sep 12, 2023 · Artificial Intelligence

Various Anomaly Detection Techniques with Python Code Examples

This article introduces ten common anomaly detection approaches—including statistical thresholds, boxplots, clustering, isolation forest, LOF, collaborative filtering, robust covariance, NLP, computer‑vision, and time‑series methods—each accompanied by concise Python code snippets illustrating how to identify outliers in different data domains.

PythonTime Seriesanomaly detection
0 likes · 9 min read
Various Anomaly Detection Techniques with Python Code Examples
dbaplus Community
dbaplus Community
Aug 30, 2023 · Databases

How to Build a Stable, High‑Performance Redis Cluster from Scratch

This guide walks through the evolution of a Redis deployment—from a single‑node cache, through data persistence options, master‑slave replication, Sentinel automatic failover, and finally sharding with Redis Cluster—explaining each technique, its trade‑offs, and practical implementation steps.

PersistenceReplicationclustering
0 likes · 18 min read
How to Build a Stable, High‑Performance Redis Cluster from Scratch
Data Thinking Notes
Data Thinking Notes
Aug 21, 2023 · Product Management

How User Profiling Drives Personalized Marketing and Product Innovation

This article explains the fundamentals, principles, methodologies, and practical applications of user profiling, covering core concepts such as user characteristics, behavior, preferences, needs, and value, the data collection-to-model pipeline, common models like RFM, clustering, association rules, text mining, and how these insights enable personalized recommendation, precise marketing, brand management, service optimization, CRM, market research, and product innovation.

RFM modelSentiment Analysisassociation rules
0 likes · 14 min read
How User Profiling Drives Personalized Marketing and Product Innovation
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jul 25, 2023 · Artificial Intelligence

Building a Reverse Image Search Engine with Geometric Distance, ResNet Feature Embeddings, Clustering, and Milvus Vector Database

This article walks through implementing a reverse image search system, starting with simple pixel‑based geometric distance, then improving accuracy using ResNet‑derived feature embeddings, accelerating queries with K‑means clustering, and finally deploying a Milvus vector database for fast, scalable similarity retrieval.

MilvusResNet50clustering
0 likes · 17 min read
Building a Reverse Image Search Engine with Geometric Distance, ResNet Feature Embeddings, Clustering, and Milvus Vector Database
Architect
Architect
Jul 14, 2023 · Databases

From Single‑Node to Scalable Redis Cluster: A Step‑by‑Step Architecture Guide

This article walks through Redis's evolution from a simple single‑instance cache to a highly available, high‑performance cluster, explaining persistence mechanisms (RDB, AOF, hybrid), master‑slave replication, Sentinel automatic failover, and sharding strategies with concrete examples and trade‑offs.

Database ArchitecturePersistenceReplication
0 likes · 20 min read
From Single‑Node to Scalable Redis Cluster: A Step‑by‑Step Architecture Guide
Practical DevOps Architecture
Practical DevOps Architecture
May 16, 2023 · Databases

Redis Course Curriculum Overview: Distributed Locks, High Availability, Clustering, Persistence, and Advanced Projects

This article outlines a comprehensive Redis training program covering fundamentals, distributed lock implementation, high‑availability mechanisms, clustering, persistence strategies, and practical projects such as Bloom filter integration and flash‑sale systems, providing learners with the knowledge to master advanced Redis usage.

Persistencebloom-filterclustering
0 likes · 5 min read
Redis Course Curriculum Overview: Distributed Locks, High Availability, Clustering, Persistence, and Advanced Projects
Architects Research Society
Architects Research Society
Apr 28, 2023 · Databases

High‑Availability Cluster Solutions for PostgreSQL

This article explains high‑availability concepts for PostgreSQL, reviews standby database types, describes clustering models, and evaluates several HA solutions such as DRBD, ClusterControl, Rubyrep, Pgpool II, Bucardo, Postgres‑XC, Citus, and PostgresXL, while noting practical considerations and trade‑offs.

Database ReplicationHA SolutionsPostgreSQL
0 likes · 10 min read
High‑Availability Cluster Solutions for PostgreSQL
Model Perspective
Model Perspective
Mar 22, 2023 · Artificial Intelligence

Master DBSCAN Clustering: Theory, Python Code, and Real-World Examples

DBSCAN is a density‑based clustering algorithm that automatically discovers arbitrarily shaped clusters and isolates noise, with detailed explanations of core, border, and noise points, step‑by‑step examples, Python implementations using scikit‑learn, and guidance on key parameters such as eps and min_samples.

DBSCANPythonclustering
0 likes · 10 min read
Master DBSCAN Clustering: Theory, Python Code, and Real-World Examples
Yunxuetang Frontend Team
Yunxuetang Frontend Team
Mar 3, 2023 · Frontend Development

Essential Frontend Techniques: Module Cycles, Proxy & Reflect, Bug Recording, and Guide Pages

This article curates key frontend topics—including how CommonJS and ES modules handle circular dependencies, why Proxy must be paired with Reflect, practical bug‑recording and source‑code navigation, building guide pages without third‑party libraries, plus a look at clustering algorithms and a powerful CRUD code generator—offering concise insights for modern web developers.

Code GenerationJavaScriptProxy
0 likes · 4 min read
Essential Frontend Techniques: Module Cycles, Proxy & Reflect, Bug Recording, and Guide Pages
DataFunTalk
DataFunTalk
Feb 28, 2023 · Artificial Intelligence

Event‑Aware Graph Extraction and Adaptive Clustering‑Gain Network for Insurance Creative Recommendation

This article presents a comprehensive study on insurance creative recommendation, introducing an event‑aware graph extractor, a heterogeneous graph construction, and an adaptive clustering‑gain network that together address data sparsity, counterfactual samples, and cross‑industry cold‑start challenges, achieving significant AUC improvements in experiments.

AIAdvertisingGraph Neural Network
0 likes · 15 min read
Event‑Aware Graph Extraction and Adaptive Clustering‑Gain Network for Insurance Creative Recommendation
DataFunSummit
DataFunSummit
Feb 1, 2023 · Artificial Intelligence

Clustering-Based Global LSTM Models for Large-Scale Time Series Forecasting

The paper proposes clustering thousands of related time series and training separate global LSTM models for each cluster, showing that this reduces heterogeneity, leverages shared information, and improves forecasting accuracy compared to individual models, with extensive experiments on CIF2016 and NN5 datasets.

LSTMRNNclustering
0 likes · 33 min read
Clustering-Based Global LSTM Models for Large-Scale Time Series Forecasting
Architect
Architect
Jan 16, 2023 · Databases

Redis Fundamentals: Pipelines, Pub/Sub, Expiration, Transactions, Persistence, Distributed Locks and Clustering

This article provides a comprehensive overview of Redis, covering basic concepts, pipeline optimization, publish/subscribe messaging, key expiration strategies, transaction behavior, persistence mechanisms (RDB, AOF, hybrid), distributed locking techniques, Redisson and Redlock algorithms, as well as high‑availability setups using replication, Sentinel and Cluster modes.

Distributed LocksPersistenceTransactions
0 likes · 32 min read
Redis Fundamentals: Pipelines, Pub/Sub, Expiration, Transactions, Persistence, Distributed Locks and Clustering
Model Perspective
Model Perspective
Jan 8, 2023 · Artificial Intelligence

Unlock Hidden Patterns: A Deep Dive into Unsupervised Learning Techniques

This article introduces unsupervised learning, covering its motivation, Jensen's inequality, key clustering methods such as EM, k‑means, hierarchical clustering, evaluation metrics, and dimensionality‑reduction techniques like PCA and ICA, providing clear explanations and illustrative diagrams.

EM algorithmICAK-Means
0 likes · 8 min read
Unlock Hidden Patterns: A Deep Dive into Unsupervised Learning Techniques
Architect's Guide
Architect's Guide
Dec 1, 2022 · Databases

Comprehensive Guide to Redis: Architecture, Data Structures, Persistence, Replication, Clustering, and Advanced Features

This article provides an in‑depth overview of Redis, covering its single‑threaded architecture, core data structures (String, Hash, List, Set, ZSet), persistence mechanisms (RDB, AOF, hybrid), replication, Sentinel and cluster designs, memory eviction policies, bitmap analytics, skiplist implementation, and strategies for ensuring data consistency.

ConsistencyData StructuresPersistence
0 likes · 27 min read
Comprehensive Guide to Redis: Architecture, Data Structures, Persistence, Replication, Clustering, and Advanced Features
Model Perspective
Model Perspective
Nov 8, 2022 · Artificial Intelligence

Mastering K-Means: How Distance-Based Clustering Works and How to Implement It

This article explains the fundamentals of the K-means clustering algorithm, describing its distance‑based similarity principle, the objective of minimizing squared error, and a step‑by‑step iterative procedure—including random centroid initialization, assignment, centroid recomputation, and convergence criteria.

Unsupervised Learningalgorithmclustering
0 likes · 3 min read
Mastering K-Means: How Distance-Based Clustering Works and How to Implement It
Architecture Digest
Architecture Digest
Oct 13, 2022 · Databases

Comprehensive Overview of Redis: Architecture, Data Structures, Persistence, Replication, Clustering and Advanced Features

This article provides an in‑depth technical guide to Redis, covering its single‑threaded model, core data structures (String, Hash, List, Set, ZSet) with practical commands, persistence mechanisms (RDB, AOF, hybrid), master‑slave replication, Sentinel and cluster architectures, cache eviction policies, memory management, progressive rehash, skiplist implementation, bitmap statistics, and consistency challenges when coupling Redis with MySQL.

Data StructuresPersistenceclustering
0 likes · 28 min read
Comprehensive Overview of Redis: Architecture, Data Structures, Persistence, Replication, Clustering and Advanced Features
IT Architects Alliance
IT Architects Alliance
Oct 9, 2022 · Databases

Comprehensive Overview of Redis: Architecture, Data Structures, Persistence, Replication, Cluster, Eviction, and Advanced Mechanisms

This article provides an in‑depth technical guide to Redis, covering its single‑threaded I/O model, core data structures (String, Hash, List, Set, ZSet), persistence options (RDB, AOF, hybrid), master‑slave replication, Sentinel, cluster slot allocation, eviction policies, progressive rehash, skip‑list implementation, and consistency challenges between MySQL and Redis.

Data StructuresPersistenceclustering
0 likes · 28 min read
Comprehensive Overview of Redis: Architecture, Data Structures, Persistence, Replication, Cluster, Eviction, and Advanced Mechanisms
Top Architect
Top Architect
Oct 9, 2022 · Databases

Comprehensive Guide to Redis: Architecture, Data Structures, Persistence, Clustering, and Advanced Features

This article provides an in‑depth overview of Redis, covering its single‑threaded design, core data structures, persistence mechanisms (RDB, AOF, hybrid), master‑slave replication, Sentinel and Cluster architectures, eviction policies, progressive rehash, bitmap usage for massive analytics, skiplist implementation, and strategies for handling MySQL‑Redis write inconsistencies.

Data StructuresPersistenceclustering
0 likes · 27 min read
Comprehensive Guide to Redis: Architecture, Data Structures, Persistence, Clustering, and Advanced Features
Laravel Tech Community
Laravel Tech Community
Sep 27, 2022 · Databases

Understanding Redis: Overview, Architecture, and Persistence Model

Redis is an open‑source in‑memory key‑value data‑structure server that serves as a cache, primary database, and messaging system; this article explains its core concepts, deployment options (single instance, HA, Sentinel, Cluster), and persistence mechanisms (RDB, AOF, and hybrid approaches).

In-Memory DatabasePersistenceclustering
0 likes · 18 min read
Understanding Redis: Overview, Architecture, and Persistence Model
Tencent Cloud Developer
Tencent Cloud Developer
Sep 22, 2022 · Databases

Redis Fundamentals: Data Structures, Persistence, Replication, and Clustering

The article explains Redis’s core concepts—including its in‑memory key‑value store, six low‑level data structures, single‑threaded execution, AOF and RDB persistence options, master‑replica replication with Sentinel failover, and horizontal scaling via Cluster’s hash‑slot sharding and rebalancing.

Data StructuresPersistenceReplication
0 likes · 26 min read
Redis Fundamentals: Data Structures, Persistence, Replication, and Clustering
MaGe Linux Operations
MaGe Linux Operations
Sep 8, 2022 · Artificial Intelligence

Master 10 Popular Clustering Algorithms in Python with Scikit‑Learn

This tutorial introduces unsupervised clustering, explains its purpose, and walks through installing scikit‑learn and implementing ten popular clustering algorithms—including AffinityPropagation, Agglomerative, BIRCH, DBSCAN, K‑Means, Mini‑Batch K‑Means, MeanShift, OPTICS, Spectral Clustering, and Gaussian Mixture—complete with code examples and visualizations.

Unsupervised Learningclusteringdata mining
0 likes · 27 min read
Master 10 Popular Clustering Algorithms in Python with Scikit‑Learn
DataFunTalk
DataFunTalk
Aug 22, 2022 · Artificial Intelligence

Live‑Streaming Recommendation System: Interaction Scenarios, User Cold‑Start, Prior Modeling, and Scene Modeling

The article presents a comprehensive technical overview of a live‑streaming recommendation system, covering common and specific characteristics, user cold‑start strategies using unbiased clustering, prior knowledge integration, multi‑task modeling, and scene‑aware routing to improve relevance and engagement in interactive environments.

Recommendation Systemsclusteringfeature modeling
0 likes · 19 min read
Live‑Streaming Recommendation System: Interaction Scenarios, User Cold‑Start, Prior Modeling, and Scene Modeling
Model Perspective
Model Perspective
Aug 18, 2022 · Artificial Intelligence

Master SciPy Clustering: K‑Means and Hierarchical Methods with Python

This guide introduces SciPy's clustering modules, explaining the vector quantization and k‑means algorithm in scipy.cluster.vq, and demonstrates hierarchical clustering with scipy.cluster.hierarchy, accompanied by complete Python code examples and visualizations to help you apply these techniques to real data.

Hierarchical ClusteringK-Meansclustering
0 likes · 4 min read
Master SciPy Clustering: K‑Means and Hierarchical Methods with Python
Model Perspective
Model Perspective
Aug 16, 2022 · Fundamentals

13 Essential Statistical Analysis Methods Every Researcher Should Know

This article outlines thirteen key statistical techniques—including descriptive and inferential methods, hypothesis testing, reliability analysis, contingency tables, regression, clustering, discriminant, factor, and time‑series analysis—explaining their purposes, assumptions, and typical applications for researchers and data analysts.

clusteringdata analysishypothesis testing
0 likes · 25 min read
13 Essential Statistical Analysis Methods Every Researcher Should Know
MaGe Linux Operations
MaGe Linux Operations
Jul 29, 2022 · Artificial Intelligence

Master 10 Popular Clustering Algorithms in Python with Scikit‑Learn

This tutorial introduces clustering, explains why no single algorithm fits all data, and provides step‑by‑step Python examples using scikit‑learn for ten popular unsupervised learning methods, complete with code snippets and visualizations to illustrate results.

PythonUnsupervised Learningclustering
0 likes · 24 min read
Master 10 Popular Clustering Algorithms in Python with Scikit‑Learn
Model Perspective
Model Perspective
Jul 4, 2022 · Fundamentals

Top Model Guides: Clustering, Regression, Queueing & Monte Carlo Simulations

This curated list groups recent explanatory and simulation model articles—covering clustering analysis, linear regression, queueing theory, Markov chains, and Monte Carlo methods—into easy-to-navigate sections for quick reference, helping students and practitioners locate relevant resources efficiently.

ModelingMonte Carloclustering
0 likes · 2 min read
Top Model Guides: Clustering, Regression, Queueing & Monte Carlo Simulations
Model Perspective
Model Perspective
Jun 4, 2022 · Artificial Intelligence

Master K-means Clustering: How the Algorithm Finds Compact Groups

K-means is a classic distance‑based clustering algorithm that iteratively partitions data into k compact, well‑separated groups by minimizing the sum of squared errors, using random centroid initialization and heuristic updates until convergence, making it a fundamental tool in AI and data analysis.

K-MeansUnsupervised Learningalgorithm
0 likes · 3 min read
Master K-means Clustering: How the Algorithm Finds Compact Groups
Model Perspective
Model Perspective
Jun 4, 2022 · Artificial Intelligence

Master Systematic Clustering: From Distance Matrix to Multi-Level Groupings

Systematic clustering, a widely used hierarchical clustering technique, builds a dendrogram by iteratively merging the closest sample points based on a distance matrix, allowing analysts to visualize and select groupings at various distance thresholds, from a single cluster to each point as its own class.

Hierarchical Clusteringclusteringdistance matrix
0 likes · 3 min read
Master Systematic Clustering: From Distance Matrix to Multi-Level Groupings
Model Perspective
Model Perspective
Jun 4, 2022 · Fundamentals

Understanding Sample Similarity: Distance Metrics and Cluster Methods

This article explains how to quantify similarity between data samples using distance metrics such as Manhattan, Euclidean, and Chebyshev, outlines the properties these distances must satisfy, and describes common inter‑class measures like single linkage, complete linkage, centroid, group average, and sum‑of‑squares methods.

Minkowskiclusteringdistance metrics
0 likes · 4 min read
Understanding Sample Similarity: Distance Metrics and Cluster Methods
Top Architect
Top Architect
May 31, 2022 · Backend Development

AKF Principles for Microservice Splitting: X‑Axis, Y‑Axis, and Z‑Axis Strategies

The article explains how to address single‑point failures and capacity limits by clustering servers using replication patterns and introduces the AKF microservice splitting principle, detailing X‑axis horizontal replication, Y‑axis business‑oriented division, and Z‑axis data‑driven partitioning with diagrams.

AKFBackendMicroservices
0 likes · 5 min read
AKF Principles for Microservice Splitting: X‑Axis, Y‑Axis, and Z‑Axis Strategies
Top Architect
Top Architect
May 7, 2022 · Backend Development

AKF Principle for Microservice Splitting: X, Y, and Z Axis Strategies

The article explains the AKF principle for microservice design, detailing how X‑axis horizontal replication, Y‑axis functional splitting, and Z‑axis data‑centric partitioning address single‑point failures, capacity limits, and performance bottlenecks by employing master‑master, master‑slave, and master‑backup replication patterns.

AKFBackendMicroservices
0 likes · 6 min read
AKF Principle for Microservice Splitting: X, Y, and Z Axis Strategies
Code Ape Tech Column
Code Ape Tech Column
Apr 29, 2022 · Backend Development

AKF Microservice Splitting Principles: X‑Axis, Y‑Axis, and Z‑Axis Partitioning for Cluster Design

The article explains how to address single‑node limitations by clustering servers using the AKF microservice splitting principle, detailing X‑axis horizontal replication, Y‑axis business‑oriented partitioning, and Z‑axis data‑source segmentation to achieve scalable, fault‑tolerant backend architectures.

AKF principleBackend ArchitectureMicroservices
0 likes · 5 min read
AKF Microservice Splitting Principles: X‑Axis, Y‑Axis, and Z‑Axis Partitioning for Cluster Design
Python Programming Learning Circle
Python Programming Learning Circle
Apr 14, 2022 · Artificial Intelligence

Top Clustering Algorithms in Python with scikit-learn: A Comprehensive Tutorial

This tutorial explains clustering as an unsupervised learning task, outlines why no single algorithm fits all data, and provides step‑by‑step Python code using scikit‑learn to install the library, generate synthetic datasets, and apply ten popular clustering algorithms with visualizations.

PythonUnsupervised Learningclustering
0 likes · 21 min read
Top Clustering Algorithms in Python with scikit-learn: A Comprehensive Tutorial
Top Architect
Top Architect
Feb 21, 2022 · Databases

Comprehensive Redis Technical Guide and Interview Q&A

This extensive guide covers Redis fundamentals, data structures, persistence mechanisms, clustering, sentinel high‑availability, performance tuning, common pitfalls, and practical use‑case implementations, providing detailed answers and code examples for interview preparation and real‑world development.

MessageQueuePersistencecaching
0 likes · 66 min read
Comprehensive Redis Technical Guide and Interview Q&A
Aotu Lab
Aotu Lab
Feb 17, 2022 · Artificial Intelligence

How DBSCAN Clustering Powers Automatic Layout Generation in Front‑End Design

This article explains the DBSCAN density‑based clustering algorithm, its core concepts, parameters, and step‑by‑step implementation, then shows how dynamically derived eps values enable the algorithm to group design‑draft modules for automatic front‑end code generation, improving development efficiency.

DBSCANclusteringdensity-based clustering
0 likes · 11 min read
How DBSCAN Clustering Powers Automatic Layout Generation in Front‑End Design
ITPUB
ITPUB
Jan 21, 2022 · Backend Development

Designing Scalable High‑Concurrency Architecture for E‑Commerce Platforms

This article explains how to design and evolve a high‑concurrency server architecture for e‑commerce applications, covering load balancing, database clustering, NoSQL caching, CDN static assets, message‑queue based asynchronous processing, tiered caching, service‑oriented decomposition, redundancy, and automation to ensure reliable performance under massive traffic spikes.

cachingclusteringhigh concurrency
0 likes · 21 min read
Designing Scalable High‑Concurrency Architecture for E‑Commerce Platforms
IT Architects Alliance
IT Architects Alliance
Jan 15, 2022 · Backend Development

How to Build a Million‑Message‑Per‑Second RabbitMQ Cluster: Lessons from Google and Real‑World Experiments

This article explains the fundamentals of RabbitMQ, compares normal and mirrored cluster modes, details Google’s large‑scale test setup, and walks through advanced plugins such as sharding, consistent‑hash exchange, federation, and high‑availability strategies for achieving million‑level message throughput.

BackendMessage QueueRabbitMQ
0 likes · 24 min read
How to Build a Million‑Message‑Per‑Second RabbitMQ Cluster: Lessons from Google and Real‑World Experiments
IT Architects Alliance
IT Architects Alliance
Jan 14, 2022 · Operations

Scaling RabbitMQ to Million‑Message Throughput: Architecture, Plugins, and High‑Availability Practices

This article explains how to horizontally scale RabbitMQ clusters, use sharding and federation plugins, configure mirror queues and other high‑availability features, and apply practical patterns such as confirms, retries, and delayed delivery to achieve million‑level message throughput in production environments.

FederationMessage QueueRabbitMQ
0 likes · 23 min read
Scaling RabbitMQ to Million‑Message Throughput: Architecture, Plugins, and High‑Availability Practices
Architecture Digest
Architecture Digest
Jan 13, 2022 · Backend Development

Scaling RabbitMQ to Million‑Message Throughput: Architecture, Sharding, Federation, and High‑Availability Practices

This article explains how to horizontally scale RabbitMQ clusters to handle millions of messages per second by leveraging cluster modes, mirror queues, sharding plugins, consistent‑hash exchanges, federation, and high‑availability configurations, while also covering practical scenarios such as retries, delayed tasks, and Spring AMQP integration.

FederationMessage QueueRabbitMQ
0 likes · 22 min read
Scaling RabbitMQ to Million‑Message Throughput: Architecture, Sharding, Federation, and High‑Availability Practices
21CTO
21CTO
Jan 12, 2022 · Backend Development

How to Build a Million‑Message‑Per‑Second RabbitMQ Service

This article explains how to horizontally scale RabbitMQ clusters, use sharding and federation plugins, and configure high‑availability and reliability features to achieve million‑level message throughput in demanding production environments.

Message QueueRabbitMQclustering
0 likes · 24 min read
How to Build a Million‑Message‑Per‑Second RabbitMQ Service
Architects Research Society
Architects Research Society
Jan 7, 2022 · Databases

High‑Availability Clustering Solutions for PostgreSQL

This article explains the concepts of high availability, continuous recovery, and standby databases, then reviews various PostgreSQL clustering options such as DRBD, ClusterControl, Rubyrep, Pgpool‑II, Bucardo, Postgres‑XC, Citus, and PostgresXL, highlighting their features, advantages, and drawbacks.

ClusterControlDRBDDatabase Replication
0 likes · 16 min read
High‑Availability Clustering Solutions for PostgreSQL
Wukong Talks Architecture
Wukong Talks Architecture
Dec 26, 2021 · Backend Development

Core Concepts and Common Patterns of RabbitMQ

This article explains the purpose of message queues, outlines RabbitMQ's architecture, describes common MQ patterns such as simple, work, fan‑out, direct, and topic, and discusses reliability features like transactions, confirms, dead‑letter queues, TTL, clustering, and ordering guarantees.

Dead Letter QueueMessage QueueMessage Routing
0 likes · 22 min read
Core Concepts and Common Patterns of RabbitMQ
IT Architects Alliance
IT Architects Alliance
Oct 26, 2021 · Backend Development

An Overview of RabbitMQ: Architecture, Message Flow, Persistence, Clustering, and Flow Control

This article provides a comprehensive technical overview of RabbitMQ, covering its origins, core components, message routing patterns, persistence mechanisms, delivery modes, RPC support, clustering design, mirrored queues, and flow‑control strategies, while illustrating concepts with code examples and diagrams.

AMQPFlow ControlMessage Queue
0 likes · 13 min read
An Overview of RabbitMQ: Architecture, Message Flow, Persistence, Clustering, and Flow Control
IT Architects Alliance
IT Architects Alliance
Sep 25, 2021 · Big Data

Top 10 Classic Data Mining Algorithms and Their Core Characteristics

This article introduces the ten classic data‑mining algorithms selected by IEEE ICDM—C4.5, k‑Means, SVM, Apriori, EM, PageRank, AdaBoost, k‑NN, Naive Bayes, and CART—explaining their main ideas, advantages, and typical applications for readers seeking a solid foundation in data analysis.

Algorithmsclassificationclustering
0 likes · 8 min read
Top 10 Classic Data Mining Algorithms and Their Core Characteristics
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Sep 22, 2021 · Databases

Common Redis Interview Questions and Answers

This article provides a comprehensive list of typical Redis interview questions covering its features, performance, data structures, supported data types, common use cases, eviction policies, persistence methods, clustering, high‑availability mechanisms, transaction handling, and comparisons with local caches like Guava and Caffeine.

CacheData StructuresPersistence
0 likes · 15 min read
Common Redis Interview Questions and Answers
DataFunTalk
DataFunTalk
Sep 18, 2021 · Artificial Intelligence

Unsupervised Algorithms for Fraud Detection in Huya's Risk Control System

This article presents Huya's exploration of unsupervised learning techniques for risk control, detailing business risk scenarios, black‑market attack vectors, limitations of traditional defenses, and the design, implementation, and evaluation of graph‑based and density‑based clustering methods to automatically discover and mitigate fraudulent user groups.

AIHuyaUnsupervised Learning
0 likes · 11 min read
Unsupervised Algorithms for Fraud Detection in Huya's Risk Control System
iQIYI Technical Product Team
iQIYI Technical Product Team
May 21, 2021 · Big Data

Design and Implementation of iQIYI's User Feedback Analysis System

iQIYI built an in‑house user‑feedback analysis system that automatically ingests multi‑channel data, classifies and clusters issues, assesses feedback quality, localizes problems, and streamlines repair closure, boosting recall accuracy, alarm precision, closure rates and reducing cycle time across business lines to enhance user experience.

AIBig Dataclassification
0 likes · 15 min read
Design and Implementation of iQIYI's User Feedback Analysis System
ITPUB
ITPUB
May 17, 2021 · Operations

How DBSCAN Clustering and Bayesian Inference Boost Root‑Cause Detection in Securities Trading Systems

This article describes how a Chinese securities firm applied big‑data‑driven clustering and Bayesian methods to automate root‑cause analysis of trading‑system anomalies, detailing the challenges, algorithmic designs, practical implementations, and evaluation results that demonstrate significant reductions in false alarms and faster recovery.

Bayesian inferenceOperationsRoot Cause Analysis
0 likes · 17 min read
How DBSCAN Clustering and Bayesian Inference Boost Root‑Cause Detection in Securities Trading Systems
dbaplus Community
dbaplus Community
May 16, 2021 · Operations

How DBSCAN Clustering and Bayesian Inference Enable Fast Root‑Cause Detection in Securities Trading Systems

This article details the challenges of root‑cause identification in high‑availability securities trading platforms and presents two intelligent‑operations solutions—DBSCAN‑based clustering and Bayesian inference—to quickly locate anomalies and improve recovery efficiency.

Bayesian inferenceDBSCANIntelligent Operations
0 likes · 17 min read
How DBSCAN Clustering and Bayesian Inference Enable Fast Root‑Cause Detection in Securities Trading Systems
Java Architect Essentials
Java Architect Essentials
Mar 31, 2021 · Databases

40 Common Redis Interview Questions and Answers

This article compiles 40 frequently asked Redis interview questions, covering its definition, data types, advantages, comparison with Memcached, persistence mechanisms, performance considerations, eviction policies, clustering, client libraries, security commands, usage scenarios, and best‑practice tips for high‑traffic production environments.

Persistenceclusteringdatabase
0 likes · 19 min read
40 Common Redis Interview Questions and Answers
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jan 19, 2021 · Backend Development

Mastering RabbitMQ: Core AMQP Concepts and Reliable Messaging Strategies

This article explains RabbitMQ’s AMQP fundamentals—including servers, connections, channels, messages, exchanges, queues, routing keys, and virtual hosts—while covering reliable delivery, message idempotency, consumer flow control, TTL, dead‑letter handling, clustering modes, and high‑availability solutions such as HAProxy and Keepalived.

AMQPMessage QueueRabbitMQ
0 likes · 17 min read
Mastering RabbitMQ: Core AMQP Concepts and Reliable Messaging Strategies
Code Ape Tech Column
Code Ape Tech Column
Jan 15, 2021 · Backend Development

Mastering RabbitMQ: From AMQP Basics to High‑Availability Clusters

This article explains RabbitMQ's AMQP fundamentals, exchange types, reliable delivery mechanisms, idempotency strategies, consumer flow control, TTL and dead‑letter handling, as well as clustering, federation, HAProxy and Keepalived solutions for building a resilient messaging infrastructure.

AMQPConsumer Flow ControlMessage Queue
0 likes · 16 min read
Mastering RabbitMQ: From AMQP Basics to High‑Availability Clusters
Architect
Architect
Jan 12, 2021 · Backend Development

Understanding RabbitMQ: Architecture, Message Routing, Persistence, Clustering, and Flow Control

This article explains RabbitMQ’s origins, core components, message publishing and consumption patterns, routing modes, persistence mechanisms, delivery guarantees, RPC support, clustering and mirrored‑queue designs, as well as its flow‑control strategy, providing a comprehensive overview for backend developers.

BackendMessagingPersistence
0 likes · 13 min read
Understanding RabbitMQ: Architecture, Message Routing, Persistence, Clustering, and Flow Control
Architect
Architect
Jan 8, 2021 · Backend Development

Understanding RabbitMQ: AMQP Fundamentals, Exchange Types, Reliability Mechanisms, and High‑Availability Deployment

This article provides a comprehensive overview of RabbitMQ, covering AMQP core concepts, exchange and queue types, message reliability techniques such as confirms and returns, consumer flow‑control, TTL and dead‑letter handling, as well as clustering, federation, and HAProxy/Keepalived high‑availability solutions.

AMQPBackendRabbitMQ
0 likes · 16 min read
Understanding RabbitMQ: AMQP Fundamentals, Exchange Types, Reliability Mechanisms, and High‑Availability Deployment
Programmer DD
Programmer DD
Dec 6, 2020 · Databases

Master Redis Interview: Persistence, Caching Strategies, Clustering & More

This comprehensive guide covers essential Redis interview topics, including persistence mechanisms (RDB and AOF), cache challenges such as avalanche, penetration and pre‑warming, data types and their use cases, internal structures, eviction policies, single‑thread performance, clustering solutions, distributed locking, transactions, and best practices for high‑traffic environments.

Persistencecachingclustering
0 likes · 25 min read
Master Redis Interview: Persistence, Caching Strategies, Clustering & More
Code Ape Tech Column
Code Ape Tech Column
Nov 30, 2020 · Backend Development

Master Redis: Persistence, Caching Pitfalls, Clustering & Performance Tips

This article provides a comprehensive guide to Redis for interview preparation, covering persistence mechanisms, common caching challenges such as avalanche and penetration, data types and their use cases, internal structures, expiration policies, single‑threaded performance, clustering options, distributed locking, and best‑practice solutions.

Data TypesPersistencecaching
0 likes · 27 min read
Master Redis: Persistence, Caching Pitfalls, Clustering & Performance Tips
Top Architect
Top Architect
Nov 29, 2020 · Backend Development

Software Architecture Patterns: Traditional Three‑Tier, Cluster, Distributed, and Microservice Architectures

This article explains common software architecture patterns—including traditional three‑tier, cluster (horizontal scaling), distributed (vertical splitting), and microservice (vertical division) architectures—detailing their components, advantages, drawbacks, and configuration examples such as Tomcat clustering with Nginx load balancing.

BackendDistributed SystemsMicroservices
0 likes · 9 min read
Software Architecture Patterns: Traditional Three‑Tier, Cluster, Distributed, and Microservice Architectures
Code Ape Tech Column
Code Ape Tech Column
Nov 26, 2020 · Backend Development

Understanding RabbitMQ: Architecture, Messaging Patterns, Persistence, Clustering, and Flow Control

This article provides a comprehensive overview of RabbitMQ, covering its origins, core components, message publishing and consumption, routing modes, persistence mechanisms, delivery guarantees, RPC support, clustering design, mirrored‑queue architecture, and flow‑control strategies for reliable backend messaging.

AMQPBackendDistributed Systems
0 likes · 13 min read
Understanding RabbitMQ: Architecture, Messaging Patterns, Persistence, Clustering, and Flow Control
Architects' Tech Alliance
Architects' Tech Alliance
Nov 1, 2020 · Fundamentals

Essential Storage Fundamentals: RAID, Snapshots, Backup, CDP, Clustering, and VTL Explained

This guide provides a concise overview of core storage concepts—including RAID levels, COW/ROW snapshot mechanisms, backup strategies, continuous data protection, clustering architectures, and virtual tape library (VTL) implementations—helping readers understand how each technology contributes to data reliability and availability.

BackupCDPRAID
0 likes · 4 min read
Essential Storage Fundamentals: RAID, Snapshots, Backup, CDP, Clustering, and VTL Explained
Beike Product & Technology
Beike Product & Technology
Oct 24, 2020 · Artificial Intelligence

FrameX: An AI System for Intelligent Floorplan Analysis and Applications

FrameX is an AI-powered platform developed by Beike’s Data Intelligence Center that leverages vector floorplan data to automatically tag, score, interpret, cluster, and retrieve housing layouts, supporting numerous business scenarios through a layered architecture of data, feature, and application layers.

AIDeep LearningFloorplan Analysis
0 likes · 9 min read
FrameX: An AI System for Intelligent Floorplan Analysis and Applications
Selected Java Interview Questions
Selected Java Interview Questions
Oct 17, 2020 · Databases

Redis Interview Questions and Answers: Persistence, Caching Issues, Data Types, Clustering, and More

This article provides a comprehensive overview of Redis interview topics, covering persistence mechanisms, cache avalanche and penetration problems, hot and cold data concepts, differences from Memcached, single‑thread performance, data structures, expiration policies, clustering solutions, distributed locks, transactions, and practical troubleshooting tips.

Data TypesPersistencecaching
0 likes · 23 min read
Redis Interview Questions and Answers: Persistence, Caching Issues, Data Types, Clustering, and More