Tagged articles
17 articles
Page 1 of 1
php Courses
php Courses
Oct 21, 2025 · Backend Development

Master PHP’s array_unique(): Remove Duplicates Efficiently

Learn how PHP’s built-in array_unique() function efficiently removes duplicate values from arrays, with detailed syntax, practical code examples, and real-world scenarios such as data cleaning, user input validation, and data aggregation to improve code readability and performance.

PHParray_uniquedata deduplication
0 likes · 7 min read
Master PHP’s array_unique(): Remove Duplicates Efficiently
php Courses
php Courses
Jul 4, 2025 · Backend Development

Mastering PHP’s array_unique(): Remove Duplicates Efficiently

This article explains how PHP’s built‑in array_unique() function works, shows syntax and code examples, and demonstrates practical scenarios such as data cleaning, user input validation, and re‑indexing arrays to efficiently remove duplicate values.

Code ExamplesPHParray_unique
0 likes · 6 min read
Mastering PHP’s array_unique(): Remove Duplicates Efficiently
php Courses
php Courses
Jul 29, 2024 · Backend Development

Using PHP’s array_unique() Function to Extract Unique Values

This article explains PHP’s built-in array_unique() function, detailing its syntax, behavior, and flags, and demonstrates how to remove duplicate values from arrays with practical code examples and real-world use cases such as data cleaning, input validation, and aggregation.

ArraysBackendarray_unique
0 likes · 6 min read
Using PHP’s array_unique() Function to Extract Unique Values
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 30, 2023 · Big Data

How Transaction Table2.0 Cuts Data Deduplication Costs by 98% in MaxCompute

This article explains how Renliji's data warehouse team leveraged MaxCompute's Transaction Table2.0 to dramatically reduce incremental data deduplication costs and execution time, while also introducing efficient small‑file merging, time‑travel queries, and future data‑sync strategies for a high‑growth HR SaaS platform.

Big DataCost OptimizationMaxCompute
0 likes · 11 min read
How Transaction Table2.0 Cuts Data Deduplication Costs by 98% in MaxCompute

Probability Algorithms in Big Data: BloomFilter and Count-min Sketch Applications

The article explains how space‑efficient probabilistic structures such as BloomFilter and Count‑min Sketch enable large‑scale data deduplication, join pruning, real‑time idempotent filtering, and approximate top‑K analytics by trading modest accuracy loss for dramatically reduced storage and faster computation.

Big DataBloomFilterCount-Min Sketch
0 likes · 12 min read
Probability Algorithms in Big Data: BloomFilter and Count-min Sketch Applications
Top Architect
Top Architect
Nov 11, 2021 · Databases

How to Remove Duplicate Data in MySQL Tables Efficiently

This article explains why duplicate rows appear in MySQL tables, demonstrates how to identify them with SELECT queries, and provides step‑by‑step SQL solutions—including safe deletion of all duplicates or retaining a single record per group—using subqueries and temporary tables for efficient cleanup.

Duplicate DataSQLdata deduplication
0 likes · 5 min read
How to Remove Duplicate Data in MySQL Tables Efficiently
Laravel Tech Community
Laravel Tech Community
Sep 10, 2021 · Databases

How to Remove Duplicate Records in MySQL Tables

This article explains why duplicate rows appeared in production MySQL tables, demonstrates how to identify them with SELECT queries, and provides two SQL solutions—one to delete all duplicates and another to keep a single record per duplicated key—while preserving data integrity.

SQLdata deduplicationdatabase cleanup
0 likes · 5 min read
How to Remove Duplicate Records in MySQL Tables
Java Backend Technology
Java Backend Technology
Aug 10, 2021 · Databases

How to Efficiently Remove Duplicate Rows in MySQL Tables

This article explains why a naïve Python script for deleting duplicate MySQL rows is too slow, demonstrates the MySQL error caused by deleting from the same table you query, and provides two pure‑SQL solutions: one that removes all duplicates and another that keeps a single row per duplicate key.

SQLdata deduplicationdatabase cleanup
0 likes · 5 min read
How to Efficiently Remove Duplicate Rows in MySQL Tables
Architects' Tech Alliance
Architects' Tech Alliance
Nov 30, 2020 · Industry Insights

Cut Storage Costs and Boost Disaster Recovery with Deduplication and Encryption

Data deduplication eliminates redundant data blocks to lower storage and bandwidth costs, while source‑ and transmission‑level encryption safeguards data in transit and at rest; the article also compares hardware vs software deduplication, various storage architectures (DAS, SAN, NAS, object and distributed storage) and their trade‑offs.

BackupNASSAN
0 likes · 15 min read
Cut Storage Costs and Boost Disaster Recovery with Deduplication and Encryption
Aikesheng Open Source Community
Aikesheng Open Source Community
Feb 10, 2020 · Databases

Handling Duplicate Data in MySQL: Techniques and Examples

This article explains how to identify and remove various kinds of duplicate data in MySQL—including fully duplicated rows, records with duplicate non‑key columns, and unwanted whitespace inside fields—by using SQL statements, table cloning, OS utilities, and regular‑expression updates, with performance measurements for each method.

SQLdata cleaningdata deduplication
0 likes · 13 min read
Handling Duplicate Data in MySQL: Techniques and Examples
Qunar Tech Salon
Qunar Tech Salon
Apr 21, 2017 · Big Data

Ensuring Exact‑Once Semantics in Spark Streaming with Kafka: Offline Repair and Data Deduplication Strategies

This article explains why Spark Streaming combined with Kafka can only guarantee at‑least‑once delivery, outlines the challenges of delayed and out‑of‑order events, and presents practical offline‑repair, deduplication, and output‑format techniques—including code examples—to achieve exact‑once semantics in big‑data pipelines.

Exact-OnceHBaseHDFS
0 likes · 11 min read
Ensuring Exact‑Once Semantics in Spark Streaming with Kafka: Offline Repair and Data Deduplication Strategies
Architects' Tech Alliance
Architects' Tech Alliance
Apr 11, 2017 · Fundamentals

Technical Overview of Huawei Dorado V3 All‑Flash Storage: GRIP and FAST Features

This article provides a detailed technical analysis of Huawei's Dorado V3 all‑flash storage system, explaining its classification, the GRIP (Granular management, ROW, Inline deduplication & compression, Parity RAID) and FAST (FlashLink, Active‑active, Zero‑loss snapshot, RAID‑TP) technologies, and why these features are essential for modern flash‑oriented solutions.

All-Flash StorageDorado V3Flash Optimization
0 likes · 12 min read
Technical Overview of Huawei Dorado V3 All‑Flash Storage: GRIP and FAST Features