Tagged articles
8 articles
Page 1 of 1
Huolala Tech
Huolala Tech
Jun 25, 2024 · Big Data

How Huolala Built a Scalable Big Data Testing Platform to Cut Cycle Time by 70%

Huolala’s data testing platform tackles massive data volume, complexity, and quality challenges by automating test case generation, execution, monitoring, and alerting across multiple storage systems, dramatically reducing testing cycles from five days to 1.5 days and saving over 800 person-days.

SQLdata testingplatform architecture
0 likes · 19 min read
How Huolala Built a Scalable Big Data Testing Platform to Cut Cycle Time by 70%

Big Data Testing: Methods, Tool Selection, and Practical Implementation with Datacompy

This article introduces big data testing concepts, outlines common testing methods, evaluates the Python library Datacompy against alternatives, and details a practical implementation for large-scale data migration and validation, including configuration, volume comparison, content verification, and performance optimizations such as sorting and multithreading.

Data MigrationPythondata testing
0 likes · 10 min read
Big Data Testing: Methods, Tool Selection, and Practical Implementation with Datacompy
DevOps
DevOps
Oct 25, 2023 · Big Data

An Introduction to Big Data: Origins, Definitions, 5V Characteristics, Applications, Hadoop Architecture, and Testing Strategies

This article provides a comprehensive overview of big data, covering its origins, definitions, 5V characteristics, data formats, real‑world applications, Hadoop architecture, testing challenges, functional and performance testing strategies, and the skills required for effective big data testing.

5V CharacteristicsBig DataData Formats
0 likes · 35 min read
An Introduction to Big Data: Origins, Definitions, 5V Characteristics, Applications, Hadoop Architecture, and Testing Strategies
360 Quality & Efficiency
360 Quality & Efficiency
Aug 12, 2022 · Big Data

Testing Plan and Efficiency Strategies for a Data Download Refactoring Project

This article outlines the testing plan for a data download refactoring project involving over 400 metrics, describes automated CSV comparison scripts, evaluates single‑process, multithreaded, and multiprocess approaches with shared memory, and provides practical recommendations for improving verification efficiency and performance.

CSV comparisonPerformance OptimizationPython multiprocessing
0 likes · 8 min read
Testing Plan and Efficiency Strategies for a Data Download Refactoring Project
Software Development Quality
Software Development Quality
Jul 14, 2022 · Fundamentals

Mastering Software Testing: From Basics to Advanced Strategies

This comprehensive guide walks you through the fundamentals of software testing, covering test analysis and design, mobile testing techniques, performance and security testing methods, algorithm validation, data quality assurance, and effective cross‑domain project management for modern development teams.

Software Testingalgorithm testingdata testing
0 likes · 68 min read
Mastering Software Testing: From Basics to Advanced Strategies
Liulishuo Tech Team
Liulishuo Tech Team
Oct 22, 2017 · Big Data

Data-CI: A SQL-Based Data Unit Testing Framework for ETL

The article introduces data-ci, a SQL‑driven unit testing framework that lets engineers write, organize, and automate data validation tests for ETL pipelines, providing assertions, failure callbacks, coverage reporting, and CI integration to improve data quality and reliability.

Big DataData QualityETL
0 likes · 9 min read
Data-CI: A SQL-Based Data Unit Testing Framework for ETL