Topic

Batch Processing

Collection size
121 articles
Page 5 of 7
Code Ape Tech Column
Code Ape Tech Column
Dec 29, 2022 · Backend Development

Batch Request Merging in Spring Boot to Reduce Database Connection Overhead

The article explains how to merge multiple user queries into a single batch request using a queue, scheduled thread pool and CompletableFuture in Spring Boot, thereby reducing database connections and improving resource utilization while handling high‑concurrency scenarios.

Batch ProcessingCompletableFutureConcurrency
0 likes · 13 min read
Batch Request Merging in Spring Boot to Reduce Database Connection Overhead
Architect
Architect
Apr 20, 2023 · Backend Development

Common Interface Performance Optimization Strategies

This article summarizes practical techniques for reducing API latency, including batch processing, asynchronous execution, caching, pre‑processing, pooling, parallelization, indexing, transaction size control, program structure refactoring, deep pagination handling, SQL tuning, and proper lock granularity.

API optimizationBatch ProcessingIndexing
0 likes · 9 min read
Common Interface Performance Optimization Strategies
Top Architect
Top Architect
Jun 7, 2022 · Backend Development

Request Merging Techniques: Hystrix Collapser, Custom BatchCollapser, and ConcurrentHashMultiset

This article compares three request‑merging approaches—Hystrix Collapser, a custom BatchCollapser implementation, and Guava’s ConcurrentHashMultiset—explaining their configurations, code examples, and suitable scenarios for reducing downstream load and improving system throughput while also highlighting performance trade‑offs and practical tips for integration in Spring‑Boot services.

BackendBatch ProcessingGuava
0 likes · 14 min read
Request Merging Techniques: Hystrix Collapser, Custom BatchCollapser, and ConcurrentHashMultiset
DataFunSummit
DataFunSummit
May 17, 2024 · Big Data

Comprehensive Hudi Real-Time Data Lake Ingestion Solutions

This article presents a complete guide to Hudi-based real-time data lake ingestion, covering overall data integration architecture, batch and streaming ingestion strategies, advanced table design, and practical recommendations for handling challenges such as deduplication, latency, partitioning, and performance optimization.

Batch ProcessingHudiReal-time Ingestion
0 likes · 12 min read
Comprehensive Hudi Real-Time Data Lake Ingestion Solutions
DataFunSummit
DataFunSummit
Jan 8, 2023 · Big Data

Streaming‑Batch Integrated Real‑time Multi‑dimensional Analysis

This article presents a comprehensive overview of evolving big‑data architectures—from classic offline warehouses to Lambda and Kappa models—and details a streaming‑batch integrated solution that addresses latency, data freshness, and multi‑table join challenges to achieve minute‑level real‑time multi‑dimensional analytics.

Batch ProcessingKappa architecturebig data
0 likes · 18 min read
Streaming‑Batch Integrated Real‑time Multi‑dimensional Analysis
DataFunTalk
DataFunTalk
Feb 2, 2023 · Big Data

SeaTunnel: Design Goals, Current Status, Architecture, and Future Roadmap

This article provides a comprehensive overview of Apache SeaTunnel, covering its design objectives, current capabilities such as multi‑engine support and extensive connector ecosystem, detailed architecture including engine‑independent APIs and execution flows, and outlines the upcoming roadmap to expand connectors, launch a visual web UI, and introduce a dedicated SeaTunnel Engine.

ApacheBatch ProcessingConnector
0 likes · 12 min read
SeaTunnel: Design Goals, Current Status, Architecture, and Future Roadmap
DataFunTalk
DataFunTalk
Aug 25, 2022 · Big Data

Applying OpenMLDB for Efficient AI Toolchain and Data‑Driven Architecture at Akulaku

This article presents Akulaku’s practical experience with OpenMLDB, describing the company’s data‑driven requirements, the design of a unified stream‑batch architecture, implementation details across offline, online and RocksDB modes, and future recommendations for high‑performance, scenario‑agnostic big‑data processing.

AIBatch ProcessingOpenMLDB
0 likes · 17 min read
Applying OpenMLDB for Efficient AI Toolchain and Data‑Driven Architecture at Akulaku
DataFunTalk
DataFunTalk
Nov 17, 2020 · Artificial Intelligence

Alink: A Flink‑Based Machine Learning Platform – Overview, Features, and Quick‑Start Guide

This article introduces Alink, Alibaba's open‑source machine‑learning platform built on Flink, explains its core algorithms, performance comparison with Spark ML, version‑wise feature evolution, and provides practical quick‑start instructions for both Java (Maven) and Python (PyAlink) users, including data source handling, type conversion components, unified file‑system operations, and an overview of its FM algorithm implementation.

AlinkBatch ProcessingFactorization Machines
0 likes · 13 min read
Alink: A Flink‑Based Machine Learning Platform – Overview, Features, and Quick‑Start Guide
Architect's Guide
Architect's Guide
Nov 3, 2023 · Databases

Performance Evaluation of Inserting Hundreds of Millions of Records into MySQL Using Java, MyBatis, and JDBC

This article experimentally compares three Java‑based insertion strategies—MyBatis without transaction, plain JDBC with and without transaction, and JDBC batch processing—with and without transactions, measuring how they affect the time required to insert tens of millions of rows into MySQL.

Batch ProcessingDatabase OptimizationJDBC
0 likes · 13 min read
Performance Evaluation of Inserting Hundreds of Millions of Records into MySQL Using Java, MyBatis, and JDBC
Python Programming Learning Circle
Python Programming Learning Circle
May 27, 2024 · Fundamentals

Batch Image Compression with Python: A 20‑Line Script Tutorial

This article explains how designers and developers can use a short Python script to automatically compress large numbers of high‑resolution images, covering the problem background, required environment, step‑by‑step implementation, and the impressive compression results achieved.

Batch ProcessingTutorialautomation
0 likes · 3 min read
Batch Image Compression with Python: A 20‑Line Script Tutorial
Python Programming Learning Circle
Python Programming Learning Circle
Jul 8, 2021 · Operations

Python Office Automation Techniques: Document Conversion, Geocoding, Distance Calculation, Batch File Processing, and Outlook Email Automation

This article presents a collection of Python scripts for office automation, covering Word doc‑to‑docx conversion, batch address geocoding using Baidu API, distance computation with geopy, coordinate conversion between Baidu and Gaode, Excel merging, Word‑to‑PDF conversion, extracting tables from Word, and bulk Outlook email sending.

Batch ProcessingFile ConversionOffice Automation
0 likes · 11 min read
Python Office Automation Techniques: Document Conversion, Geocoding, Distance Calculation, Batch File Processing, and Outlook Email Automation
Python Programming Learning Circle
Python Programming Learning Circle
Jun 25, 2021 · Artificial Intelligence

Batch Image Translation Demo Using Youdao OCR API with Python

This article presents a step‑by‑step Python demo that uses Youdao's OCR translation API to batch‑process cosmetic product images, covering API key setup, request parameters, signature generation, GUI implementation with Tkinter, and code snippets for file selection, result storage, and API invocation.

AIBatch ProcessingImage Translation
0 likes · 10 min read
Batch Image Translation Demo Using Youdao OCR API with Python
Vipshop Quality Engineering
Vipshop Quality Engineering
Nov 29, 2018 · Backend Development

Boost Backend Batch Testing Efficiency with a Comparative Test Platform

The article introduces a comparative test platform for financial core systems that automates regression testing of batch jobs, dramatically reducing test cycles from days to under one, while ensuring billing calculations remain accurate across evolving business scenarios.

Batch Processingbackend testingfinancial systems
0 likes · 12 min read
Boost Backend Batch Testing Efficiency with a Comparative Test Platform
Efficient Ops
Efficient Ops
Jun 13, 2023 · Operations

How to Automate Batch Job Retries and Eliminate Midnight Outages

This article explores a real‑world scenario where a support manager faces nightly batch job interruptions, analyzes common database and environment failures, and presents a systematic redesign of the batch framework and executor to enable automatic retry, reducing manual intervention and improving operational reliability.

Batch ProcessingException HandlingMySQL
0 likes · 7 min read
How to Automate Batch Job Retries and Eliminate Midnight Outages
ByteDance Cloud Native
ByteDance Cloud Native
Apr 20, 2023 · Cloud Native

How Dragonfly Accelerates Image Distribution and Scales Kubernetes Batch Processing

At KubeCon+CloudNativeCon 2023 in Amsterdam, Volcano Engine and ByteDance presented two technical sessions covering Dragonfly's P2P image distribution best practices and large‑scale Kubernetes batch processing strategies, offering deep insights and real‑world implementations for cloud‑native developers.

Batch ProcessingCloud NativeDragonfly
0 likes · 4 min read
How Dragonfly Accelerates Image Distribution and Scales Kubernetes Batch Processing
Qunar Tech Salon
Qunar Tech Salon
Jan 21, 2017 · Backend Development

Message Consumption Patterns and Best Practices in Qunar's QMQ

This article shares Qunar's practical experiences with message-driven architecture, detailing consumer handling of duplicate messages, ordering, concurrency control, asynchronous processing, and batch strategies, and presents concrete solutions such as idempotent checks, deduplication tables, versioning, and QMQ's built‑in executors.

Batch ProcessingConcurrencyMessage Queue
0 likes · 18 min read
Message Consumption Patterns and Best Practices in Qunar's QMQ
IT Architects Alliance
IT Architects Alliance
Jan 3, 2023 · Fundamentals

Performance Optimization Techniques: Indexing, Caching, Compression, Prefetching, Throttling, and Batch Processing

The article explores a wide range of performance‑optimization strategies—including indexing, caching, compression, prefetching, peak‑shaving, and batch processing—explaining their trade‑offs, practical applications, and how they relate to hardware latency and system design in modern computing environments.

Batch ProcessingIndexingPerformance
0 likes · 34 min read
Performance Optimization Techniques: Indexing, Caching, Compression, Prefetching, Throttling, and Batch Processing
Architecture Digest
Architecture Digest
Nov 7, 2022 · Backend Development

Comprehensive Guide to Software Performance Optimization: Indexing, Compression, Caching, Prefetching, Throttling, and Batch Processing

This article presents a thorough, multi‑part exploration of software performance optimization techniques—including indexing, compression, caching, prefetching, peak‑shaving, and batch processing—explaining their principles, trade‑offs, practical applications, and how they relate to hardware constraints and system design.

Batch ProcessingIndexingPerformance
0 likes · 36 min read
Comprehensive Guide to Software Performance Optimization: Indexing, Compression, Caching, Prefetching, Throttling, and Batch Processing
Architecture Digest
Architecture Digest
Jan 21, 2022 · Big Data

Building a Real-Time Data Warehouse with Flink: Architecture, Core Concepts, and Practical Implementation

This article explains how to build a unified stream‑batch real‑time data warehouse using FlinkSQL, covering prerequisite knowledge, five core concepts, two implementation approaches, a comparison of traditional versus real‑time architectures, and a comprehensive hands‑on example, illustrated with diagrams.

Batch ProcessingFlinkdata architecture
0 likes · 6 min read
Building a Real-Time Data Warehouse with Flink: Architecture, Core Concepts, and Practical Implementation