Tagged articles
15 articles
Page 1 of 1
StarRing Big Data Open Lab
StarRing Big Data Open Lab
Feb 24, 2023 · Big Data

What Makes MPP Databases the Powerhouse Behind Modern Data Analytics?

MPP (Massive Parallel Processing) databases, designed for large‑scale analytical workloads, use distributed, shared‑nothing architectures with multiple control and compute nodes, offering high scalability, diverse data‑sharding strategies, and powerful SQL compatibility, as illustrated by vendors like Teradata, Vertica, Greenplum, and emerging open‑source solutions.

Big DataGreenplumMPP
0 likes · 15 min read
What Makes MPP Databases the Powerhouse Behind Modern Data Analytics?
Aikesheng Open Source Community
Aikesheng Open Source Community
Jan 30, 2023 · Databases

Greenplum Segment Failure Diagnosis and Recovery Procedures

This article explains how to simulate and diagnose segment failures in a Greenplum cluster, including identifying master, segment, and tablespace issues, generating recovery configuration files, and using gprecoverseg and gpstate commands to restore segment roles and ensure all nodes are operational.

Database RecoveryGreenplumSegment Failure
0 likes · 24 min read
Greenplum Segment Failure Diagnosis and Recovery Procedures
Aikesheng Open Source Community
Aikesheng Open Source Community
Dec 5, 2022 · Databases

Greenplum Standby Failure Diagnosis and Recovery Procedure

This article describes how to identify and fix Greenplum standby segment failures by inspecting segment status, comparing missing files, generating a recovery configuration, executing gprecoverseg commands, and verifying that all segments return to normal operation without service interruption.

GreenplumRecoverySegment
0 likes · 17 min read
Greenplum Standby Failure Diagnosis and Recovery Procedure
Aikesheng Open Source Community
Aikesheng Open Source Community
Nov 28, 2022 · Databases

Recovering a Deleted Greenplum Standby Node: Analysis and Step‑by‑Step Procedure

This article describes the background, architecture, and a detailed troubleshooting process for restoring a mistakenly deleted Greenplum standby data directory, including status checks, configuration cleanup, standby initialization, verification commands, and additional recommendations for handling master failures in a distributed database cluster.

Database AdministrationGreenplumStandby Recovery
0 likes · 10 min read
Recovering a Deleted Greenplum Standby Node: Analysis and Step‑by‑Step Procedure
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 12, 2022 · Databases

How to Slash Cloud Data Warehouse Costs with ADB PG Disk Optimization

This article explains how enterprises can dramatically reduce cloud‑native data‑warehouse expenses by understanding ADB PG/Greenplum architecture, applying disk‑reservation and lock‑write safeguards, and implementing practical optimizations such as table compression, hot‑cold tiering, vacuuming, redundant‑index cleanup, replication conversion, and isolated temporary‑table spaces.

ADB PGCost reductionGreenplum
0 likes · 25 min read
How to Slash Cloud Data Warehouse Costs with ADB PG Disk Optimization
dbaplus Community
dbaplus Community
Aug 1, 2021 · Databases

Scaling and Optimizing a Greenplum Data Warehouse Cluster: Key Lessons

This article details the background, goals, design decisions, deployment steps, kernel tuning, fault‑recovery testing, performance optimization, and TPCH benchmark results of a Greenplum data‑warehouse cluster expansion, highlighting practical challenges and concrete solutions for large‑scale database environments.

Cluster DeploymentGreenplumLinux
0 likes · 20 min read
Scaling and Optimizing a Greenplum Data Warehouse Cluster: Key Lessons
Big Data Technology & Architecture
Big Data Technology & Architecture
Apr 20, 2021 · Fundamentals

Consistent Hashing and Jump Consistent Hash: Concepts, Implementation, and Use Cases

This article explains the fundamentals of consistent hashing, compares the classic ring‑based method with the more efficient jump consistent hash algorithm, provides reference implementations in C++, discusses their time‑complexity and practical trade‑offs, and shows how they are applied in systems such as Greenplum.

Distributed SystemsGreenplumalgorithm
0 likes · 11 min read
Consistent Hashing and Jump Consistent Hash: Concepts, Implementation, and Use Cases
Qunar Tech Salon
Qunar Tech Salon
Oct 19, 2017 · Databases

Avoiding OOM in Greenplum: Memory Configuration and Resource Queue Best Practices

This article explains why Greenplum can encounter out‑of‑memory (OOM) errors, lists the typical causes, and provides detailed recommendations for memory‑related OS and database parameters, statement_mem settings, and the proper use of Greenplum resource queues to achieve stable and efficient cluster operation.

Database ConfigurationGreenplumMemory Management
0 likes · 13 min read
Avoiding OOM in Greenplum: Memory Configuration and Resource Queue Best Practices
Baidu Waimai Technology Team
Baidu Waimai Technology Team
Apr 20, 2017 · Databases

Greenplum (GPDB) Architecture, Features, and Operational Tools Overview

This article explains Greenplum's MPP architecture, master‑segment design, high‑availability, interconnect network, rich management tools, parallel query planning, data loading techniques, and additional capabilities such as LDAP authentication and resource queues, demonstrating why it is a strong next‑generation big‑data query engine.

Big DataGreenplumMPP
0 likes · 15 min read
Greenplum (GPDB) Architecture, Features, and Operational Tools Overview
Baidu Waimai Technology Team
Baidu Waimai Technology Team
Mar 23, 2017 · Databases

Design and Implementation of the "Little Boy" Greenplum Optimization and Operations Platform

This article introduces the architecture, key modules, and implementation details of the Little Boy platform, a Greenplum optimization and operations system that parses SQL, applies index and distribution‑key tuning, manages resources, and outlines future enhancements for large‑scale data warehouses.

Big DataDatabase OptimizationGreenplum
0 likes · 15 min read
Design and Implementation of the "Little Boy" Greenplum Optimization and Operations Platform
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Mar 9, 2017 · Databases

Master Greenplum Table Design & Performance Optimization: Practical Tips

This article explains what Greenplum is, its MPP shared‑nothing architecture, and provides concrete table‑design principles, distribution‑column strategies, indexing guidance, vacuum and table‑rebuilding techniques, as well as SQL, join, insert, update/delete, and resource‑queue optimizations for better performance.

Database designDistributed TablesGreenplum
0 likes · 9 min read
Master Greenplum Table Design & Performance Optimization: Practical Tips
Architects' Tech Alliance
Architects' Tech Alliance
Nov 19, 2016 · Databases

An Overview of Greenplum Database Architecture and Core Components

Greenplum is an open‑source, massively parallel processing (MPP) database built on PostgreSQL, offering ANSI‑SQL compliance, distributed ACID transactions, linear scalability, polymorphic storage, advanced optimizers, and extensive ecosystem integrations, making it suitable for large‑scale data warehousing, analytics, and big‑data workloads.

Data WarehousingGreenplumMPP
0 likes · 15 min read
An Overview of Greenplum Database Architecture and Core Components
dbaplus Community
dbaplus Community
Feb 22, 2016 · Databases

Mastering Greenplum: Planning, Data Modeling, and Daily Ops Best Practices

This article delivers a comprehensive guide to Greenplum deployment, covering early architecture planning, data‑model design, daily maintenance best practices, system‑table management, diagnostic tools like gpcheckcat, and detailed troubleshooting techniques for persistent tables and other common issues.

GreenplumMPPSystem Tables
0 likes · 13 min read
Mastering Greenplum: Planning, Data Modeling, and Daily Ops Best Practices