Tagged articles
6 articles
Page 1 of 1
IT Services Circle
IT Services Circle
Feb 5, 2022 · Big Data

DataEase: Open‑Source Data Visualization Tool Based on SpringBoot, Apache Doris, and Kettle – Installation and Usage Guide

This article introduces DataEase, an open‑source BI platform built with SpringBoot, Apache Doris, and Kettle, explains its system and functional architecture, provides step‑by‑step installation commands and configuration details, and demonstrates how to create datasets, views, and dashboards for data analysis.

Apache DorisBIData visualization
0 likes · 11 min read
DataEase: Open‑Source Data Visualization Tool Based on SpringBoot, Apache Doris, and Kettle – Installation and Usage Guide
Big Data Technology & Architecture
Big Data Technology & Architecture
Nov 29, 2020 · Big Data

Installing and Configuring Kettle (Pentaho Data Integration) on Linux for Hadoop ETL

This guide provides a step‑by‑step tutorial on preparing a Linux environment, installing Java, GNOME Desktop, VNC remote access, Chinese language support, downloading and extracting Kettle, configuring its startup scripts, creating desktop shortcuts, and managing essential Kettle configuration files for successful Hadoop ETL development.

ETLInstallationKettle
0 likes · 37 min read
Installing and Configuring Kettle (Pentaho Data Integration) on Linux for Hadoop ETL
Big Data Technology & Architecture
Big Data Technology & Architecture
Nov 28, 2020 · Big Data

ETL Fundamentals and Introduction to Kettle (Pentaho Data Integration)

This article provides an in-depth overview of ETL concepts, including extraction, transformation, loading, data warehouse architecture, and detailed discussion of Kettle (Pentaho Data Integration) features, design principles, components, transformations, jobs, database connections, metadata management, and practical examples for building robust data integration pipelines.

Data IntegrationETLKettle
0 likes · 57 min read
ETL Fundamentals and Introduction to Kettle (Pentaho Data Integration)
58 Tech
58 Tech
Sep 6, 2019 · Big Data

Architecture and Technical Implementation of the WMDA Data Analytics Platform

The article details WMDA's end‑to‑end data analytics architecture, covering zero‑event data collection, real‑time and offline processing pipelines built on Spark Streaming, Druid, Hadoop, Kettle, and TaskServer, and explains how these components collaborate to deliver comprehensive user behavior analysis.

Big DataDruidETL
0 likes · 11 min read
Architecture and Technical Implementation of the WMDA Data Analytics Platform