NetEase Game Operations Platform
Author

NetEase Game Operations Platform

The NetEase Game Automated Operations Platform delivers stable services for thousands of NetEase titles, focusing on efficient ops workflows, intelligent monitoring, and virtualization.

81
Articles
0
Likes
166
Views
0
Comments
Recent Articles

Latest from NetEase Game Operations Platform

81 recent articles
NetEase Game Operations Platform
NetEase Game Operations Platform
Sep 19, 2022 · Artificial Intelligence

Applying AIOps to Game Operations: Roadmap, Anomaly Detection, and Fault Localization

This article describes NetEase's AIOps journey for game operations, explaining the Gartner definition of intelligent operations, the implementation roadmap, detailed anomaly‑detection techniques for business, performance, and log data, and a comprehensive fault‑localization workflow that combines resource, code, and historical analysis.

Fault Localizationaiopsanomaly detection
0 likes · 12 min read
Applying AIOps to Game Operations: Roadmap, Anomaly Detection, and Fault Localization
NetEase Game Operations Platform
NetEase Game Operations Platform
Jun 10, 2022 · Databases

Apache Doris Deployment and Optimization at NetEase Interactive Entertainment

This article details NetEase Interactive Entertainment's adoption of Apache Doris for large‑scale game data analytics, covering background, Doris architecture, cluster governance, tablet and compaction tuning, scaling strategies, monitoring, alerting, and fault‑handling practices to improve performance and stability.

Apache DorisCompaction TuningMonitoring
0 likes · 22 min read
Apache Doris Deployment and Optimization at NetEase Interactive Entertainment
NetEase Game Operations Platform
NetEase Game Operations Platform
May 9, 2022 · Operations

Intelligent Log Classification and Anomaly Detection: Design and Implementation

This article presents a two‑stage streaming log classification system using an improved prefix‑tree and longest‑common‑subsequence algorithms, along with a statistical unsupervised anomaly detection method that leverages chi‑square aggregation and box‑plot scoring to reduce false alarms and accelerate template convergence.

LCS algorithmlog classificationoperations
0 likes · 11 min read
Intelligent Log Classification and Anomaly Detection: Design and Implementation
NetEase Game Operations Platform
NetEase Game Operations Platform
Apr 23, 2022 · Artificial Intelligence

Design and Implementation of an AI‑Driven Intelligent Operations Platform for Game Services

The article presents a comprehensive overview of an AI‑ops platform for game operations, covering its background, roadmap, team structure, business scenarios, anomaly‑detection techniques, platform architecture, detection workflow, model deployment, and intelligent fault‑management strategies.

Intelligent Operationsfault managementplatform architecture
0 likes · 20 min read
Design and Implementation of an AI‑Driven Intelligent Operations Platform for Game Services
NetEase Game Operations Platform
NetEase Game Operations Platform
Sep 18, 2021 · Big Data

StreamflySQL: NetEase Games’ Journey from Template JAR to SQL Gateway for Flink SQL Platformization

This article details NetEase Games’ evolution of its Flink SQL platform, from the early StreamflySQL v1 template‑JAR approach to the v2 SQL‑Gateway architecture, discussing design decisions, challenges such as metadata persistence, multi‑tenant security, horizontal scaling, and job state management.

FlinkReal-time analyticsSQL
0 likes · 17 min read
StreamflySQL: NetEase Games’ Journey from Template JAR to SQL Gateway for Flink SQL Platformization
NetEase Game Operations Platform
NetEase Game Operations Platform
May 22, 2021 · Big Data

Comprehensive Overview and Source Code Analysis of NetEase Spark Kyuubi

This article systematically introduces NetEase Kyuubi, an open‑source high‑performance JDBC and SQL execution engine built on Apache Spark, covering its background, core architecture, service discovery, session and operation management, startup processes, and key source‑code implementations with detailed code examples.

Apache ThriftDistributed ComputingKyuubi
0 likes · 47 min read
Comprehensive Overview and Source Code Analysis of NetEase Spark Kyuubi
NetEase Game Operations Platform
NetEase Game Operations Platform
Jan 9, 2021 · Operations

Real-Time Log Intelligent Classification Practice

This article describes how NetEase built a real‑time log intelligent classification system using Flink and AI algorithms, detailing the challenges of massive log volumes, the Drain template‑extraction method, algorithm workflow, performance results, and a practical case study that demonstrates reduced alert storms and faster issue diagnosis.

AIDrain algorithmFlink
0 likes · 15 min read
Real-Time Log Intelligent Classification Practice
NetEase Game Operations Platform
NetEase Game Operations Platform
Aug 8, 2020 · Backend Development

Debugging “Instance XXX is not bound to a Session” Errors in Gevent‑Enabled Flask APIs with SQLAlchemy

This article analyzes the intermittent “Instance XXX is not bound to a Session” error that occurs after converting a Flask‑SQLAlchemy endpoint from serial to multithreaded/gevent concurrency, reproduces the issue with test code, explains the root cause in session handling, and provides a concrete fix by patching gevent before session initialization.

FlaskPythonSQLAlchemy
0 likes · 6 min read
Debugging “Instance XXX is not bound to a Session” Errors in Gevent‑Enabled Flask APIs with SQLAlchemy
NetEase Game Operations Platform
NetEase Game Operations Platform
Jul 11, 2020 · Operations

GitLab Code Review Workflow and Configuration Guide

This article provides a comprehensive guide to GitLab code review, covering environment setup, Git flow basics, review methods, configuration of approvals, templates, issue linking, Code Quality integration with CI/CD, step‑by‑step practice, and recommendations for choosing between Community and Enterprise editions.

CI/CDDevOpsGit Flow
0 likes · 10 min read
GitLab Code Review Workflow and Configuration Guide