Big Data 4 min read

How to Evaluate Battery Storage Health with Big Data and LightGBM

This report details a university big‑data project that builds a data‑driven framework for assessing lithium‑ion battery storage health, cleaning operational data, detecting abnormal cells with DBSCAN, and predicting SOC/SOH using LightGBM, while highlighting findings, limitations, and future improvements.

Data Party THU
Data Party THU
Data Party THU
How to Evaluate Battery Storage Health with Big Data and LightGBM

Project Background

Developing advanced storage technologies is crucial for achieving green energy transition and carbon‑neutral goals. In 2024, new‑type storage (mainly lithium‑ion batteries) accounts for nearly 50% of installed capacity, but its lifespan and safety need systematic evaluation.

Specific Requirements

The project focuses on two operational sites—Datang Leizhou and a sodium‑ion plant in Hubei. Tasks include data cleaning, analysis, lifespan and status prediction, and producing a final assessment report.

Overall Framework

The raw data are processed and split into two modules: (1) operational state assessment to detect anomalies, and (2) SOC/SOH prediction for remaining useful life. The workflow is illustrated in the diagram below.

Consistency Evaluation

Consistency is judged by SOC ranging from 5% to 95% and SOH generally above 80%. Temperature and voltage are the primary indicators; derived metrics include average voltage, voltage range, standard deviation, and temperature statistics.

DBSCAN clustering isolates abnormal individual cells—identified as cells 13, 44, and 67 in the example. Visual inspection shows voltage dips in the faulty cells, confirming the anomalies.

SOC/SOH Prediction

Traditional electrochemical models are costly and require precise parameter identification. LSTM models need large datasets and are slow. The team selected a LightGBM model, achieving an R² score of 0.7, though some high‑variance regions remain under‑captured.

Effect Demonstration

The implemented system accepts Excel uploads, performs battery cluster consistency evaluation, isolates abnormal cells, and predicts SOC for individual cells.

Conclusion

The project successfully delivered a battery‑cluster consistency assessment, abnormal cell detection, and SOC/SOH remaining‑life prediction using big‑data techniques.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

Big DataDBSCANLightGBMSOCenergy storagebattery healthSOH
Data Party THU
Written by

Data Party THU

Official platform of Tsinghua Big Data Research Center, sharing the team's latest research, teaching updates, and big data news.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.