Tagged articles
29 articles
Page 1 of 1
HyperAI Super Neural
HyperAI Super Neural
Dec 11, 2025 · Artificial Intelligence

Carnegie Team Uses Random Forests on 406 Samples to Detect 3.3‑Billion‑Year‑Old Life

An interdisciplinary Carnegie research team combined pyrolysis‑GC‑MS with supervised random‑forest machine learning on 406 modern and ancient samples, achieving up to 100% accuracy in distinguishing biogenic from abiotic organic matter and successfully identifying molecular biosignatures dating back 3.3 billion years.

PNASRandom Forestancient life
0 likes · 15 min read
Carnegie Team Uses Random Forests on 406 Samples to Detect 3.3‑Billion‑Year‑Old Life
Data Party THU
Data Party THU
Oct 30, 2025 · Artificial Intelligence

How to Generate Realistic Synthetic Data with Histograms and GMMs

This article explains two practical techniques—histogram‑based per‑column synthesis and Gaussian‑Mixture‑Model generation—for creating large, privacy‑preserving synthetic datasets that retain the statistical distributions and inter‑column relationships of the original data, and shows how to evaluate their quality.

Data GenerationGaussian mixture modelPython
0 likes · 27 min read
How to Generate Realistic Synthetic Data with Histograms and GMMs
Data STUDIO
Data STUDIO
Aug 26, 2025 · Artificial Intelligence

How a Rolling Random Forest Strategy Predicts Bitcoin’s Weekly Direction

This article explains a Python‑based rolling random‑forest classifier that uses a 30‑day training window and selected technical indicators to forecast whether Bitcoin’s price will rise or fall over the next seven days, detailing the methodology, code, back‑test results, and limitations.

BitcoinPythonRandom Forest
0 likes · 7 min read
How a Rolling Random Forest Strategy Predicts Bitcoin’s Weekly Direction
Python Programming Learning Circle
Python Programming Learning Circle
Feb 8, 2025 · Artificial Intelligence

Random Forest Classification with PCA and Hyper‑Parameter Tuning on the Breast Cancer Dataset

This tutorial walks through loading the scikit‑learn breast‑cancer dataset, preprocessing it, building baseline and PCA‑reduced Random Forest models, applying RandomizedSearchCV and GridSearchCV for hyper‑parameter optimization, and evaluating the final models using recall as the primary metric.

Breast CancerPCARandom Forest
0 likes · 17 min read
Random Forest Classification with PCA and Hyper‑Parameter Tuning on the Breast Cancer Dataset
Tencent Cloud Developer
Tencent Cloud Developer
Jul 4, 2024 · Artificial Intelligence

Football Match Outcome Prediction and Betting Strategy Using Machine Learning

The study combines team statistics and bookmaker odds with machine‑learning models—including Poisson, regression, Bayesian, SVM, Random Forest, DNN, and LSTM—to predict football match outcomes, identify confidence‑based betting intervals that yield profit, and suggests extensions to broader data, features, and financial trading.

Random Forestdata miningfootball prediction
0 likes · 23 min read
Football Match Outcome Prediction and Betting Strategy Using Machine Learning
Python Programming Learning Circle
Python Programming Learning Circle
Apr 10, 2024 · Artificial Intelligence

Top 10 Machine Learning Algorithms Explained

This article introduces the No‑Free‑Lunch principle in machine learning and provides concise explanations of ten fundamental algorithms—including linear and logistic regression, LDA, decision trees, Naïve Bayes, K‑Nearest Neighbors, LVQ, SVM, bagging with random forests, and boosting with AdaBoost—guiding beginners on how to choose the right model.

Random Forestailinear regression
0 likes · 14 min read
Top 10 Machine Learning Algorithms Explained
Model Perspective
Model Perspective
Aug 8, 2023 · Artificial Intelligence

Predicting Tomorrow’s Weather with Random Forests: A European City Case Study

Using detailed meteorological records from 18 European cities between 2000 and 2010, this article demonstrates how random forest regression and comprehensive data preprocessing can forecast daily precipitation, evaluate model performance, and compare climatic patterns across cities, highlighting both strengths and limitations of the approach.

Random Forestclimate dataweather prediction
0 likes · 20 min read
Predicting Tomorrow’s Weather with Random Forests: A European City Case Study
Model Perspective
Model Perspective
Aug 5, 2023 · Artificial Intelligence

Can a Random Forest Predict Smoking Habits? 79% Accuracy Explained

This article analyzes a biomedical dataset to identify key factors influencing smoking status, performs descriptive and exploratory data analysis, selects important features with a Random Forest, builds a predictive model achieving about 79% accuracy, and discusses evaluation metrics and future improvements.

Health DataRandom Forestfeature importance
0 likes · 15 min read
Can a Random Forest Predict Smoking Habits? 79% Accuracy Explained
Tencent Cloud Developer
Tencent Cloud Developer
Dec 2, 2022 · Artificial Intelligence

Football Match Prediction Using Machine Learning and Betting Strategy Analysis

The study applies machine‑learning models—including logistic regression, SVM, random forest, deep neural networks and a DNN‑SVM ensemble—to 17‑dimensional team features and 51‑dimensional bookmaker odds, achieving up to 54.5% match‑outcome accuracy, proposing a profit‑condition betting strategy and extending the approach to stock‑price forecasting.

Betting StrategyData ScienceRandom Forest
0 likes · 21 min read
Football Match Prediction Using Machine Learning and Betting Strategy Analysis
Code DAO
Code DAO
Dec 18, 2021 · Artificial Intelligence

Implement Random Forest Regression in Python using Scikit-Learn

This article explains the fundamentals of random forest regression, describes why it outperforms single decision trees for nonlinear or noisy data, defines bootstrapping and bagging, and provides a step‑by‑step Python example using NumPy, Pandas, and Scikit‑Learn’s RandomForestRegressor with data loading, preprocessing, model training, prediction, and evaluation via MSE and R².

BootstrappingPythonRandom Forest
0 likes · 6 min read
Implement Random Forest Regression in Python using Scikit-Learn
Code DAO
Code DAO
Dec 13, 2021 · Artificial Intelligence

A Comprehensive Guide to Ensemble Learning: Bagging, Boosting, and Stacking

This article explains the core concepts of ensemble learning, covering the bias‑variance trade‑off, the mechanics of bagging with bootstrap and random forests, the sequential strategies of boosting (AdaBoost and gradient boosting), and the heterogeneous stacking framework with meta‑models and multi‑layer extensions.

Random ForestStackingbagging
0 likes · 20 min read
A Comprehensive Guide to Ensemble Learning: Bagging, Boosting, and Stacking
Didi Tech
Didi Tech
May 11, 2021 · Artificial Intelligence

Continuous Causal Forest: Extending Uplift Modeling to Multi‑dimensional Continuous Treatments in Ride‑hailing Pricing

By extending binary causal forests with a Continuous Average Partial Effect statistic, the Continuous Causal Forest enables uplift modeling for multi‑dimensional continuous treatments such as ride‑hailing pricing, delivering superior Qini scores and over 15% ROI improvement while simplifying implementation and reducing deployment costs.

Pricing strategyRandom Forestcontinuous treatment
0 likes · 10 min read
Continuous Causal Forest: Extending Uplift Modeling to Multi‑dimensional Continuous Treatments in Ride‑hailing Pricing
21CTO
21CTO
Sep 18, 2020 · Artificial Intelligence

Top 10 Essential Machine Learning Algorithms Every Data Scientist Should Know

This article provides a concise overview of ten fundamental machine learning algorithms—linear regression, logistic regression, linear discriminant analysis, naive Bayes, K‑nearest neighbors, learning vector quantization, decision trees, random forest, support vector machines, and boosting (AdaBoost)—explaining their core concepts, typical use‑cases, and practical considerations.

Naive BayesRandom ForestSupport Vector Machine
0 likes · 13 min read
Top 10 Essential Machine Learning Algorithms Every Data Scientist Should Know
Python Crawling & Data Mining
Python Crawling & Data Mining
Aug 5, 2020 · Artificial Intelligence

Master Random Forest: From Bagging Theory to Python Implementation

This article explains the fundamentals of ensemble learning and bagging, details the random forest algorithm, answers common questions, and provides a complete Python walkthrough—including data exploration, decision‑tree baseline, random‑forest modeling with grid‑search tuning, and practical insights for handling imbalanced and missing data.

PythonRandom Forestbagging
0 likes · 16 min read
Master Random Forest: From Bagging Theory to Python Implementation
Amap Tech
Amap Tech
Jul 16, 2019 · Artificial Intelligence

Mobile Wi‑Fi Identification for Enhanced Network Positioning Using Machine Learning

By replacing rule‑based pipelines with an active‑learning‑driven random‑forest model that extracts clustering, signal, association, IP, and temporal features, Gaode accurately identifies mobile, cloned, and moved Wi‑Fi, cutting large‑error network‑positioning cases by ~18% and boosting overall positioning precision.

Random ForestWiFi fingerprintingmachine learning
0 likes · 13 min read
Mobile Wi‑Fi Identification for Enhanced Network Positioning Using Machine Learning
21CTO
21CTO
Apr 12, 2019 · Artificial Intelligence

Top 10 Essential Machine Learning Algorithms Every Data Scientist Should Know

This article provides a concise overview of ten fundamental machine learning algorithms—linear regression, logistic regression, linear discriminant analysis, naive Bayes, K‑nearest neighbors, learning vector quantization, support vector machines, decision trees, bagging/random forest, and boosting/AdaBoost—explaining their principles, typical use cases, and key characteristics.

Naive BayesRandom ForestSupport Vector Machine
0 likes · 13 min read
Top 10 Essential Machine Learning Algorithms Every Data Scientist Should Know
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Jan 30, 2018 · Operations

Can You Predict Switch Failures Before They Happen? Inside PreFix’s ML Approach

This article reviews the PreFix system, which uses machine‑learning on datacenter switch logs to predict hardware failures ahead of time, detailing its design, feature extraction, random‑forest model, experimental validation across multiple switch models, and its broader applicability to disk failure prediction.

Random Forestdatacenter networkslog analysis
0 likes · 12 min read
Can You Predict Switch Failures Before They Happen? Inside PreFix’s ML Approach
Hulu Beijing
Hulu Beijing
Dec 22, 2017 · Artificial Intelligence

Master Ensemble Learning: Boosting, Bagging, and Real-World Examples

This article introduces ensemble learning as a meta‑algorithm that combines multiple base classifiers, explains the two main strategies—Boosting and Bagging—covers their bias‑variance trade‑offs, outlines essential steps, and provides concrete examples such as AdaBoost, Random Forest, and GBDT applied to user age prediction.

AdaBoostGBDTRandom Forest
0 likes · 8 min read
Master Ensemble Learning: Boosting, Bagging, and Real-World Examples
StarRing Big Data Open Lab
StarRing Big Data Open Lab
Oct 20, 2017 · Artificial Intelligence

How to Build a Customer Churn Warning Model with R and Discover

This article demonstrates a step‑by‑step workflow for constructing a churn prediction model using R in Discover, covering data loading, preprocessing, feature extraction, labeling, random‑forest training, prediction, and evaluation to help businesses proactively retain high‑value customers.

DiscoverRRandom Forest
0 likes · 11 min read
How to Build a Customer Churn Warning Model with R and Discover
Architects Research Society
Architects Research Society
Nov 17, 2016 · Artificial Intelligence

Comparative Study of Machine Learning Classifiers and Guidance for Algorithm Selection

The article summarizes a JMLR 2014 study that evaluated 179 classifiers across 121 UCI datasets, finding Random Forests and Gaussian‑kernel SVMs to be top performers, provides a review of supervised learning algorithms, and includes visual guidance for selecting appropriate machine‑learning methods.

Random Forestalgorithm selectionclassifier comparison
0 likes · 3 min read
Comparative Study of Machine Learning Classifiers and Guidance for Algorithm Selection
Architects Research Society
Architects Research Society
Oct 28, 2016 · Artificial Intelligence

Phishing Website Detection Using Machine Learning Models in R

This article presents a step‑by‑step machine‑learning analysis of the UCI Phishing Websites dataset in R, loading the data, training boosted logistic regression, SVM, tree‑bagging, and random‑forest models, comparing their accuracies, and identifying the most important predictive features for phishing detection.

RRandom Forestcaret
0 likes · 11 min read
Phishing Website Detection Using Machine Learning Models in R
Qunar Tech Salon
Qunar Tech Salon
Jul 4, 2016 · Information Security

Xiaomi Risk Control Practices: Architecture, Rule Engine, and Machine Learning

Xiaomi senior R&D engineer Deng Wenjun shares the evolution of Xiaomi's internet‑finance risk‑control system, describing early rule‑based limits, the adoption of Drools for fast rule deployment, data‑driven modeling with random‑forest classifiers, and ongoing challenges in scalability, latency, and privacy.

DroolsRandom Forestfinancial technology
0 likes · 16 min read
Xiaomi Risk Control Practices: Architecture, Rule Engine, and Machine Learning
High Availability Architecture
High Availability Architecture
Jun 24, 2016 · Information Security

Xiaomi's Internet Finance Risk Control Practices: Architecture, Rules Engine, and Machine Learning

The article details Xiaomi's evolution of internet‑finance risk control—from early limit and frequency rules that cut bad‑debt by a third, through adopting the Drools rules engine for rapid deployment and gray‑release, to leveraging random‑forest machine‑learning models and extensive user profiling that reduced fraud by roughly 40%, while addressing privacy and operational challenges.

DroolsRandom ForestXiaomi
0 likes · 15 min read
Xiaomi's Internet Finance Risk Control Practices: Architecture, Rules Engine, and Machine Learning