Tagged articles
3 articles
Page 1 of 1
Model Perspective
Model Perspective
Mar 20, 2025 · Big Data

How to Sample Effectively in the Big Data Era: Methods and Best Practices

This article explores essential sampling strategies for big‑data environments—including simple random, reservoir, stratified, oversampling, undersampling, and weighted sampling—detailing their principles, algorithmic steps, advantages, drawbacks, and suitable application scenarios to help analysts choose the right method.

Big DataSamplingoversampling
0 likes · 8 min read
How to Sample Effectively in the Big Data Era: Methods and Best Practices
Model Perspective
Model Perspective
Mar 19, 2023 · Artificial Intelligence

Master Data Sampling Techniques in Python for Machine Learning

This article explains common data sampling methods—random, stratified, oversampling, undersampling, and adaptive sampling—and provides Python code examples using scikit-learn and imbalanced-learn to implement each technique on the Iris dataset and synthetic data.

data samplingoversamplingscikit-learn
0 likes · 11 min read
Master Data Sampling Techniques in Python for Machine Learning
Code DAO
Code DAO
Dec 3, 2021 · Artificial Intelligence

SMOTE Techniques for Handling Imbalanced Classification in Machine Learning

This article explains the SMOTE oversampling method for imbalanced classification, demonstrates how to generate synthetic minority samples, evaluates models with and without SMOTE using scikit‑learn pipelines, and explores advanced variants such as Borderline‑SMOTE, SVMSMOTE and ADASYN with concrete code examples and benchmark results.

SMOTEclassificationimbalanced learning
0 likes · 24 min read
SMOTE Techniques for Handling Imbalanced Classification in Machine Learning