Tagged articles
240 articles
Page 1 of 3
Model Perspective
Model Perspective
Mar 14, 2026 · Fundamentals

Why Our Perception of Risk Varies: Lessons from The Art of Uncertainty

The article explores David Spiegelhalter's book on uncertainty, explaining how probability reflects personal ignorance, detailing Bayesian versus frequentist views, and illustrating real-world applications such as COVID risk communication, sports luck, and investment performance.

Bayesiandecision makingprobability
0 likes · 11 min read
Why Our Perception of Risk Varies: Lessons from The Art of Uncertainty
Java Baker
Java Baker
Jan 31, 2026 · Backend Development

Mastering Gray Releases and A/B Testing: Strategies, Code, and Analytics

This article provides a comprehensive guide to gray releases and A/B testing, covering common scenarios, implementation methods, layered experiment design, hash-based bucket allocation, data collection workflows, statistical analysis, and practical Java and SQL code examples for reliable feature validation.

A/B testingBackend DevelopmentJava
0 likes · 11 min read
Mastering Gray Releases and A/B Testing: Strategies, Code, and Analytics
Data STUDIO
Data STUDIO
Jan 30, 2026 · Fundamentals

Discover Python’s New Built‑in Functions That Simplify Your Code

From Python 3.9 to 3.12 the standard library adds a suite of ready‑to‑use functions—such as str.removeprefix/removesuffix, math.dist, math.comb, zoneinfo, hashlib.blake2b, and tomllib—that eliminate repetitive code, boost performance, and let developers focus on core logic.

Pythonbuilt-in functionshashlib
0 likes · 9 min read
Discover Python’s New Built‑in Functions That Simplify Your Code
Model Perspective
Model Perspective
Dec 6, 2025 · Artificial Intelligence

Understanding the Ladder of Causation: From Correlation to Counterfactuals

Judea Pearl’s Ladder of Causation framework divides reasoning into three levels—association, intervention, and counterfactuals—explaining how conditional probability, the do‑operator, and structural causal models enable moving from mere data correlation to actionable causal insights, with practical criteria like back‑door and front‑door adjustments.

Judea Pearlcausal inferencecounterfactuals
0 likes · 10 min read
Understanding the Ladder of Causation: From Correlation to Counterfactuals
Model Perspective
Model Perspective
Sep 28, 2025 · Fundamentals

Unlock Hidden Patterns: When to Use PCA vs Factor Analysis

This article explains the core ideas, mathematical steps, geometric intuition, and practical differences between Principal Component Analysis and Factor Analysis, guiding readers on when to apply each technique for dimensionality reduction and latent structure discovery in high‑dimensional data.

Data SciencePCAdimensionality reduction
0 likes · 11 min read
Unlock Hidden Patterns: When to Use PCA vs Factor Analysis
Model Perspective
Model Perspective
Sep 24, 2025 · Fundamentals

What Life Lessons Do Ten Classic Inequalities Reveal?

This article explores ten fundamental mathematical inequalities—AM‑GM, Jensen, Bernoulli, Chebyshev, Markov, Cauchy‑Schwarz, Triangle, Mean‑Inequality Chain, Information, and Hoeffding—explaining their formal statements and illustrating how each offers practical insights for personal growth, risk management, and decision‑making.

inequalitieslife lessonsmathematics
0 likes · 12 min read
What Life Lessons Do Ten Classic Inequalities Reveal?
Model Perspective
Model Perspective
Aug 29, 2025 · Fundamentals

Unlock Complex Math Models with 10 Everyday Metaphors

This article uses ten vivid everyday metaphors—from a stretched rubber band to a domino chain—to illuminate the core ideas behind various mathematical models, helping readers intuitively grasp abstract concepts across fields such as statistics, optimization, and system dynamics.

educationmathematical modelsmetaphors
0 likes · 8 min read
Unlock Complex Math Models with 10 Everyday Metaphors
Data Party THU
Data Party THU
Aug 5, 2025 · Artificial Intelligence

20 Advanced Statistical Techniques Every Data Scientist Must Master

This comprehensive guide introduces twenty essential advanced statistical methods—from Bayesian inference and maximum likelihood estimation to copulas and generalized additive models—explaining their concepts, real‑world use cases, and providing concise Python code examples so data scientists can confidently apply them to complex analytical problems.

AnalysisPythonstatistical methods
0 likes · 24 min read
20 Advanced Statistical Techniques Every Data Scientist Must Master
Data Party THU
Data Party THU
Jul 31, 2025 · Industry Insights

How a 30‑Minute Steel Melt Can Unlock a 10% Production Boost – Insights from Industrial Data Analysis

The article explores real‑world industrial cases—from steel furnace timing and historic lithography to modern manufacturing—showing how continuous improvement, root‑cause analysis, and careful handling of correlation versus causation can reveal hidden inefficiencies, while highlighting the limits of traditional statistics and the emerging role of AI in industrial data analytics.

AIBig DataContinuous Improvement
0 likes · 14 min read
How a 30‑Minute Steel Melt Can Unlock a 10% Production Boost – Insights from Industrial Data Analysis
IT Services Circle
IT Services Circle
Jun 18, 2025 · Fundamentals

Bayes’ Theorem: Uncovering the Real Odds of Top Students and Rare Diseases

Using everyday scenarios—a random student who solved a tough math question and a rare‑disease test result—the article demonstrates how Bayes’ theorem updates prior probabilities with new evidence, revealing that intuitive guesses often misjudge the true likelihood of being a top performer or having a condition.

Bayes theoremconditional probabilitymedical diagnosis
0 likes · 9 min read
Bayes’ Theorem: Uncovering the Real Odds of Top Students and Rare Diseases
Python Crawling & Data Mining
Python Crawling & Data Mining
Jun 15, 2025 · Fundamentals

Why Sitting at the Back of a Plane Boosts Survival Odds and What Statistics Reveal

Research by aviation writer David Nolan shows rear‑seat passengers have higher crash survival rates, and the article explains how frequency distribution tables, probability calculations, and hypothesis testing can be used to analyze such data, illustrating core statistical concepts with real‑world aviation examples.

air crash analysisfrequency distributionhypothesis testing
0 likes · 9 min read
Why Sitting at the Back of a Plane Boosts Survival Odds and What Statistics Reveal
Model Perspective
Model Perspective
Jun 15, 2025 · Fundamentals

How Does Statistical Thinking Differ from Mathematical Thinking? A Deep Dive

This article explores how mathematical and statistical thinking differ in their starting points, research objects, problem formulation, reasoning methods, and attitudes toward uncertainty, highlighting that mathematics emphasizes formal deduction while statistics focuses on probabilistic inference and decision‑making under uncertainty.

data analysismathematical thinkingprobability
0 likes · 7 min read
How Does Statistical Thinking Differ from Mathematical Thinking? A Deep Dive
JD Cloud Developers
JD Cloud Developers
Jun 12, 2025 · Fundamentals

How to Tackle Outliers in Internet A/B Experiments: Methods & Best Practices

This article explores why outliers destabilize online A/B tests, explains their statistical definitions, compares trimming and winsorizing techniques, reviews classic and machine‑learning detection methods, and offers practical guidance for applying these approaches to improve experiment reliability.

A/B testingexperimental designoutlier detection
0 likes · 14 min read
How to Tackle Outliers in Internet A/B Experiments: Methods & Best Practices
Meituan Technology Team
Meituan Technology Team
May 22, 2025 · Fundamentals

Why Write an A/B Experiment Whitepaper? – Overview and Methodology

This whitepaper introduces the importance of data‑driven A/B testing, outlines its theoretical foundations, practical challenges such as small samples and spillover effects, and presents a structured roadmap—including experiment basics, statistical principles, advanced designs, and SDK implementation—to help practitioners design trustworthy experiments.

A/B testingData-drivencausal inference
0 likes · 18 min read
Why Write an A/B Experiment Whitepaper? – Overview and Methodology
php Courses
php Courses
May 8, 2025 · Fundamentals

Data Visualization with Matplotlib and Seaborn in Python

This article introduces Python's Matplotlib and Seaborn libraries for data visualization, covering basic and advanced statistical charts, common plot types, customization techniques, and multi‑plot layouts with clear code examples and a comparative summary of each library's strengths.

Data visualizationMatplotlibPython
0 likes · 5 min read
Data Visualization with Matplotlib and Seaborn in Python
Aikesheng Open Source Community
Aikesheng Open Source Community
May 7, 2025 · Databases

Differences in Manual Statistics Collection for OceanBase: DBMS_STATS vs ANALYZE Across Versions

This article explains the differences between manual statistics collection methods in OceanBase 3.x and 4.x, compares DBMS_STATS and ANALYZE commands across MySQL and Oracle modes, provides version‑specific tables, detailed code examples for table, partition, and schema level collection, and shows how to verify statistics.

ANALYZEDBMS_STATSOceanBase
0 likes · 9 min read
Differences in Manual Statistics Collection for OceanBase: DBMS_STATS vs ANALYZE Across Versions
Model Perspective
Model Perspective
May 5, 2025 · Fundamentals

How Likely Is It to Land a High‑Pay, Low‑Stress Job Near Home? A Probabilistic Analysis

This article models the chance of finding a job that simultaneously offers a high salary, light workload, and proximity to one’s hometown, using conditional probability and real‑world data to show why such an ideal position is extremely rare and how relaxing requirements dramatically improves the odds.

career planningconditional probabilityjob market
0 likes · 7 min read
How Likely Is It to Land a High‑Pay, Low‑Stress Job Near Home? A Probabilistic Analysis
AI Cyberspace
AI Cyberspace
Apr 5, 2025 · Fundamentals

Mastering Statistics: From Data Basics to Regression Analysis

This comprehensive guide explains the fundamentals of statistics—including data types, collection, descriptive analysis, visualization tools, measures of central tendency and dispersion, correlation techniques, and regression modeling—providing practical insights for data scientists and engineers seeking to extract meaningful insights from data.

correlationdata analysisdescriptive statistics
0 likes · 20 min read
Mastering Statistics: From Data Basics to Regression Analysis
Model Perspective
Model Perspective
Mar 17, 2025 · Fundamentals

How the Texas Sharpshooter Fallacy Skews Sports, Tech & Everyday Decisions

The Texas Sharpshooter Fallacy—selectively highlighting data that supports a claim while ignoring contradictory evidence—appears in sports highlights, tech marketing, startup storytelling, and even medical research, and the article explains its mechanics and offers practical steps to avoid being misled.

cognitive biascritical thinkingfallacy
0 likes · 8 min read
How the Texas Sharpshooter Fallacy Skews Sports, Tech & Everyday Decisions
Open Source Tech Hub
Open Source Tech Hub
Dec 25, 2024 · Backend Development

Is PHP Really Dead? Docker Pulls Hit 1 Billion, Proving Its Vitality

Despite long‑standing rumors that PHP is obsolete, recent data shows the official PHP Docker image has been pulled over one billion times, highlighting the language’s continued dominance in web development, with 79% of server‑side sites still powered by PHP and platforms like WordPress serving billions of pages.

DockerPHPWeb Development
0 likes · 3 min read
Is PHP Really Dead? Docker Pulls Hit 1 Billion, Proving Its Vitality
Didi Tech
Didi Tech
Dec 12, 2024 · Product Management

Key AB Testing Interview Questions and Answers for Data Science Candidates

The article reviews common AB‑testing interview questions for data‑science candidates, explaining the role of p‑values, Type I/II errors, the difference between statistical and business significance, why effects can vanish when scaling, and how to improve experiment sensitivity through larger samples, variance‑reduction methods, and careful metric design.

AB testingInterview PreparationMDE
0 likes · 12 min read
Key AB Testing Interview Questions and Answers for Data Science Candidates
Model Perspective
Model Perspective
Dec 2, 2024 · Fundamentals

What Is the Beta Distribution and Why It Matters in A/B Testing?

The Beta distribution is a flexible probability model defined on the interval [0,1] with two shape parameters that control its form, offering useful properties such as mean and variance, and is widely applied in A/B testing, risk assessment, and machine‑learning tasks to model proportions and uncertainties.

A/B testingbeta distributionmachine learning
0 likes · 5 min read
What Is the Beta Distribution and Why It Matters in A/B Testing?
php Courses
php Courses
Nov 26, 2024 · Databases

Using Redis Bitmap Operations for Precise Statistics with PHP

This article explains Redis bitmap data structures, introduces the main bitmap commands (BITSET, BITGET, BITCOUNT), and provides a complete PHP example that demonstrates how to record user sign‑ins and count them efficiently using Redis bit operations.

BitmapNoSQLPHP
0 likes · 5 min read
Using Redis Bitmap Operations for Precise Statistics with PHP
DeWu Technology
DeWu Technology
Nov 25, 2024 · Databases

Redis Hot Key Detection and Kernel-Based Real-Time Statistics

The article describes a kernel‑level hot‑key detection module for Redis that tracks per‑second access counts via an O(1) LRU queue, flags keys exceeding configurable thresholds, and provides real‑time subscription alerts and queryable logs, overcoming the latency and overhead limitations of existing detection methods.

HotKeyKernelperformance
0 likes · 11 min read
Redis Hot Key Detection and Kernel-Based Real-Time Statistics
Test Development Learning Exchange
Test Development Learning Exchange
Nov 10, 2024 · Fundamentals

20 Essential Pandas Data Processing Methods with Code Examples

This article provides a comprehensive overview of 20 essential Pandas data processing methods with detailed code examples covering statistics, data cleaning, transformation, filtering, merging, grouping, sorting, reshaping, aggregation, window functions, time series analysis, conditional selection, indexing, slicing, visualization, type conversion, data filling, filtering, renaming, and import/export operations.

Data visualizationPythondata analysis
0 likes · 16 min read
20 Essential Pandas Data Processing Methods with Code Examples
Model Perspective
Model Perspective
Sep 11, 2024 · Fundamentals

How Expectation Theory Can Guide Your Life Decisions

This article explores how the mathematical concept of expectation helps us understand and manage everyday hopes, assess risk, and make more rational decisions by balancing optimism with realistic assessments of uncertainty and variability.

Psychologydecision makingexpectation
0 likes · 9 min read
How Expectation Theory Can Guide Your Life Decisions
IT Services Circle
IT Services Circle
Sep 8, 2024 · Artificial Intelligence

10 Essential Plots for Linear Regression with Python Code Examples

This tutorial explains ten crucial visualizations for linear regression—scatter plot, trend line, residual plot, normal probability plot, learning curve, bias‑variance tradeoff, residuals vs fitted, partial regression, leverage, and Cook's distance—each illustrated with clear Python code using scikit‑learn, matplotlib, seaborn, and statsmodels.

Data visualizationMatplotlibModel Evaluation
0 likes · 21 min read
10 Essential Plots for Linear Regression with Python Code Examples
Model Perspective
Model Perspective
Aug 20, 2024 · Fundamentals

Why Statistics Is the Hidden Engine Behind Everyday Decisions

Statistics, the science of extracting information from data, underpins everything from medical diagnoses and scientific experiments to e‑commerce recommendations and market research, yet its misuse—confusing correlation with causation or suffering from sample bias—highlights the need for basic statistical literacy in daily life.

decision makingscientific methodstatistical literacy
0 likes · 4 min read
Why Statistics Is the Hidden Engine Behind Everyday Decisions
Model Perspective
Model Perspective
Aug 13, 2024 · Fundamentals

Can You Test Life’s Assumptions with Statistical and Bayesian Methods?

This article explores how everyday decisions—from buying breakfast to quitting a job—are built on hidden assumptions and shows how statistical hypothesis testing and Bayesian thinking can help you identify, test, and adjust those assumptions for better outcomes.

bayesian thinkingdecision makinghypothesis testing
0 likes · 5 min read
Can You Test Life’s Assumptions with Statistical and Bayesian Methods?
Model Perspective
Model Perspective
Aug 3, 2024 · Fundamentals

Why Skewness Makes Statistics Delightful: Visual Guides & Real‑World Problems

This article explores the charm of statistics by explaining skewness with clear diagrams, sharing memorable mnemonics, reviewing a top‑rated textbook, presenting diverse real‑world case studies, and solving probability problems about adult male hip widths using normal distribution and the central limit theorem.

central limit theoremprobabilityskewness
0 likes · 8 min read
Why Skewness Makes Statistics Delightful: Visual Guides & Real‑World Problems
Python Programming Learning Circle
Python Programming Learning Circle
Jul 30, 2024 · Fundamentals

Comprehensive Guide to Essential NumPy Functions for Array Creation, Manipulation, and Analysis

This tutorial presents a detailed overview of over fifty core NumPy functions, covering array creation, reshaping, arithmetic, statistical analysis, set operations, splitting, stacking, printing, and data persistence, with clear explanations and complete code examples for each operation.

ArrayOperationsDataScienceMachineLearning
0 likes · 29 min read
Comprehensive Guide to Essential NumPy Functions for Array Creation, Manipulation, and Analysis
Model Perspective
Model Perspective
Jun 18, 2024 · Fundamentals

Why Statistics Mislead Us: Common Data Traps and How to Spot Them

Statistics permeate daily life, from news to personal decisions, yet common pitfalls like misleading averages, ambiguous percentages, and false causal links often trick us, so understanding these traps helps us interpret data more accurately and avoid faulty judgments.

cognitive biasdata interpretationmisinformation
0 likes · 4 min read
Why Statistics Mislead Us: Common Data Traps and How to Spot Them
php Courses
php Courses
Jun 5, 2024 · Databases

Using Redis Bitmaps for Efficient User Sign‑in Statistics with PHP

This article explains Redis bitmap data structures, introduces common bitmap commands such as BITSET, BITGET, and BITCOUNT, and provides a complete PHP example that records user sign‑ins and calculates daily sign‑in counts using Redis bit operations.

BitmapPHPbitwise operations
0 likes · 5 min read
Using Redis Bitmaps for Efficient User Sign‑in Statistics with PHP
Python Programming Learning Circle
Python Programming Learning Circle
May 29, 2024 · Game Development

Python Implementation of the Zha Jin Hua Card Game

This article demonstrates how to create a Python program that simulates the Chinese card game Zha Jin Hua, covering deck preparation, player handling, card dealing, hand evaluation, scoring rules, and statistical analysis of game outcomes, with complete source code and example results.

PythonZha Jin Huaalgorithm
0 likes · 12 min read
Python Implementation of the Zha Jin Hua Card Game
DataFunTalk
DataFunTalk
May 25, 2024 · Fundamentals

Systematic Solutions to the AA Problem in Random Experiments

This talk explains how combining heavy randomization with regression adjustment can effectively mitigate AA problems in A/B testing, improving experiment credibility by addressing covariate imbalance and enhancing result validity for data‑driven decision making.

A/B testingAA problemData Science
0 likes · 2 min read
Systematic Solutions to the AA Problem in Random Experiments
DataFunTalk
DataFunTalk
May 22, 2024 · Fundamentals

Systematic Solutions to the AA Problem in Random Experiments

This presentation introduces the AA problem that can compromise A/B test validity and explains how combining re‑randomization with regression adjustment provides an effective, practical solution to improve experiment reliability and credibility.

A/B testingAA problemexperiment design
0 likes · 3 min read
Systematic Solutions to the AA Problem in Random Experiments
DataFunSummit
DataFunSummit
May 12, 2024 · Artificial Intelligence

Pairwise Data Based A/B Experiments: Unbiased Causal Inference in Network Experiments

The DataFun Data Science Summit on May 25 will feature Tencent data scientist Li Yilin presenting a comprehensive talk on pairwise‑data A/B experiments, covering unbiased estimation under various randomizations, theoretical analysis, and practical insights for causal inference in network‑driven online experiments.

A/B testingcausal inferencenetwork experiments
0 likes · 4 min read
Pairwise Data Based A/B Experiments: Unbiased Causal Inference in Network Experiments
DataFunTalk
DataFunTalk
May 12, 2024 · Artificial Intelligence

Paired Data Based A/B Experiments: Causal Inference in Network Experiments

The DataFun Data Science Summit on May 25 will feature Tencent data scientist Li Yilin presenting a comprehensive overview of paired‑data A/B experiments, covering causal inference challenges, unbiased estimators under various randomization designs, theoretical analysis, and practical insights for network‑based online experiments.

A/B testingcausal inferencenetwork experiments
0 likes · 5 min read
Paired Data Based A/B Experiments: Causal Inference in Network Experiments
Model Perspective
Model Perspective
Apr 9, 2024 · Fundamentals

Unlocking the Power of Bootstrap: A Practical Guide to Resampling Statistics

Bootstrap, a versatile resampling technique, repeatedly draws samples with replacement from existing data to estimate statistics like means and confidence intervals, offering flexible, distribution-agnostic insights across fields—from education and economics to ecology and finance—illustrated with Python code examples.

Bootstrapconfidence intervalresampling
0 likes · 10 min read
Unlocking the Power of Bootstrap: A Practical Guide to Resampling Statistics
Python Programming Learning Circle
Python Programming Learning Circle
Mar 27, 2024 · Fundamentals

Common Probability Distributions and Their Visualization with Python

This article explains the fundamentals of several common probability distributions—including uniform, normal, log‑normal, Poisson, exponential, binomial, Student's t, and chi‑squared—and provides complete Python code to generate and plot each distribution for data‑science and machine‑learning applications.

data-sciencedistributionprobability
0 likes · 12 min read
Common Probability Distributions and Their Visualization with Python
Baidu Geek Talk
Baidu Geek Talk
Feb 21, 2024 · Fundamentals

Master Data Distribution Visualization with Seaborn: Histograms to Violin Plots

This tutorial walks through essential seaborn techniques for visualizing data distributions—including univariate histograms, conditional histograms, KDE curves, ECDFs, boxplots, violin plots, bivariate histograms, and joint plots—providing code snippets, parameter explanations, and practical examples using the penguins dataset.

Data visualizationHistogramKDE
0 likes · 18 min read
Master Data Distribution Visualization with Seaborn: Histograms to Violin Plots
Test Development Learning Exchange
Test Development Learning Exchange
Jan 18, 2024 · Fundamentals

Common Statistical Methods for Data Analysis with Python Code Examples

This article introduces ten common statistical techniques used in data analysis—including descriptive statistics, correlation, t‑test, ANOVA, linear regression, PCA, outlier detection, frequency distribution, time‑series analysis, and non‑parametric tests—providing concise explanations and Python code snippets for each method.

machine learningstatistical methodsstatistics
0 likes · 7 min read
Common Statistical Methods for Data Analysis with Python Code Examples
Model Perspective
Model Perspective
Jan 10, 2024 · Fundamentals

How Many Survey Responses Do You Really Need? A Practical Guide to Calculating Sample Size

Determining the right sample size is crucial for reliable survey results, and this article explains the key factors—population size, confidence level, margin of error, response rate, and effect size—while walking through a concrete example of calculating the needed respondents for a university library service study.

confidence intervalmargin of errorresearch design
0 likes · 7 min read
How Many Survey Responses Do You Really Need? A Practical Guide to Calculating Sample Size
Model Perspective
Model Perspective
Jan 3, 2024 · Fundamentals

How Randomized Controlled Trials Reveal True Causality

Randomized Controlled Trials (RCTs) are considered the gold standard for establishing causal relationships because randomization balances known and unknown confounders, control groups provide clear comparisons, and reproducibility ensures reliable results, though practical limitations like cost and ethics often require alternative observational methods.

experimental designrandomized controlled trialsresearch methodology
0 likes · 7 min read
How Randomized Controlled Trials Reveal True Causality
Model Perspective
Model Perspective
Dec 19, 2023 · Fundamentals

Why Hospital Survival Rates Can Mislead: Unveiling Simpson’s Paradox

Simpson’s Paradox shows how aggregated data can suggest one trend while each subgroup reveals the opposite, illustrated with hospital survival rates where overall A appears better than B, yet detailed analysis by severity flips the conclusion, highlighting the need to consider background variables in statistical interpretation.

BiasSimpson's paradoxdata analysis
0 likes · 5 min read
Why Hospital Survival Rates Can Mislead: Unveiling Simpson’s Paradox
Model Perspective
Model Perspective
Dec 1, 2023 · Artificial Intelligence

Why Causal Graphs Matter: From Philosophy to AI Insights

This article explores the distinction between causal reasoning and conspiracy thinking, the challenges of defining causality, and how Judea Pearl's causal graph framework provides a powerful tool for AI, epidemiology, and other fields to visualize and analyze complex cause‑effect relationships.

Judea Pearlartificial intelligencecausal graphs
0 likes · 10 min read
Why Causal Graphs Matter: From Philosophy to AI Insights
Huolala Tech
Huolala Tech
Nov 24, 2023 · Fundamentals

Master AB Testing: Hypothesis Testing and Minimum Sample Size Made Simple

This article explains the statistical foundations of AB experiments—hypothesis testing and minimum sample size calculation—showing how to determine whether observed differences are real, how to control type‑I and type‑II errors, and how to plan experiments with sufficient power.

experiment designhypothesis testingsample size
0 likes · 15 min read
Master AB Testing: Hypothesis Testing and Minimum Sample Size Made Simple
Code Ape Tech Column
Code Ape Tech Column
Oct 14, 2023 · Backend Development

Implementing Sign‑In and Statistics with Redis BitMap in Spring Boot

This article explains how to use Redis BitMap to build a memory‑efficient sign‑in feature and continuous‑sign‑in statistics in a Spring Boot microservice, covering basic BitMap commands, key design, core Java code, testing steps, and a bitmap‑based solution for cache‑penetration protection.

BackendBitmapSign-in
0 likes · 10 min read
Implementing Sign‑In and Statistics with Redis BitMap in Spring Boot
Model Perspective
Model Perspective
Oct 1, 2023 · Fundamentals

How Poisson Distribution Models Rare Events Across Real-World Systems

The Poisson distribution, originating from 19th‑century French military statistics, describes the probability of a given number of rare events occurring within a fixed interval, and its formula, derivation from the binomial limit, and diverse applications—from traffic flow to insurance claims—are explored in this article.

ApplicationsPoisson distributionderivation
0 likes · 5 min read
How Poisson Distribution Models Rare Events Across Real-World Systems
Model Perspective
Model Perspective
Sep 21, 2023 · Fundamentals

Unlock the Jargon: Essential Terms Every Math Modeling Beginner Must Know

This comprehensive guide demystifies over one hundred core mathematical modeling terms—from basic concepts like models and abstraction to advanced topics such as optimization, dynamic systems, stochastic processes, statistical methods, and machine learning—helping newcomers confidently navigate the field.

machine learningmathematical modelingmodeling terminology
0 likes · 20 min read
Unlock the Jargon: Essential Terms Every Math Modeling Beginner Must Know
Test Development Learning Exchange
Test Development Learning Exchange
Sep 12, 2023 · Artificial Intelligence

Various Anomaly Detection Techniques with Python Code Examples

This article introduces ten common anomaly detection approaches—including statistical thresholds, boxplots, clustering, isolation forest, LOF, collaborative filtering, robust covariance, NLP, computer‑vision, and time‑series methods—each accompanied by concise Python code snippets illustrating how to identify outliers in different data domains.

PythonTime Seriesanomaly detection
0 likes · 9 min read
Various Anomaly Detection Techniques with Python Code Examples
Model Perspective
Model Perspective
Aug 31, 2023 · Fundamentals

How the Central Limit Theorem Solves Real-World Probability Problems

This article explains the Central Limit Theorem, shows how a sum of independent identically distributed variables approaches a normal distribution, and demonstrates its practical use through six detailed examples ranging from power supply planning to medical donor matching.

Applicationscentral limit theoremnormal distribution
0 likes · 9 min read
How the Central Limit Theorem Solves Real-World Probability Problems
Model Perspective
Model Perspective
Aug 26, 2023 · Artificial Intelligence

75 Essential Data Science Terms Every Practitioner Must Know

This article compiles a comprehensive alphabetically ordered list of 75 crucial data science and machine learning terms—from accuracy and AUC to zero-shot learning—providing concise definitions that help practitioners quickly grasp essential concepts and improve their analytical vocabulary.

AI termsData ScienceGlossary
0 likes · 13 min read
75 Essential Data Science Terms Every Practitioner Must Know
Model Perspective
Model Perspective
Aug 24, 2023 · Fundamentals

Master Essential Data Visualization Techniques for Data Science

This article presents a comprehensive collection of practical data visualization methods—including KS plots, SHAP explanations, Q‑Q plots, cumulative variance, Gini vs Entropy, bias‑variance tradeoff, ROC and precision‑recall curves, and elbow analysis—each illustrated with Python code and clear explanations to help analysts and non‑experts quickly interpret complex datasets.

Data visualizationmachine learningplotting
0 likes · 25 min read
Master Essential Data Visualization Techniques for Data Science
Architect's Guide
Architect's Guide
Aug 21, 2023 · Fundamentals

Guidelines for Structured Data Analysis Reports and Effective Chart Usage

This article outlines a clear framework for writing data analysis reports—including hierarchical structure, concise conclusions, business‑oriented recommendations, reliable data sourcing, and best‑practice chart design—while highlighting common statistical pitfalls and tips for improving readability and impact.

Methodologybest practiceschart design
0 likes · 12 min read
Guidelines for Structured Data Analysis Reports and Effective Chart Usage
dbaplus Community
dbaplus Community
Aug 17, 2023 · Databases

Mastering Redis Set Operations for Scalable Statistics and Aggregations

This article explains how to leverage Redis data structures such as Set, Sorted Set, Bitmap, and HyperLogLog to perform aggregation, sorted, binary‑state, and cardinality statistics efficiently in large‑scale applications, while addressing performance considerations and practical implementation details.

BackendHyperLogLogSet
0 likes · 10 min read
Mastering Redis Set Operations for Scalable Statistics and Aggregations
Model Perspective
Model Perspective
Aug 16, 2023 · Fundamentals

Understanding the Beta Distribution: A Key to Bayesian Inference

This article explores the Beta distribution’s role in Bayesian statistics, detailing its definition, properties, conjugate prior relationship, and practical examples such as coin flips and bus arrivals, illustrating how it simplifies probability updates and supports intuitive belief revision.

Bayesian inferencebeta distributionconjugate prior
0 likes · 10 min read
Understanding the Beta Distribution: A Key to Bayesian Inference
Model Perspective
Model Perspective
Aug 12, 2023 · Fundamentals

Unlocking Bayesian Methods: Theory, Real-World Examples, and Python Demo

This article explains Bayesian methods—its core theorem, historical and everyday applications, a detailed medical testing model, and provides a step‑by‑step Python calculation illustrating how prior probabilities and new evidence combine to produce posterior probabilities.

Bayesian inferenceMedical TestingPython
0 likes · 6 min read
Unlocking Bayesian Methods: Theory, Real-World Examples, and Python Demo
Aikesheng Open Source Community
Aikesheng Open Source Community
Aug 2, 2023 · Databases

Real‑time Update of AUTO_INCREMENT in INFORMATION_SCHEMA.TABLES on MySQL 8.0

This article explains how MySQL 8.0 updates the AUTO_INCREMENT column in INFORMATION_SCHEMA.TABLES, describes the underlying statistics caching mechanism, shows how the information_schema_stats_expiry parameter controls refresh frequency, and provides step‑by‑step tests demonstrating real‑time behavior with code examples.

AUTO_INCREMENTInformation Schemamysql
0 likes · 10 min read
Real‑time Update of AUTO_INCREMENT in INFORMATION_SCHEMA.TABLES on MySQL 8.0
DataFunSummit
DataFunSummit
Jul 3, 2023 · Big Data

Avoiding Data Misuse: Case Studies on Invalid Data, Simpson’s Paradox, and Statistical Pitfalls

This article examines how data can be misused or misinterpreted through real‑world case studies—ranging from breakfast myths and toothpaste advertising to contraceptive risks, crime statistics, judicial decisions, questionnaire bias, airline efficiency, and correlation‑causation confusion—offering practical guidelines to recognize and prevent invalid data analysis in the big‑data era.

BiasSimpson's paradoxdata analysis
0 likes · 22 min read
Avoiding Data Misuse: Case Studies on Invalid Data, Simpson’s Paradox, and Statistical Pitfalls
DaTaobao Tech
DaTaobao Tech
May 22, 2023 · Artificial Intelligence

Statistical and Machine Learning Metrics for Data Analysis

The article presents a practical toolbox of statistical and machine‑learning metrics—including short‑term growth rates, CAGR, Excel forecasting functions, Wilson score adjustment, sigmoid decay weighting, correlation coefficients, KL divergence, elbow detection with KneeLocator, entropy‑based weighting, PCA, and TF‑IDF—offering concise formulas and code snippets for data analysis without deep theory.

PCAcorrelationdata analysis
0 likes · 12 min read
Statistical and Machine Learning Metrics for Data Analysis
ITPUB
ITPUB
Mar 7, 2023 · Databases

Why Oracle Auto Tasks Slow Your Database and How to Fix Them

The article explains how Oracle's built‑in auto‑task jobs run during specific weekday and weekend windows, why they can cause performance degradation by changing execution plans or consuming resources, and provides practical steps to diagnose, adjust windows, and selectively disable problematic tasks.

Auto TaskDatabase PerformanceOracle
0 likes · 9 min read
Why Oracle Auto Tasks Slow Your Database and How to Fix Them
Model Perspective
Model Perspective
Mar 3, 2023 · Fundamentals

Unlock Hidden Patterns: A Practical Guide to Factor Analysis with Python

Factor analysis, a statistical technique for uncovering underlying common factors among variables, is explained alongside its distinction from PCA, detailed procedural steps, adequacy tests, and a hands‑on Python implementation using the factor_analyzer library with visualizations and factor rotation methods.

Pythondata preprocessingfactor analysis
0 likes · 10 min read
Unlock Hidden Patterns: A Practical Guide to Factor Analysis with Python
Model Perspective
Model Perspective
Feb 26, 2023 · Fundamentals

How to Detect Trends with the Mann‑Kendall Test in Python

This article explains how to determine whether a time‑series dataset exhibits a monotonic trend using the non‑parametric Mann‑Kendall test, walks through its statistical foundations, shows the calculation steps with sample sales data, and provides a complete Python implementation for practical analysis.

Mann-KendallPythonstatistics
0 likes · 7 min read
How to Detect Trends with the Mann‑Kendall Test in Python
DataFunSummit
DataFunSummit
Feb 23, 2023 · Artificial Intelligence

An Introduction to Causal Inference: Concepts, Methods, and Real‑World Applications

This article provides a comprehensive overview of causal inference, explaining its definition, the distinction between correlation and causation, classic pitfalls such as Simpson's paradox, key metrics like ATE and ATT, experimental designs, bias mitigation techniques, and practical case studies from content platforms and the Titanic dataset.

A/B testingbias mitigationcausal inference
0 likes · 22 min read
An Introduction to Causal Inference: Concepts, Methods, and Real‑World Applications
Model Perspective
Model Perspective
Feb 19, 2023 · Fundamentals

How the Central Limit Theorem Solves Real-World Probability Problems

This article explains the Central Limit Theorem and demonstrates its practical use through six diverse examples ranging from power planning in factories to ambulance allocation, seed quality assessment, and prize‑draw probabilities, showing how normal approximations simplify complex random‑variable calculations.

Applicationscentral limit theoremnormal approximation
0 likes · 8 min read
How the Central Limit Theorem Solves Real-World Probability Problems
Model Perspective
Model Perspective
Feb 8, 2023 · Artificial Intelligence

Mastering Feature Selection: From Filters to Embedded Methods in Python

This article explains why feature selection is crucial for machine learning, outlines the general workflow, compares filter, wrapper, embedded, and synthesis approaches, and provides practical Python examples—including Pearson correlation, chi‑square tests, mutual information, variance selection, recursive elimination, L1 regularization, and PCA—complete with code snippets and visualizations.

Pythonfeature selectionstatistics
0 likes · 20 min read
Mastering Feature Selection: From Filters to Embedded Methods in Python
DataFunTalk
DataFunTalk
Feb 6, 2023 · Product Management

A Comprehensive Guide to A/B Testing: Principles, Methods, and Applications

This article explains the scientific foundations, historical origins, statistical principles, implementation techniques, and practical applications of A/B testing as a data‑driven growth tool for product optimization, algorithm iteration, and marketing decisions in modern internet companies.

A/B testingdata-driven growthonline experiments
0 likes · 26 min read
A Comprehensive Guide to A/B Testing: Principles, Methods, and Applications
Model Perspective
Model Perspective
Dec 9, 2022 · Fundamentals

How to Perform a 2×2 Chi‑Square Test in SPSS: Step‑by‑Step Guide

This guide explains how to conduct a 2 × 2 chi‑square test in SPSS, covering data entry, weighting, step‑by‑step menu selections, cell options, result interpretation, and decision rules for choosing the appropriate test based on sample size and expected frequencies.

2x2 contingency tableSPSSchi-square
0 likes · 6 min read
How to Perform a 2×2 Chi‑Square Test in SPSS: Step‑by‑Step Guide
Model Perspective
Model Perspective
Dec 7, 2022 · Fundamentals

Can Different Diets Change Mouse Liver Iron? A Step‑by‑Step SPSS t‑Test Guide

This article presents a practical example using SPSS to perform an independent‑samples t‑test on iron concentrations in mouse livers from two diet groups, explains the assumptions, walks through data entry, test configuration, result interpretation, and concludes that the dietary effect is not statistically significant.

SPSSbiostatisticsexperimental design
0 likes · 4 min read
Can Different Diets Change Mouse Liver Iron? A Step‑by‑Step SPSS t‑Test Guide
Model Perspective
Model Perspective
Dec 6, 2022 · Fundamentals

How to Perform a One‑Sample t‑Test in SPSS: Step‑by‑Step Guide

This guide walks through a health‑survey example, showing how to check SPSS assumptions for outliers and normality, perform a one‑sample t‑test on BMI data, interpret the output, and draw a statistically significant conclusion about the sample mean versus the population mean.

BMISPSSnormality test
0 likes · 7 min read
How to Perform a One‑Sample t‑Test in SPSS: Step‑by‑Step Guide
Model Perspective
Model Perspective
Dec 2, 2022 · Fundamentals

Master Linear Regression in R: From Random Data to Insightful Models

This article explains the theory behind simple linear regression, demonstrates how to generate random data and fit a model using R's lm() function, and interprets the statistical output including coefficients, significance tests, and goodness‑of‑fit measures.

Rdata analysislinear regression
0 likes · 4 min read
Master Linear Regression in R: From Random Data to Insightful Models
Model Perspective
Model Perspective
Nov 30, 2022 · Fundamentals

How to Perform One‑Sample and Two‑Sample t‑Tests in R

This article explains the theory behind hypothesis testing, demonstrates one‑sample and two‑sample t‑tests using R’s t.test() function with agricultural yield examples, and shows how to interpret one‑sided versus two‑sided test results for decision making.

Rdata analysishypothesis testing
0 likes · 5 min read
How to Perform One‑Sample and Two‑Sample t‑Tests in R
Model Perspective
Model Perspective
Nov 29, 2022 · Fundamentals

Mastering Probability Distributions in R: From Normal to Poisson

This article explains how various continuous and discrete probability distributions—such as normal, binomial, Poisson, and negative binomial—are used in real‑world contexts, introduces R’s naming conventions for distribution functions, and provides code examples for computing densities, CDFs, quantiles, and random samples.

Rdata analysisdistributions
0 likes · 5 min read
Mastering Probability Distributions in R: From Normal to Poisson
Model Perspective
Model Perspective
Nov 18, 2022 · Fundamentals

How to Use an F-Test to Compare Stock Return Variances: A Step-by-Step Example

This article walks through a complete F‑test example comparing the monthly return variances of IBM and HP stocks over 36 months, detailing hypothesis formulation, calculation of the test statistic, critical value lookup, and interpretation of results to determine if the variances differ significantly.

F-testfinancial analysishypothesis testing
0 likes · 2 min read
How to Use an F-Test to Compare Stock Return Variances: A Step-by-Step Example
Model Perspective
Model Perspective
Nov 17, 2022 · Fundamentals

How to Use an F-Test to Compare Stock Return Variances: A Step-by-Step Guide

This article explains the null and alternative hypotheses for one‑ and two‑tailed variance tests, demonstrates how to compute the F‑statistic and critical value, and walks through a practical example comparing the monthly return standard deviations of IBM and HP stocks, concluding that their variances are not significantly different.

F-testhypothesis testingstatistics
0 likes · 2 min read
How to Use an F-Test to Compare Stock Return Variances: A Step-by-Step Guide
Model Perspective
Model Perspective
Nov 15, 2022 · Fundamentals

How to Use Paired Comparison Tests for Evaluating Investment Strategies

This article explains the paired comparison test for non‑independent samples, outlines the null and alternative hypotheses, presents the t‑test statistic formula, and walks through a detailed example evaluating a Dow‑10 investment strategy versus a buy‑and‑hold approach.

financehypothesis testinginvestment strategy
0 likes · 4 min read
How to Use Paired Comparison Tests for Evaluating Investment Strategies
Model Perspective
Model Perspective
Nov 14, 2022 · Fundamentals

Testing Equality of Means for Two Independent Normal Populations

This article explains how to conduct hypothesis tests for equality of means between two independent normal populations, covering both equal‑variance and unequal‑variance cases, detailing test statistics, degrees of freedom, critical values, and decision rules through examples on S&P 500 returns and bond recovery rates.

hypothesis testingindependent samplesstatistics
0 likes · 6 min read
Testing Equality of Means for Two Independent Normal Populations