Tagged articles
43 articles
Page 1 of 1
Data Party THU
Data Party THU
May 4, 2026 · Artificial Intelligence

Understanding the Mathematical Foundations of Reinforcement Learning

This article provides a concise overview of a ten‑chapter reinforcement‑learning textbook, outlining the progression from basic concepts such as states and rewards to advanced algorithms like policy gradients and actor‑critic methods, and explains how each chapter builds on the previous ones.

Bellman equationMonte Carloactor-critic
0 likes · 11 min read
Understanding the Mathematical Foundations of Reinforcement Learning
Model Perspective
Model Perspective
Sep 29, 2025 · Fundamentals

How Monte Carlo Simulations Power Integration, Finance, Physics, and Optimization

The Monte Carlo method, a probabilistic numerical technique introduced in the 1940s, uses extensive random sampling to tackle high‑dimensional problems, with key applications in numerical integration, statistical simulation, financial risk assessment, physical system modeling, and optimization, highlighting its versatility and limitations.

Monte CarloNumerical MethodsPhysics
0 likes · 11 min read
How Monte Carlo Simulations Power Integration, Finance, Physics, and Optimization
Model Perspective
Model Perspective
Sep 25, 2025 · Fundamentals

Master Quick Estimates: 5 Proven Methods for Accurate Decision‑Making

This guide explores five practical estimation techniques—Fermi, analogy, expert judgment, Monte Carlo simulation, and three‑point PERT—detailing their principles, mathematical models, real‑world examples, and how to combine them for reliable decisions in business, engineering, and research.

AnalogyFermiMonte Carlo
0 likes · 14 min read
Master Quick Estimates: 5 Proven Methods for Accurate Decision‑Making
Didi Tech
Didi Tech
Aug 28, 2025 · Artificial Intelligence

Why Temporal Difference Beats Monte Carlo: Mastering the Bellman Equation

Explore how the Bellman equation underpins reinforcement learning, comparing Dynamic Programming, Monte Carlo, and Temporal‑Difference methods, and discover why TD’s low‑variance, online updates make it a powerful bridge between model‑based planning and sample‑based learning.

Bellman equationMonte CarloQ-Learning
0 likes · 21 min read
Why Temporal Difference Beats Monte Carlo: Mastering the Bellman Equation
AI Algorithm Path
AI Algorithm Path
May 24, 2025 · Artificial Intelligence

How N-step Temporal-Difference Methods Extend TD Learning in Reinforcement AI

This tutorial explains how n-step temporal‑difference (TD) algorithms generalize the one‑step TD and Monte‑Carlo methods, presents the n‑step return update rule, walks through a three‑step TD example, shows how Sarsa and Q‑learning can be extended, and discusses how to choose the optimal n value for a given problem.

Monte CarloQ-Learningalgorithm analysis
0 likes · 9 min read
How N-step Temporal-Difference Methods Extend TD Learning in Reinforcement AI
AI Algorithm Path
AI Algorithm Path
May 23, 2025 · Artificial Intelligence

Understanding Temporal‑Difference Algorithms in Reinforcement Learning

This tutorial explains temporal‑difference (TD) learning, compares it with dynamic programming and Monte‑Carlo methods, walks through concrete soccer‑match examples, shows one‑step TD versus constant‑α Monte‑Carlo updates, discusses convergence, bias, and introduces popular TD variants such as Sarsa, Q‑learning, Expected Sarsa and double learning.

Monte CarloQ-LearningTD learning
0 likes · 18 min read
Understanding Temporal‑Difference Algorithms in Reinforcement Learning
AI Algorithm Path
AI Algorithm Path
May 22, 2025 · Artificial Intelligence

Monte Carlo Policy Improvement in RL: Epsilon‑Greedy, On‑Policy vs Off‑Policy, and Incremental Updates

This tutorial explains how Monte Carlo methods are enhanced in reinforcement learning through epsilon‑greedy and epsilon‑soft policies, Monte Carlo control, a Blackjack Q‑function example, the distinction between on‑policy and off‑policy learning, importance sampling, and efficient incremental update techniques.

Epsilon-GreedyImportance SamplingMonte Carlo
0 likes · 14 min read
Monte Carlo Policy Improvement in RL: Epsilon‑Greedy, On‑Policy vs Off‑Policy, and Incremental Updates
Model Perspective
Model Perspective
Dec 20, 2024 · Artificial Intelligence

From Monte Carlo to Deep Learning: How Algorithms Evolved to Power AI

This article traces the evolution of algorithms—from the random‑sampling Monte Carlo method through classic machine‑learning models to modern deep‑learning architectures—highlighting how data, computing power, and scientific demand have driven each breakthrough and hinting at future trends like interpretability, AGI, and quantum algorithms.

Deep LearningMonte Carloalgorithm evolution
0 likes · 8 min read
From Monte Carlo to Deep Learning: How Algorithms Evolved to Power AI
Model Perspective
Model Perspective
Apr 15, 2024 · Fundamentals

Unlocking Model Insights: A Practical Guide to Sobol Sensitivity Analysis

This article introduces the concept and various methods of sensitivity analysis—including one‑factor, multi‑factor, variance‑based, and Monte Carlo approaches—explains Sobol indices, outlines step‑by‑step procedures, and demonstrates their application with a Python case study on urban air‑quality modeling.

Air QualityEnvironmental ModelingMonte Carlo
0 likes · 10 min read
Unlocking Model Insights: A Practical Guide to Sobol Sensitivity Analysis
Model Perspective
Model Perspective
Nov 30, 2022 · Fundamentals

Simulating Stock Prices with Monte Carlo and Brownian Motion in Python

This article explains Brownian motion and Monte Carlo methods, then demonstrates how to model stock price dynamics as a geometric Brownian motion using Python, providing full code for simulating returns, generating price paths, and visualizing multiple trial outcomes.

Brownian MotionMonte CarloPython
0 likes · 9 min read
Simulating Stock Prices with Monte Carlo and Brownian Motion in Python
Model Perspective
Model Perspective
Nov 29, 2022 · Artificial Intelligence

MCMC Demystified: Monte Carlo Basics, Metropolis-Hastings & Gibbs Sampling

Markov Chain Monte Carlo (MCMC) extends classic Monte Carlo by generating dependent samples via a Markov chain, enabling Bayesian inference through concepts like the plug‑in principle, burn‑in, asymptotic independence, and algorithms such as Metropolis‑Hastings and Gibbs sampling, while addressing convergence and effective sample size.

Bayesian inferenceGibbs samplingMCMC
0 likes · 13 min read
MCMC Demystified: Monte Carlo Basics, Metropolis-Hastings & Gibbs Sampling
Model Perspective
Model Perspective
Nov 9, 2022 · Fundamentals

Explore Key Probability & Monte Carlo Models: Curated Resource List

This article compiles recent resources on probability and statistical modeling, covering Monte Carlo simulation, Markov processes, queueing theory, and Bayesian methods, providing direct links to each detailed write‑up for students and researchers seeking comprehensive study material.

Bayesian methodsMarkov processMonte Carlo
0 likes · 3 min read
Explore Key Probability & Monte Carlo Models: Curated Resource List
Model Perspective
Model Perspective
Nov 7, 2022 · Fundamentals

How Simulated Annealing Mimics Physical Annealing to Find Global Optima

Simulated Annealing, inspired by the physical annealing of solids, uses a Monte‑Carlo based stochastic search that gradually lowers temperature to probabilistically accept worse solutions, enabling it to escape local minima and effectively solve combinatorial optimization problems such as TSP, knapsack, and graph coloring.

Monte Carlocombinatorial optimizationoptimization
0 likes · 5 min read
How Simulated Annealing Mimics Physical Annealing to Find Global Optima
Model Perspective
Model Perspective
Oct 22, 2022 · Fundamentals

Unlocking Bayesian Sampling: How MCMC and Hamiltonian Monte Carlo Work

This article explains the principles behind Markov Chain Monte Carlo methods, including Monte Carlo sampling, the Metropolis‑Hastings algorithm, and the Hamiltonian Monte Carlo (HMC) approach, illustrating how they efficiently approximate posterior distributions in Bayesian analysis.

Bayesian inferenceHamiltonian Monte CarloMCMC
0 likes · 11 min read
Unlocking Bayesian Sampling: How MCMC and Hamiltonian Monte Carlo Work
Model Perspective
Model Perspective
Oct 4, 2022 · Artificial Intelligence

How Metropolis-Hastings Improves MCMC Sampling Efficiency

This article explains the detailed‑balance condition for Markov chains, shows why finding a transition matrix for a given stationary distribution is hard, and demonstrates how Metropolis‑Hastings modifies MCMC to achieve higher acceptance rates with a concrete Python example.

MCMCMarkov chainMetropolis-Hastings
0 likes · 9 min read
How Metropolis-Hastings Improves MCMC Sampling Efficiency
Model Perspective
Model Perspective
Oct 2, 2022 · Fundamentals

Why Do Markov Chains Always Converge? A Hands‑On Exploration

This article explains the basic definition of Markov chains, illustrates a stock‑market example with transition matrices, demonstrates convergence through Python simulations, and shows how the steady‑state distribution enables sampling for Monte Carlo methods.

Monte CarloPythonconvergence
0 likes · 10 min read
Why Do Markov Chains Always Converge? A Hands‑On Exploration
Model Perspective
Model Perspective
Sep 28, 2022 · Artificial Intelligence

How Monte Carlo Sampling Powers AI: From Basics to Acceptance-Rejection

This article introduces Monte Carlo methods, explains how random sampling approximates integrals, discusses uniform and non‑uniform probability distributions, and details acceptance‑rejection sampling as a technique for generating samples from complex distributions, laying the groundwork for understanding Markov Chain Monte Carlo in AI.

Acceptance-RejectionMCMCMonte Carlo
0 likes · 8 min read
How Monte Carlo Sampling Powers AI: From Basics to Acceptance-Rejection
Model Perspective
Model Perspective
Sep 23, 2022 · Fundamentals

Mastering Monte Carlo: From Acceptance-Rejection to Gibbs Sampling in Python

This article explains the motivations behind Monte Carlo methods, introduces acceptance-rejection sampling, details Markov Chain Monte Carlo concepts, and walks through Metropolis-Hastings and Gibbs sampling algorithms with Python implementations, highlighting their use in high‑dimensional probability distribution sampling.

MCMCMonte CarloPython
0 likes · 18 min read
Mastering Monte Carlo: From Acceptance-Rejection to Gibbs Sampling in Python
Model Perspective
Model Perspective
Sep 21, 2022 · Fundamentals

Unlocking Monte Carlo Sampling: From Basics to Acceptance‑Rejection in AI

Monte Carlo methods, originally a gambling-inspired random simulation technique, provide a versatile way to approximate integrals and sums, and by using acceptance‑rejection sampling they enable drawing samples from complex probability distributions, a key step toward effective Markov Chain Monte Carlo algorithms in machine learning and AI.

Acceptance-RejectionMCMCMonte Carlo
0 likes · 7 min read
Unlocking Monte Carlo Sampling: From Basics to Acceptance‑Rejection in AI
Model Perspective
Model Perspective
Sep 8, 2022 · Fundamentals

How Monte Carlo Simulation Optimizes Part Parameter Design and Reduces Losses

This article explains how to design part calibration values and tolerances for a product composed of seven components, models the relationship between component parameters and product quality, and uses a Monte Carlo simulation in Python to estimate the average loss per product, illustrating the trade‑off between quality loss and manufacturing cost.

ManufacturingMonte CarloParameter Design
0 likes · 5 min read
How Monte Carlo Simulation Optimizes Part Parameter Design and Reduces Losses
Model Perspective
Model Perspective
Jul 4, 2022 · Fundamentals

Top Model Guides: Clustering, Regression, Queueing & Monte Carlo Simulations

This curated list groups recent explanatory and simulation model articles—covering clustering analysis, linear regression, queueing theory, Markov chains, and Monte Carlo methods—into easy-to-navigate sections for quick reference, helping students and practitioners locate relevant resources efficiently.

ModelingMonte Carloclustering
0 likes · 2 min read
Top Model Guides: Clustering, Regression, Queueing & Monte Carlo Simulations
Model Perspective
Model Perspective
Jun 30, 2022 · Operations

Simulating a Single-Server Queue: Daily Service Count and Wait Times

This article models a single-mechanic repair shop as a single-server queue with exponentially distributed arrivals and uniformly distributed service times, then uses Python to simulate one workday and 1,000 workdays, reporting average daily serviced customers and average customer waiting time.

Monte CarloOperations ResearchPython
0 likes · 4 min read
Simulating a Single-Server Queue: Daily Service Count and Wait Times
Model Perspective
Model Perspective
Jun 24, 2022 · Fundamentals

Unlocking Complex Systems: How Monte Carlo Simulation Transforms Problem Solving

Monte Carlo simulation, a computer-based random sampling technique originating from the Manhattan Project, offers a powerful way to approximate solutions for complex systems with inherent randomness, bypassing unrealistic analytical assumptions by leveraging massive repeated experiments to estimate probabilities and unknown variables.

Monte CarloRandom Samplingcomputational methods
0 likes · 2 min read
Unlocking Complex Systems: How Monte Carlo Simulation Transforms Problem Solving
Model Perspective
Model Perspective
Jun 20, 2022 · Fundamentals

How Monte Carlo Integration Quickly Estimates Double Integrals

This article explains how Monte Carlo methods can approximate definite integrals by randomly sampling points inside a bounding box, showing the geometric interpretation, probability reasoning, and providing a Python implementation that yields a fast low‑precision estimate.

Monte CarloNumerical MethodsPython
0 likes · 3 min read
How Monte Carlo Integration Quickly Estimates Double Integrals
Model Perspective
Model Perspective
Jun 20, 2022 · Fundamentals

Estimating Projectile Hit Probability Inside an Ellipse with Monte Carlo and Numerical Integration

This article demonstrates how to compute the probability that a projectile, whose impact points follow a bivariate normal distribution with 100 m standard deviations, lands inside a given elliptical target by comparing analytical numerical integration with a Monte Carlo simulation implemented in Python.

Bivariate Normal DistributionMonte CarloNumerical Integration
0 likes · 3 min read
Estimating Projectile Hit Probability Inside an Ellipse with Monte Carlo and Numerical Integration
Python Crawling & Data Mining
Python Crawling & Data Mining
Nov 8, 2021 · Fundamentals

Boost Your Python Combinatorial Solver: Faster Code with NumPy & Set Tricks

This article revisits a Python combinatorial challenge—selecting five numbers from a random fifteen‑element list that include consecutive values—presenting an optimized solution that leverages NumPy arrays, set operations, and intersection logic to cut execution time from twelve seconds to about one and a half seconds, while explaining the underlying reasoning.

Combinatorial AlgorithmsMonte CarloPerformance Optimization
0 likes · 6 min read
Boost Your Python Combinatorial Solver: Faster Code with NumPy & Set Tricks
Python Crawling & Data Mining
Python Crawling & Data Mining
Nov 5, 2021 · Fundamentals

How to Generate All 5-Number Combinations Containing Adjacent Values Using Python

This article walks through solving a combinatorial challenge—selecting 5 numbers from a random set of 15 such that a chosen number and its consecutive successor both appear—by presenting multiple Python implementations, from basic random sampling to optimized Monte‑Carlo approaches, complete with code snippets and performance insights.

Monte CarloRandom Samplingcombinatorics
0 likes · 13 min read
How to Generate All 5-Number Combinations Containing Adjacent Values Using Python
DataFunTalk
DataFunTalk
Dec 10, 2019 · Artificial Intelligence

Applying Deep Reinforcement Learning (DQN) to the 2048 Game: Experiments and Insights

This article details a series of reinforcement‑learning experiments on the 2048 game, from random baselines through DQN implementations, classical value‑iteration methods, network redesigns, and Monte‑Carlo tree search, highlighting challenges such as reward design, over‑estimation, and exploration while achieving scores up to 34 000 and tiles of 2048.

2048AIDQN
0 likes · 8 min read
Applying Deep Reinforcement Learning (DQN) to the 2048 Game: Experiments and Insights
Hulu Beijing
Hulu Beijing
Mar 8, 2018 · Artificial Intelligence

Master Common Sampling Techniques: Inverse Transform, Rejection, Importance & MCMC

This article explains the core ideas and step-by-step procedures of widely used sampling methods—including inverse transform, rejection, importance, and Markov Chain Monte Carlo techniques such as Metropolis‑Hastings and Gibbs—highlighting their mathematical foundations, practical implementations, and when each method is appropriate.

Importance SamplingMCMCMonte Carlo
0 likes · 11 min read
Master Common Sampling Techniques: Inverse Transform, Rejection, Importance & MCMC
Architecture Digest
Architecture Digest
Feb 11, 2018 · Artificial Intelligence

Recent Advances in Bayesian Machine Learning: Foundations, Non‑Parametric Methods, and Large‑Scale Applications

This article reviews recent progress in Bayesian machine learning, covering foundational theory, non‑parametric approaches such as Dirichlet and Indian buffet processes, regularized Bayesian inference, and scalable techniques for big‑data environments including stochastic variational methods, distributed algorithms, and hardware acceleration.

Big DataMonte CarloVariational Inference
0 likes · 23 min read
Recent Advances in Bayesian Machine Learning: Foundations, Non‑Parametric Methods, and Large‑Scale Applications
Qunar Tech Salon
Qunar Tech Salon
Aug 8, 2015 · Fundamentals

Monte Carlo Method: Five Illustrative Examples

This article introduces the Monte Carlo method and demonstrates its versatility through five examples covering π estimation, integral calculation, traffic‑jam simulation, product thickness reliability, and securities market profit forecasting, highlighting its simplicity, power, and broad applicability.

Computational MathematicsMonte Carloprobability
0 likes · 6 min read
Monte Carlo Method: Five Illustrative Examples