Tagged articles
240 articles
Page 1 of 3
Data STUDIO
Data STUDIO
Jan 4, 2026 · Industry Insights

Is Python Still the #1 Programming Language in 2026?

The article argues that Python remains the top programming choice in 2026 because its concise syntax, massive ecosystem, and modern tooling deliver unmatched development speed, lower total cost, and a balanced blend of rapid prototyping with long‑term stability for a wide range of applications.

2026AutomationData Science
0 likes · 11 min read
Is Python Still the #1 Programming Language in 2026?
JD Retail Technology
JD Retail Technology
Nov 20, 2025 · Fundamentals

How Heterogeneous Treatment Effect Analysis Uncovers Sub‑Group Performance

This article explains the concept of heterogeneous treatment effects, outlines how to select dimensions for HTE analysis, describes a Python‑based MVP tool for automated CATE exploration, and showcases a real‑world experiment case where sub‑group insights turned a non‑significant overall result into actionable findings.

CATEData Sciencecausal inference
0 likes · 7 min read
How Heterogeneous Treatment Effect Analysis Uncovers Sub‑Group Performance
Data Party THU
Data Party THU
Nov 1, 2025 · Fundamentals

10 Hidden Jupyter Notebook Tricks That Can Save You an Hour Every Day

Discover ten lesser‑known Jupyter Notebook features—from magic commands that list variables and benchmark code to shortcuts, export utilities, and interactive help—that turn a simple notebook into a highly efficient, production‑ready data‑science workspace.

AutomationData ScienceJupyter
0 likes · 7 min read
10 Hidden Jupyter Notebook Tricks That Can Save You an Hour Every Day
Data STUDIO
Data STUDIO
Oct 24, 2025 · Fundamentals

10 Must‑Have Python Project Repositories on GitHub for 2025

Python remains a top language in 2025 thanks to its simple syntax, massive library ecosystem and broad applicability, and this article curates ten GitHub repositories—ranging from AI and data‑science tutorials to automation scripts and beginner‑friendly projects—each explained with concrete reasons why they’re valuable for learning and building real‑world applications.

AutomationData ScienceGitHub
0 likes · 17 min read
10 Must‑Have Python Project Repositories on GitHub for 2025
Model Perspective
Model Perspective
Sep 28, 2025 · Fundamentals

Unlock Hidden Patterns: When to Use PCA vs Factor Analysis

This article explains the core ideas, mathematical steps, geometric intuition, and practical differences between Principal Component Analysis and Factor Analysis, guiding readers on when to apply each technique for dimensionality reduction and latent structure discovery in high‑dimensional data.

Data SciencePCAdimensionality reduction
0 likes · 11 min read
Unlock Hidden Patterns: When to Use PCA vs Factor Analysis
Architects Research Society
Architects Research Society
Sep 11, 2025 · Artificial Intelligence

12 Essential AI Algorithms: Quick Guide to Use Cases & Benefits

This concise guide presents twelve core AI algorithms—from gradient boosting and deep neural networks to decision trees and K‑nearest neighbors—detailing their strengths, typical applications such as fraud detection, image classification, and price forecasting, and offering practical tips for selecting the right model.

AIAlgorithmsData Science
0 likes · 3 min read
12 Essential AI Algorithms: Quick Guide to Use Cases & Benefits
Data Party THU
Data Party THU
Sep 5, 2025 · Big Data

Key Takeaways from the 2025 China University Big Data Challenge

In this reflective case study, a first‑time undergraduate shares how competing in the 2025 China University Big Data Challenge—predicting Shanghai‑Shenzhen 300 index component movements—deepened his understanding of structured time‑series data processing, algorithm adaptability, iterative model optimization, and the broader value of data‑driven problem solving.

Data ScienceModelingTime Series
0 likes · 5 min read
Key Takeaways from the 2025 China University Big Data Challenge
Model Perspective
Model Perspective
Sep 3, 2025 · Artificial Intelligence

Top Free Datasets for AI, ML, and Data Science Projects – A Curated Guide

This article compiles a comprehensive list of high‑quality, publicly available datasets across domains such as general platforms, education, finance, health, text, and vision, providing URLs, key features, and practical usage tips to help researchers and practitioners quickly find the right data for their AI and data‑science projects.

AIData ScienceDatasets
0 likes · 11 min read
Top Free Datasets for AI, ML, and Data Science Projects – A Curated Guide
DataFunTalk
DataFunTalk
Sep 2, 2025 · Operations

How Data Science Transforms Intelligent Supply Chains: Theory and Real‑World Cases

This article introduces the book “Intelligent Supply Chain: Data Science Theory and Practice”, outlining how data science drives end‑to‑end supply‑chain optimization through real‑world case studies, covering topics from data preprocessing to advanced modeling, delivery efficiency, and customer‑service forecasting.

Data ScienceLogistics Optimizationmachine learning
0 likes · 9 min read
How Data Science Transforms Intelligent Supply Chains: Theory and Real‑World Cases
Meituan Technology Team
Meituan Technology Team
Aug 21, 2025 · Fundamentals

How Meituan’s Trusted Experiment Engine Enables Zero‑Barrier A/B Testing

The article introduces Meituan’s trusted experiment analysis engine, detailing its rich methodological library, system architecture, integration options, and a step‑by‑step offline analysis case that together empower teams to conduct reliable, efficient A/B tests without deep statistical expertise.

Data Scienceexperiment analysisplatform engineering
0 likes · 14 min read
How Meituan’s Trusted Experiment Engine Enables Zero‑Barrier A/B Testing
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 16, 2025 · Artificial Intelligence

What Are the Core Concepts Behind AI? From Data to Models Explained

This article walks readers through the fundamentals of artificial intelligence, covering AI, machine learning, deep learning, data types, linear regression, supervised and unsupervised learning, reinforcement learning, feature engineering, tokenization, vectorization, embeddings, and includes a practical Word2Vec code example.

AIData ScienceDeep Learning
0 likes · 21 min read
What Are the Core Concepts Behind AI? From Data to Models Explained
Code Mala Tang
Code Mala Tang
Jul 1, 2025 · Artificial Intelligence

Why Great Code Starts in Your Mind, Not the IDE

This article argues that successful programming and data‑science projects begin with clear problem definition, logical planning, and simple models before any code is written, emphasizing thinking over tools to ensure transparent, maintainable solutions.

AIData ScienceModeling
0 likes · 7 min read
Why Great Code Starts in Your Mind, Not the IDE
Model Perspective
Model Perspective
Jun 5, 2025 · Fundamentals

How Modeling Assumptions Reflect Values and Shape the World

This essay explains that every modeling choice rests on hidden assumptions that encode the modeler's values, showing how different assumptions lead to different perspectives, ethical trade‑offs, and ultimately influence real‑world decisions across domains such as education, insurance, and public policy.

Data ScienceEthicsModeling
0 likes · 10 min read
How Modeling Assumptions Reflect Values and Shape the World
Python Programming Learning Circle
Python Programming Learning Circle
May 15, 2025 · Artificial Intelligence

Python Dominates the TIOBE Index: Trends and Implications

The May 2024 TIOBE ranking shows Python soaring to a record 25.35% share, widening the gap over C++ and cementing its dominance in AI, data science, and automation, while highlighting the continued relevance of languages like Java, C++, and R for various development needs.

Data ScienceTIOBE Indexartificial intelligence
0 likes · 6 min read
Python Dominates the TIOBE Index: Trends and Implications
DevOps Engineer
DevOps Engineer
Apr 25, 2025 · Big Data

Reflections on PyCon LT 2025 Data Day: Sessions on Static Code Analysis, Data Warehouses, Pipelines, and Data Science Tools

The author recounts attending PyCon LT 2025 Data Day, summarizing talks on building a simple static code analyzer with AST, challenges of data warehouses versus data lakes, cloud cost‑scraping pipelines, A/B testing libraries, privacy‑enhancing data processing, and tools like Panel and Dagster, while noting the inspiring presence of female speakers.

DagsterData SciencePanel
0 likes · 7 min read
Reflections on PyCon LT 2025 Data Day: Sessions on Static Code Analysis, Data Warehouses, Pipelines, and Data Science Tools
Test Development Learning Exchange
Test Development Learning Exchange
Mar 16, 2025 · Backend Development

Comprehensive Python Ecosystem Overview: Web Frameworks, HTTP Clients, Databases, Data Analysis, Machine Learning, Image Processing, NLP, CLI, Concurrency, Testing, and Logging

This guide introduces a wide range of Python libraries and tools—including Flask, Django, FastAPI, Requests, HTTPX, SQLAlchemy, Pandas, NumPy, Scikit‑learn, TensorFlow, PyTorch, Pillow, OpenCV, spaCy, Click, asyncio, pytest, and logging—providing concise descriptions and ready‑to‑run code examples for each domain.

Data SciencePythonmachine-learning
0 likes · 7 min read
Comprehensive Python Ecosystem Overview: Web Frameworks, HTTP Clients, Databases, Data Analysis, Machine Learning, Image Processing, NLP, CLI, Concurrency, Testing, and Logging
php Courses
php Courses
Mar 14, 2025 · Fundamentals

Top 10 Application Areas of Python with Sample Code

This article introduces ten major fields where Python excels—including web development, data science, automation, web crawling, game development, desktop applications, DevOps, IoT, blockchain, and education—provides brief overviews, lists common tools or libraries, and supplies concise code examples for each use case.

AutomationBlockchainData Science
0 likes · 9 min read
Top 10 Application Areas of Python with Sample Code
Code Mala Tang
Code Mala Tang
Mar 4, 2025 · Fundamentals

Essential Python Libraries: 100 Must‑Have Packages for Every Developer

This article presents a curated list of 100 essential Python libraries covering web development, AI and machine learning, data science, automation, DevOps, security, databases, game development, and various utility tools, helping developers boost productivity across diverse projects.

Data SciencePythonWeb Development
0 likes · 14 min read
Essential Python Libraries: 100 Must‑Have Packages for Every Developer
DaTaobao Tech
DaTaobao Tech
Jan 24, 2025 · Artificial Intelligence

MktAI Assistant: AI‑Driven Marketing Data Query and Insight Platform

The MktAI Assistant combines LLM‑powered memory, skill planning, and tool‑calling with real‑time API data to replace slow, manual SQL dashboards, delivering sub‑minute, fresh, explainable marketing queries and attribution insights that boost decision speed, accuracy, and collaboration between data scientists and business users.

AI AgentData ScienceFunction Calling
0 likes · 16 min read
MktAI Assistant: AI‑Driven Marketing Data Query and Insight Platform
Test Development Learning Exchange
Test Development Learning Exchange
Jan 22, 2025 · Artificial Intelligence

Comprehensive Guide to Python Data Science Libraries with Code Examples

This article presents a concise tutorial on essential Python data science libraries, covering data cleaning with Pandas, numerical analysis with NumPy and SciPy, visualization with Matplotlib and Seaborn, machine learning with scikit‑learn, NLP with NLTK and spaCy, time‑series modeling, image processing, database access, and parallel computing, each illustrated with ready‑to‑run code examples.

Data ScienceData visualizationNLP
0 likes · 7 min read
Comprehensive Guide to Python Data Science Libraries with Code Examples
Test Development Learning Exchange
Test Development Learning Exchange
Jan 17, 2025 · Artificial Intelligence

Essential Python Libraries for Data Processing, Visualization, and Machine Learning

This article introduces ten essential Python libraries—including SciPy, Matplotlib, Plotly, Scikit‑learn, TensorFlow, spaCy, BeautifulSoup, OpenPyXL, Feather/Parquet, and SQLAlchemy—detailing their primary uses for scientific computing, visualization, machine learning, deep learning, NLP, web scraping, Excel handling, efficient data storage, and ORM, with practical code examples.

Data ScienceNLPPython
0 likes · 8 min read
Essential Python Libraries for Data Processing, Visualization, and Machine Learning
Architects' Tech Alliance
Architects' Tech Alliance
Jan 12, 2025 · Artificial Intelligence

Explore the Full AI Expert Roadmap: From Data Science to Big Data Engineering

The AI Expert Roadmap on GitHub offers a comprehensive, interactive guide covering data‑science fundamentals, machine‑learning algorithms, deep‑learning techniques, data‑engineering pipelines, and big‑data architectures, with linked resources, up‑to‑date references, and practical tool recommendations for aspiring AI professionals.

AIBig DataData Science
0 likes · 6 min read
Explore the Full AI Expert Roadmap: From Data Science to Big Data Engineering
Java Tech Enthusiast
Java Tech Enthusiast
Jan 9, 2025 · Industry Insights

Why Python Dominates the 2024 TIOBE Index and What It Means for Developers

The January 2025 TIOBE Index reveals Python as the 2024 programming language of the year with a 9.3% growth, while C declines, C++ and Java surge, PHP exits the top ten, and Rust, Kotlin, and Go show varied trajectories, underscoring Python’s dominance across data science, web development, and automation.

AIData ScienceProgramming Language Trends
0 likes · 7 min read
Why Python Dominates the 2024 TIOBE Index and What It Means for Developers
Test Development Learning Exchange
Test Development Learning Exchange
Dec 15, 2024 · Fundamentals

45 Common NumPy Operations with Code Examples

This article presents a comprehensive guide to 45 essential NumPy operations, covering array creation, reshaping, arithmetic, statistical functions, linear algebra, and more, each illustrated with concise explanations and ready-to-run Python code examples to help readers efficiently leverage NumPy for scientific computing.

ArrayData ScienceNumPy
0 likes · 18 min read
45 Common NumPy Operations with Code Examples
Test Development Learning Exchange
Test Development Learning Exchange
Nov 22, 2024 · Artificial Intelligence

Introduction to Data Modeling with Scikit-Learn

This article provides a comprehensive guide to using Scikit-Learn for data modeling, covering linear regression and decision tree algorithms, including data preparation, model training, evaluation metrics, and visualization techniques for predictive analysis.

Data ScienceDecision TreesPython
0 likes · 4 min read
Introduction to Data Modeling with Scikit-Learn
Python Programming Learning Circle
Python Programming Learning Circle
Nov 22, 2024 · Artificial Intelligence

Introducing the Python "communities" Library for Graph Clustering and Visualization

This article introduces the Python "communities" library, explains its support for multiple graph clustering algorithms such as Louvain and Girvan‑Newman, demonstrates how to import algorithms, build adjacency matrices, visualize communities, create animation of the clustering process, and provides author and resource information.

Data ScienceLouvain Algorithmcommunity-detection
0 likes · 7 min read
Introducing the Python "communities" Library for Graph Clustering and Visualization
Test Development Learning Exchange
Test Development Learning Exchange
Nov 16, 2024 · Artificial Intelligence

Basic Operations with NumPy Arrays in Python

This tutorial introduces NumPy array creation, manipulation, and fundamental mathematical operations, providing step‑by‑step code examples for importing the library, generating arrays with various functions, reshaping, indexing, slicing, and computing mean, variance, and standard deviation.

Array OperationsData ScienceNumPy
0 likes · 6 min read
Basic Operations with NumPy Arrays in Python
JD Cloud Developers
JD Cloud Developers
Nov 6, 2024 · Artificial Intelligence

How Data Science Powers JD’s Logistics, Finance, and Healthcare Innovations

This article explains the fundamentals of data science, its key components, and showcases how JD applies it across e‑commerce, finance, healthcare, and logistics, while also reviewing past innovations, common project pitfalls, and future directions such as quantum computing and supply‑chain digital twins.

Data ScienceHealthcareQuantum Computing
0 likes · 21 min read
How Data Science Powers JD’s Logistics, Finance, and Healthcare Innovations
21CTO
21CTO
Nov 5, 2024 · Artificial Intelligence

Why Python Overtook JavaScript on GitHub: AI Projects Fuel Explosive Growth

The 2024 GitHub Octoverse report reveals that Python has surpassed JavaScript as the most used language, driven by a surge in generative AI projects and a 92% rise in Jupyter Notebook usage, highlighting the expanding role of AI and data‑science communities on the platform.

AIData ScienceGitHub
0 likes · 5 min read
Why Python Overtook JavaScript on GitHub: AI Projects Fuel Explosive Growth
Python Programming Learning Circle
Python Programming Learning Circle
Oct 7, 2024 · Fundamentals

50 Classic Python Libraries You Can Master Quickly

This article presents a curated list of fifty essential Python libraries spanning data analysis, scientific computing, visualization, machine learning, web development, database access, testing, and utilities, providing brief descriptions to help developers quickly identify and master the most useful tools in the Python ecosystem.

Data ScienceWeb Developmentlibraries
0 likes · 7 min read
50 Classic Python Libraries You Can Master Quickly
Liangxu Linux
Liangxu Linux
Sep 28, 2024 · Databases

10 Advanced SQL Concepts Every Data Scientist Should Master

This guide walks through ten essential advanced SQL concepts—including CTEs, recursive queries, temporary functions, CASE‑based pivoting, EXCEPT vs NOT IN, self‑joins, ranking functions, delta calculations, cumulative totals, and date‑time manipulation—providing clear explanations and runnable examples to help data‑science professionals ace interview challenges.

Advanced QueriesCTEData Science
0 likes · 11 min read
10 Advanced SQL Concepts Every Data Scientist Should Master
Liangxu Linux
Liangxu Linux
Sep 19, 2024 · Databases

10 Advanced SQL Concepts Every Data Scientist Should Master

This guide walks through ten essential advanced SQL techniques—including CTEs, recursive CTEs, temporary functions, CASE‑WHEN pivots, EXCEPT vs NOT IN, self‑joins, ranking functions, delta calculations with LAG/LEAD, cumulative sums, and date‑time manipulation—to help data professionals ace interview challenges and write cleaner, more powerful queries.

Advanced SQLCTEData Science
0 likes · 11 min read
10 Advanced SQL Concepts Every Data Scientist Should Master
21CTO
21CTO
Sep 14, 2024 · Fundamentals

Why Python Remains the Top Choice for New Developers in 2024

The 2024 JetBrains and Python Software Foundation survey of 25,000 developers reveals Python's continued dominance, detailed demographics, tool preferences, language trends, age distribution, web and data‑science usage, cloud adoption, and security insights, highlighting why newcomers still favor Python as their first language.

Data SciencePythonWeb Development
0 likes · 9 min read
Why Python Remains the Top Choice for New Developers in 2024
21CTO
21CTO
Sep 11, 2024 · Fundamentals

Python, Julia, Rust: Pros, Cons, and Best Use Cases for Data Science

An in‑depth comparison of Python, Julia, and Rust reveals each language’s strengths and weaknesses for data‑science tasks, covering ease of development, library ecosystems, performance, deployment challenges, and suitability for rapid prototyping versus high‑performance, memory‑safe applications.

Data ScienceJuliaPython
0 likes · 9 min read
Python, Julia, Rust: Pros, Cons, and Best Use Cases for Data Science
21CTO
21CTO
Sep 5, 2024 · Artificial Intelligence

Python vs Julia vs Rust: Which Language Wins for Data Science?

This article compares Python, Julia, and Rust—the three leading languages in data science—detailing their strengths, ecosystem support, performance, and deployment challenges to help developers choose the most suitable tool for their projects.

Data ScienceEcosystemJulia
0 likes · 9 min read
Python vs Julia vs Rust: Which Language Wins for Data Science?
DataFunSummit
DataFunSummit
Jul 26, 2024 · Big Data

Understanding Power Law Distributions in Content Ecosystems: Data Science Insights and Applications

This article explores how power‑law and other heavy‑tailed distributions appear in content ecosystems, explains their statistical foundations, discusses why they are common, and presents data‑driven strategies—including integer programming, graph‑based creator analysis, and causal inference—to optimize content production, recommendation, and settlement policies.

Big DataData SciencePower Law
0 likes · 18 min read
Understanding Power Law Distributions in Content Ecosystems: Data Science Insights and Applications
DataFunSummit
DataFunSummit
Jul 13, 2024 · Artificial Intelligence

Causal Inference Knowledge Map: Framework, Application Evaluation, Typical Algorithms, Implementation Challenges, and JD Tech Credit Decision Model

This article presents a comprehensive knowledge map of causal inference covering its overall framework, how to evaluate decision‑making scenarios, typical causal algorithms, practical challenges in deployment, a JD Tech credit‑limit case study, and future research directions.

Data ScienceDecision Modelingalgorithm
0 likes · 15 min read
Causal Inference Knowledge Map: Framework, Application Evaluation, Typical Algorithms, Implementation Challenges, and JD Tech Credit Decision Model
21CTO
21CTO
Jul 1, 2024 · Fundamentals

Explore Positron: The Next‑Gen VS Code‑Based IDE for R and Python

Positron, the new beta IDE from Posit built on Visual Studio Code, offers a ready‑to‑use, cross‑platform environment for R and Python with integrated data exploration tools, seamless language switching, and easy extension management via OpenVSX, while still being an early‑stage project.

Data ScienceIDEOpenVSX
0 likes · 5 min read
Explore Positron: The Next‑Gen VS Code‑Based IDE for R and Python
Python Programming Learning Circle
Python Programming Learning Circle
Jun 27, 2024 · Artificial Intelligence

Eight Python Libraries to Accelerate Data Science and Machine Learning Workflows

This article introduces eight Python libraries—Optuna, ITMO_FS, Shap-hypetune, PyCaret, floWeaver, Gradio, Terality, and Torch-Handle—that streamline data science tasks such as hyperparameter optimization, feature selection, model building, visualization, and rapid prototyping, helping users save coding time and improve productivity.

AutomationData SciencePython
0 likes · 11 min read
Eight Python Libraries to Accelerate Data Science and Machine Learning Workflows
Ops Development & AI Practice
Ops Development & AI Practice
Jun 26, 2024 · Fundamentals

Why Jupyter Notebooks Revolutionized Data Science and Machine Learning

This article explores the origins, key innovations, and lasting impact of Jupyter notebooks, highlighting how their multi‑language support, interactive computing, reproducibility, and extensibility have transformed data exploration, collaboration, education, and research in modern data science and machine learning.

Data ScienceInteractive ComputingJupyter
0 likes · 5 min read
Why Jupyter Notebooks Revolutionized Data Science and Machine Learning
DataFunTalk
DataFunTalk
May 25, 2024 · Fundamentals

Systematic Solutions to the AA Problem in Random Experiments

This talk explains how combining heavy randomization with regression adjustment can effectively mitigate AA problems in A/B testing, improving experiment credibility by addressing covariate imbalance and enhancing result validity for data‑driven decision making.

A/B testingAA problemData Science
0 likes · 2 min read
Systematic Solutions to the AA Problem in Random Experiments
Python Programming Learning Circle
Python Programming Learning Circle
May 11, 2024 · Artificial Intelligence

A Comprehensive Overview of Popular Python Libraries for Artificial Intelligence and Data Science

This article introduces and demonstrates more than twenty widely used Python libraries for artificial intelligence, computer vision, natural language processing, and data analysis, providing concise explanations and runnable code snippets that illustrate each library's core functionality and typical use cases.

Data ScienceNumPyPyTorch
0 likes · 29 min read
A Comprehensive Overview of Popular Python Libraries for Artificial Intelligence and Data Science
Python Programming Learning Circle
Python Programming Learning Circle
Apr 2, 2024 · Artificial Intelligence

Overview of Common Python Libraries for Artificial Intelligence and Data Science with Code Examples

This article provides a comprehensive introduction to popular Python libraries for artificial intelligence, computer vision, data analysis, and machine learning—such as NumPy, OpenCV, scikit‑image, Pillow, TensorFlow, PyTorch, and many others—accompanied by concise code snippets and performance comparisons to help beginners select suitable tools.

AI librariesCode ExamplesData Science
0 likes · 33 min read
Overview of Common Python Libraries for Artificial Intelligence and Data Science with Code Examples
Python Programming Learning Circle
Python Programming Learning Circle
Mar 23, 2024 · Artificial Intelligence

Eight Python Libraries to Accelerate Data‑Science Workflows

This article introduces eight Python libraries—including Optuna, ITMO_FS, shap‑hypetune, PyCaret, floWeaver, Gradio, Terality, and Torch‑Handle—that streamline data‑science tasks such as hyperparameter optimization, feature selection, model building, visualization, and deployment, helping users save coding time and improve productivity.

AutomationData SciencePython
0 likes · 12 min read
Eight Python Libraries to Accelerate Data‑Science Workflows
TAL Education Technology
TAL Education Technology
Mar 20, 2024 · Artificial Intelligence

Understanding AI: From Brain Differences to Data Science Practices and Large Model Applications

This article explains why current AI cannot achieve self‑awareness, outlines data‑science steps for large models—including preprocessing, exploratory analysis, modeling, and evaluation—then surveys general and vertical applications of large language models and details a complete machine‑learning workflow with transformer fine‑tuning techniques.

AIApplicationsData Science
0 likes · 14 min read
Understanding AI: From Brain Differences to Data Science Practices and Large Model Applications
21CTO
21CTO
Mar 12, 2024 · Artificial Intelligence

Top 10 Python Libraries Every Data Scientist Must Master in 2024

Discover the essential Python libraries for data science in 2024, from versatile tools like Taipy and Pandas to powerful machine‑learning frameworks such as TensorFlow, PyTorch, and Scikit‑Learn, each with key features, use‑cases, and GitHub links to boost your analytics career.

AIData SciencePython
0 likes · 7 min read
Top 10 Python Libraries Every Data Scientist Must Master in 2024
Python Programming Learning Circle
Python Programming Learning Circle
Mar 12, 2024 · Fundamentals

Visual Guide to NumPy: Creating Arrays, Operations, Indexing, and Applications

This tutorial provides a visual, step‑by‑step guide to NumPy, covering array creation, arithmetic and broadcasting, indexing, aggregation, matrix operations, reshaping, and practical examples such as computing mean‑squared error for machine‑learning models, illustrated with code snippets and diagrams.

Array OperationsData SciencePython
0 likes · 10 min read
Visual Guide to NumPy: Creating Arrays, Operations, Indexing, and Applications
21CTO
21CTO
Mar 7, 2024 · Artificial Intelligence

Build Production-Ready AI Web Apps in Python with Taipy—No Frontend Code Needed

Taipy is a free, open‑source Python framework that lets data scientists and machine‑learning engineers create full‑stack, production‑grade web applications without writing HTML, JavaScript or CSS, offering three components—frontend, backend pipeline, and REST API—plus integration with common data sources and low‑code UI tools.

AIData ScienceFramework
0 likes · 6 min read
Build Production-Ready AI Web Apps in Python with Taipy—No Frontend Code Needed
DataFunTalk
DataFunTalk
Feb 5, 2024 · Fundamentals

User Portrait Tagging: Construction, Feature Processing, and Evaluation

This article explains how to build user portrait tags—from basic attribute tags to business and strategy tags—covers methods for data collection, anomaly handling, time decay, smoothing, and evaluates tag quality using cohesion, stability, and AUC-related metrics to support data‑driven product decisions.

Data ScienceEvaluation Metricsfeature engineering
0 likes · 12 min read
User Portrait Tagging: Construction, Feature Processing, and Evaluation
Python Programming Learning Circle
Python Programming Learning Circle
Jan 9, 2024 · Artificial Intelligence

Overview of Common Python Libraries for Artificial Intelligence with Code Examples

This article provides a comprehensive introduction to popular Python libraries used in artificial intelligence, such as NumPy, OpenCV, scikit-image, Pillow, SimpleCV, Mahotas, Ilastik, Scikit-learn, SciPy, NLTK, spaCy, LibROSA, Pandas, Matplotlib, Seaborn, Orange, PyBrain, Theano, Keras, Caffe, MXNet, PaddlePaddle, CNTK, and more, including code snippets and usage examples.

AIData SciencePython
0 likes · 34 min read
Overview of Common Python Libraries for Artificial Intelligence with Code Examples
Test Development Learning Exchange
Test Development Learning Exchange
Dec 29, 2023 · Artificial Intelligence

Introduction to Essential Python Data Science Libraries with Example Code

This article introduces key Python libraries for data analysis, visualization, statistical modeling, and machine learning—including NumPy, Pandas, Matplotlib, Seaborn, SciPy, Statsmodels, Scikit-learn, BeautifulSoup, TensorFlow, and Plotly—each accompanied by concise code examples demonstrating their core functionality.

Data SciencePythonlibraries
0 likes · 6 min read
Introduction to Essential Python Data Science Libraries with Example Code
DataFunTalk
DataFunTalk
Dec 14, 2023 · Fundamentals

Evaluating Long-Term vs Short-Term Effects in A/B Experiments

While A/B testing is widely used for data-driven decisions, short-term experimental results often diverge from long-term impacts, leading to misguided strategies; this article examines why such inconsistencies arise and reviews major methods—including extended experiments, holdout groups, post‑analysis, CCD, and surrogate‑metric modeling—to reliably estimate long‑term effects.

A/B testingData ScienceLong-term impact
0 likes · 13 min read
Evaluating Long-Term vs Short-Term Effects in A/B Experiments
DataFunSummit
DataFunSummit
Dec 6, 2023 · Artificial Intelligence

Huya's Experiment Science Platform: Causal Inference, AB Testing, and Uplift Modeling Practices

Huya’s data‑driven experiment platform showcases how causal inference, AB testing, and uplift modeling are applied to advertising, user activation, and growth scenarios, detailing platform evolution, metric design, statistical challenges, and practical solutions such as multi‑test correction, CUPED, RTA, and propensity‑score methods.

AB testingData ScienceExperiment Platform
0 likes · 18 min read
Huya's Experiment Science Platform: Causal Inference, AB Testing, and Uplift Modeling Practices
DaTaobao Tech
DaTaobao Tech
Dec 1, 2023 · Artificial Intelligence

Design, Evaluation, and Production of a VOC Tagging System for Taobao User Experience

Taobao’s Technical Industry Data team designed a four‑level VOC tagging hierarchy to unify fragmented user‑feedback sources, evaluated label similarity with vector‑based distance matrices, optimized tag groups via entropy‑driven re‑grouping, built a stacking ensemble of FastText and TextCNN achieving over 90% accuracy, and deployed an automated production pipeline that generates tags, maintains ODPS tables, and provides APIs for rapid experimentation.

Data ScienceNLPTagging
0 likes · 18 min read
Design, Evaluation, and Production of a VOC Tagging System for Taobao User Experience
Huolala Tech
Huolala Tech
Dec 1, 2023 · Product Management

Tackling AB Testing Pitfalls in Freight Bilateral Markets

This article explores how freight platforms can optimize transaction strategies through AB experiments, detailing common challenges such as split‑testing interference, SUTVA violations, capacity competition, homogeneity issues, and Simpson's paradox, and presents practical solutions like time‑slice routing, city isolation, and advanced statistical corrections.

AB testingData Sciencebilateral market
0 likes · 14 min read
Tackling AB Testing Pitfalls in Freight Bilateral Markets
Model Perspective
Model Perspective
Nov 2, 2023 · Artificial Intelligence

Why Mathematical Modelers Must Embrace LLMs and Forget Outdated Skills

The article explains how rapid advances in data and large language models force mathematical modelers to continuously update their models and skills, discard obsolete knowledge, and adopt lifelong learning to stay effective in a fast‑changing AI‑driven environment.

Data Scienceartificial intelligencecontinuous learning
0 likes · 6 min read
Why Mathematical Modelers Must Embrace LLMs and Forget Outdated Skills
Architects Research Society
Architects Research Society
Oct 30, 2023 · Big Data

Essential Data Science Tools for Elevating Analytics Operations

The article surveys the most important data‑science tools—including Jupyter Notebooks, notebook lab platforms, RStudio, Sweave/Knitr, IDEs, domain‑specific solutions, hardware, and data sources—explaining how they support modern, real‑time analytics and help organizations turn raw data into actionable insights.

Data ScienceIDEJupyter
0 likes · 10 min read
Essential Data Science Tools for Elevating Analytics Operations
Huolala Tech
Huolala Tech
Oct 27, 2023 · R&D Management

How to Overcome Experimentation Challenges in Freight Two‑Sided Markets

This article examines the unique characteristics of freight two‑sided markets, outlines the experimental challenges across transaction, pricing, marketing, and product scenarios, and presents a comprehensive technical framework—including allocation strategies, homogeneity controls, efficient interpretation, and observational study methods—to achieve reliable, actionable insights.

Data Sciencecausal inferenceexperiment design
0 likes · 12 min read
How to Overcome Experimentation Challenges in Freight Two‑Sided Markets
21CTO
21CTO
Oct 23, 2023 · Fundamentals

What the 2022 JetBrains Python Survey Reveals About Language Trends and Tooling

The 2022 JetBrains Python Developer Survey of over 23,000 respondents shows 93% now use Python 3, highlights the decline of Python 2, rising popularity of FastAPI, dominant IDEs like PyCharm and VS Code, and shifting preferences in frameworks, databases, big‑data tools, and operating systems.

Data ScienceIDEPython
0 likes · 5 min read
What the 2022 JetBrains Python Survey Reveals About Language Trends and Tooling
DataFunSummit
DataFunSummit
Oct 17, 2023 · Artificial Intelligence

DataFunSummit2023: Deep Learning‑Driven Multi‑Experiment Causal Inference and Distributed Causal Tools

The DataFunSummit2023 online conference brings together experts from Tencent and Kuaishou to present cutting‑edge research on causal inference for large‑scale A/B testing, including deep‑learning‑based multi‑experiment effect estimation, a distributed causal inference framework (Fast‑Causal‑Inference), and strategies for evaluating long‑term policy impacts.

A/B testingData ScienceDeep Learning
0 likes · 7 min read
DataFunSummit2023: Deep Learning‑Driven Multi‑Experiment Causal Inference and Distributed Causal Tools
Test Development Learning Exchange
Test Development Learning Exchange
Sep 11, 2023 · Fundamentals

Why Do Data Analysis? 10 Practical Python Data Analysis Scenarios with Code Examples

The article explains the importance of data analysis for business insight, problem detection, decision support, operational optimization, forecasting, and competitiveness, and then presents ten practical Python code scenarios covering data loading, cleaning, filtering, aggregation, visualization, statistics, transformation, time‑series analysis, export, and machine‑learning applications.

Data ScienceData visualizationPython
0 likes · 7 min read
Why Do Data Analysis? 10 Practical Python Data Analysis Scenarios with Code Examples
DaTaobao Tech
DaTaobao Tech
Sep 8, 2023 · Product Management

BPPISE Framework for Product Data Science Case Studies

The fourth article in a ten‑part Taobao series introduces the BPPISE framework—Business, Problem, Data, Insight, Strategy, Evaluation—as a product‑data‑science case structure, contrasting it with CRISP‑DM, detailing each stage, offering writing tips, and noting the team’s recruitment for data‑science talent.

BPPISEData ScienceFramework
0 likes · 9 min read
BPPISE Framework for Product Data Science Case Studies
Model Perspective
Model Perspective
Aug 26, 2023 · Artificial Intelligence

75 Essential Data Science Terms Every Practitioner Must Know

This article compiles a comprehensive alphabetically ordered list of 75 crucial data science and machine learning terms—from accuracy and AUC to zero-shot learning—providing concise definitions that help practitioners quickly grasp essential concepts and improve their analytical vocabulary.

AI termsData ScienceGlossary
0 likes · 13 min read
75 Essential Data Science Terms Every Practitioner Must Know
Python Programming Learning Circle
Python Programming Learning Circle
Aug 3, 2023 · Artificial Intelligence

A Comprehensive Overview of Popular Python Libraries for Artificial Intelligence with Sample Code

This article provides a concise yet thorough introduction to the most commonly used Python AI libraries—including NumPy, OpenCV, scikit‑image, Pillow, SimpleCV, Mahotas, Ilastik, scikit‑learn, LibROSA, Pandas, Matplotlib, Seaborn, Orange, PyBrain, Milk, TensorFlow, PyTorch, Theano, Keras, MXNet, PaddlePaddle and CNTK—explaining their main features and offering ready‑to‑run code examples for each.

AI librariesData Science
0 likes · 34 min read
A Comprehensive Overview of Popular Python Libraries for Artificial Intelligence with Sample Code
DataFunSummit
DataFunSummit
Jul 16, 2023 · Game Development

Applying A/B Testing to Drive Growth in Tencent’s Overseas Games

This article explains how Tencent leverages A/B testing across its overseas games, detailing the current market situation, experimental capabilities, multi‑cloud platform architecture, and case studies that illustrate how data‑driven experiments improve user retention, engagement, and overall business growth.

A/B testingData ScienceGame Development
0 likes · 12 min read
Applying A/B Testing to Drive Growth in Tencent’s Overseas Games
Programmer DD
Programmer DD
Jul 13, 2023 · Artificial Intelligence

Unlock Interactive Kotlin Development with the New Kotlin Notebook Plugin

IntelliJ IDEA's Kotlin Notebook plugin brings interactive notebooks to Kotlin, enabling developers to combine code, visualizations, and narrative in a single document, supporting rapid prototyping, data analysis, and AI workflows with seamless library integration and flexible execution controls.

Data ScienceIntelliJ IDEAInteractive Development
0 likes · 4 min read
Unlock Interactive Kotlin Development with the New Kotlin Notebook Plugin
Test Development Learning Exchange
Test Development Learning Exchange
Jul 12, 2023 · Fundamentals

Common Python Libraries and Practical Projects: NumPy, Pandas, Matplotlib, Scikit‑learn, Requests, Beautiful Soup, Selenium, Pygame, Flask, PyTorch

This article introduces ten widely used Python libraries—NumPy, Pandas, Matplotlib, Scikit‑learn, Requests, Beautiful Soup, Selenium, Pygame, Flask, and PyTorch—each accompanied by a concise real‑world project and complete code examples to help readers understand and apply them effectively.

Data ScienceDeep LearningGame Development
0 likes · 18 min read
Common Python Libraries and Practical Projects: NumPy, Pandas, Matplotlib, Scikit‑learn, Requests, Beautiful Soup, Selenium, Pygame, Flask, PyTorch
Architecture Digest
Architecture Digest
Jul 4, 2023 · Fundamentals

JupyterLab 4.0: New Features, Performance Boosts, and Extension Management

JupyterLab 4.0 introduces a flexible multi‑document interface, customizable layouts, an integrated file browser, modular extensibility, built‑in terminal, significant performance improvements, a separate real‑time collaboration package, and a PyPI‑based extension manager, while noting that AI‑assisted coding tools may now outpace it.

Data ScienceExtensionsIDE
0 likes · 4 min read
JupyterLab 4.0: New Features, Performance Boosts, and Extension Management
Architects Research Society
Architects Research Society
Jul 1, 2023 · Artificial Intelligence

What Is Data Science? Definitions, Work Processes, and Roles – Reflections on a Decade of Data Science and Future Visualization Tools

This article reviews a decade of data‑science growth, defines data science as a multidisciplinary field, outlines its four high‑level and fourteen low‑level work processes, categorises nine distinct data‑science roles, and discusses how these insights should shape the next generation of data‑visualisation and analysis tools.

AIAnalyticsData Science
0 likes · 12 min read
What Is Data Science? Definitions, Work Processes, and Roles – Reflections on a Decade of Data Science and Future Visualization Tools
Python Programming Learning Circle
Python Programming Learning Circle
Jun 12, 2023 · Fundamentals

Mojo: A Python‑Compatible Language with Rust‑Level Performance

The article introduces Mojo, a new programming language positioned as a superset of Python that offers Rust‑like speed and safety through ahead‑of‑time compilation, discusses its early‑stage features, performance claims, online playground, and evaluates its potential to complement or replace Python in data‑science and high‑performance scenarios.

Data ScienceMojoProgramming Language
0 likes · 8 min read
Mojo: A Python‑Compatible Language with Rust‑Level Performance
Python Programming Learning Circle
Python Programming Learning Circle
May 12, 2023 · Fundamentals

Comprehensive Guide to NumPy Array Creation, Operations, and Manipulation

This article provides an extensive tutorial on NumPy, covering array creation functions such as array, linspace, arange, and random generators, a wide range of array operations including reshaping, slicing, statistical calculations, set operations, splitting, stacking, comparison, repetition, and data persistence, all illustrated with clear Python code examples.

Array OperationsData Science
0 likes · 17 min read
Comprehensive Guide to NumPy Array Creation, Operations, and Manipulation
21CTO
21CTO
Apr 2, 2023 · Artificial Intelligence

Which Jobs Will Vanish and Which Will Thrive with ChatGPT?

The article examines how ChatGPT automates many manual tasks, reducing demand for translators, editors, customer support reps, and data analysts, while boosting opportunities for chatbot developers, NLP engineers, data scientists, content creators, and software developers, and outlines broader industry advancements driven by AI.

AI ImpactChatGPTData Science
0 likes · 7 min read
Which Jobs Will Vanish and Which Will Thrive with ChatGPT?
Python Programming Learning Circle
Python Programming Learning Circle
Mar 27, 2023 · Artificial Intelligence

Top 10 Machine Learning Algorithms: Concepts, Uses, and Key Characteristics

This article introduces the No‑Free‑Lunch principle in machine learning and provides concise explanations of ten fundamental supervised‑learning algorithms—including linear regression, logistic regression, LDA, decision trees, Naïve Bayes, K‑Nearest Neighbors, LVQ, SVM, random forest, and boosting—highlighting their mathematical basis, typical applications, advantages, and limitations.

Data Scienceartificial intelligencemachine learning
0 likes · 12 min read
Top 10 Machine Learning Algorithms: Concepts, Uses, and Key Characteristics
Python Crawling & Data Mining
Python Crawling & Data Mining
Mar 24, 2023 · Fundamentals

What Are the Three Levels of Business Analytics? A Simple Classification Explained

Business analytics, data science, and related fields are gaining prominence, and this article outlines a straightforward three‑layer classification—descriptive, predictive, and prescriptive analytics—explaining their purposes, techniques, and how organizations progress through these maturity levels to turn data into actionable insights.

Business AnalyticsData Sciencedescriptive analytics
0 likes · 9 min read
What Are the Three Levels of Business Analytics? A Simple Classification Explained
DataFunSummit
DataFunSummit
Feb 9, 2023 · Operations

Designing Experiments for Bilateral Markets in Advertising Platforms

This article explains how to design and evaluate experiments for bilateral markets in advertising platforms, covering the limitations of traditional randomization, the four‑cell traffic‑advertisement experiment, various mitigation strategies such as counterfactual interleaving and joint sampling, and the use of a simulation system to validate methods.

A/B testingData Scienceadvertising experiment
0 likes · 15 min read
Designing Experiments for Bilateral Markets in Advertising Platforms
DataFunTalk
DataFunTalk
Jan 29, 2023 · Artificial Intelligence

Data Science Practices in E‑commerce Search: Experimentation, Causal Inference, and Metric Design

This article presents the JD Retail search data‑science team's practical approaches to e‑commerce search, covering the scene’s unique data characteristics, order attribution methods, AB experiment design, causal‑inference frameworks, variance‑reduction techniques, quasi‑experimental evaluations, and metric design for traffic distribution, all illustrated with real‑world examples and visualizations.

Data Sciencecausal inferencee‑commerce
0 likes · 18 min read
Data Science Practices in E‑commerce Search: Experimentation, Causal Inference, and Metric Design
Tencent Cloud Developer
Tencent Cloud Developer
Dec 2, 2022 · Artificial Intelligence

Football Match Prediction Using Machine Learning and Betting Strategy Analysis

The study applies machine‑learning models—including logistic regression, SVM, random forest, deep neural networks and a DNN‑SVM ensemble—to 17‑dimensional team features and 51‑dimensional bookmaker odds, achieving up to 54.5% match‑outcome accuracy, proposing a profit‑condition betting strategy and extending the approach to stock‑price forecasting.

Betting StrategyData ScienceRandom Forest
0 likes · 21 min read
Football Match Prediction Using Machine Learning and Betting Strategy Analysis
DataFunTalk
DataFunTalk
Nov 20, 2022 · Artificial Intelligence

Construction of Generalized Causal Forests and Their Application in Online Transaction Markets

On November 26, 2022, at the DataFun Summit 2022 online causal inference conference, PhD candidate Wan Shu from Arizona State University will present a talk titled “Construction of Generalized Causal Forests and Their Application in Online Transaction Markets,” covering treatment effect estimation, model building, performance comparison, and practical use cases.

Data ScienceGeneralized Causal ForestOnline Market
0 likes · 3 min read
Construction of Generalized Causal Forests and Their Application in Online Transaction Markets
Bitu Technology
Bitu Technology
Nov 18, 2022 · Fundamentals

Tubi’s Switchback Experiment Platform: Design, Challenges, and Solutions

The article describes Tubi’s internal experimentation platform, explaining how traditional user‑group A/B tests can suffer from network interference and how Switchback experiments—time‑window based designs—address these issues, detailing their implementation, statistical methods, and the practical challenges overcome.

A/B testingData ScienceSwitchback experiments
0 likes · 12 min read
Tubi’s Switchback Experiment Platform: Design, Challenges, and Solutions
Practical DevOps Architecture
Practical DevOps Architecture
Nov 15, 2022 · Fundamentals

Comprehensive Programming Course Curriculum Overview

This article presents a detailed curriculum covering programming fundamentals, web development (HTML, CSS, JavaScript, jQuery), backend frameworks (Flask, Django), database concepts, data analysis with Python (NumPy, pandas, matplotlib, seaborn), machine learning, AI, and related project tutorials, along with a brief promotional note.

CurriculumData Sciencemachine learning
0 likes · 12 min read
Comprehensive Programming Course Curriculum Overview
Model Perspective
Model Perspective
Nov 13, 2022 · Fundamentals

Essential Python Programming Resources: Basics, Libraries, and Modeling Tools

This article compiles a curated list of Python programming fundamentals and popular third‑party libraries for data processing, visualization, scientific computing, optimization, and more, linking to detailed tutorials for each topic, including Numpy, Matplotlib, Pandas, Scipy, PuLP, NetworkX, Geatpy, Numba, and Taichi.

Data Sciencelibrariesprogramming
0 likes · 4 min read
Essential Python Programming Resources: Basics, Libraries, and Modeling Tools
Model Perspective
Model Perspective
Oct 12, 2022 · Fundamentals

Mastering Model‑Centric Statistics: From Exploratory Analysis to Bayesian Inference

This article explains how statistics—through data collection, exploratory analysis, descriptive metrics, visualization, and model‑centric inference—provides a framework for understanding and predicting phenomena, emphasizing the role of programming (e.g., Python) and Bayesian modeling principles such as simplicity and the Occam razor.

Data ScienceModelingexploratory data analysis
0 likes · 6 min read
Mastering Model‑Centric Statistics: From Exploratory Analysis to Bayesian Inference
MaGe Linux Operations
MaGe Linux Operations
Sep 24, 2022 · Fundamentals

Boost Your Jupyter Notebook Productivity with 5 Essential Extensions

This guide walks you through installing Jupyter Notebook extensions, explains why they improve workflow, and highlights five must‑have add‑ons—including Table of Contents, Autopep8, Variable Inspector, ExecuteTime, and Hide Code—to streamline data‑science tasks.

Data ScienceExtensionsJupyter Notebook
0 likes · 7 min read
Boost Your Jupyter Notebook Productivity with 5 Essential Extensions
21CTO
21CTO
Sep 22, 2022 · Fundamentals

Why Every Developer Needs to Master Software Frameworks (And How to Get Started)

This article explains what software frameworks are, why they are essential for developers, outlines the main types—including frontend, backend, mobile, and data‑science frameworks—and offers practical advice on how beginners can start learning them effectively.

Data ScienceDevelopmentMobile
0 likes · 12 min read
Why Every Developer Needs to Master Software Frameworks (And How to Get Started)