Tagged articles
466 articles
Page 5 of 5
MaGe Linux Operations
MaGe Linux Operations
Sep 2, 2020 · Fundamentals

Boost Your Python Data Analysis: 20 Essential Jupyter Tips & Tricks

This article compiles a collection of practical Python and Jupyter Notebook tips—including pandas profiling, interactive plotting with Cufflinks, useful magic commands, debugging shortcuts, and output handling techniques—to help data analysts work faster, produce richer visualizations, and avoid common pitfalls.

Jupyterdata profilinginteractive plotting
0 likes · 9 min read
Boost Your Python Data Analysis: 20 Essential Jupyter Tips & Tricks
Efficient Ops
Efficient Ops
Jul 13, 2020 · Operations

What 13,966 Ops Job Listings Reveal About Salary, Skills, and Hot Cities

This article analyzes 13,966 Chinese operations‑engineer job postings scraped from 51job, cleaning the data with Python and Pandas, then visualizing industry demand, city concentration, salary ranges, education requirements, company size distribution, and keyword trends to guide job seekers and recruiters.

Data visualizationOperationsPython
0 likes · 14 min read
What 13,966 Ops Job Listings Reveal About Salary, Skills, and Hot Cities
Python Crawling & Data Mining
Python Crawling & Data Mining
Jun 30, 2020 · Fundamentals

Master Excel‑Pandas Integration: From Data Import to Visualization in Python

This tutorial demonstrates how to combine Excel’s interactive features with Python’s Pandas library to perform comprehensive data operations—including reading, generating, filtering, sorting, handling missing values, deduplication, merging, grouping, calculation, statistics, visualization, sampling, pivot tables, and VLOOKUP—showing when each tool excels.

ExcelPythondata-processing
0 likes · 13 min read
Master Excel‑Pandas Integration: From Data Import to Visualization in Python
Java Backend Technology
Java Backend Technology
Jun 20, 2020 · Databases

What’s the Real Execution Order of SQL Queries? A Visual Guide

Although most SQL statements begin with SELECT, the actual execution follows a specific order—WHERE, GROUP BY, HAVING, then SELECT—illustrated by a diagram that also clarifies common misconceptions about filtering window functions, column aliases, and how databases may reorder operations for optimization.

LINQQuery ExecutionSQL
0 likes · 6 min read
What’s the Real Execution Order of SQL Queries? A Visual Guide
Python Programming Learning Circle
Python Programming Learning Circle
Jun 4, 2020 · Fundamentals

Superstore Sales Data Analysis: From Data Preprocessing to RFM Modeling

This article presents a comprehensive analysis of a global supermarket's four‑year sales dataset, covering data collection, preprocessing, exploratory visualizations, sales, quantity, profit, market segmentation, product performance, customer segmentation, RFM modeling, and actionable recommendations to improve revenue and customer retention.

RFM modelRetail analyticsSales Forecasting
0 likes · 27 min read
Superstore Sales Data Analysis: From Data Preprocessing to RFM Modeling
Python Programming Learning Circle
Python Programming Learning Circle
May 15, 2020 · Fundamentals

Automating Multi‑Sheet Excel Sales Analysis with Python

The article demonstrates how a programmer can replace tedious manual Excel operations by using Python and pandas to batch‑process 128 sales spreadsheets, calculate brand‑level revenue, and dramatically reduce processing time from hours to seconds, illustrating a practical data‑analysis workflow.

AutomationPythondata-analysis
0 likes · 4 min read
Automating Multi‑Sheet Excel Sales Analysis with Python
Python Programming Learning Circle
Python Programming Learning Circle
May 14, 2020 · Fundamentals

Data Cleaning and Preprocessing for HR Attrition Dataset Using Pandas

This tutorial demonstrates how to download, read, explore, visualize, and preprocess the HR attrition dataset with pandas, covering tasks such as duplicate removal, missing‑value handling, categorical encoding, normalization, and conditional column updates to prepare the data for machine‑learning modeling.

HR datasetdata cleaningmachine learning preprocessing
0 likes · 9 min read
Data Cleaning and Preprocessing for HR Attrition Dataset Using Pandas
Python Programming Learning Circle
Python Programming Learning Circle
Apr 24, 2020 · Fundamentals

Python Basics, Common Pitfalls, and a Simple Web Scraper for Douban Book Ratings

This article introduces Python's core concepts and hierarchy, highlights ten frequent beginner mistakes, and walks through building a basic web scraper that extracts book information from Douban, processes it with pandas, and displays the resulting data, providing a practical learning path for Python fundamentals.

Pythondata-analysisfundamentals
0 likes · 5 min read
Python Basics, Common Pitfalls, and a Simple Web Scraper for Douban Book Ratings
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 16, 2020 · Artificial Intelligence

How Mars Supercharges Numpy, Pandas, and Scikit‑Learn with Parallel and GPU Acceleration

This article explains how the Mars framework enables parallel and distributed execution of core Python data‑science libraries—Numpy, Pandas, and Scikit‑Learn—while integrating with RAPIDS for GPU acceleration, and demonstrates its performance advantages through code examples and benchmark results.

GPU AccelerationMarsNumPy
0 likes · 16 min read
How Mars Supercharges Numpy, Pandas, and Scikit‑Learn with Parallel and GPU Acceleration
Python Programming Learning Circle
Python Programming Learning Circle
Apr 13, 2020 · Fundamentals

Python Date and Time Manipulation Tutorial

This tutorial explains how to retrieve the current timestamp, format specific dates, convert between strings and datetime objects, perform time indexing with pandas, and execute time arithmetic and offsets in Python using datetime, pandas, and dateutil libraries.

Time Manipulationdatetimedateutil
0 likes · 5 min read
Python Date and Time Manipulation Tutorial
Python Programming Learning Circle
Python Programming Learning Circle
Feb 3, 2020 · Fundamentals

Master Pandas: Essential Data Manipulation Techniques for Python Beginners

This guide introduces pandas, the essential Python library for data science, covering installation, data import/export, basic DataFrame operations, logical filtering, visualization with matplotlib, performance tips using tqdm, and advanced techniques like merging, grouping, and iterating, helping beginners become efficient data analysts.

data analysisdata manipulationpandas
0 likes · 8 min read
Master Pandas: Essential Data Manipulation Techniques for Python Beginners
ITPUB
ITPUB
Jan 21, 2020 · Fundamentals

Mastering Data Queries: Pandas vs SQL – A Step‑by‑Step Comparison

This tutorial walks data analysts through a side‑by‑side comparison of common data‑manipulation tasks using pandas in Python and SQL, covering everything from basic selects and filters to joins, aggregations, unions, ordering, case expressions, and data updates with clear code examples.

ComparisonSQLTutorial
0 likes · 15 min read
Mastering Data Queries: Pandas vs SQL – A Step‑by‑Step Comparison
ITPUB
ITPUB
Dec 9, 2019 · Fundamentals

Master Date Operations in pandas and SQL: Retrieval, Conversion, and Calculation

This tutorial walks through loading order data into pandas and SQL, then demonstrates how to retrieve current dates, extract date components, convert between readable dates and Unix timestamps, transform between 10‑digit and 8‑digit date formats, and perform date arithmetic using pandas, MySQL, and Hive.

HiveSQLdata-processing
0 likes · 16 min read
Master Date Operations in pandas and SQL: Retrieval, Conversion, and Calculation
MaGe Linux Operations
MaGe Linux Operations
Nov 2, 2019 · Fundamentals

Master Pandas: Essential Data Reading, Cleaning, and Merging Techniques

This article introduces essential Pandas techniques for data import, cleaning, type conversion, and merging, providing clear code examples that demonstrate reading from MySQL, handling missing values, transforming columns, and combining multiple DataFrames for comprehensive data analysis.

Pythondata cleaningdata merging
0 likes · 6 min read
Master Pandas: Essential Data Reading, Cleaning, and Merging Techniques
Python Programming Learning Circle
Python Programming Learning Circle
Oct 15, 2019 · Artificial Intelligence

Why Python Beats Java for Data Science: Jupyter, Pandas, scikit-learn & Mapping

Python’s ecosystem—Jupyter notebooks, Pandas for data manipulation, scikit-learn for machine learning, and matplotlib/Basemap for powerful visualizations—offers a streamlined, scriptable environment that outperforms traditional Java or PHP workflows, enabling researchers to write, run, and document code seamlessly in a single web interface.

Data visualizationJupyterMatplotlib
0 likes · 8 min read
Why Python Beats Java for Data Science: Jupyter, Pandas, scikit-learn & Mapping
Efficient Ops
Efficient Ops
Sep 29, 2019 · Backend Development

How to Scrape and Visualize 6,000+ Chinese Tourist Spots with Selenium and Python

This article demonstrates how to use Selenium and Python to crawl over 6,000 Chinese tourist attractions from Qunar, extract ratings, popularity and sales data, and visualize the results with pandas, seaborn, matplotlib, and pyecharts, revealing the most visited sites and regional travel trends during the 2019 National Day holiday.

Data visualizationPythonSelenium
0 likes · 9 min read
How to Scrape and Visualize 6,000+ Chinese Tourist Spots with Selenium and Python
Python Crawling & Data Mining
Python Crawling & Data Mining
Aug 20, 2019 · Fundamentals

Master Jupyter Notebook: A Step‑by‑Step Data Analysis Guide for Beginners

Learn how to install Jupyter via Anaconda or pip, create and manage notebooks, understand cells and kernels, write and run Python code, explore a Fortune 500 dataset with pandas, clean missing values, and visualize profit and revenue trends using matplotlib and seaborn—all illustrated with screenshots and code snippets.

Jupyter NotebookMatplotlibPython
0 likes · 15 min read
Master Jupyter Notebook: A Step‑by‑Step Data Analysis Guide for Beginners
FunTester
FunTester
Jul 28, 2019 · Fundamentals

How to Fix pip Uninstall Errors for NumPy When Installing Pandas on macOS

When rebuilding a Plotly environment on macOS, pip fails to uninstall the system‑installed NumPy, causing pandas installation errors, and the solution involves manually removing NumPy’s egg‑info, using a reliable PyPI mirror, and selecting compatible library versions.

NumPymacOSpandas
0 likes · 4 min read
How to Fix pip Uninstall Errors for NumPy When Installing Pandas on macOS
MaGe Linux Operations
MaGe Linux Operations
Jul 1, 2019 · Artificial Intelligence

How to Quickly Analyze, Visualize, and Predict Stock Prices with Python in 12 Minutes

This tutorial walks you through loading historical stock data from Yahoo Finance with pandas, computing moving averages and returns, comparing multiple tech stocks, engineering features, training linear, quadratic, and K‑Nearest‑Neighbor models using scikit‑learn, evaluating their confidence scores, and visualizing short‑term price forecasts—all in under a dozen minutes of reading and coding.

Data visualizationPythonpandas
0 likes · 15 min read
How to Quickly Analyze, Visualize, and Predict Stock Prices with Python in 12 Minutes
Python Crawling & Data Mining
Python Crawling & Data Mining
Jun 29, 2019 · Frontend Development

Build a Weather Forecast Desktop App with PyQt5 in Minutes

This tutorial walks you through creating a Python desktop weather application using PyQt5, covering environment setup, city code preprocessing with pandas, UI design in Qt Designer, API integration, JSON parsing, shortcut handling, and packaging with PyInstaller.

Desktop ApplicationPyQt5Python
0 likes · 6 min read
Build a Weather Forecast Desktop App with PyQt5 in Minutes
MaGe Linux Operations
MaGe Linux Operations
Mar 29, 2019 · Fundamentals

Unlock Powerful Data Analysis with Pandas: A Hands‑On Guide

This tutorial walks you through importing Pandas, understanding its Series and DataFrame structures, loading CSV data, inspecting, filtering, indexing, reshaping, merging, visualizing, and finally saving datasets, providing a comprehensive foundation for scientific Python data analysis.

dataframefilteringpandas
0 likes · 15 min read
Unlock Powerful Data Analysis with Pandas: A Hands‑On Guide
Python Crawling & Data Mining
Python Crawling & Data Mining
Mar 24, 2019 · Backend Development

How to Scrape Douban Book Data and Analyze It with Python

This tutorial shows how to collect book metadata such as publisher, publication date, ISBN, price, rating and review count from Douban for a list of titles stored in Excel, using Python requests, lxml XPath parsing, pandas for merging and analysis, and visualizing the results with matplotlib.

PythonXPathdata analysis
0 likes · 14 min read
How to Scrape Douban Book Data and Analyze It with Python
Tencent Cloud Developer
Tencent Cloud Developer
Mar 13, 2019 · Fundamentals

Introduction to NumPy, pandas, and Matplotlib for Python Data Analysis

This article introduces Python’s core data‑analysis stack—NumPy for fast multidimensional arrays, pandas for labeled DataFrames, and Matplotlib for interactive plotting—while showing how to set up a Jupyter/VS Code environment, perform basic indexing, slicing, and visualisation, and clean log files with pandas.

JupyterMatplotlibNumPy
0 likes · 9 min read
Introduction to NumPy, pandas, and Matplotlib for Python Data Analysis
MaGe Linux Operations
MaGe Linux Operations
Dec 14, 2018 · Big Data

Analyzing Python Job Trends from Zhaopin: Salary Distribution and Skill Word Clouds

This tutorial walks through extracting Python job postings from Zhaopin, storing them in MongoDB, cleaning the data with pandas, visualizing national and city‑level salary distributions, and generating word clouds of required skills using matplotlib and wordcloud, providing a complete end‑to‑end data analysis pipeline.

MatplotlibMongoDBSalary Distribution
0 likes · 12 min read
Analyzing Python Job Trends from Zhaopin: Salary Distribution and Skill Word Clouds
MaGe Linux Operations
MaGe Linux Operations
Jul 27, 2018 · Fundamentals

Master Pandas: Essential Techniques for Data Exploration and Analysis

This tutorial introduces Pandas fundamentals, covering installation, data structures, importing CSV files, inspecting and reshaping data, filtering with boolean masks, indexing, applying functions, grouping, merging, quick plotting, and saving results, all illustrated with clear examples and images.

Pythondata analysisdataframe
0 likes · 14 min read
Master Pandas: Essential Techniques for Data Exploration and Analysis
ITPUB
ITPUB
Jul 23, 2018 · Big Data

What China's Vaccine Procurement Data Reveals: A Province‑Level Analysis

This article documents the collection, cleaning, and statistical analysis of publicly released second‑category vaccine procurement data from 28 Chinese provinces, highlighting data sources, processing steps with pandas, top manufacturers, regional market shares, and the challenges encountered during the effort.

Big DataChinadata analysis
0 likes · 9 min read
What China's Vaccine Procurement Data Reveals: A Province‑Level Analysis
ITPUB
ITPUB
Jul 23, 2018 · Big Data

Uncovering China’s Vaccine Procurement: A Province‑Level Data Crawl and Analysis

This article documents the collection of public second‑class vaccine procurement data from 28 Chinese provinces, describes the CSV schema, outlines the challenges faced during web scraping, and presents a pandas‑driven statistical analysis that highlights top manufacturers and their provincial market shares.

Chinadata analysispandas
0 likes · 10 min read
Uncovering China’s Vaccine Procurement: A Province‑Level Data Crawl and Analysis
Efficient Ops
Efficient Ops
Jun 21, 2018 · Fundamentals

Can Python Predict the 2018 World Cup Champion? A Data‑Driven Analysis

This article demonstrates how to use Python, pandas, and Jupyter Notebook to explore a comprehensive World Cup dataset, clean and enrich the data, visualize win and goal statistics for all teams, and finally predict the top three contenders for the 2018 tournament.

PythonSports AnalyticsWorld Cup
0 likes · 12 min read
Can Python Predict the 2018 World Cup Champion? A Data‑Driven Analysis
ITPUB
ITPUB
Jun 15, 2018 · Fundamentals

Can Python Predict the 2018 World Cup Champion? A Data‑Driven Analysis

Using a Kaggle dataset of over 40,000 matches from 1872 to 2018, this notebook demonstrates how to clean, transform, and visualize World Cup data with Python, pandas, and Matplotlib to identify top‑winning teams, total goal statistics, and forecast the most likely 2018 champion.

Jupyter NotebookPredictiondata-analysis
0 likes · 11 min read
Can Python Predict the 2018 World Cup Champion? A Data‑Driven Analysis
ITPUB
ITPUB
Jun 15, 2018 · Fundamentals

Can Python Predict the 2018 World Cup Champion? A Data‑Driven Exploration

This article walks through a Python‑based data analysis of World Cup matches from 1872 to 2018, using pandas and Jupyter Notebook to clean the data, compute win counts and total goals, visualize the top teams, and finally predict that Germany, Argentina and Brazil are the strongest contenders for the 2018 title.

Jupyter Notebookdata-analysispandas
0 likes · 11 min read
Can Python Predict the 2018 World Cup Champion? A Data‑Driven Exploration
MaGe Linux Operations
MaGe Linux Operations
Jun 12, 2018 · Backend Development

How to Scrape Douban Movie Reviews in 12 Lines of Python

Learn to quickly build a Python web scraper using requests and Xpath to extract Douban movie 'Black Panther' short reviews, covering setup, HTTP request analysis, data parsing, storage with pandas, and best practices like polite crawling intervals, all demonstrated with concise 12-line code.

XPathpandasrequests
0 likes · 10 min read
How to Scrape Douban Movie Reviews in 12 Lines of Python
MaGe Linux Operations
MaGe Linux Operations
Mar 29, 2018 · Artificial Intelligence

Master Python’s Top Data Analysis & AI Libraries with Hands‑On Code

This article introduces Python’s essential features for data analysis and mining, then reviews the most widely used libraries—NumPy, SciPy, Matplotlib, Pandas, Scikit‑Learn, Keras, and Gensim—each accompanied by concise code examples that demonstrate their core capabilities.

KerasPythondata analysis
0 likes · 14 min read
Master Python’s Top Data Analysis & AI Libraries with Hands‑On Code
MaGe Linux Operations
MaGe Linux Operations
Jan 14, 2018 · Artificial Intelligence

7 Essential Python Tools Every Data Scientist Must Master

This article introduces seven must‑know Python tools—including IPython, GraphLab Create, Pandas, PuLP, Matplotlib, Scikit‑Learn, and Spark—explaining their key features and how they empower data scientists to work efficiently in production environments.

Data ScienceGraphLabIPython
0 likes · 9 min read
7 Essential Python Tools Every Data Scientist Must Master
MaGe Linux Operations
MaGe Linux Operations
Dec 27, 2017 · Fundamentals

Visualizing Stock Data and Building K‑Line Charts with Python

This guide walks you through importing stock data, cleaning column names, visualizing price and volume trends, creating candlestick (K‑line) charts, analyzing relative changes, exploring correlations, and implementing a simple moving‑average trading strategy using pandas, matplotlib, and numpy.

Data visualizationK-LineMatplotlib
0 likes · 14 min read
Visualizing Stock Data and Building K‑Line Charts with Python
MaGe Linux Operations
MaGe Linux Operations
Nov 15, 2017 · Fundamentals

Master Stock Market Data Analysis with Python: Moving Averages Explained

This tutorial walks through using Python and pandas to fetch Yahoo Finance data, visualize stock prices with line and candlestick charts, and apply moving‑average techniques—including 20‑day, 50‑day, and 200‑day averages—to identify trends and build simple trading signals, all while emphasizing that the content is for educational purposes only and not investment advice.

Pythondata analysisfinance
0 likes · 13 min read
Master Stock Market Data Analysis with Python: Moving Averages Explained
MaGe Linux Operations
MaGe Linux Operations
Oct 21, 2017 · Big Data

What 1.38 Million Zhihu Followers Reveal: A Python Scraping & Visualization Journey

This article documents a Python‑based web‑scraping project that harvested over 1.38 million Zhihu followers, filtered high‑impact users, and visualized insights such as follower distribution, gender ratio, top influencers, geographic spread, education, industry, and certification details, highlighting challenges and lessons learned.

big-datadata-visualizationpandas
0 likes · 11 min read
What 1.38 Million Zhihu Followers Reveal: A Python Scraping & Visualization Journey
Tencent Advertising Technology
Tencent Advertising Technology
Jun 17, 2017 · Artificial Intelligence

SkullGreymon Team’s Progress and Technical Insights in the Tencent Social Ads Algorithm Competition

The SkullGreymon team, winners of the biggest improvement award in the Tencent Social Ads university algorithm competition, share their journey from a late start in the preliminaries to significant performance gains in the finals, detailing memory‑saving feature extraction techniques, pandas and numpy usage, and their XGBoost modeling approach.

Memory OptimizationXGBoostalgorithm competition
0 likes · 6 min read
SkullGreymon Team’s Progress and Technical Insights in the Tencent Social Ads Algorithm Competition
ITPUB
ITPUB
May 29, 2017 · Fundamentals

Why R Users Should Learn Python for Data Science: A Hands‑On Guide

This tutorial explains why R programmers should add Python to their toolkit, compares core data types and structures between the two languages, introduces essential Python libraries for data analysis, and walks through a practical Boston housing dataset example to solidify the concepts.

Data ScienceNumPyPython
0 likes · 12 min read
Why R Users Should Learn Python for Data Science: A Hands‑On Guide
Qunar Tech Salon
Qunar Tech Salon
Jan 29, 2016 · Big Data

Python Data Analysis Learning Roadmap (16‑Week Plan)

This article presents a 16‑week Python data‑analysis learning roadmap covering environment setup, basic syntax, web‑scraping techniques, data‑analysis libraries such as pandas and NumPy, and data‑visualization with matplotlib, along with curated free resources and tutorials for each stage.

NumPyRoadmapWeb Scraping
0 likes · 6 min read
Python Data Analysis Learning Roadmap (16‑Week Plan)