Test Development Learning Exchange
Test Development Learning Exchange
Sep 1, 2024 · Fundamentals

Python Utility Scripts for Data Cleaning, Translation, File Sync, Cloud Backup, and More

This article presents a collection of Python utility scripts that demonstrate how to clean CSV data, translate text files, synchronize folders, upload files to S3, count directory contents, classify files by type, perform OCR on images, convert video to audio, extract images from webpages, and generate text summaries using modern libraries.

AIUtilitiescloud storage
0 likes · 6 min read
Python Utility Scripts for Data Cleaning, Translation, File Sync, Cloud Backup, and More
Python Programming Learning Circle
Python Programming Learning Circle
May 18, 2024 · Fundamentals

Pandas Data Modification, Iteration, and Function Application Techniques

This article provides a comprehensive guide to using Pandas for data cleaning and transformation, covering value modification, replacement, filling missing data, renaming, column addition, row insertion, merging, deletion, advanced filtering, iteration methods, and applying functions such as pipe, apply, agg, and transform.

DataFrameFunctionsIteration
0 likes · 9 min read
Pandas Data Modification, Iteration, and Function Application Techniques
Python Programming Learning Circle
Python Programming Learning Circle
Mar 22, 2024 · Fundamentals

Using FuzzyWuzzy for Fuzzy String Matching in Python

This article introduces the FuzzyWuzzy Python library, explains its underlying Levenshtein distance algorithm, demonstrates how to install it, describes the key functions in the fuzz and process modules, and provides practical examples for matching company names and province fields with complete code snippets.

Levenshteindata-cleaningfuzzy logic
0 likes · 10 min read
Using FuzzyWuzzy for Fuzzy String Matching in Python
Python Programming Learning Circle
Python Programming Learning Circle
Jul 10, 2023 · Fundamentals

Using FuzzyWuzzy for Fuzzy String Matching in Python

This article introduces the Python FuzzyWuzzy library, explains its Levenshtein‑based fuzzy string matching functions such as Ratio, Partial Ratio, Token Sort Ratio and Token Set Ratio, demonstrates how to use the process module for extracting best matches, and provides practical code examples for matching company and province names.

LevenshteinPythondata-cleaning
0 likes · 11 min read
Using FuzzyWuzzy for Fuzzy String Matching in Python
Python Crawling & Data Mining
Python Crawling & Data Mining
Jan 23, 2022 · Fundamentals

Master Pandas: From Data Loading to Advanced Manipulation

This comprehensive Pandas tutorial walks you through loading CSV and Excel files, creating Series and DataFrames, performing basic operations, cleaning data, handling missing values, working with hierarchical indexes, grouping, merging, concatenating, and applying time‑series techniques, all illustrated with clear code examples and screenshots.

DataFramePythondata-cleaning
0 likes · 12 min read
Master Pandas: From Data Loading to Advanced Manipulation
Python Programming Learning Circle
Python Programming Learning Circle
Jun 5, 2020 · Fundamentals

Pure Python Techniques: Lambda Functions, List Comprehensions, and Zip

This article introduces several powerful pure‑Python features—lambda functions, list comprehensions, and the zip function—demonstrating how they can simplify data cleaning, feature engineering, and iterative processing with concise code examples, helping programmers reduce reliance on external libraries while improving productivity.

LambdaPythondata-cleaning
0 likes · 7 min read
Pure Python Techniques: Lambda Functions, List Comprehensions, and Zip