Tagged articles

One-hot encoding

6 articles · Page 1 of 1
Lisa Notes
Lisa Notes
Jul 1, 2026 · Artificial Intelligence

How to Convert Text into Numerical Features for NLP: Tokenization, One‑Hot Encoding, and Word Embedding

This article walks through the essential steps of turning raw natural language into machine‑readable numbers, covering categorical vs. numerical features, one‑hot encoding of categorical data, tokenization, building vocabularies, and using word embeddings, illustrated with an IMDB sentiment‑analysis example in Keras.

Data preprocessingIMDB sentiment analysisKeras
0 likes · 7 min read
How to Convert Text into Numerical Features for NLP: Tokenization, One‑Hot Encoding, and Word Embedding
Lisa Notes
Lisa Notes
Jun 30, 2026 · Artificial Intelligence

NLP Study Notes: 4 Essential Steps for Preprocessing Chinese Text Corpora

This article walks through the four core steps of Chinese NLP corpus preparation—collecting data, cleaning it with regex and encoding detection, tokenizing using dictionary‑based or statistical methods such as jieba, HMM and CRF, and finally standardizing with stop‑word removal, vocabulary building and one‑hot encoding—while illustrating each step with concrete code snippets and practical examples.

CRFChineseNLP
0 likes · 12 min read
NLP Study Notes: 4 Essential Steps for Preprocessing Chinese Text Corpora
Python Programming Learning Circle
Python Programming Learning Circle
Jul 8, 2025 · Artificial Intelligence

10 One‑Line Python Tricks to Jump‑Start Your Machine Learning Projects

This article presents ten concise, practical one‑line Python code snippets—ranging from loading CSV data with Pandas to building sophisticated Scikit‑learn pipelines—that streamline common machine‑learning tasks such as data cleaning, encoding, splitting, scaling, model training, evaluation, cross‑validation, and prediction.

Data preprocessingOne-hot encodingPandas
0 likes · 10 min read
10 One‑Line Python Tricks to Jump‑Start Your Machine Learning Projects
Model Perspective
Model Perspective
Sep 10, 2024 · Artificial Intelligence

Why Cross-Entropy Is the Key Loss Function for Classification Models

This article explains how loss functions evaluate model performance, contrasts regression’s mean squared error with classification’s cross‑entropy, describes one‑hot encoding and softmax outputs, and shows why higher predicted probabilities for the correct class yield lower loss, highlighting applications in image, language, and speech tasks.

One-hot encodingclassificationcross entropy
0 likes · 5 min read
Why Cross-Entropy Is the Key Loss Function for Classification Models
Python Programming Learning Circle
Python Programming Learning Circle
Dec 31, 2022 · Artificial Intelligence

A Beginner’s Guide to Data Preprocessing for Machine Learning in Python

This tutorial walks beginners through the essential steps of data preprocessing for any machine learning model, covering library imports, dataset loading, handling missing values, encoding categorical features, splitting into train‑test sets, and applying feature scaling using Python’s scikit‑learn.

Data preprocessingOne-hot encodingPython
0 likes · 11 min read
A Beginner’s Guide to Data Preprocessing for Machine Learning in Python
21CTO
21CTO
Jun 29, 2017 · Artificial Intelligence

Why Machine Learning Mirrors Human Learning: From Features to Reinforcement

The article explores how machine learning models emulate human learning by converting diverse real‑world descriptions into numerical features, illustrating concepts such as one‑hot encoding, supervised, unsupervised, and reinforcement learning, and emphasizing the importance of mapping inputs to outputs for intelligent systems.

AI conceptsOne-hot encodingfeatures
0 likes · 14 min read
Why Machine Learning Mirrors Human Learning: From Features to Reinforcement