Tagged articles
18 articles
Page 1 of 1
AI Algorithm Path
AI Algorithm Path
Jan 21, 2026 · Artificial Intelligence

Understanding Vector Similarity in Machine Learning: A Plain‑Language Guide

The article explains key vector similarity measures—dot product, cosine similarity, and L1/L2 distances—illustrates their geometric meanings, compares their behavior with concrete examples and PyTorch/Numpy code, and discusses when to prefer each metric in machine‑learning tasks.

Cosine SimilarityL1 distanceL2 distance
0 likes · 8 min read
Understanding Vector Similarity in Machine Learning: A Plain‑Language Guide
Data Party THU
Data Party THU
Sep 17, 2025 · Artificial Intelligence

How Matching Networks Tackle Imbalance with Cosine Similarity and Attention

This article provides a comprehensive technical review of Matching Networks, covering cosine similarity mathematics, its transformations, the bias introduced by imbalanced support sets, and a range of mitigation strategies such as adaptive weighting, global distance‑matrix normalization, prior‑based weighting, hierarchical multi‑scale matching, hybrid learning architectures, and attention‑driven dynamic sample selection.

Attention MechanismCosine SimilarityMatching Networks
0 likes · 10 min read
How Matching Networks Tackle Imbalance with Cosine Similarity and Attention
Sohu Smart Platform Tech Team
Sohu Smart Platform Tech Team
Aug 9, 2025 · Artificial Intelligence

How SimHash and Cosine Similarity Accelerate Large-Scale Text Deduplication

This article explains why traditional pairwise text comparison is impractical for massive news corpora, introduces cosine similarity and SimHash as efficient deduplication techniques, walks through their mathematical foundations, step‑by‑step implementation details, code examples, and discusses trade‑offs such as accuracy versus speed.

Big DataCosine SimilaritySimHash
0 likes · 12 min read
How SimHash and Cosine Similarity Accelerate Large-Scale Text Deduplication
AI Algorithm Path
AI Algorithm Path
Jul 1, 2025 · Artificial Intelligence

Beginner’s Guide to CLIP Inference: Step‑by‑Step with Hugging Face

This tutorial walks through loading the openai/clip‑vit‑base‑patch32 model with Hugging Face, preprocessing images and text, encoding them into a shared embedding space, computing cosine similarity for zero‑shot image‑text matching, and visualizing the results, all with concrete code examples.

CLIPCosine SimilarityHugging Face
0 likes · 6 min read
Beginner’s Guide to CLIP Inference: Step‑by‑Step with Hugging Face
Mingyi World Elasticsearch
Mingyi World Elasticsearch
May 16, 2025 · Artificial Intelligence

Easysearch Vector Search: From Theory to Hands‑On Implementation

This article explains the principles of vector search, compares Easysearch's approximate (LSH) and exact kNN APIs, and walks through a complete hands‑on example using Stanford's 50‑dimensional GloVe embeddings to index, import, and query semantically similar words.

Approximate SearchCosine SimilarityEasysearch
0 likes · 9 min read
Easysearch Vector Search: From Theory to Hands‑On Implementation
Test Development Learning Exchange
Test Development Learning Exchange
Apr 20, 2024 · Artificial Intelligence

Implementing a Simple University Paper Plagiarism Detection System in Python

This article outlines the design and implementation of a basic university paper plagiarism detection system using Python, covering text preprocessing with NLTK, TF‑IDF weighting, cosine similarity calculation, and a sample in‑memory paper database, while also discussing scalability, UI, and legal considerations.

Cosine SimilarityNLPPython
0 likes · 10 min read
Implementing a Simple University Paper Plagiarism Detection System in Python
Sohu Tech Products
Sohu Tech Products
Feb 28, 2024 · Big Data

How SimHash and Cosine Similarity Accelerate Large‑Scale Text Deduplication

This article explains why massive news feeds need efficient deduplication, compares cosine similarity and SimHash for measuring text similarity, walks through a step‑by‑step implementation with Java code, and shows how a space‑for‑time indexing strategy can reduce duplicate‑detection complexity from O(n²) to near O(1).

Big DataCosine SimilarityNear-Duplicate Detection
0 likes · 14 min read
How SimHash and Cosine Similarity Accelerate Large‑Scale Text Deduplication
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Oct 19, 2023 · Artificial Intelligence

NLP Basics: Word Embeddings, Word2Vec, and Hand‑crafted RNN Implementation in PyTorch

This article introduces word‑level representations—from one‑hot encoding to dense word embeddings via Word2Vec—explains cosine similarity, then walks through the structure, limitations, and PyTorch implementation of a vanilla RNN, including a custom forward function and verification against the library API.

Cosine SimilarityNLPPyTorch
0 likes · 19 min read
NLP Basics: Word Embeddings, Word2Vec, and Hand‑crafted RNN Implementation in PyTorch
ELab Team
ELab Team
Nov 29, 2022 · Frontend Development

How to Build Real-Time Mouse Gesture Recognition with TensorFlow.js

This article explains how to design, implement, and evaluate a mouse gesture recognition system using machine learning and geometric analysis, covering data preprocessing, model training with TensorFlow.js, cosine‑similarity matching, performance optimizations, and extensions to three‑dimensional VR/AR environments.

Cosine SimilarityTensorFlow.jsWeb Development
0 likes · 32 min read
How to Build Real-Time Mouse Gesture Recognition with TensorFlow.js
Sohu Tech Products
Sohu Tech Products
Mar 17, 2021 · Big Data

Understanding Simhash: From Traditional Hash to Random Projection LSH

This article explains the principles and implementation of Simhash, covering the shortcomings of traditional hash functions, the use of cosine similarity, random projection for dimensionality reduction, locality‑sensitive hashing, and practical optimizations for large‑scale duplicate detection.

Big DataCosine SimilarityLocality Sensitive Hashing
0 likes · 24 min read
Understanding Simhash: From Traditional Hash to Random Projection LSH
DataFunTalk
DataFunTalk
Sep 22, 2020 · Artificial Intelligence

User-Based Collaborative Filtering with Python: A Step-by-Step Guide

This article explains how to implement a user‑based collaborative filtering recommendation system in Python, covering data loading, preprocessing, cosine‑similarity computation, neighbor selection, rating prediction, and generating top‑5 movie recommendations with detailed code examples.

Cosine SimilarityPythoncollaborative filtering
0 likes · 12 min read
User-Based Collaborative Filtering with Python: A Step-by-Step Guide
Mafengwo Technology
Mafengwo Technology
Jan 16, 2020 · Artificial Intelligence

How Machine Learning Transforms Hotel Aggregation for Real‑Time Accurate Pricing

This article explains the evolution of hotel aggregation at Mafengwo, from simple cosine similarity matching to advanced machine‑learning pipelines using tokenization, feature engineering, and LightGBM models, highlighting challenges of accuracy and real‑time performance and presenting practical solutions.

Cosine SimilarityLightGBMfeature engineering
0 likes · 16 min read
How Machine Learning Transforms Hotel Aggregation for Real‑Time Accurate Pricing
Xianyu Technology
Xianyu Technology
May 16, 2018 · Artificial Intelligence

Geographic Alias Mining and Knowledge Base Construction Using Contextual Vectors and Address Similarity

The paper presents two inexpensive techniques for extracting geographic aliases of points of interest—comparing high‑dimensional contextual vectors of nearby shipping addresses and analyzing co‑occurring words in identical addresses—to construct a knowledge base that links official names with their synonyms, improving location‑based service accuracy.

Cosine SimilarityGeographic AliasKnowledge Base
0 likes · 9 min read
Geographic Alias Mining and Knowledge Base Construction Using Contextual Vectors and Address Similarity
Hulu Beijing
Hulu Beijing
Nov 23, 2017 · Artificial Intelligence

Why Use Cosine Similarity Over Euclidean Distance? Insights & Limits

This article explains the concept of cosine distance, compares it with Euclidean distance, discusses when cosine similarity is preferable, and shows why cosine distance does not satisfy all metric axioms, providing examples and interview‑style analysis.

Cosine SimilarityInterview Preparationdistance metric
0 likes · 7 min read
Why Use Cosine Similarity Over Euclidean Distance? Insights & Limits
21CTO
21CTO
Oct 6, 2017 · Artificial Intelligence

How Cosine Similarity Powers Movie Recommendations: A Python Guide

This tutorial explains various similarity metrics such as cosine similarity, Euclidean distance, Jaccard index, and Pearson correlation, demonstrates a Python function to compute user interest similarity, and shows how to generate movie recommendations with example code and output.

Cosine Similarityrecommendation systemsimilarity metrics
0 likes · 7 min read
How Cosine Similarity Powers Movie Recommendations: A Python Guide
Qunar Tech Salon
Qunar Tech Salon
Mar 14, 2015 · Artificial Intelligence

Common Distance and Similarity Measures in Machine Learning and Data Mining

This article reviews the most frequently used distance and similarity formulas in machine learning and data mining, explaining their definitions, mathematical properties, practical examples, and when each metric is appropriate for measuring differences between data points or probability distributions.

Cosine SimilarityKL divergenceMahalanobis distance
0 likes · 13 min read
Common Distance and Similarity Measures in Machine Learning and Data Mining