Tag

cosine similarity

0 views collected around this technical thread.

Test Development Learning Exchange
Test Development Learning Exchange
Apr 20, 2024 · Artificial Intelligence

Implementing a Simple University Paper Plagiarism Detection System in Python

This article outlines the design and implementation of a basic university paper plagiarism detection system using Python, covering text preprocessing with NLTK, TF‑IDF weighting, cosine similarity calculation, and a sample in‑memory paper database, while also discussing scalability, UI, and legal considerations.

NLPPythonTF-IDF
0 likes · 10 min read
Implementing a Simple University Paper Plagiarism Detection System in Python
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Oct 19, 2023 · Artificial Intelligence

NLP Basics: Word Embeddings, Word2Vec, and Hand‑crafted RNN Implementation in PyTorch

This article introduces word‑level representations—from one‑hot encoding to dense word embeddings via Word2Vec—explains cosine similarity, then walks through the structure, limitations, and PyTorch implementation of a vanilla RNN, including a custom forward function and verification against the library API.

NLPPyTorchRNN
0 likes · 19 min read
NLP Basics: Word Embeddings, Word2Vec, and Hand‑crafted RNN Implementation in PyTorch
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Apr 5, 2022 · Artificial Intelligence

One-Stroke Gesture Recognition Using Canvas: From Drawing to Comparison

This tutorial explains how to implement a one‑stroke gesture recognizer with HTML5 canvas, covering drawing, resampling, translation, rotation, scaling, feature extraction, and similarity measurement using Euclidean and cosine distances, complete with full TypeScript code examples.

AlgorithmGesture Recognitioncanvas
0 likes · 19 min read
One-Stroke Gesture Recognition Using Canvas: From Drawing to Comparison
Sohu Tech Products
Sohu Tech Products
Mar 17, 2021 · Big Data

Understanding Simhash: From Traditional Hash to Random Projection LSH

This article explains the principles and implementation of Simhash, covering the shortcomings of traditional hash functions, the use of cosine similarity, random projection for dimensionality reduction, locality‑sensitive hashing, and practical optimizations for large‑scale duplicate detection.

AlgorithmBig DataLocality Sensitive Hashing
0 likes · 24 min read
Understanding Simhash: From Traditional Hash to Random Projection LSH
vivo Internet Technology
vivo Internet Technology
Oct 14, 2020 · Artificial Intelligence

Understanding Cosine Similarity: From Mathematical Foundations to Practical Applications

The article explains cosine similarity from basic geometry and vector math, derives its formula, and shows how it powers precision marketing, image classification, and text retrieval, while also detailing its industrial implementation in Lucene’s vector space model.

LuceneSearch EngineTF-IDF
0 likes · 18 min read
Understanding Cosine Similarity: From Mathematical Foundations to Practical Applications
DataFunTalk
DataFunTalk
Sep 22, 2020 · Artificial Intelligence

User-Based Collaborative Filtering with Python: A Step-by-Step Guide

This article explains how to implement a user‑based collaborative filtering recommendation system in Python, covering data loading, preprocessing, cosine‑similarity computation, neighbor selection, rating prediction, and generating top‑5 movie recommendations with detailed code examples.

Pythoncollaborative filteringcosine similarity
0 likes · 12 min read
User-Based Collaborative Filtering with Python: A Step-by-Step Guide
Xianyu Technology
Xianyu Technology
May 16, 2018 · Artificial Intelligence

Geographic Alias Mining and Knowledge Base Construction Using Contextual Vectors and Address Similarity

The paper presents two inexpensive techniques for extracting geographic aliases of points of interest—comparing high‑dimensional contextual vectors of nearby shipping addresses and analyzing co‑occurring words in identical addresses—to construct a knowledge base that links official names with their synonyms, improving location‑based service accuracy.

Data MiningGeographic Aliascosine similarity
0 likes · 9 min read
Geographic Alias Mining and Knowledge Base Construction Using Contextual Vectors and Address Similarity
Qunar Tech Salon
Qunar Tech Salon
Mar 14, 2015 · Artificial Intelligence

Common Distance and Similarity Measures in Machine Learning and Data Mining

This article reviews the most frequently used distance and similarity formulas in machine learning and data mining, explaining their definitions, mathematical properties, practical examples, and when each metric is appropriate for measuring differences between data points or probability distributions.

Data MiningKL divergenceMahalanobis distance
0 likes · 13 min read
Common Distance and Similarity Measures in Machine Learning and Data Mining