Tagged articles
31 articles
Page 1 of 1
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Mar 11, 2026 · Backend Development

How to Achieve One‑Line Semantic Search for Nearby Clean Coffee Shops with Elasticsearch

This article walks through building a practical Elasticsearch demo that lets users type a single query like “nearby clean coffee shop” and get results by combining dense‑vector semantic search, geo filtering, BM25, and a hybrid RRF‑style ranking, with both LLM‑based structuring and a fallback hash‑based embedding.

BM25FlaskHybrid Search
0 likes · 10 min read
How to Achieve One‑Line Semantic Search for Nearby Clean Coffee Shops with Elasticsearch
Tech Freedom Circle
Tech Freedom Circle
Nov 5, 2025 · Artificial Intelligence

Elasticsearch: BM25, TF‑IDF, Dense Vectors, kNN, L2 & Cosine Distances, RRF

This article provides a comprehensive technical guide to Elasticsearch’s core retrieval models—BM25 and TF‑IDF—while detailing modern vector‑based search using dense_vector, kNN, L2 and cosine distances, and demonstrates how to combine keyword and semantic results through hybrid search and Reciprocal Rank Fusion (RRF) with practical configuration examples.

BM25ElasticsearchRRF
0 likes · 42 min read
Elasticsearch: BM25, TF‑IDF, Dense Vectors, kNN, L2 & Cosine Distances, RRF
Data Party THU
Data Party THU
Oct 8, 2025 · Artificial Intelligence

Build a Music Genre Classifier from Scratch with KNN and MFCC

This tutorial walks through constructing a complete music‑genre classification project using Python, covering dataset preparation, MFCC feature extraction, K‑Nearest Neighbors implementation, train‑test splitting, model evaluation, and testing on new audio files, all with reproducible code snippets.

Audio ProcessingMFCCMusic Genre Classification
0 likes · 14 min read
Build a Music Genre Classifier from Scratch with KNN and MFCC
Data STUDIO
Data STUDIO
Sep 15, 2025 · Artificial Intelligence

Build a Music Genre Classifier with KNN and MFCC from Scratch

This tutorial walks through building a music‑genre classification system using the GTZAN dataset, extracting MFCC features, implementing a K‑Nearest Neighbors classifier in Python, and achieving roughly 70% accuracy on test data.

Audio ProcessingMFCCMusic Genre Classification
0 likes · 14 min read
Build a Music Genre Classifier with KNN and MFCC from Scratch
Mingyi World Elasticsearch
Mingyi World Elasticsearch
May 16, 2025 · Artificial Intelligence

Easysearch Vector Search: From Theory to Hands‑On Implementation

This article explains the principles of vector search, compares Easysearch's approximate (LSH) and exact kNN APIs, and walks through a complete hands‑on example using Stanford's 50‑dimensional GloVe embeddings to index, import, and query semantically similar words.

Approximate SearchCosine SimilarityEasysearch
0 likes · 9 min read
Easysearch Vector Search: From Theory to Hands‑On Implementation
dbaplus Community
dbaplus Community
May 3, 2025 · Backend Development

Boost Elasticsearch with Vector Embeddings: Python & Logstash Step‑by‑Step Guide

This article explains how vector embeddings enhance Elasticsearch for semantic search and recommendation, walks through the complete workflow of generating, storing, and querying embeddings, and provides detailed Python and Logstash implementations with code samples, pros and cons, and guidance on choosing the right approach.

ElasticsearchLogstashPython
0 likes · 11 min read
Boost Elasticsearch with Vector Embeddings: Python & Logstash Step‑by‑Step Guide
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Apr 8, 2025 · Backend Development

Boost Elasticsearch 8.x Search with Vector Embeddings

This article explains how vector embeddings enhance Elasticsearch 8.x search, walks through the concepts of dense vectors, shows step‑by‑step Python and Logstash pipelines for generating and storing embeddings, compares their pros and cons, and offers guidance on selecting the right approach for large‑scale log data.

ElasticsearchLogstashPython
0 likes · 12 min read
Boost Elasticsearch 8.x Search with Vector Embeddings
AI Code to Success
AI Code to Success
Mar 13, 2025 · Artificial Intelligence

Unlocking K-Nearest Neighbors: Theory, Implementation, and Real-World Tips

This article provides a comprehensive guide to the K‑Nearest Neighbors algorithm, covering its intuitive principle, step‑by‑step workflow, distance metrics, strategies for selecting the optimal K via cross‑validation, Python implementation with scikit‑learn, advantages, limitations, and diverse application scenarios.

Pythonclassificationcross-validation
0 likes · 24 min read
Unlocking K-Nearest Neighbors: Theory, Implementation, and Real-World Tips
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Mar 11, 2025 · Backend Development

Master Elasticsearch dense_vector: definition, usage, and kNN search guide

This article explains Elasticsearch's dense_vector field for storing dense vectors, covering its definition, how to define and index vectors, kNN search methods (brute‑force and approximate with HNSW), similarity options, quantization strategies, bit‑vector support, key parameters, and how to update mappings.

Elasticsearchbit vectorsdense_vector
0 likes · 13 min read
Master Elasticsearch dense_vector: definition, usage, and kNN search guide
DaTaobao Tech
DaTaobao Tech
Mar 4, 2024 · Artificial Intelligence

Iris Classification with Machine Learning: Data Exploration and Classic Algorithms

This beginner-friendly guide walks through loading the classic Iris dataset, performing exploratory data analysis, and implementing four fundamental classifiers—Decision Tree, Logistic Regression, Support Vector Machine, and K‑Nearest Neighbors—complete with training, visualization, and accuracy evaluation, illustrating a full machine‑learning workflow.

classificationdecision treeiris dataset
0 likes · 22 min read
Iris Classification with Machine Learning: Data Exploration and Classic Algorithms
Bitu Technology
Bitu Technology
Jan 17, 2024 · Artificial Intelligence

Rosetta Stone: Scalable ID Mapping System for Tubi's Content Library Using LLMs and Embeddings

This article describes how Tubi built the Rosetta Stone system—a flexible ID mapping workflow that leverages large language models, embedding similarity ranking, and K‑nearest‑neighbors to unify and enrich metadata across a 200,000‑title library, improve content recommendation, and streamline operations.

Big DataLLMcontent ID mapping
0 likes · 10 min read
Rosetta Stone: Scalable ID Mapping System for Tubi's Content Library Using LLMs and Embeddings
Architects Research Society
Architects Research Society
Jul 24, 2023 · Artificial Intelligence

Neural Search in Apache Solr: Dense Vector Fields, HNSW Graphs, and K‑Nearest Neighbor Implementation

This article explains how Apache Solr implements neural search using dense vector fields, K‑Nearest Neighbor algorithms, and Hierarchical Navigable Small World graphs, detailing the underlying Lucene support, configuration options, query syntax, and integration with AI‑driven vector representations.

Apache SolrDense VectorsHNSW
0 likes · 15 min read
Neural Search in Apache Solr: Dense Vector Fields, HNSW Graphs, and K‑Nearest Neighbor Implementation
Architects Research Society
Architects Research Society
Jun 6, 2022 · Artificial Intelligence

Neural Search in Apache Solr: Dense Vector Fields, HNSW Graphs, and K‑Nearest Neighbor Implementation

This article explains how Apache Solr and Lucene implement neural search using dense vector fields, hierarchical navigable small‑world (HNSW) graphs, and approximate K‑nearest neighbor algorithms, covering configuration, custom codecs, indexing formats, and query parsers for vector‑based retrieval.

Apache SolrDense VectorsHNSW
0 likes · 15 min read
Neural Search in Apache Solr: Dense Vector Fields, HNSW Graphs, and K‑Nearest Neighbor Implementation
Top Architect
Top Architect
Feb 21, 2022 · Databases

Key New Features in Elasticsearch 8.0

Elasticsearch 8.0 introduces major updates including 7.x REST API compatibility headers, default-enabled security with registration tokens, known issues on ARM/macOS, a preview KNN search API using dense_vector, storage reductions for keyword and text fields, faster geo indexing, PyTorch model support, and numerous other enhancements across aggregations, allocation, analysis, authentication, and core infrastructure.

ElasticsearchPyTorchVersion 8
0 likes · 10 min read
Key New Features in Elasticsearch 8.0
Laravel Tech Community
Laravel Tech Community
Feb 17, 2022 · Backend Development

Key New Features and Changes in Elasticsearch 8.0 Release

Elasticsearch 8.0 introduces major updates including 7.x REST API compatibility headers, default‑enabled security with registration tokens, protected system indices, a preview KNN search API, storage‑efficient keyword/match_only_text/text fields, faster indexing for geo_point and geo_shape, PyTorch model support, and numerous deprecations and enhancements across aggregations, allocation, analysis, authentication, cluster coordination, and engine components.

BackendElasticsearchindexing
0 likes · 8 min read
Key New Features and Changes in Elasticsearch 8.0 Release
MaGe Linux Operations
MaGe Linux Operations
Feb 15, 2022 · Backend Development

What’s New in Elasticsearch 8.0? Key Features, Security, and API Changes

Elasticsearch 8.0 introduces major updates including 7.x REST API compatibility headers, default-enabled security with enrollment tokens, protected system indices, a preview KNN search API, storage‑saving field encodings, faster geo indexing, and numerous deprecations and enhancements across aggregations, authentication, cluster coordination, and packaging.

API compatibilityElasticsearchindexing
0 likes · 10 min read
What’s New in Elasticsearch 8.0? Key Features, Security, and API Changes
Big Data Technology & Architecture
Big Data Technology & Architecture
Feb 13, 2022 · Big Data

What's New in Elasticsearch 8.0 – Key Features and Changes

The article provides a comprehensive overview of Elasticsearch 8.0, highlighting major updates such as 7.x REST API compatibility headers, default-enabled security, system‑index protection, a new KNN search API, storage and indexing optimizations, PyTorch model support, and numerous deprecations and feature removals across the stack.

8.0APIBig Data
0 likes · 10 min read
What's New in Elasticsearch 8.0 – Key Features and Changes
Programmer DD
Programmer DD
Feb 12, 2022 · Databases

What’s New in Elasticsearch 8.0? Key Features and Migration Tips

Elasticsearch 8.0 introduces major changes such as 7.x REST API compatibility headers, default‑enabled security with registration tokens, protected system indices, a technical preview of KNN search, storage‑saving field encodings, faster geo‑point indexing, PyTorch model support for NLP, and numerous deprecations and improvements across aggregations, allocation, analysis, authentication, cluster coordination, and packaging.

APIBig DataElasticsearch
0 likes · 10 min read
What’s New in Elasticsearch 8.0? Key Features and Migration Tips
MaGe Linux Operations
MaGe Linux Operations
Sep 9, 2020 · Artificial Intelligence

Master Machine Learning Basics: Concepts, Types, Algorithms & K‑NN Walkthrough

This comprehensive tutorial introduces machine learning fundamentals, its history, differences from traditional programming, key characteristics, and why Python is the preferred language, then explores supervised, unsupervised, and reinforcement learning, popular algorithms, detailed K‑Nearest Neighbors examples for classification and regression, and the essential steps to build and evaluate models.

PythonReinforcement LearningUnsupervised Learning
0 likes · 21 min read
Master Machine Learning Basics: Concepts, Types, Algorithms & K‑NN Walkthrough
Python Programming Learning Circle
Python Programming Learning Circle
May 7, 2020 · Artificial Intelligence

Understanding the k-Nearest Neighbor (kNN) Classification Algorithm and Its Python Implementation

This article introduces the concept and intuition behind the k-Nearest Neighbor (kNN) classification algorithm, explains its simple and full forms, discusses feature engineering and Euclidean distance calculations, and provides a complete Python implementation with example code.

classificationeuclidean distancefeature engineering
0 likes · 10 min read
Understanding the k-Nearest Neighbor (kNN) Classification Algorithm and Its Python Implementation
Python Programming Learning Circle
Python Programming Learning Circle
Mar 7, 2020 · Artificial Intelligence

k-Nearest Neighbors (kNN) Algorithm: Overview, Pros/Cons, Data Preparation, Implementation, and Handwritten Digit Recognition

This article explains the k‑Nearest Neighbors classification method, discusses its advantages and drawbacks, describes data preparation and normalization, presents Python code for the algorithm and a full handwritten digit recognition project, and reports an error rate of about 1.2%.

classificationeuclidean distancehandwritten digit recognition
0 likes · 9 min read
k-Nearest Neighbors (kNN) Algorithm: Overview, Pros/Cons, Data Preparation, Implementation, and Handwritten Digit Recognition
360 Quality & Efficiency
360 Quality & Efficiency
Aug 23, 2019 · Artificial Intelligence

High‑Performance High‑Dimensional Vector KNN Search Using FAISS

This article introduces the background of vector representations in machine learning, explains the K‑Nearest Neighbors algorithm and its key parameters, reviews traditional tree‑based and modern high‑performance search solutions, and demonstrates how FAISS can achieve microsecond‑level KNN queries on large‑scale high‑dimensional data.

FAISShigh-dimensionalkNN
0 likes · 5 min read
High‑Performance High‑Dimensional Vector KNN Search Using FAISS
21CTO
21CTO
Apr 12, 2019 · Artificial Intelligence

Top 10 Essential Machine Learning Algorithms Every Data Scientist Should Know

This article provides a concise overview of ten fundamental machine learning algorithms—linear regression, logistic regression, linear discriminant analysis, naive Bayes, K‑nearest neighbors, learning vector quantization, support vector machines, decision trees, bagging/random forest, and boosting/AdaBoost—explaining their principles, typical use cases, and key characteristics.

Naive BayesRandom ForestSupport Vector Machine
0 likes · 13 min read
Top 10 Essential Machine Learning Algorithms Every Data Scientist Should Know
JD Tech
JD Tech
Feb 12, 2019 · Artificial Intelligence

Content‑Based Filtering: Concepts, Implementation, and Pros/Cons

The article explains content‑based filtering for recommendation systems, covering its basic concepts, feature requirements, implementation using vector representations and cosine similarity, advantages and disadvantages, and supplementary algorithms such as k‑Nearest Neighbor, Rocchio, decision trees, linear classifiers, and Naive Bayes.

Naive BayesRocchiocontent-based filtering
0 likes · 11 min read
Content‑Based Filtering: Concepts, Implementation, and Pros/Cons
MaGe Linux Operations
MaGe Linux Operations
Dec 17, 2018 · Artificial Intelligence

How to Build a KNN-Based CAPTCHA Solver with OpenCV in Python

This tutorial walks through using OpenCV and a K‑Nearest Neighbors model to preprocess, segment, manually label, train, and finally recognize distorted, noisy CAPTCHA images, achieving about 82% accuracy on a test set of one hundred samples.

CaptchaImage ProcessingOpenCV
0 likes · 12 min read
How to Build a KNN-Based CAPTCHA Solver with OpenCV in Python
MaGe Linux Operations
MaGe Linux Operations
Apr 8, 2018 · Artificial Intelligence

Master Python Data Mining & Machine Learning: From Preprocessing to Classification

This comprehensive tutorial walks you through Python data mining and machine learning fundamentals, covering data preprocessing techniques, common classification algorithms, an Iris flower classification case study, and practical tips for selecting the right algorithm, all illustrated with clear code examples and visualizations.

Classification AlgorithmsNaive BayesPython
0 likes · 22 min read
Master Python Data Mining & Machine Learning: From Preprocessing to Classification
Qunar Tech Salon
Qunar Tech Salon
Mar 15, 2015 · Artificial Intelligence

Overview of Common Classification Algorithms in Data Mining

This article introduces the concepts of classification and prediction in data mining, outlines their workflow, and provides concise explanations of six widely used classification techniques—decision trees, K‑Nearest Neighbour, Support Vector Machine, Vector Space Model, Bayesian methods, and neural networks—highlighting their principles, advantages, and limitations.

Bayesiandata miningdecision tree
0 likes · 9 min read
Overview of Common Classification Algorithms in Data Mining