Author

AI Algorithm Path

A public account focused on deep learning, computer vision, and autonomous driving perception algorithms, covering visual CV, neural networks, pattern recognition, related hardware and software configurations, and open-source projects.

138

Articles

Likes

442

Views

Comments

Latest from AI Algorithm Path

100 recent articles max

AI Algorithm Path

Jul 19, 2025 · Artificial Intelligence

Understanding LoRA and QLoRA: Techniques for Efficient LLM Fine‑Tuning

This article explains how low‑rank adaptation (LoRA) and its quantized variant (QLoRA) compress large language model weights, reduce training cost, and enable flexible adapter switching, while detailing matrix decomposition, training mechanics, and trade‑offs with concrete examples and quantitative analysis.

AdapterLLM fine-tuningLoRA

0 likes · 11 min read

Understanding LoRA and QLoRA: Techniques for Efficient LLM Fine‑Tuning

AI Algorithm Path

Jul 15, 2025 · Artificial Intelligence

Day 8: Fine‑Tuning CLIP for Image‑Text Tasks – A Beginner’s Guide

This tutorial walks through fine‑tuning OpenAI's CLIP ViT‑B/32 on a small image‑text dataset in a Kaggle notebook, covering environment setup, model loading, data preprocessing with CLIPProcessor, training a linear head, and observing loss convergence to align visual and textual embeddings.

CLIPFine-tuningHuggingFace

0 likes · 5 min read

Day 8: Fine‑Tuning CLIP for Image‑Text Tasks – A Beginner’s Guide

AI Algorithm Path

Jul 14, 2025 · Artificial Intelligence

The Most Powerful Open‑Source Agent Model: Kimi K2

Kimi K2, an open‑source trillion‑parameter AI model released by Moonshot AI, offers Base and Instruct variants, achieves leading scores on benchmarks such as SWE‑bench, LiveCodeBench and AceBench, and introduces a novel post‑training autonomous‑exploration stage with MuonClip optimization to enable robust tool use and reinforcement‑learning‑driven self‑improvement.

Autonomous AgentsKimi K2Large Language Model

0 likes · 8 min read

The Most Powerful Open‑Source Agent Model: Kimi K2

AI Algorithm Path

Jul 13, 2025 · Artificial Intelligence

How to Calculate the Right AI Model Size for Your PC (3B, 7B, 13B)

This article explains how to estimate the GPU memory required for running large language models of 3 B, 7 B, and 13 B parameters, walks through step‑by‑step calculations, shows how hardware limits affect feasibility, and offers practical optimization techniques such as quantization and CPU offloading.

AI model sizingCPU offloadingFP16

0 likes · 5 min read

How to Calculate the Right AI Model Size for Your PC (3B, 7B, 13B)

AI Algorithm Path

Jul 5, 2025 · Artificial Intelligence

Beginner’s Guide to Vision‑Language Models Day 7: How CLIP Achieves Joint Visual‑Language Understanding

This article explains CLIP’s dual‑encoder architecture—using a Vision Transformer for images and a Transformer for text—how both encoders map inputs into a shared embedding space, the role of cosine similarity, and the InfoNCE contrastive loss that drives joint visual‑language learning.

CLIPInfoNCEMulti-modal Embedding

0 likes · 8 min read

Beginner’s Guide to Vision‑Language Models Day 7: How CLIP Achieves Joint Visual‑Language Understanding

AI Algorithm Path

Jul 3, 2025 · Artificial Intelligence

Exploring Advanced, Graph, and Agentic RAG: The Evolution of Retrieval‑Augmented Generation

This article examines how Retrieval‑Augmented Generation (RAG) has progressed from simple keyword‑based retrieval to advanced semantic methods, modular architectures, graph‑enhanced reasoning, and autonomous agentic systems, highlighting each approach's workflow, benefits, limitations, and the shift toward dynamic AI decision‑making.

AIAgentic RAGKnowledge Graph

0 likes · 7 min read

Exploring Advanced, Graph, and Agentic RAG: The Evolution of Retrieval‑Augmented Generation

AI Algorithm Path

Jul 2, 2025 · Artificial Intelligence

Exploring the Open‑Source Flux.1 Kontext Dev Model for Advanced Image Editing

Black Forest Labs releases the open‑source Flux.1 Kontext Dev model, a 12‑billion‑parameter image‑editing system whose weights are publicly available; the article details its core features, benchmark‑level performance comparable to leading commercial models, access via HuggingFace, and step‑by‑step usage through Fal AI and Replicate APIs.

AI modelFal AIFlux.1

0 likes · 9 min read

Exploring the Open‑Source Flux.1 Kontext Dev Model for Advanced Image Editing

AI Algorithm Path

Jul 1, 2025 · Artificial Intelligence

Beginner’s Guide to CLIP Inference: Step‑by‑Step with Hugging Face

This tutorial walks through loading the openai/clip‑vit‑base‑patch32 model with Hugging Face, preprocessing images and text, encoding them into a shared embedding space, computing cosine similarity for zero‑shot image‑text matching, and visualizing the results, all with concrete code examples.

CLIPHugging FacePython

0 likes · 6 min read

Beginner’s Guide to CLIP Inference: Step‑by‑Step with Hugging Face

AI Algorithm Path

Jun 29, 2025 · Artificial Intelligence

Understanding CLIP: Theory, Architecture, and Zero‑Shot Vision

CLIP (Contrastive Language‑Image Pre‑training) is an OpenAI model that learns visual concepts from 400 million image‑text pairs using a dual‑encoder architecture, enabling zero‑shot classification, flexible text‑driven search, and cross‑modal reasoning, while its strengths, limitations, and emerging applications are examined in detail.

CLIPContrastive Language-Image PretrainingDual Encoder

0 likes · 15 min read

Understanding CLIP: Theory, Architecture, and Zero‑Shot Vision

AI Algorithm Path

Jun 28, 2025 · Artificial Intelligence

Implementing Greedy and Beam Decoding for Large Language Models from Scratch

This article walks through the mechanics of greedy search and beam search in large language models, demonstrates both methods with GPT‑2 on the prompt "I have a dream", visualizes the decoding trees, compares their scores, and discusses the trade‑offs between efficiency and output quality.

GPT-2Greedy SearchLLM

0 likes · 16 min read

Implementing Greedy and Beam Decoding for Large Language Models from Scratch