Tagged articles
2 articles
Page 1 of 1
DeWu Technology
DeWu Technology
Jan 22, 2024 · Artificial Intelligence

How to Integrate Business Systems with LLMs: Prompt, RAG, and Fine‑Tuning Strategies

This article outlines three practical approaches—direct prompting, retrieval‑augmented generation (RAG), and fine‑tuning—to connect enterprise applications to large language models, explains key prompt‑engineering techniques, details RAG workflow and vector‑database integration, and provides step‑by‑step guidance for fine‑tuning on the KubeAI platform.

AI for businessFine-tuningKubeAI
0 likes · 20 min read
How to Integrate Business Systems with LLMs: Prompt, RAG, and Fine‑Tuning Strategies
DeWu Technology
DeWu Technology
Jul 5, 2023 · Artificial Intelligence

Fine-tuning Large Language Models with LoRA/QLoRA and Deploying via GPTQ Quantization on KubeAI

The article explains how LoRA and its 4‑bit QLoRA extension dramatically reduce trainable parameters and GPU memory for fine‑tuning large language models, while GPTQ post‑training quantization compresses weights for cheap inference, and shows how KubeAI integrates these techniques into a one‑click workflow for 7 B, 13 B, and 33 B models from data upload to API deployment.

GPTQKubeAILarge Language Models
0 likes · 13 min read
Fine-tuning Large Language Models with LoRA/QLoRA and Deploying via GPTQ Quantization on KubeAI