Tagged articles
13 articles
Page 1 of 1
21CTO
21CTO
Jul 9, 2024 · Artificial Intelligence

How to Run Open-Source LLMs Locally with Ollama: A Step-by-Step Guide

This article explains what Ollama is, how to download it for different operating systems, and provides detailed command‑line examples for running LLaMA 2 and the multimodal LLaVA models locally, showcasing the power of open‑source large language models on your own computer.

CLILLaVALlama-2
0 likes · 7 min read
How to Run Open-Source LLMs Locally with Ollama: A Step-by-Step Guide
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jun 12, 2024 · Artificial Intelligence

Deploy Llama‑2 on ACK with KServe, Triton, and TensorRT‑LLM – Step‑by‑Step Guide

This tutorial walks through deploying the Llama‑2‑7b‑hf model on Alibaba Cloud Kubernetes (ACK) using KServe, Triton Inference Server with the TensorRT‑LLM backend, covering prerequisites, model preparation, YAML configuration, PV/PVC setup, runtime creation, and troubleshooting steps.

AI inferenceKServeKubernetes
0 likes · 13 min read
Deploy Llama‑2 on ACK with KServe, Triton, and TensorRT‑LLM – Step‑by‑Step Guide
21CTO
21CTO
Apr 8, 2024 · Artificial Intelligence

Download and Run Ollama with LLaMA 2 and LLaVA Locally

This tutorial walks you through downloading Ollama, an open‑source LLM platform, and demonstrates how to run the Meta LLaMA 2 text model and the multimodal LLaVA model on your own computer, including command‑line usage and image‑based queries.

AI TutorialLLaVALlama-2
0 likes · 7 min read
Download and Run Ollama with LLaMA 2 and LLaVA Locally
NewBeeNLP
NewBeeNLP
Apr 1, 2024 · Artificial Intelligence

How Llama 2 Uses RLHF, PPO, Rejection Sampling, and Ghost Attention

This article provides a detailed technical walkthrough of Llama 2's Reinforcement Learning with Human Feedback pipeline, covering human preference data collection, reward‑model design and training, iterative fine‑tuning with PPO and rejection sampling, the Ghost Attention technique for multi‑turn consistency, and the resulting experimental evaluations.

Ghost AttentionLlama-2PPO
0 likes · 18 min read
How Llama 2 Uses RLHF, PPO, Rejection Sampling, and Ghost Attention
NewBeeNLP
NewBeeNLP
Mar 27, 2024 · Artificial Intelligence

Deep Dive into Llama 2: Architecture, Pre‑training, SFT, and Safety Insights

This article provides a comprehensive technical overview of Meta's Llama 2 series, covering its architectural upgrades such as Group Query Attention, the pre‑training dataset and hyper‑parameters, loss behavior, benchmark comparisons, and the supervised fine‑tuning pipeline with safety considerations.

Llama-2Model architectureRLHF
0 likes · 11 min read
Deep Dive into Llama 2: Architecture, Pre‑training, SFT, and Safety Insights
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jan 3, 2024 · Artificial Intelligence

Llama 2: Open Foundation and Fine‑Tuned Chat Models – Ghost Attention, RLHF Results, and Safety Evaluation

This article summarizes the Llama 2 series, describing the Ghost Attention technique for maintaining system‑message consistency across multi‑turn dialogs, presenting RLHF and human evaluation results, and discussing extensive safety pre‑training, benchmark assessments, and model release details.

AI EvaluationGhost AttentionLarge Language Models
0 likes · 20 min read
Llama 2: Open Foundation and Fine‑Tuned Chat Models – Ghost Attention, RLHF Results, and Safety Evaluation
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Dec 24, 2023 · Artificial Intelligence

Llama 2: Open Foundation and Fine‑Tuned Chat Models – Overview, Training, and RLHF Details

This article provides a comprehensive English overview of Meta's Llama 2 family, describing the model sizes, pre‑training data, architectural improvements, supervised fine‑tuning, reinforcement learning with human feedback, safety evaluations, reward‑model training, and iterative optimization techniques used to produce the high‑performing Llama 2‑Chat models.

Llama-2Open‑sourceRLHF
0 likes · 33 min read
Llama 2: Open Foundation and Fine‑Tuned Chat Models – Overview, Training, and RLHF Details
21CTO
21CTO
Sep 21, 2023 · Artificial Intelligence

Falcon 180B vs Llama 2: Which Open‑Source LLM Leads the AI Race?

This article compares the open‑source large language models Falcon 180B and Meta’s Llama 2, detailing their parameter sizes, training data, licensing, variants, infrastructure, language support, and safety policies, while providing links to official resources and a side‑by‑side feature table.

AI comparisonFalcon 180BLLM
0 likes · 8 min read
Falcon 180B vs Llama 2: Which Open‑Source LLM Leads the AI Race?
UCloud Tech
UCloud Tech
Sep 11, 2023 · Artificial Intelligence

Build a Soul‑Healing Chatbot with LangChain & Llama 2: A Step‑by‑Step Guide

This article walks through constructing a domain‑specific, soul‑healing chatbot using LangChain and Llama 2, comparing fine‑tuning versus external knowledge bases, detailing environment setup, data loading, text splitting, embedding with a Chinese model, vector store creation, prompt engineering, inference, and optimization strategies.

Fine-tuningKnowledge BaseLangChain
0 likes · 14 min read
Build a Soul‑Healing Chatbot with LangChain & Llama 2: A Step‑by‑Step Guide
UCloud Tech
UCloud Tech
Aug 30, 2023 · Artificial Intelligence

Unlocking Llama 2: Architecture, Training Insights, and Cloud Deployment Guide

This article explores Meta's Llama 2 large language model—its performance, expanded training data, architectural details, evaluation results, RLHF fine‑tuning process, and step‑by‑step deployment on UCloud UK8S using Docker and Kubernetes—providing a comprehensive guide for AI practitioners.

AI deploymentLlama-2RLHF
0 likes · 11 min read
Unlocking Llama 2: Architecture, Training Insights, and Cloud Deployment Guide
php Courses
php Courses
Aug 14, 2023 · Artificial Intelligence

Guide to the Five Most Powerful Large Language Models and How to Choose Them

This article explains the fundamentals of modern large language models, outlines the top five most powerful LLMs—including GPT‑4, Claude 2, Llama 2, Orca, and Cohere—and provides practical guidance on selecting and applying them across business and development use cases.

AI applicationsClaude 2GPT-4
0 likes · 9 min read
Guide to the Five Most Powerful Large Language Models and How to Choose Them
21CTO
21CTO
Jul 23, 2023 · Artificial Intelligence

What Nathan Lambert Reveals About Meta’s Llama 2: Key Insights and Technical Deep‑Dive

This article translates and analyzes Nathan Lambert’s commentary on Meta’s Llama 2 paper, detailing the model’s architecture, training data, RLHF pipeline, reward models, evaluation methods, safety improvements, licensing terms, and the broader implications for open‑source large language models.

Llama-2Meta AIModel Evaluation
0 likes · 22 min read
What Nathan Lambert Reveals About Meta’s Llama 2: Key Insights and Technical Deep‑Dive
Baobao Algorithm Notes
Baobao Algorithm Notes
Jul 19, 2023 · Artificial Intelligence

Llama 2’s Breakthroughs: Architecture, Data, and Training Tricks Explained

Llama 2 advances open‑source large‑model research by expanding context length to 4096, adopting GQA attention, scaling training data to 2 trillion tokens, and introducing refined SFT and RLHF techniques such as Ghost Attention, margin‑based reward modeling, and iterative rejection sampling, all detailed in Meta’s 76‑page report.

Llama-2RLHFSFT
0 likes · 8 min read
Llama 2’s Breakthroughs: Architecture, Data, and Training Tricks Explained