Tagged articles
20 articles
Page 1 of 1
AI Engineer Programming
AI Engineer Programming
Mar 28, 2026 · Artificial Intelligence

How to Start Training Your Own AI Model: A Complete Roadmap

This guide maps the end-to-end process for building a small AI model—from leveraging open-source base models and applying SFT with LoRA/QLoRA, through alignment techniques like DPO or ORPO, to low-cost distillation and final quantization for local deployment, while recommending free GPU resources and essential tooling.

AlignmentDistillationLoRA
0 likes · 12 min read
How to Start Training Your Own AI Model: A Complete Roadmap
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Oct 22, 2025 · Artificial Intelligence

Mastering LLM Training: A Step‑by‑Step Blueprint from Data to Alignment

This guide walks through the complete end‑to‑end process of training a large language model from scratch, covering data collection, cleaning, tokenization, pre‑training objectives and engineering, post‑training alignment methods, scaling laws, over‑fitting mitigation, and gradient‑stability techniques.

AlignmentLLMgradient stability
0 likes · 9 min read
Mastering LLM Training: A Step‑by‑Step Blueprint from Data to Alignment
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 31, 2025 · Artificial Intelligence

Why Post‑Training Matters: Scaling Laws, Fine‑Tuning, and RL Strategies for LLMs

This article explores the importance of post‑training for large language models, explains scaling laws for pre‑ and post‑training, details common fine‑tuning methods (full, PEFT, LoRA), outlines alignment techniques such as RLHF, DPO, PPO, and presents practical workflows using Llama 3 and DeepSeek‑R1, while also discussing test‑time reasoning optimizations.

AlignmentFine-tuningLLM
0 likes · 19 min read
Why Post‑Training Matters: Scaling Laws, Fine‑Tuning, and RL Strategies for LLMs
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 9, 2025 · Artificial Intelligence

What Are Foundation Agents? A Deep Dive into Next‑Gen AI Architectures

This article reviews the 2025 "Advances and Challenges in Foundation Agents" paper, defining the Foundation Agent concept, detailing its seven core components, exploring self‑evolution, multi‑agent collaboration, and the safety and alignment challenges required to build trustworthy, autonomous AI systems.

AI ArchitectureAlignmentFoundation Agents
0 likes · 16 min read
What Are Foundation Agents? A Deep Dive into Next‑Gen AI Architectures
Bilibili Tech
Bilibili Tech
May 20, 2025 · Artificial Intelligence

How AnimeReward and GAPO Transform Anime Video Generation with Human Feedback

Researchers at Bilibili present Index‑Anisora, an open‑source anime video generation framework that builds a 30k‑sample reward dataset, introduces the multi‑dimensional AnimeReward model and a Gap‑Aware Preference Optimization (GAPO) method, and demonstrate through extensive automatic and human evaluations that their approach significantly outperforms baseline video generators.

AlignmentGAPOHuman Feedback
0 likes · 20 min read
How AnimeReward and GAPO Transform Anime Video Generation with Human Feedback
Baobao Algorithm Notes
Baobao Algorithm Notes
Dec 16, 2024 · Artificial Intelligence

What Do Leading Open‑Source LLMs Do After Pretraining? A Deep Dive into Post‑Training Strategies

This article surveys the post‑training pipelines of major open‑source large language models released this year, detailing their alignment algorithms, data synthesis, reward modeling, DPO/GRPO variants, long‑context handling, tool use, and model‑averaging techniques, and highlights emerging trends such as data‑centric pipelines and iterative weak‑to‑strong alignment.

AI researchAlignmentLLM
0 likes · 99 min read
What Do Leading Open‑Source LLMs Do After Pretraining? A Deep Dive into Post‑Training Strategies
DataFunTalk
DataFunTalk
Nov 11, 2024 · Artificial Intelligence

OpenAI VP Lilian Weng Departs and Shares Full AI Safety Talk Transcript

The article reports the departure of OpenAI research VP Lilian Weng, provides the full transcript of her recent AI safety and alignment presentation at a Bilibili event, and discusses broader concerns about OpenAI's safety culture, reinforcement learning from human feedback, and the importance of collective involvement in AI safety.

AI SafetyAlignmentOpenAI
0 likes · 10 min read
OpenAI VP Lilian Weng Departs and Shares Full AI Safety Talk Transcript
DaTaobao Tech
DaTaobao Tech
Jul 19, 2024 · Artificial Intelligence

Practices and Techniques for Vertical Domain Large Language Models

Vertical domain large language models, fine‑tuned on specialized data, deliver higher expertise and task performance, but require continual knowledge updates and careful alignment; techniques such as BPO‑guided instruction tuning (+1.8% accuracy), Reflexion‑based Text2API (+4% API correctness), advanced RAG preprocessing, and SFT combined with ORPO (+5.2% gain) demonstrate notable improvements while underscoring remaining challenges and collaborative opportunities.

AlignmentRAGSFT
0 likes · 9 min read
Practices and Techniques for Vertical Domain Large Language Models
Bilibili Tech
Bilibili Tech
Jun 14, 2024 · Artificial Intelligence

Technical Report on the Index-1.9B Series: Model Variants, Pre‑training Optimizations, and Alignment Experiments

The report presents the open‑source Index‑1.9B family—base, pure, chat, and character variants—detailing benchmark results, pre‑training optimizations such as a normalized LM‑Head and deeper‑slim architectures, the importance of modest instruction data, alignment via SFT/DPO, role‑play enhancements with RAG, and acknowledges remaining safety and factual limitations.

AlignmentInstruction TuningLLM
0 likes · 15 min read
Technical Report on the Index-1.9B Series: Model Variants, Pre‑training Optimizations, and Alignment Experiments
Baobao Algorithm Notes
Baobao Algorithm Notes
May 30, 2024 · Artificial Intelligence

What’s the Latest RLHF Landscape? From PPO to ORPO Explained

This article surveys the current RLHF ecosystem, comparing on‑policy methods like PPO with off‑policy approaches such as DPO, and examines recent variants—including ReMax, GRPO, DPOP, TDPO, and ORPO—highlighting their algorithmic differences, resource trade‑offs, and practical performance insights.

AlignmentDPOLLM
0 likes · 23 min read
What’s the Latest RLHF Landscape? From PPO to ORPO Explained
Advanced AI Application Practice
Advanced AI Application Practice
Mar 29, 2023 · Frontend Development

How to Implement Alignment Features in a Graphics Editor

This article walks through the step‑by‑step implementation of alignment functions—left, center, right, top, middle, and bottom—in a graphics editor by computing AABB boxes, deriving a mixed bounding box, and applying concise JavaScript loops to adjust element positions.

AABBAlignmentCanvas
0 likes · 6 min read
How to Implement Alignment Features in a Graphics Editor
DataFunTalk
DataFunTalk
Mar 16, 2023 · Artificial Intelligence

Technical Optimizations and Breakthroughs of GPT‑4: Multimodal Capabilities, Alignment Strategies, and Predictable Scaling

The article summarizes the technical innovations behind GPT‑4, highlighting its multimodal abilities, improved alignment methods, scaling‑law‑based performance prediction, and remaining limitations, while referencing the official OpenAI technical report and community analyses.

AI researchAlignmentGPT-4
0 likes · 10 min read
Technical Optimizations and Breakthroughs of GPT‑4: Multimodal Capabilities, Alignment Strategies, and Predictable Scaling
58UXD
58UXD
Feb 21, 2023 · Fundamentals

How to Instantly Boost UI Design Quality: Proven Tips for a Premium Look

This guide outlines practical UI design techniques—such as typographic hierarchy, strategic spacing, precise alignment, consistent styling, micro‑textures, and proper image proportions—to help designers create visually rich, high‑quality interfaces that delight users and strengthen brand perception.

AlignmentUI designdesign-consistency
0 likes · 8 min read
How to Instantly Boost UI Design Quality: Proven Tips for a Premium Look
Zhengtong Technical Team
Zhengtong Technical Team
Dec 30, 2022 · Frontend Development

Implementation of Component Drag‑and‑Drop in the Wukong Low‑Code Visual Platform

This article explains how the Wukong low‑code visual platform implements component drag‑and‑drop, covering material‑to‑canvas dragging, intra‑canvas movement, resizing via eight handles, alignment guide generation, performance optimizations using CSS transforms, and component encapsulation with Vue 3.

AlignmentComponentDrag-and-Drop
0 likes · 14 min read
Implementation of Component Drag‑and‑Drop in the Wukong Low‑Code Visual Platform
Tencent Cloud Developer
Tencent Cloud Developer
Apr 27, 2022 · Artificial Intelligence

Alignment-Uniformity Representation Learning for Zero-shot Video Classification (AURL)

The AURL framework, presented by Pu Shi, introduces alignment‑uniformity aware representation learning for zero‑shot video classification, achieving up to 28 % top‑1 accuracy gains on UCF101 and HMDB51, and has already boosted business metrics in Tencent’s advertising, search, and video‑channel recommendation systems.

AlignmentComputer VisionDeep Learning
0 likes · 19 min read
Alignment-Uniformity Representation Learning for Zero-shot Video Classification (AURL)