Tagged articles
7 articles
Page 1 of 1
Machine Heart
Machine Heart
Apr 30, 2026 · Artificial Intelligence

From Post‑hoc to Intrinsic: Cutting‑Edge Advances in Making Large Language Models More Transparent

This article surveys recent progress in intrinsic interpretability for large language models, contrasting traditional post‑hoc analysis with design‑level approaches that embed transparency into model architecture, training objectives, and information flow, and outlines five core design paradigms and their challenges.

intrinsic interpretabilitylarge language modelsmodel design principles
0 likes · 11 min read
From Post‑hoc to Intrinsic: Cutting‑Edge Advances in Making Large Language Models More Transparent
DataFunSummit
DataFunSummit
Mar 14, 2024 · Artificial Intelligence

Multi‑Level Efficiency Challenges and Emerging Paradigms for Large AI Models

The article examines how large AI models are moving toward a unified, low‑knowledge‑density paradigm that raises computational efficiency challenges across model, algorithm, framework, and infrastructure layers, while also highlighting NVIDIA's GTC 2024 China AI Day sessions that showcase practical solutions and upcoming training opportunities.

AI InfrastructureAI conferencesNVIDIA GTC
0 likes · 10 min read
Multi‑Level Efficiency Challenges and Emerging Paradigms for Large AI Models
Bilibili Tech
Bilibili Tech
Jun 13, 2023 · Artificial Intelligence

InferX Inference Framework and Its Integration with Triton for High‑Performance AI Model Serving

Bilibili’s self‑developed InferX framework, combined with NVIDIA Triton Inference Server, streamlines AI model serving by adding quantization, structured sparsity, and custom kernels, delivering up to eight‑fold throughput gains, cutting GPU usage by half, and enabling faster, cost‑effective OCR and large‑model deployments.

AI inferenceGPU utilizationInferX
0 likes · 10 min read
InferX Inference Framework and Its Integration with Triton for High‑Performance AI Model Serving
DataFunSummit
DataFunSummit
Sep 4, 2022 · Artificial Intelligence

Sparse Features in Machine Learning: Challenges, NVIDIA Ampere Structured Sparsity, Knowledge Distillation, and GAN Model Compression

This talk explores the challenges and opportunities of leveraging sparsity in machine learning models, covering fine‑grained and coarse‑grained sparsity, NVIDIA Ampere’s 2:4 structured sparsity, knowledge‑distillation techniques for converting unstructured to structured sparsity, and model compression strategies for generative adversarial networks.

Deep LearningGANGPU Acceleration
0 likes · 14 min read
Sparse Features in Machine Learning: Challenges, NVIDIA Ampere Structured Sparsity, Knowledge Distillation, and GAN Model Compression
DataFunTalk
DataFunTalk
Mar 16, 2022 · Artificial Intelligence

Parameter-Efficient Sparsity Training for the PLUG Large-Scale Language Model

This article presents the PLUG 270‑billion‑parameter Chinese language model and introduces a parameter‑efficient sparsity training (PST) framework that combines unstructured and structured pruning with low‑rank decomposition to dramatically reduce model size while preserving downstream performance.

Deep LearningPLUGParameter-Efficient Training
0 likes · 13 min read
Parameter-Efficient Sparsity Training for the PLUG Large-Scale Language Model
MaGe Linux Operations
MaGe Linux Operations
Sep 21, 2018 · Artificial Intelligence

What Classic Diagrams Reveal About Test Error, Overfitting, and Model Selection

The article presents a series of insightful diagrams that illustrate core machine‑learning concepts such as the relationship between training and test error, the dangers of under‑ and over‑fitting, Occam’s razor, feature interactions, discriminative versus generative models, loss functions, least‑squares geometry, and sparsity.

Loss FunctionsModel Selectionbias‑variance
0 likes · 6 min read
What Classic Diagrams Reveal About Test Error, Overfitting, and Model Selection
MaGe Linux Operations
MaGe Linux Operations
Apr 17, 2017 · Artificial Intelligence

Essential Machine Learning Visuals: Test Error, Overfitting, and More

This article presents a curated collection of insightful machine‑learning diagrams that illustrate key concepts such as test versus training error, under‑ and over‑fitting, Occam’s razor, feature interactions, irrelevant features, basis functions, discriminative versus generative models, loss functions, least‑squares geometry, and sparsity.

Loss FunctionsOccam's razorfeature selection
0 likes · 6 min read
Essential Machine Learning Visuals: Test Error, Overfitting, and More