Tagged articles

model analysis

5 articles · Page 1 of 1

May 3, 2026 · Artificial Intelligence

Alibaba’s Qwen‑Scope: A Brain‑Computer Interface for Qwen‑3.5‑27B

Qwen‑Scope adds a sparse autoencoder (SAE) to the Qwen‑3.5‑27B model, exposing a top‑K 50‑feature, residual‑stream hook across all 64 layers for interpretability, controllable generation, data analysis, and training diagnostics, while detailing installation, usage, and practical trade‑offs.

Large Language ModelQwenSAE

0 likes · 11 min read

Alibaba’s Qwen‑Scope: A Brain‑Computer Interface for Qwen‑3.5‑27B

Cognitive Technology Team

Apr 12, 2025 · Artificial Intelligence

Analyzing a Trained Neural Network: Visualizing Hidden Layers and Understanding Its Limitations

This article walks through an interactive exploration of a simple two‑hidden‑layer neural network, showing how real‑time visualizations reveal its learned representations, accuracy limits, and why constrained training leads to over‑confident yet unintelligent predictions before introducing backpropagation.

Backpropagationdeep learninghidden layers

0 likes · 10 min read

Analyzing a Trained Neural Network: Visualizing Hidden Layers and Understanding Its Limitations

Xiaohongshu Tech REDtech

Feb 17, 2025 · Artificial Intelligence

WorldSense: A New Benchmark for Evaluating Multimodal Large Models in Real‑World Scenarios

WorldSense, a new benchmark of 1,662 real‑world video‑audio clips and 3,172 QA pairs across 26 cognitive tasks, reveals that current multimodal large models achieve only 25%–48% accuracy, highlighting the crucial role of combined visual‑audio input and the difficulty of audio‑ and emotion‑related reasoning.

Multimodal AIbenchmark datasetlarge models

0 likes · 12 min read

WorldSense: A New Benchmark for Evaluating Multimodal Large Models in Real‑World Scenarios

NewBeeNLP

Oct 31, 2024 · Artificial Intelligence

How o1 Is Redefining LLM Engineering and What It Means for AI Professionals

The article examines OpenAI's o1 model, highlighting its unprecedented scientific capabilities, its shift from a chat toy to a high‑value tool, the potential impact on algorithm engineers, and the technical directions (RLHF, MCTS, PPO, PRM) that practitioners should master to stay relevant.

AILLMmodel analysis

0 likes · 8 min read

How o1 Is Redefining LLM Engineering and What It Means for AI Professionals

Baobao Algorithm Notes

Mar 22, 2024 · Artificial Intelligence

Unveiling Sora: How OpenAI Might Build Its Groundbreaking Text‑to‑Video Model

This article provides a detailed, step‑by‑step technical analysis of OpenAI's Sora text‑to‑video system, exploring its overall architecture, visual encoder‑decoder choices, Spacetime Latent Patch design, transformer‑based diffusion model, training strategies, and long‑time consistency mechanisms while referencing relevant research papers and open‑source techniques.

AISoradiffusion

0 likes · 50 min read

Unveiling Sora: How OpenAI Might Build Its Groundbreaking Text‑to‑Video Model