AI Explorer
AI Explorer
Apr 16, 2026 · Artificial Intelligence

Claude Opus 4.7: How Anthropic’s New Model Makes AI Programming Autonomous

Anthropic’s Claude Opus 4.7, released on April 16, 2026, boosts visual resolution threefold, adds self‑verifying programming ability, delivers strong benchmark gains across code review, data analysis, legal and financial tasks, and introduces new inference tiers and security controls, reshaping AI‑assisted software development.

AI programmingAnthropicClaude Opus 4.7
0 likes · 11 min read
Claude Opus 4.7: How Anthropic’s New Model Makes AI Programming Autonomous
AntTech
AntTech
Dec 6, 2025 · Artificial Intelligence

FinEval‑KR: Diagnosing Knowledge vs. Reasoning Gaps in Financial Large Language Models

FinEval‑KR, a new EMNLP2025 evaluation framework co‑authored by Shanghai University of Finance and Economics and Ant Group, separates knowledge coverage from logical reasoning to reveal why financial LLMs often hallucinate on calculation tasks, introduces KS, RS, and CS metrics, and ranks 18 state‑of‑the‑art models on a rigorously curated finance dataset.

Knowledge vs reasoningLLM evaluationfinance AI
0 likes · 14 min read
FinEval‑KR: Diagnosing Knowledge vs. Reasoning Gaps in Financial Large Language Models
Fun with Large Models
Fun with Large Models
Aug 19, 2025 · Artificial Intelligence

Deep Dive into OpenAI’s GPT‑OSS and GPT‑5: Features, Performance, and Controversies

The article provides a detailed analysis of OpenAI’s newly released open‑source GPT‑OSS models (20B and 120B) and the closed‑source GPT‑5 family, covering their architectures, training pipelines, benchmark results, practical usage observations, pricing, and the mixed user feedback that surrounds GPT‑5.

GPT-5GPT-OSSOpenAI
0 likes · 13 min read
Deep Dive into OpenAI’s GPT‑OSS and GPT‑5: Features, Performance, and Controversies
Baobao Algorithm Notes
Baobao Algorithm Notes
Apr 11, 2022 · Artificial Intelligence

Can ResNet Still Beat Transformers? A Deep Dive into Modern Training Tricks

This article reviews recent research and official PyTorch blog updates that modify ResNet architectures and training tricks, compares their performance against EfficientNet, ConvNeXt, and Vision Transformers using extensive ImageNet benchmarks, and provides both literature‑based and local evaluation results to assess whether classic CNNs remain competitive.

CNNResNetmodel benchmarking
0 likes · 13 min read
Can ResNet Still Beat Transformers? A Deep Dive into Modern Training Tricks