Tagged articles

GPT-5.2

14 articles · Page 1 of 1

Jun 6, 2026 · Artificial Intelligence

Why Exam Proctors Are Targeting Smart Glasses for Cheating Prevention

The article analyzes how rapidly advancing smart‑glass technology, combined with large AI models, enables sophisticated cheating in the Chinese college entrance exam, examines market growth, outlines the evolution of cheating methods, and evaluates both exam‑room defenses and AI platform countermeasures.

AI cheatingGPT-5.2education technology

0 likes · 9 min read

Why Exam Proctors Are Targeting Smart Glasses for Cheating Prevention

IT Services Circle

Apr 6, 2026 · Information Security

Why AI-Generated Passwords Are Predictable and Insecure: Study Findings

A recent Irregular study reveals that AI models such as Claude Opus 4.6, OpenAI GPT‑5.2, and Google Gemini 3 Flash produce passwords with striking patterns, making over half of generated passwords predictable, which poses serious security risks despite appearing strong.

ClaudeGPT-5.2password generation

0 likes · 5 min read

Why AI-Generated Passwords Are Predictable and Insecure: Study Findings

Machine Learning Algorithms & Natural Language Processing

Mar 13, 2026 · Artificial Intelligence

Can Multimodal LLMs Beat Humans in Real Web Search? GPT‑5.2 Scores Only 36% on New BrowseComp‑V3 Benchmark

A new multimodal browsing benchmark, BrowseComp‑V3, reveals that human experts achieve a 68.03% success rate while the strongest closed‑source model, GPT‑5.2, manages just 36.17%, highlighting current limitations in deep web‑scale visual‑text reasoning and the critical role of tool‑augmented agents.

GPT-5.2Multimodal LLMOmniSeeker

0 likes · 12 min read

Can Multimodal LLMs Beat Humans in Real Web Search? GPT‑5.2 Scores Only 36% on New BrowseComp‑V3 Benchmark

AI Explorer

Mar 4, 2026 · Artificial Intelligence

When AI Simulates a Nuclear Crisis: Unveiling Complex Strategic Reasoning

A groundbreaking experiment by King's College London placed top AI models, including GPT‑5.2, into a 300‑round simulated nuclear crisis, revealing that these systems can perform nuanced, narrative‑driven strategic reasoning under extreme uncertainty, hinting at future roles in high‑risk global decision‑making.

AIGPT-5.2decision making

0 likes · 6 min read

When AI Simulates a Nuclear Crisis: Unveiling Complex Strategic Reasoning

AI Engineering

Feb 14, 2026 · Artificial Intelligence

ByteDance’s Seed 2.0 Pro Beats GPT‑5.2 High in Math Benchmarks

ByteDance’s newly released Seed 2.0 series, especially the Pro model, outperforms GPT‑5.2 High and Claude Opus on MathVista and MathVision tests, offers competitive coding scores, multimodal capabilities, and a pricing model up to four times cheaper, while still lagging behind in some programming and factual‑accuracy benchmarks.

ByteDanceCodeforcesGPT-5.2

0 likes · 4 min read

ByteDance’s Seed 2.0 Pro Beats GPT‑5.2 High in Math Benchmarks

AI Engineering

Jan 21, 2026 · Artificial Intelligence

ChatGPT Adds Age Prediction Feature, Raising NSFW Content Concerns

OpenAI's latest update introduces an age‑prediction system that infers users' ages from behavior data, prompting accurate user tests, version‑specific NSFW content differences, and a new Persona verification option for adults mistakenly flagged as minors.

ChatGPTGPT-4GPT-5.2

0 likes · 3 min read

ChatGPT Adds Age Prediction Feature, Raising NSFW Content Concerns

Programmer's Advance

Jan 21, 2026 · Industry Insights

How GPT‑5.2 and ServiceNow Are Redefining Enterprise AI Agents

The article analyzes OpenAI’s integration of GPT‑5.2 into ServiceNow’s workflow platform, detailing model variants, performance metrics, pricing, AI Agent architecture, real‑world use cases, competitive comparisons, and future enterprise AI trends, while offering practical guidance for developers.

AI GovernanceAI agentsEnterprise AI

0 likes · 16 min read

How GPT‑5.2 and ServiceNow Are Redefining Enterprise AI Agents

21CTO

Jan 17, 2026 · Artificial Intelligence

Can AI Agents Really Build a Functional Web Browser? Inside Cursor’s GPT‑5.2 Experiment

The article examines Cursor’s claim that hundreds of GPT‑5.2 agents autonomously built a full‑stack web browser, detailing the massive code output, the publicly shared repository, persistent compilation failures, and what the results reveal about the limits of large‑scale AI‑driven software development.

GPT-5.2autonomous codingbrowser development

0 likes · 7 min read

Can AI Agents Really Build a Functional Web Browser? Inside Cursor’s GPT‑5.2 Experiment

PaperAgent

Dec 19, 2025 · Artificial Intelligence

Can We Trust AI? Inside GPT‑5.2‑Codex’s Monitorability Breakthrough

OpenAI’s new GPT‑5.2‑Codex model achieves state‑of‑the‑art performance on SWE‑Bench Pro and Terminal‑Bench 2.0, and a 90‑page technical report introduces the concept of monitorability, defining metrics, benchmark suites, and key findings about chain‑of‑thought length, RL training, and model size.

AI safetyChain of ThoughtGPT-5.2

0 likes · 10 min read

Can We Trust AI? Inside GPT‑5.2‑Codex’s Monitorability Breakthrough

PaperAgent

Dec 14, 2025 · Artificial Intelligence

GPT‑5.2 vs Gemini 3 Pro: Coding Tests, NeurIPS 2025 Paper Insights, and RAG Refactor

The article evaluates GPT‑5.2 and Gemini 3 Pro on real‑world coding tasks, analyzes trends from the 6000 papers presented at NeurIPS 2025, and demonstrates how to extract and refactor the tree‑building component of the open‑source RAPTOR RAG system into an independent module.

AI model evaluationGPT-5.2Gemini 3 Pro

0 likes · 5 min read

GPT‑5.2 vs Gemini 3 Pro: Coding Tests, NeurIPS 2025 Paper Insights, and RAG Refactor

Design Hub

Dec 12, 2025 · Artificial Intelligence

GPT-5.2 Unveiled: A Cutting-Edge AI Super-Assistant Built for Real-World Work

OpenAI's newly released GPT-5.2 claims to outperform human experts on about 70% of real tasks, achieve a perfect score on the AIME 2025 competition, and deliver dramatic efficiency gains—up to 390× cost reduction—while showcasing impressive examples such as one‑shot ocean shader generation, a full 3D engine built in a single file, and visual‑perception scores rivaling top models.

AI benchmarksAgent AIGPT-5.2

0 likes · 8 min read

GPT-5.2 Unveiled: A Cutting-Edge AI Super-Assistant Built for Real-World Work

PaperAgent

Dec 12, 2025 · Artificial Intelligence

What Makes GPT‑5.2 and Gemini‑3‑Pro So Fast? Inside Their Key Features and Real‑World Tests

Gemini‑3‑pro’s surprise debut and OpenAI’s emergency release of GPT‑5.2 highlight a shift toward faster inference, deeper reasoning, and lower hallucination rates, with detailed performance metrics, three‑tier model options, extended context windows, and mixed community test results that reveal both strengths and shortcomings.

AI Model PerformanceGPT-5.2Gemini 3 Pro

0 likes · 4 min read

What Makes GPT‑5.2 and Gemini‑3‑Pro So Fast? Inside Their Key Features and Real‑World Tests

AI Insight Log

Dec 11, 2025 · Artificial Intelligence

GPT-5.2 Released: How It Outperforms Claude 4.5 and Gemini 3 Pro

OpenAI’s GPT‑5.2 launch introduces three specialized modes, achieves a record 55.6% score on SWE‑Bench Pro, demonstrates strong front‑end generation, adds a /compact API for long‑context efficiency, offers tiered pricing with cache discounts, and improves safety for younger users.

AI benchmarkingAI safetyGPT-5.2

0 likes · 6 min read

GPT-5.2 Released: How It Outperforms Claude 4.5 and Gemini 3 Pro

ShiZhen AI

Dec 6, 2025 · Artificial Intelligence

OpenAI’s Daily Users Plunge 12 M as Gemini 3 Threatens; GPT‑5.2 Rushed for Dec 9

Amid a 6% (≈12 million) daily‑active‑user decline triggered by Google’s Gemini 3 launch, OpenAI’s leadership issued a “red‑alert”, accelerated the release of GPT‑5.2 to Dec 9, halted ad and Pulse projects, and outlined strategic risks, competitive benchmarks, and the future “Garlic” roadmap.

AI Industry AnalysisAI benchmarksGPT-5.2

0 likes · 15 min read

OpenAI’s Daily Users Plunge 12 M as Gemini 3 Threatens; GPT‑5.2 Rushed for Dec 9