IT Services Circle
IT Services Circle
Apr 6, 2026 · Information Security

Why AI-Generated Passwords Are Predictable and Insecure: Study Findings

A recent Irregular study reveals that AI models such as Claude Opus 4.6, OpenAI GPT‑5.2, and Google Gemini 3 Flash produce passwords with striking patterns, making over half of generated passwords predictable, which poses serious security risks despite appearing strong.

ClaudeGPT-5.2password generation
0 likes · 5 min read
Why AI-Generated Passwords Are Predictable and Insecure: Study Findings
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Mar 13, 2026 · Artificial Intelligence

Can Multimodal LLMs Beat Humans in Real Web Search? GPT‑5.2 Scores Only 36% on New BrowseComp‑V3 Benchmark

A new multimodal browsing benchmark, BrowseComp‑V3, reveals that human experts achieve a 68.03% success rate while the strongest closed‑source model, GPT‑5.2, manages just 36.17%, highlighting current limitations in deep web‑scale visual‑text reasoning and the critical role of tool‑augmented agents.

GPT-5.2OmniSeekerhuman performance
0 likes · 12 min read
Can Multimodal LLMs Beat Humans in Real Web Search? GPT‑5.2 Scores Only 36% on New BrowseComp‑V3 Benchmark
AI Explorer
AI Explorer
Mar 4, 2026 · Artificial Intelligence

When AI Simulates a Nuclear Crisis: Unveiling Complex Strategic Reasoning

A groundbreaking experiment by King's College London placed top AI models, including GPT‑5.2, into a 300‑round simulated nuclear crisis, revealing that these systems can perform nuanced, narrative‑driven strategic reasoning under extreme uncertainty, hinting at future roles in high‑risk global decision‑making.

AIGPT-5.2decision making
0 likes · 6 min read
When AI Simulates a Nuclear Crisis: Unveiling Complex Strategic Reasoning
AI Engineering
AI Engineering
Feb 14, 2026 · Artificial Intelligence

ByteDance’s Seed 2.0 Pro Beats GPT‑5.2 High in Math Benchmarks

ByteDance’s newly released Seed 2.0 series, especially the Pro model, outperforms GPT‑5.2 High and Claude Opus on MathVista and MathVision tests, offers competitive coding scores, multimodal capabilities, and a pricing model up to four times cheaper, while still lagging behind in some programming and factual‑accuracy benchmarks.

ByteDanceCodeforcesGPT-5.2
0 likes · 4 min read
ByteDance’s Seed 2.0 Pro Beats GPT‑5.2 High in Math Benchmarks
AI Engineering
AI Engineering
Jan 21, 2026 · Artificial Intelligence

ChatGPT Adds Age Prediction Feature, Raising NSFW Content Concerns

OpenAI's latest update introduces an age‑prediction system that infers users' ages from behavior data, prompting accurate user tests, version‑specific NSFW content differences, and a new Persona verification option for adults mistakenly flagged as minors.

ChatGPTGPT-4GPT-5.2
0 likes · 3 min read
ChatGPT Adds Age Prediction Feature, Raising NSFW Content Concerns
Programmer's Advance
Programmer's Advance
Jan 21, 2026 · Industry Insights

How GPT‑5.2 and ServiceNow Are Redefining Enterprise AI Agents

The article analyzes OpenAI’s integration of GPT‑5.2 into ServiceNow’s workflow platform, detailing model variants, performance metrics, pricing, AI Agent architecture, real‑world use cases, competitive comparisons, and future enterprise AI trends, while offering practical guidance for developers.

AI agentsAI governanceEnterprise AI
0 likes · 16 min read
How GPT‑5.2 and ServiceNow Are Redefining Enterprise AI Agents
21CTO
21CTO
Jan 17, 2026 · Artificial Intelligence

Can AI Agents Really Build a Functional Web Browser? Inside Cursor’s GPT‑5.2 Experiment

The article examines Cursor’s claim that hundreds of GPT‑5.2 agents autonomously built a full‑stack web browser, detailing the massive code output, the publicly shared repository, persistent compilation failures, and what the results reveal about the limits of large‑scale AI‑driven software development.

GPT-5.2autonomous codingbrowser development
0 likes · 7 min read
Can AI Agents Really Build a Functional Web Browser? Inside Cursor’s GPT‑5.2 Experiment
PaperAgent
PaperAgent
Dec 19, 2025 · Artificial Intelligence

Can We Trust AI? Inside GPT‑5.2‑Codex’s Monitorability Breakthrough

OpenAI’s new GPT‑5.2‑Codex model achieves state‑of‑the‑art performance on SWE‑Bench Pro and Terminal‑Bench 2.0, and a 90‑page technical report introduces the concept of monitorability, defining metrics, benchmark suites, and key findings about chain‑of‑thought length, RL training, and model size.

AI safetyGPT-5.2benchmark
0 likes · 10 min read
Can We Trust AI? Inside GPT‑5.2‑Codex’s Monitorability Breakthrough
Design Hub
Design Hub
Dec 12, 2025 · Artificial Intelligence

GPT-5.2 Unveiled: A Cutting-Edge AI Super-Assistant Built for Real-World Work

OpenAI's newly released GPT-5.2 claims to outperform human experts on about 70% of real tasks, achieve a perfect score on the AIME 2025 competition, and deliver dramatic efficiency gains—up to 390× cost reduction—while showcasing impressive examples such as one‑shot ocean shader generation, a full 3D engine built in a single file, and visual‑perception scores rivaling top models.

AI benchmarksGPT-5.2Large Language Model
0 likes · 8 min read
GPT-5.2 Unveiled: A Cutting-Edge AI Super-Assistant Built for Real-World Work
PaperAgent
PaperAgent
Dec 12, 2025 · Artificial Intelligence

What Makes GPT‑5.2 and Gemini‑3‑Pro So Fast? Inside Their Key Features and Real‑World Tests

Gemini‑3‑pro’s surprise debut and OpenAI’s emergency release of GPT‑5.2 highlight a shift toward faster inference, deeper reasoning, and lower hallucination rates, with detailed performance metrics, three‑tier model options, extended context windows, and mixed community test results that reveal both strengths and shortcomings.

AI model performanceGPT-5.2Gemini-3-Pro
0 likes · 4 min read
What Makes GPT‑5.2 and Gemini‑3‑Pro So Fast? Inside Their Key Features and Real‑World Tests
ShiZhen AI
ShiZhen AI
Dec 6, 2025 · Artificial Intelligence

OpenAI’s Daily Users Plunge 12 M as Gemini 3 Threatens; GPT‑5.2 Rushed for Dec 9

Amid a 6% (≈12 million) daily‑active‑user decline triggered by Google’s Gemini 3 launch, OpenAI’s leadership issued a “red‑alert”, accelerated the release of GPT‑5.2 to Dec 9, halted ad and Pulse projects, and outlined strategic risks, competitive benchmarks, and the future “Garlic” roadmap.

AI benchmarksAI industry analysisGPT-5.2
0 likes · 15 min read
OpenAI’s Daily Users Plunge 12 M as Gemini 3 Threatens; GPT‑5.2 Rushed for Dec 9