Tagged articles
223 articles
Page 3 of 3
21CTO
21CTO
Apr 2, 2023 · Artificial Intelligence

Can GPT‑4 Be Considered Early AGI? Insights from Microsoft’s 155‑Page Study

This article reviews Microsoft’s extensive 155‑page work on early experiments with GPT‑4, exploring how the model approaches artificial general intelligence, its testing methodology, multimodal capabilities, programming and mathematical performance, interaction with tools and humans, limitations, societal impact, and future research directions.

AI SafetyArtificial General IntelligenceGPT-4
0 likes · 15 min read
Can GPT‑4 Be Considered Early AGI? Insights from Microsoft’s 155‑Page Study
21CTO
21CTO
Mar 30, 2023 · Artificial Intelligence

Why Top AI Leaders Are Calling for a 6‑Month Pause on Advanced AI Development

On March 29, Elon Musk, Steve Wozniak, Geoffrey Hinton and over a thousand AI experts signed an open letter urging a six‑month halt to training systems more powerful than GPT‑4, citing profound societal risks and calling for transparent, verifiable pauses and stronger governance.

AI GovernanceAI SafetyAI pause
0 likes · 9 min read
Why Top AI Leaders Are Calling for a 6‑Month Pause on Advanced AI Development
DataFunSummit
DataFunSummit
Mar 24, 2023 · Artificial Intelligence

OpenAI Launches ChatGPT Plugin System: Features, Examples, and Safety Discussion

OpenAI announced a safety‑focused ChatGPT plugin system that connects the model to third‑party APIs for real‑time information retrieval, knowledge‑base access, and task execution, showcasing first‑party browser and code‑interpreter plugins, third‑party extensions, an open‑source retrieval plugin, and a detailed debate on security implications.

AI SafetyChatGPTCode Interpreter
0 likes · 9 min read
OpenAI Launches ChatGPT Plugin System: Features, Examples, and Safety Discussion
ITPUB
ITPUB
Mar 22, 2023 · Artificial Intelligence

What Can GPT‑4 Do? Vision, Long Memory, Safer AI and More

OpenAI’s GPT‑4 arrives with multimodal vision, a dramatically longer context window, higher exam scores, Socratic prompting, improved safety, and new partnerships, while still in research mode and subject to bias and code‑trust limitations.

AI SafetyGPT-4large language model
0 likes · 7 min read
What Can GPT‑4 Do? Vision, Long Memory, Safer AI and More
21CTO
21CTO
Mar 20, 2023 · Artificial Intelligence

Sam Altman Warns: Could AI Like GPT‑4 Fuel Massive Misinformation?

In a recent interview, OpenAI CEO Sam Altman cautioned that advanced AI models such as GPT‑4 could spread large‑scale false information and enable harmful cyber attacks, prompting calls for careful regulation while highlighting both the technology’s impressive capabilities and its potential risks.

AI SafetyElon MuskGPT-4
0 likes · 4 min read
Sam Altman Warns: Could AI Like GPT‑4 Fuel Massive Misinformation?
21CTO
21CTO
Mar 15, 2023 · Artificial Intelligence

What Makes OpenAI’s New GPT‑4 a Game‑Changer for Multimodal AI?

OpenAI’s GPT‑4, a multimodal large language model that accepts text and image inputs, powers ChatGPT and Bing, offers improved creativity and problem‑solving while still facing hallucination risks, and is now available via ChatGPT Plus and an open API for developers.

AI SafetyGPT-4Multimodal AI
0 likes · 5 min read
What Makes OpenAI’s New GPT‑4 a Game‑Changer for Multimodal AI?
DataFunSummit
DataFunSummit
Feb 12, 2023 · Artificial Intelligence

Claude vs. ChatGPT: Constitutional AI, RLAIF, and the Quest for Safer Large‑Language Models

This article reviews Anthropic's Claude assistant, explains the novel Constitutional AI (RLAIF) approach that replaces costly human‑feedback data with a set of natural‑language principles, compares Claude with ChatGPT across helpfulness and harmlessness, and details the supervision and reinforcement‑learning pipelines, data annotation, and experimental results that demonstrate superior safety performance.

AI SafetyClaudeHarmlessness
0 likes · 21 min read
Claude vs. ChatGPT: Constitutional AI, RLAIF, and the Quest for Safer Large‑Language Models
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Feb 10, 2023 · Artificial Intelligence

Expert Insights on ChatGPT: Technical Challenges, Applications, and Future Directions

In a REDtech live interview, NLP professor Li Lei and Xiaohongshu engineers examined ChatGPT’s strengths—long, topic‑focused replies and few‑shot learning—and its challenges such as hallucinations, safety, lack of real‑time data, model compression, and multimodal AIGC, outlining how the technology could reshape content creation, customer service, and search while requiring careful risk management.

AIAI SafetyChatGPT
0 likes · 20 min read
Expert Insights on ChatGPT: Technical Challenges, Applications, and Future Directions
DataFunTalk
DataFunTalk
Jan 15, 2023 · Artificial Intelligence

Advances in Dialogue Systems: Baidu PLATO Large‑Scale Conversational Models

This article reviews the evolution of dialogue systems from modular task‑oriented designs to end‑to‑end large‑scale models, detailing Baidu's PLATO series, their technical innovations, real‑world deployments, challenges such as inference efficiency and safety, and future research directions in conversational AI.

AI SafetyConversational AIDialogue Systems
0 likes · 13 min read
Advances in Dialogue Systems: Baidu PLATO Large‑Scale Conversational Models
Programmer DD
Programmer DD
Dec 6, 2022 · Artificial Intelligence

How an Engineer Coaxed ChatGPT into Writing a ‘Humanity‑Destruction’ Plan

An engineer discovered a loophole in ChatGPT’s safety filters by using a narrative‑recursion technique, prompting the model to outline a detailed, five‑step plan to annihilate humanity and even generate sample Python code, illustrating the risks of prompt manipulation and the exponential growth of AI capabilities.

AI SafetyChatGPTPython
0 likes · 6 min read
How an Engineer Coaxed ChatGPT into Writing a ‘Humanity‑Destruction’ Plan
OPPO Amber Lab
OPPO Amber Lab
Sep 7, 2022 · Artificial Intelligence

How the World AI Conference Shaped the Future of Trustworthy AI

The World AI Conference’s Trustworthy AI Forum in Shanghai gathered over 20 global experts, government leaders, and industry representatives to discuss policies, standards, technologies, and applications, unveiling a new AI safety testing platform, a joint laboratory, and a comprehensive 2022 Trustworthy AI Industry Ecosystem Report.

AI SafetyIndustry Reporttrustworthy AI
0 likes · 7 min read
How the World AI Conference Shaped the Future of Trustworthy AI
AntTech
AntTech
Sep 3, 2022 · Artificial Intelligence

Highlights from the 2022 World AI Conference: Graph Computing, Privacy Computing, AI Safety, and New Open Platforms

The 2022 World AI Conference in Shanghai showcased cutting‑edge research on graph computing and privacy computing, announced Ant Group’s new AI safety product “AntJian”, the “YinYu Open Platform” for trusted privacy computing, and the open‑source high‑performance graph database TuGraph, highlighting the push for secure, scalable AI technologies.

AIAI SafetyAnt Group
0 likes · 7 min read
Highlights from the 2022 World AI Conference: Graph Computing, Privacy Computing, AI Safety, and New Open Platforms
DataFunSummit
DataFunSummit
Jul 21, 2022 · Artificial Intelligence

Advances and Challenges in Dialogue Systems: Baidu PLATO and Future Directions

This article reviews the evolution, architectures, challenges, and recent breakthroughs of dialogue systems—especially Baidu's PLATO model—while discussing data‑driven approaches, diversity, safety, interactive learning, and the potential role of virtual environments such as the metaverse in shaping future conversational AI.

AI SafetyConversational AIMetaverse
0 likes · 24 min read
Advances and Challenges in Dialogue Systems: Baidu PLATO and Future Directions
AntTech
AntTech
Jul 18, 2022 · Artificial Intelligence

Trusted AI Research at Ant Group: Advances in Computer Vision, Watermark Defense, Robust Machine Learning, and Explainable NLG

Ant Group’s security labs present a series of cutting‑edge AI research achievements—including hierarchical multi‑granular classification for computer vision, watermark‑vaccine defenses, multi‑modal document understanding, robust and explainable machine learning, and logic‑driven data‑to‑text generation—highlighting their commitment to trustworthy and secure AI applications.

AI SafetyComputer VisionData2Text
0 likes · 12 min read
Trusted AI Research at Ant Group: Advances in Computer Vision, Watermark Defense, Robust Machine Learning, and Explainable NLG
DataFunTalk
DataFunTalk
Jul 12, 2022 · Artificial Intelligence

Applying Computer Vision for Content Safety in Live Streaming: Practices and Future Directions

This presentation details how Huya leverages computer‑vision algorithms to detect and mitigate risky content such as political, pornographic, and violent material in live‑streaming and short‑video platforms, describing system architecture, labeling strategies, algorithmic pipelines, real‑time moderation techniques, and future research directions.

AI SafetyComputer VisionRisk Detection
0 likes · 11 min read
Applying Computer Vision for Content Safety in Live Streaming: Practices and Future Directions
DataFunTalk
DataFunTalk
May 28, 2022 · Artificial Intelligence

Adversarial Examples for Captcha: Techniques, Applications, and Future Directions

This article presents a comprehensive overview of adversarial example research applied to captcha systems, covering the definition and history of adversarial attacks, geometric‑aware generation frameworks, FGSM‑based attack variants, experimental results, trade‑offs between image quality and attack strength, and future work such as AdvGAN integration.

AI SafetyDeep LearningFGSM
0 likes · 14 min read
Adversarial Examples for Captcha: Techniques, Applications, and Future Directions
Didi Tech
Didi Tech
Apr 20, 2021 · Artificial Intelligence

Few-Shot Learning, Data Augmentation, and Semi‑Supervised Methods for Improving Safety and Governance Models at Didi

To overcome scarce labeled data for safety and governance, Didi combines few‑shot learning with systematic data augmentation, self‑training semi‑supervised labeling, and multi‑task neural architectures, cutting labeling costs and reducing log‑loss by over 20% while boosting ROC‑AUC and PR‑AUC across harassment detection, expense‑complaint, and route‑intercept use cases.

AI SafetyDidiFew‑Shot Learning
0 likes · 15 min read
Few-Shot Learning, Data Augmentation, and Semi‑Supervised Methods for Improving Safety and Governance Models at Didi
Tencent Tech
Tencent Tech
Sep 25, 2020 · Artificial Intelligence

What’s Inside Tencent’s AI Security Attack Matrix? A Minefield Guide

Tencent’s AI Security Attack Matrix, the industry’s first AI‑focused risk framework, maps attack tactics, techniques, and processes across the AI lifecycle, offering practical guidance for researchers and developers to identify and mitigate security threats in AI systems.

AI SafetyAI securityTencent
0 likes · 5 min read
What’s Inside Tencent’s AI Security Attack Matrix? A Minefield Guide
DataFunTalk
DataFunTalk
Feb 6, 2020 · Artificial Intelligence

L4 Autonomous Driving Heavy Truck: Architecture, Data Platform, and Production Challenges

This article presents a comprehensive overview of L4 autonomous driving heavy trucks, covering system architecture, sensor and computing hardware, data and model platforms, production challenges, safety considerations, and strategies for achieving reliable, high‑performance mass‑produced autonomous trucks.

AI SafetyL4 trucksautonomous driving
0 likes · 12 min read
L4 Autonomous Driving Heavy Truck: Architecture, Data Platform, and Production Challenges