Tagged articles
14 articles
Page 1 of 1
IT Services Circle
IT Services Circle
May 20, 2026 · Artificial Intelligence

Google I/O 2026 Unveils Gemini Omni and Gemini 3.5 Flash – A Leap in Multimodal AI

At Google I/O 2026 the company introduced Gemini Omni, a truly multimodal model that can ingest any combination of text, image, audio or video and generate high‑quality content, and Gemini 3.5 Flash, which outperforms Gemini 3.1 Pro across major benchmarks while delivering four‑times faster token throughput, alongside the new Antigravity 2.0 agent platform and the Gemini Spark personal AI assistant.

AI GenerationAgent PlatformBenchmark
0 likes · 13 min read
Google I/O 2026 Unveils Gemini Omni and Gemini 3.5 Flash – A Leap in Multimodal AI
AI Engineering
AI Engineering
Apr 18, 2026 · Frontend Development

HyperFrames vs. Remotion: How HTML Is Redefining Programmable Video

HeyGen’s open‑source HyperFrames framework turns HTML into a programmable video format, offering faster rendering (60 s vs. 162 s plus a 4‑minute build for Remotion), smaller output files (4 MB vs. 14 MB), AI‑friendly commands, and a simple "one file in, video out" workflow that rivals React‑based Remotion.

AI Generationffmpeghtml video
0 likes · 5 min read
HyperFrames vs. Remotion: How HTML Is Redefining Programmable Video
AI Open-Source Efficiency Guide
AI Open-Source Efficiency Guide
Mar 29, 2026 · Artificial Intelligence

12000‑Star baoyu-skills: One‑Click AI Toolkit for Xiaohongshu Covers, Infographics & Slides

The baoyu-skills library for Claude Code, now with over 12,000 GitHub stars, provides modular AI‑driven commands that let users generate Xiaohongshu post covers, professional infographics, and slide images in a single step, supporting multiple platforms, customizable styles, and automatic updates.

AI GenerationAutomationClaude Code
0 likes · 10 min read
12000‑Star baoyu-skills: One‑Click AI Toolkit for Xiaohongshu Covers, Infographics & Slides
Advanced AI Application Practice
Advanced AI Application Practice
Mar 22, 2026 · Backend Development

How to Auto‑Generate Test API Docs with Trae Skill in Seconds (Step‑by‑Step Guide)

The article explains why manual API documentation drains testers' time, then demonstrates how Trae Skill’s AI‑driven, customizable rules can generate accurate, real‑time interface docs in about ten seconds, complete with examples, curl commands, and reusable configurations.

AI GenerationAPI documentationSoftware Testing
0 likes · 11 min read
How to Auto‑Generate Test API Docs with Trae Skill in Seconds (Step‑by‑Step Guide)
JD Retail Technology
JD Retail Technology
Apr 16, 2025 · Artificial Intelligence

AI‑Driven 3D Spatial Video Generation from Monocular 2D Content with MV‑HEVC Encoding

This work presents an end‑to‑end AI pipeline that transforms existing monocular 2D videos into immersive 3D spatial streams by combining DINO‑v2‑based depth estimation, multi‑branch view synthesis, and MV‑HEVC encoding, achieving up to 33 % BD‑Rate reduction, 31 % speed gains, state‑of‑the‑art visual quality, and real‑time production suitability, validated on the new StereoV1K benchmark and deployed in JD.Vision’s e‑commerce catalog.

3D videoAI GenerationAIGC
0 likes · 21 min read
AI‑Driven 3D Spatial Video Generation from Monocular 2D Content with MV‑HEVC Encoding
AIWalker
AIWalker
Mar 18, 2025 · Artificial Intelligence

How ImageRAG Boosts Text‑to‑Image Generation with Retrieval‑Augmented Generation

ImageRAG introduces a retrieval‑augmented generation framework that dynamically fetches relevant images to guide diffusion models, dramatically improving the synthesis of rare and fine‑grained concepts across multiple text‑to‑image systems, as demonstrated by extensive quantitative and user studies.

AI GenerationBenchmarkImageRAG
0 likes · 17 min read
How ImageRAG Boosts Text‑to‑Image Generation with Retrieval‑Augmented Generation
58UXD
58UXD
Jul 3, 2024 · Artificial Intelligence

Boost B‑Side Icon Design with Stable Diffusion & ControlNet: A Step‑by‑Step Guide

This tutorial shows how designers can streamline the creation of 3D frosted‑glass B‑end icons by leveraging Stable Diffusion and ControlNet, covering model setup, line‑art preparation, prompt engineering, generation parameters, and post‑processing to achieve high‑quality results efficiently.

AI GenerationControlNetStable Diffusion
0 likes · 5 min read
Boost B‑Side Icon Design with Stable Diffusion & ControlNet: A Step‑by‑Step Guide
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 8, 2023 · Artificial Intelligence

How BeautifulPrompt Automates Prompt Engineering for Text-to-Image Generation

BeautifulPrompt, presented at EMNLP 2023, introduces a deep generation model that automatically crafts high-quality prompts from simple image descriptions, enhancing text-to-image synthesis through data-driven fine‑tuning, reward modeling, and reinforcement learning techniques.

AI Generationreinforcement learningtext-to-image synthesis
0 likes · 8 min read
How BeautifulPrompt Automates Prompt Engineering for Text-to-Image Generation
Tencent Tech
Tencent Tech
Oct 26, 2023 · Artificial Intelligence

Unlocking Tencent Hunyuan Text‑to‑Image: A Complete Guide and Prompt Tips

This guide introduces Tencent Hunyuan's upgraded text‑to‑image model, explains its technical innovations, provides detailed prompt engineering advice, showcases example prompts and generated images across various styles, and highlights real‑world applications and performance metrics for developers and creators.

AI GenerationLarge ModelPrompt engineering
0 likes · 12 min read
Unlocking Tencent Hunyuan Text‑to‑Image: A Complete Guide and Prompt Tips
DaTaobao Tech
DaTaobao Tech
Mar 13, 2023 · Artificial Intelligence

AI‑Driven 3D Content Creation and Optimization for E‑commerce

The article presents an AI‑driven pipeline that creates, delivers, and optimizes 3D e‑commerce content by leveraging diffusion‑based generation, txt2img/img2img style transfer, Shapley‑value interpretability, and a multi‑level traffic amplification framework to overcome modeling efficiency, asset scarcity, and production cost challenges.

3D contentAI Generationfeature engineering
0 likes · 12 min read
AI‑Driven 3D Content Creation and Optimization for E‑commerce
phodal
phodal
Feb 20, 2023 · Artificial Intelligence

Prompt Engineering Secrets: Text‑to‑Image, Article & Code Generation with AI

This guide explores how to craft effective prompts for Stable Diffusion image creation, ChatGPT article writing, and GitHub Copilot code generation, covering prompt evolution, negative prompts, ControlNet enhancements, model selection, and practical tips for iterative refinement and context building.

AI GenerationChatGPTControlNet
0 likes · 15 min read
Prompt Engineering Secrets: Text‑to‑Image, Article & Code Generation with AI
Tencent Cloud Developer
Tencent Cloud Developer
Nov 1, 2022 · Artificial Intelligence

The Rise of AI-Generated Content: Technologies, Applications, and Risks

The article surveys the evolution of AI‑generated content from early art programs to modern diffusion‑based text‑to‑image and text‑to‑video models, outlines key milestones such as Stable Diffusion and DALL‑E 2, explores gaming applications, and highlights limitations, ethical concerns, and copyright risks of open‑source generative AI.

AI Generationcreative AItext-to-image
0 likes · 22 min read
The Rise of AI-Generated Content: Technologies, Applications, and Risks
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jul 29, 2022 · Artificial Intelligence

Unlock Chinese Text-to-Image Generation with EasyNLP’s Open‑Source Models

This article introduces EasyNLP’s newly integrated Chinese text‑to‑image generation framework, explains the underlying Transformer‑VQGAN architecture, provides model specifications, code snippets, performance benchmarks on multiple datasets, and step‑by‑step tutorials for fine‑tuning and inference using open‑source checkpoints.

AI GenerationChinese NLPEasyNLP
0 likes · 20 min read
Unlock Chinese Text-to-Image Generation with EasyNLP’s Open‑Source Models