Author

Laiye Technology Team

Official account of Laiye Technology, featuring its best tech innovations, practical implementations, and cutting‑edge industry insights.

Articles

Likes

Views

Comments

Latest from Laiye Technology Team

36 recent articles

Laiye Technology Team

Mar 3, 2023 · Artificial Intelligence

Survey of Text‑Controlled Image Generation Models: DALL·E‑2, Imagen, Stable Diffusion, and ControlNet

This article reviews the key components and design choices of recent text‑controlled image generation systems—including DALL·E‑2, Google Imagen, Stability AI's Latent Stable Diffusion, and the ControlNet extension—highlighting how diffusion models, text encoders, prior modules, super‑resolution, and conditioning mechanisms enable high‑quality, controllable visual synthesis.

AIControlNetDALL-E-2

0 likes · 16 min read

Survey of Text‑Controlled Image Generation Models: DALL·E‑2, Imagen, Stable Diffusion, and ControlNet

Laiye Technology Team

Feb 17, 2023 · Artificial Intelligence

Understanding Diffusion Models, Autoencoders, and VAEs for AIGC with Code Examples

This article introduces the hot AIGC field by explaining diffusion‑based image generation, detailing the principles and mathematics of AutoEncoder and Variational AutoEncoder models, and providing complete TensorFlow code examples to help readers master these generative techniques step by step.

AIGCGenerative AITensorFlow

0 likes · 8 min read

Understanding Diffusion Models, Autoencoders, and VAEs for AIGC with Code Examples

Laiye Technology Team

Dec 16, 2022 · Artificial Intelligence

Efficient Production of Scene-specific OCR Models Using an AI Platform

This article explains how a unified AI platform enables rapid, data‑driven creation, training, deployment, and evaluation of OCR models for visually distinct text regions such as seals, meter readings, license plates, and VIN codes, while minimizing hardware and annotation costs.

AI PlatformKubeflowOCR

0 likes · 7 min read

Efficient Production of Scene-specific OCR Models Using an AI Platform

Laiye Technology Team

Nov 23, 2022 · Artificial Intelligence

Design and Practices of a Data‑Driven OCR Testing System

The article describes Laiye's shift to a data‑driven deep‑learning workflow and presents the design, macro‑ and micro‑analysis features, visual diff tools, distributed tracing, and code examples of their OCR testing system that accelerate model evaluation and iterative optimization.

AIData‑DrivenMLOps

0 likes · 11 min read

Design and Practices of a Data‑Driven OCR Testing System

Laiye Technology Team

Sep 28, 2022 · Artificial Intelligence

Checkbox Detection and State Classification Using YOLOv5

This article describes a comprehensive solution for detecting checkboxes in document images and determining their selected or unselected status by combining YOLOv5 object detection, synthetic and semi‑synthetic data generation, specialized post‑processing, and association logic to handle varied shapes, positions, and markings.

YOLOv5checkbox detectiondata synthesis

0 likes · 13 min read

Checkbox Detection and State Classification Using YOLOv5

Laiye Technology Team

Sep 23, 2022 · Artificial Intelligence

Overview of Automatic Text Summarization: Methods, Datasets, and Future Directions

This article provides a comprehensive overview of automatic text summarization, covering extractive, abstractive, and hybrid methods, system classifications, applications, datasets, evaluation metrics, and future research directions within the field of artificial intelligence.

NLPabstractiveevaluation

0 likes · 23 min read

Overview of Automatic Text Summarization: Methods, Datasets, and Future Directions

Laiye Technology Team

Sep 9, 2022 · Artificial Intelligence

Graph Convolutional Networks for Intelligent Document Processing: Principles, Feature Engineering, and Applications

This article presents a comprehensive overview of using graph convolutional networks in intelligent document processing, covering basic GCN theory, adjacency matrix construction, feature engineering—including text, image, and handcrafted features—model architecture, self-supervised training, and real-world applications such as semantic entity recognition and relation extraction.

Intelligent Document Processinggraph convolutional networksrelation extraction

0 likes · 14 min read

Graph Convolutional Networks for Intelligent Document Processing: Principles, Feature Engineering, and Applications

Laiye Technology Team

Aug 29, 2022 · Artificial Intelligence

Evolution of Dialogue Management: From Rule‑Based to Data‑Driven Systems and Industrial Deployments

This article reviews the historical development of dialogue management—from early rule‑based and finite‑state approaches to modern data‑driven and reinforcement‑learning methods—and examines how major industry platforms such as Amazon Alexa, Amazon Lex, and RASA implement these techniques in practice.

Amazon AlexaNLURASA

0 likes · 16 min read

Evolution of Dialogue Management: From Rule‑Based to Data‑Driven Systems and Industrial Deployments

Laiye Technology Team

Aug 15, 2022 · Artificial Intelligence

Recent Advances in Self‑Supervised Learning for Text Recognition

This article reviews recent self‑supervised learning approaches for optical character recognition, covering mainstream OCR model architectures, key factors for applying contrastive and masked image modeling methods to text images, and detailed analyses of representative works such as SeqCLR, SimAN, and DiG, including their designs and experimental results.

OCRcontrastive learningmasked image modeling

0 likes · 19 min read

Recent Advances in Self‑Supervised Learning for Text Recognition

Laiye Technology Team

Jul 22, 2022 · Cloud Native

Distributed Training Orchestration and Scheduling on Kubernetes: Architecture, Challenges, and Solutions

This article examines the pain points of distributed training orchestration and scheduling, presents a layered cloud‑native architecture built on Kubernetes, explains key components such as pipeline orchestrators, training job operators, schedulers, and topology managers, and discusses practical solutions using Argo, Kubeflow Pipelines, and the Volcano scheduler.

Distributed TrainingKubernetesML Platform

0 likes · 38 min read

Distributed Training Orchestration and Scheduling on Kubernetes: Architecture, Challenges, and Solutions