Tag

OCR

0 views collected around this technical thread.

Java Captain
Java Captain
Jun 7, 2025 · Artificial Intelligence

How to Perform OCR in Java with Spire.OCR: Step‑by‑Step Guide

This tutorial shows how to set up the Spire.OCR library in Java, configure dependencies, and write code that scans images to extract their text, complete with Maven setup, project configuration, and sample output.

JavaOCRTutorial
0 likes · 4 min read
How to Perform OCR in Java with Spire.OCR: Step‑by‑Step Guide
Selected Java Interview Questions
Selected Java Interview Questions
Jun 3, 2025 · Artificial Intelligence

Implementing OCR in Java with SpringBoot and Tess4j

This article demonstrates how to build a lightweight OCR service in Java using SpringBoot and the Tess4j library, covering dependency setup, Tesseract engine initialization, RESTful API creation, training data options, and deployment considerations.

JavaOCRRESTful API
0 likes · 7 min read
Implementing OCR in Java with SpringBoot and Tess4j
Python Programming Learning Circle
Python Programming Learning Circle
May 6, 2025 · Artificial Intelligence

Automatic Math Equation Grading with Python: Data Generation, CNN Training, Image Segmentation, and Result Feedback

This tutorial explains how to build a Python-based automatic grading system for handwritten math equations by generating synthetic character images, training a convolutional neural network, segmenting input images using projection techniques, evaluating expressions with eval, and overlaying correctness indicators on the original image.

CNNMath GradingOCR
0 likes · 28 min read
Automatic Math Equation Grading with Python: Data Generation, CNN Training, Image Segmentation, and Result Feedback
Python Programming Learning Circle
Python Programming Learning Circle
Apr 15, 2025 · Artificial Intelligence

Automatic Math Expression Grading with Python, CNN and Image Processing

This tutorial explains how to generate synthetic digit fonts, build a convolutional neural network to recognize handwritten arithmetic expressions, segment images using projection methods, evaluate the results with Python's eval function, and overlay feedback symbols on the original image, providing a complete end‑to‑end solution.

AutomationCNNImageProcessing
0 likes · 27 min read
Automatic Math Expression Grading with Python, CNN and Image Processing
Sohu Tech Products
Sohu Tech Products
Jan 8, 2025 · Artificial Intelligence

Multimodal RAG: Implementation Paths and Development Prospects

The talk outlines Multimodal RAG implementation routes, comparing OCR‑based object recognition, transformer encoder‑decoder encoding, and Visual Language Model processing, explains the ColPali late‑interaction method for multi‑dimensional vector matching, addresses scaling tensors with binarization and reranking, and recommends a hybrid long‑term strategy where VLM excels on abstract imagery while traditional OCR remains valuable.

ColPaliMultimodal RAGOCR
0 likes · 10 min read
Multimodal RAG: Implementation Paths and Development Prospects
Python Programming Learning Circle
Python Programming Learning Circle
Dec 13, 2024 · Artificial Intelligence

Batch Image Translation Demo Using Youdao OCR API with Python

This article demonstrates how to build a Python desktop application that batch‑processes cosmetic product images, sends them to Youdao's OCR translation service, and displays the translated text, covering API preparation, request parameters, signature generation, and full source code.

Image TranslationOCRPython
0 likes · 12 min read
Batch Image Translation Demo Using Youdao OCR API with Python
Test Development Learning Exchange
Test Development Learning Exchange
Dec 6, 2024 · Artificial Intelligence

Using pytesseract and Pillow for OCR: Installation, Configuration, and Accuracy Improvement Techniques

This guide explains how to install Tesseract OCR and the Python libraries pytesseract and Pillow, configure the engine path, perform image-to-text extraction with example code, and apply various preprocessing, detection, and post‑processing methods to significantly improve OCR accuracy.

Computer VisionOCRTesseract
0 likes · 8 min read
Using pytesseract and Pillow for OCR: Installation, Configuration, and Accuracy Improvement Techniques
Bilibili Tech
Bilibili Tech
Nov 8, 2024 · Artificial Intelligence

AI-Powered Game Recognition for League of Legends Live Streaming on Bilibili

Bilibili’s AI‑driven game‑recognition system extracts real‑time LoL events through OCR, hero detection and hot‑spot tagging, generating high‑energy timestamps and interactive overlays that let viewers jump to key moments and view detailed statistics, enhancing spectator engagement and analytical capabilities across major esports tournaments.

AIComputer VisionGame Recognition
0 likes · 14 min read
AI-Powered Game Recognition for League of Legends Live Streaming on Bilibili
DeWu Technology
DeWu Technology
Sep 11, 2024 · Frontend Development

Advanced Watermark Techniques and OCR Integration for Front-End Applications

The article details progressive front‑end watermark schemes—from a basic canvas overlay to mutation‑observer‑protected, hide‑ and cover‑resistant, and low‑opacity dark watermarks—and explains how adaptive tone handling, contrast tuning, region cropping, and a hybrid OCR pipeline (internal service with tesseract.js fallback) ensure robust, invisible data protection and accurate screenshot analysis.

CanvasFront-endMutationObserver
0 likes · 20 min read
Advanced Watermark Techniques and OCR Integration for Front-End Applications
Java Architect Essentials
Java Architect Essentials
Sep 6, 2024 · Artificial Intelligence

Integrating Tess4J OCR into a Spring Boot Application

This guide explains how to set up a Spring Boot project, add the Tess4J dependency, configure language data, implement an OCR service and REST controller, and test both local file uploads and remote image URLs for text recognition.

JavaOCRREST
0 likes · 6 min read
Integrating Tess4J OCR into a Spring Boot Application
Python Programming Learning Circle
Python Programming Learning Circle
Sep 4, 2024 · Artificial Intelligence

Building an Automatic Math Grading System with Python: Data Generation, CNN Training, Image Segmentation, and Result Feedback

This tutorial explains how to create an automatic math‑grading tool in Python by generating synthetic digit images, training a small CNN on the data, segmenting handwritten equations with projection techniques, recognizing characters, evaluating the expressions, and overlaying the results back onto the original image.

AutomationCNNOCR
0 likes · 30 min read
Building an Automatic Math Grading System with Python: Data Generation, CNN Training, Image Segmentation, and Result Feedback
Python Programming Learning Circle
Python Programming Learning Circle
Jun 29, 2024 · Backend Development

Python Parking Lot Management System – Project Structure and Core Code

This article presents a Python parking‑lot management application, detailing its directory layout, explaining each module such as button handling, OCR utilities, and time processing, and showcasing the main pygame‑based code that renders parking information and visualizes remaining spaces.

OCRProject StructurePygame
0 likes · 5 min read
Python Parking Lot Management System – Project Structure and Core Code
Python Programming Learning Circle
Python Programming Learning Circle
Apr 18, 2024 · Artificial Intelligence

Implementing an Automatic Math Expression Grading System with Python and Convolutional Neural Networks

This tutorial walks through building a self‑trained OCR pipeline that generates synthetic digit images, trains a CNN model, segments handwritten math expressions, predicts each character, evaluates the arithmetic result, and overlays checkmarks, crosses or answers onto the original image.

AutomationCNNOCR
0 likes · 28 min read
Implementing an Automatic Math Expression Grading System with Python and Convolutional Neural Networks
Top Architect
Top Architect
Mar 13, 2024 · Backend Development

Integrating Tess4J OCR into a Spring Boot Backend Service

This tutorial walks through setting up a Spring Boot backend, adding the Tess4J OCR library, creating a service and REST controller to recognize text from both local files and remote image URLs, and provides testing steps and deployment tips.

JavaOCRREST API
0 likes · 8 min read
Integrating Tess4J OCR into a Spring Boot Backend Service
Top Architect
Top Architect
Mar 6, 2024 · Backend Development

Integrating Tess4J OCR into a Spring Boot Backend Service

This guide demonstrates how to integrate Tess4J OCR into a Spring Boot application, covering environment setup, Maven dependencies, adding language data, creating an OCR service class, building REST endpoints for local and remote image processing, and testing the solution.

JavaOCRREST
0 likes · 8 min read
Integrating Tess4J OCR into a Spring Boot Backend Service
Code Ape Tech Column
Code Ape Tech Column
Feb 2, 2024 · Artificial Intelligence

Integrating Tess4J OCR into a Spring Boot Application

This guide walks through setting up a Spring Boot project, adding Tess4J dependencies, configuring language data, implementing an OCR service class, exposing REST endpoints for local and remote image recognition, and testing the OCR functionality end‑to‑end.

JavaOCRREST API
0 likes · 6 min read
Integrating Tess4J OCR into a Spring Boot Application
Test Development Learning Exchange
Test Development Learning Exchange
Jan 21, 2024 · Fundamentals

How to Extract MP3 Files from a PDF Using Python

This guide explains step‑by‑step how to install required Python libraries, extract text and images from a PDF, perform OCR on the images, locate embedded MP3 data in the combined text, and save the audio file, providing complete sample code for each stage.

MP3 extractionOCRPDF
0 likes · 4 min read
How to Extract MP3 Files from a PDF Using Python