Old Zhang's AI Learning
Old Zhang's AI Learning
Feb 14, 2026 · Artificial Intelligence

Translate Full PDFs While Preserving Layout Using LLMs – Core Code Included

This article presents a two‑stage, cache‑enabled pipeline that extracts text blocks from a PDF with PyMuPDF, translates them via a large‑language‑model API, and re‑renders each page as an image with Chinese text overlaid to keep the original layout, along with full Python code and usage instructions.

LLMLarge Language ModelPDF translation
0 likes · 10 min read
Translate Full PDFs While Preserving Layout Using LLMs – Core Code Included
Old Meng AI Explorer
Old Meng AI Explorer
Jan 18, 2026 · Artificial Intelligence

How BabelDOC Preserves PDF Layout While Translating & OneAIFW Shields Your Data

Two open‑source projects—BabelDOC, a Python‑based PDF translator that retains original formatting using AI models, and OneAIFW, a Zig‑and‑Rust local AI firewall that anonymizes sensitive data before LLM queries—offer practical, privacy‑preserving solutions for researchers and developers.

AI privacyPDF translationdata protection
0 likes · 8 min read
How BabelDOC Preserves PDF Layout While Translating & OneAIFW Shields Your Data
ITPUB
ITPUB
Jun 6, 2020 · Artificial Intelligence

How to Use the Open‑Source OCR Translator for Videos, Games, and PDFs

This guide explains how to set up and operate a free open‑source OCR‑based translator that captures on‑screen text from videos, games, or PDFs, registers the required Baidu AI API keys, configures translation sources, and demonstrates its performance on real content.

Baidu AIGitHubOCR
0 likes · 5 min read
How to Use the Open‑Source OCR Translator for Videos, Games, and PDFs