Tagged articles
9 articles
Page 1 of 1
Architect's Guide
Architect's Guide
Feb 25, 2026 · Backend Development

Convert Word (.docx) to PDF in Spring Boot with docx4j – A Complete Guide

Learn how to seamlessly convert uploaded .docx files to PDF in a Spring Boot application using the pure‑Java docx4j library, covering solution comparison, Maven dependencies, a utility class, controller implementation, and Windows/Linux font‑encoding fixes for reliable, high‑fidelity document rendering.

JavaPDF conversionSpring Boot
0 likes · 11 min read
Convert Word (.docx) to PDF in Spring Boot with docx4j – A Complete Guide
Java Architect Handbook
Java Architect Handbook
Feb 11, 2026 · Backend Development

Convert Word (.docx) to PDF in Spring Boot with docx4j

This guide walks you through a fully open‑source, pure‑Java solution for converting uploaded .docx files to PDF in a Spring Boot application, covering library selection, Maven dependencies, a reusable utility class, controller implementation, and handling Chinese font issues on Windows and Linux.

JavaPDF conversionSpring Boot
0 likes · 12 min read
Convert Word (.docx) to PDF in Spring Boot with docx4j
java1234
java1234
Feb 5, 2026 · Backend Development

Convert DOCX to PDF in Spring Boot with docx4j: A Lightweight Open‑Source Solution

This article walks through building a pure‑Java DOCX‑to‑PDF converter in Spring Boot using docx4j, compares it with alternatives, shows Maven dependencies, provides a utility class and controller example, and explains how to resolve Chinese font garbling on both Windows and Linux.

JavaPDF conversionSpring Boot
0 likes · 11 min read
Convert DOCX to PDF in Spring Boot with docx4j: A Lightweight Open‑Source Solution
Baidu Geek Talk
Baidu Geek Talk
Jul 26, 2021 · Artificial Intelligence

Document Rendering and Structured Extraction Techniques in Baidu Wenku

Baidu Wenku converts all document types to PDF, parses the PDF into a proprietary format, uses absolute‑position layout for PC rendering, and transforms this into flow‑type structural data for mobile devices by re‑typing layout, extracting OOXML structures, and detecting charts, thereby enabling adaptive layouts, accurate formula rendering, and interactive chart extraction.

Mobile OptimizationOOXML parsingPDF conversion
0 likes · 12 min read
Document Rendering and Structured Extraction Techniques in Baidu Wenku
Python Programming Learning Circle
Python Programming Learning Circle
Jul 3, 2021 · Artificial Intelligence

Automatic PDF Slide Transcription Using Deep Learning OCR

This article demonstrates how to automatically convert PDF slide decks into editable markdown text by first converting each page to images, then applying a deep‑learning OCR pipeline (CTPN for detection and CRNN for recognition) with Python code examples, achieving high transcription accuracy.

Deep LearningImage ProcessingOCR
0 likes · 6 min read
Automatic PDF Slide Transcription Using Deep Learning OCR