Tagged articles
83 articles
Page 1 of 1
Geek Labs
Geek Labs
May 11, 2026 · Artificial Intelligence

Why AI Translators Feel Mechanical and This Tool Makes PDFs Read Like Chinese

Most AI translation services produce literal, word‑for‑word output that reads like machine‑generated text, but the open‑source Cuimao Translator on GitHub analyzes the source’s style, tone, and domain before rendering English PDFs into fluent, context‑aware Chinese using five translation modes and six built‑in quality checks.

AI translationChinese localizationClaude Code
0 likes · 8 min read
Why AI Translators Feel Mechanical and This Tool Makes PDFs Read Like Chinese
Data STUDIO
Data STUDIO
Mar 26, 2026 · Operations

10 Open‑Source Python Tools That Replace Paid SaaS Apps

The article presents ten Python libraries—pikepdf, Playwright, pdf2image + pytesseract, moviepy, pydub + ffmpeg, reportlab, yt‑dlp, watchdog, pyvirtualcam, and rich + textual—each with code samples, runtime requirements, complexity analysis, practical tips, and common pitfalls, showing how they can substitute costly commercial software while offering greater control, privacy, and customization.

Audio ProcessingAutomationFile Monitoring
0 likes · 19 min read
10 Open‑Source Python Tools That Replace Paid SaaS Apps
DevOps Coach
DevOps Coach
Jan 8, 2026 · Backend Development

How a 300‑Line Go PDF Invoice API Generates $2K/Month with Zero Overhead

This article explains how a compact Go‑based PDF invoice generator for freelancers, built as a single‑binary API, evolved from a weekend prototype to a steady side‑business earning $1,500‑$2,000 monthly, detailing its architecture, performance gains, low‑cost hosting, and monetization model.

BackendFly.ioGo
0 likes · 6 min read
How a 300‑Line Go PDF Invoice API Generates $2K/Month with Zero Overhead
Old Meng AI Explorer
Old Meng AI Explorer
Dec 26, 2025 · Operations

How PDFPatcher Turns Tedious PDF Tasks into One‑Click Solutions

Facing endless PDF hassles like missing bookmarks, stubborn watermarks, low‑resolution screenshots, and bulky files, the open‑source PDFPatcher lets you locally generate bookmarks, strip watermarks, extract images and tables, batch‑process documents, compress PDFs, and convert to PDF/A—all without costly software or privacy risks.

BatchProcessingDocumentManagementOpenSource
0 likes · 9 min read
How PDFPatcher Turns Tedious PDF Tasks into One‑Click Solutions
php Courses
php Courses
Oct 23, 2025 · Backend Development

How to Convert Filled PDF Forms into Pre‑Populated HTML Forms with PHP

Learn how to automatically extract data from completed PDF forms and generate pre‑filled HTML forms using PHP, with step‑by‑step guidance on installing pdftk, leveraging php‑pdftk or smalot/pdfparser libraries, handling field mapping, security, and best practices for seamless integration.

Form AutomationHTMLPDF
0 likes · 8 min read
How to Convert Filled PDF Forms into Pre‑Populated HTML Forms with PHP
MaGe Linux Operations
MaGe Linux Operations
Oct 9, 2025 · Cloud Native

100 Essential Kubectl Commands for Kubernetes Diagnosis (Free PDF)

The article reports a recent outage of the popular game 'Honor of Kings' with official apologies and promises to restore points, then shifts to a comprehensive guide of 100 Kubectl commands for diagnosing Kubernetes clusters, including cluster info, pod, service, network, and security checks, with a free PDF download.

K8s commandsKubernetesPDF
0 likes · 5 min read
100 Essential Kubectl Commands for Kubernetes Diagnosis (Free PDF)
Code Mala Tang
Code Mala Tang
Jul 22, 2025 · Artificial Intelligence

Convert Any PDF to Clean Markdown with a Local LLM (Gemma 3)

Learn how to transform any PDF—including scanned documents—into well‑structured Markdown using a local LLM (Gemma 3 via Ollama), Python, PyMuPDF and Pillow, without cloud APIs or API keys, by converting pages to images, prompting the model, and saving the output.

GemmaLLMOllama
0 likes · 12 min read
Convert Any PDF to Clean Markdown with a Local LLM (Gemma 3)
Java Captain
Java Captain
Jun 7, 2025 · Backend Development

Add Custom PDF Headers and Footers with Java and Free Spire.PDF

This tutorial explains how to programmatically add headers and footers—including text, images, page numbers, and separator lines—to every page of a PDF using Java and the free Spire.PDF library, with complete code examples and visual results.

PDFfooterheader
0 likes · 8 min read
Add Custom PDF Headers and Footers with Java and Free Spire.PDF
Top Architecture Tech Stack
Top Architecture Tech Stack
Apr 1, 2025 · Backend Development

Generating PDF Templates with iText in Java: Step-by-Step Guide

This article demonstrates how to create and fill PDF templates using a PDF editor and Java iText library, covering importing Word documents, inserting text, option, and image fields (including signatures and stamps), and provides complete code examples for generating customized PDFs.

Backend DevelopmentJavaPDF
0 likes · 7 min read
Generating PDF Templates with iText in Java: Step-by-Step Guide
Architect's Guide
Architect's Guide
Mar 9, 2025 · Backend Development

Generating PDF Templates with iText in Java: A Step‑by‑Step Tutorial

This article demonstrates how to create a PDF template by first designing a Word document, converting it to PDF, inserting text, option, and image fields using a PDF editor, and then programmatically filling those fields with Java iText code, including dependency setup and image insertion.

BackendJavaPDF
0 likes · 7 min read
Generating PDF Templates with iText in Java: A Step‑by‑Step Tutorial
Open Source Linux
Open Source Linux
Feb 18, 2025 · Frontend Development

Run Linux Inside a PDF: How a Teen Turned a PDF into a Live OS

A teenage developer has ingeniously run a full Linux system inside a PDF by adapting the TinyEMU RISC‑V emulator to asm.js and using PDF‑embedded JavaScript, enabling interactive command‑line access via a virtual keyboard in Chromium‑based browsers.

BrowserPDFRISC-V
0 likes · 3 min read
Run Linux Inside a PDF: How a Teen Turned a PDF into a Live OS
Java Tech Enthusiast
Java Tech Enthusiast
Feb 14, 2025 · Information Security

Electronic Signature Implementation Using Java and PDF

This tutorial explains China’s electronic‑seal regulations, shows how to generate a CA‑based X.509 certificate and a custom seal image with BouncyCastle, and demonstrates signing a PDF in Java using iText 5 to embed a visible seal and a verifiable digital signature.

BouncyCastleJavaPDF
0 likes · 16 min read
Electronic Signature Implementation Using Java and PDF
Pan Zhi's Tech Notes
Pan Zhi's Tech Notes
Feb 14, 2025 · Backend Development

How to Generate PDF Files in Java: A Complete Step‑by‑Step Tutorial

This guide walks through the need for PDF generation in business scenarios, introduces the iText library (both iText5 and iText7), shows how to add Maven dependencies, provides a minimal “Hello World” PDF example, then details converting HTML (with CSS, images, and Freemarker variables) into PDF, and discusses when custom adaptation is required.

FreemarkerHTML to PDFJava
0 likes · 13 min read
How to Generate PDF Files in Java: A Complete Step‑by‑Step Tutorial
macrozheng
macrozheng
Jan 24, 2025 · Backend Development

Boost Java Excel Performance with FastExcel: Features, Usage, and Comparison

This article introduces FastExcel, an upgraded Java library for high‑performance Excel read/write, outlines its key features, provides step‑by‑step code examples for entity creation, event listeners, writing, reading, PDF conversion, compares it with EasyExcel, and concludes with its suitability for large‑scale data processing.

ExcelFastExcelPDF
0 likes · 8 min read
Boost Java Excel Performance with FastExcel: Features, Usage, and Comparison
Python Programming Learning Circle
Python Programming Learning Circle
Jan 20, 2025 · Fundamentals

10 Practical Python Automation Scripts for Everyday Tasks

This article presents ten ready‑to‑use Python automation scripts that demonstrate how to optimize images, process videos, convert PDFs to images, fetch and post API data, monitor battery levels, correct grammar and spelling, download files, retrieve world news, and build a simple PySide2 GUI, providing clear code examples for each use case.

APIAutomationGUI
0 likes · 11 min read
10 Practical Python Automation Scripts for Everyday Tasks
Architecture Development Notes
Architecture Development Notes
Dec 1, 2024 · Fundamentals

How to Add Importance‑Sampling PDFs to a Rust Ray Tracer

This article walks through implementing probability‑density‑function (PDF) based importance sampling in a Rust ray‑tracing renderer, covering trait definitions, concrete PDF types for spheres, cosine distributions, hittable objects, quad geometry, material adjustments, and integration into the rendering loop to achieve faster convergence and higher image quality.

GraphicsImportance SamplingPDF
0 likes · 15 min read
How to Add Importance‑Sampling PDFs to a Rust Ray Tracer
Java Architect Essentials
Java Architect Essentials
Oct 31, 2024 · Backend Development

How to Read Excel, Word, PDF, and Text Files in Java

This article explains how to use Java libraries such as Apache POI, PDFBox, and EasyExcel to read Excel, DOC/DOCX, PDF, and plain text files, providing complete code examples, required Maven dependencies, and step‑by‑step usage instructions for each file type.

Apache POIJavaPDF
0 likes · 13 min read
How to Read Excel, Word, PDF, and Text Files in Java
Code Ape Tech Column
Code Ape Tech Column
Oct 8, 2024 · Backend Development

Generating Dynamic PDF Invoices with iTextPdf in Java

This article explains how to use the iTextPdf library in Java to design a PDF template, fill fixed fields, dynamically generate product tables, and merge the two parts into a single invoice PDF, including Maven dependencies and sample code.

JavaPDFSpringBoot
0 likes · 12 min read
Generating Dynamic PDF Invoices with iTextPdf in Java
Python Crawling & Data Mining
Python Crawling & Data Mining
Sep 26, 2024 · Backend Development

How to Insert One PDF into Another Using PyPDF2 in Python

This article walks through a practical Python solution for inserting pages from one PDF into another using the PyPDF2 library, covering installation, three possible approaches, the chosen method with code examples, and tips for handling multiple pages efficiently.

PDFPDF manipulationPyPDF2
0 likes · 4 min read
How to Insert One PDF into Another Using PyPDF2 in Python
Test Development Learning Exchange
Test Development Learning Exchange
Sep 2, 2024 · Backend Development

Python PDF Operations: Merging, Splitting, Encryption, Decryption, Text Extraction, Watermarking, Page Removal, Rotation, and HTML-to-PDF Conversion

This guide demonstrates how to use Python libraries such as PyPDF2, ReportLab, and WeasyPrint to merge, split, encrypt, decrypt, extract text, add watermarks, remove or rotate pages, and convert HTML files into PDFs, providing complete code examples for each operation.

PDFPDF manipulationPyPDF2
0 likes · 7 min read
Python PDF Operations: Merging, Splitting, Encryption, Decryption, Text Extraction, Watermarking, Page Removal, Rotation, and HTML-to-PDF Conversion
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Sep 2, 2024 · Artificial Intelligence

Turning PDFs and Word Docs into Searchable Knowledge for RAG Systems

This article explains why generic large language models struggle with domain‑specific data, introduces Retrieval‑Augmented Generation (RAG) as a solution, compares Word and PDF formats, outlines document‑parsing pipelines, reviews open‑source PDF tools, and presents Alibaba Cloud's rule‑based parsing architecture with performance results.

AIDocument ParsingLLM
0 likes · 13 min read
Turning PDFs and Word Docs into Searchable Knowledge for RAG Systems
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Aug 12, 2024 · Frontend Development

Implementing Multi‑Page Print Preview and PDF Export in Vue 3 with vue3‑print‑nb, html2canvas, and jsPDF

This article explains how to achieve paginated print preview and PDF generation in a Vue 3 application by using vue3‑print‑nb for on‑demand printing, html2canvas to capture DOM elements as images, and jsPDF to compose multi‑page A4 PDFs with custom headers, footers, and spacing.

PDFVue3frontend
0 likes · 12 min read
Implementing Multi‑Page Print Preview and PDF Export in Vue 3 with vue3‑print‑nb, html2canvas, and jsPDF
Full-Stack Cultivation Path
Full-Stack Cultivation Path
Aug 8, 2024 · Artificial Intelligence

MegaParse: A Precision Document Parser Built for LLMs

MegaParse is an open‑source document parser that transforms PDFs, Word, PPT, Excel and CSV files into LLM‑friendly formats, preserving full information, boosting processing efficiency, and enabling deeper semantic analysis, with quick‑start installation steps and a roadmap for future features.

AI toolsDocument ParsingLLM
0 likes · 4 min read
MegaParse: A Precision Document Parser Built for LLMs
Java Captain
Java Captain
Mar 7, 2024 · Backend Development

Various Methods for Transferring PDF Files in Java

This article explains several common approaches for transferring PDF files in Java, including HTTP, FTP, email via JavaMail, and real‑time WebSocket transmission, outlining their implementation steps, advantages, and considerations such as security and performance.

EmailFTPPDF
0 likes · 5 min read
Various Methods for Transferring PDF Files in Java
Python Programming Learning Circle
Python Programming Learning Circle
Feb 18, 2024 · Backend Development

Introduction, Installation, and Usage of PyMuPDF (Python Bindings for MuPDF)

This article provides a comprehensive overview of PyMuPDF, covering its purpose as Python bindings for the lightweight MuPDF viewer, detailed installation instructions, essential dependencies, naming conventions, and extensive usage examples for opening documents, accessing pages, extracting text and images, manipulating PDFs, and saving changes.

Document ProcessingLibraryMuPDF
0 likes · 12 min read
Introduction, Installation, and Usage of PyMuPDF (Python Bindings for MuPDF)
Java Architect Essentials
Java Architect Essentials
Dec 24, 2023 · Backend Development

How to Add Watermarks to PDFs with Spring Boot: PDFBox, iText, Ghostscript, and More

This article walks through multiple ways to add text or image watermarks to PDF files in a Spring Boot application, covering Apache PDFBox, iText, Ghostscript command‑line, Free Spire.PDF, and Aspose.PDF, with Maven dependencies, step‑by‑step code examples, and practical tips for preserving original documents.

GhostscriptJavaPDF
0 likes · 14 min read
How to Add Watermarks to PDFs with Spring Boot: PDFBox, iText, Ghostscript, and More
Python Programming Learning Circle
Python Programming Learning Circle
Nov 30, 2023 · Fundamentals

Introduction and Usage Guide for PyMuPDF (Python Bindings for MuPDF)

This article provides a comprehensive overview of PyMuPDF, covering its relationship to MuPDF, core features, installation methods, import conventions, and detailed usage examples for opening documents, handling pages, extracting text and images, and performing PDF-specific operations such as merging, splitting, and saving.

Document ProcessingLibraryMuPDF
0 likes · 12 min read
Introduction and Usage Guide for PyMuPDF (Python Bindings for MuPDF)
Python Programming Learning Circle
Python Programming Learning Circle
Nov 20, 2023 · Fundamentals

Convert Excel to PDF Using Python and Spire.XLS

This tutorial explains how to install the Spire.XLS library, import its modules, and use Python code examples to convert entire Excel workbooks, individual worksheets, or specific sheets with custom page settings into PDF files for consistent document sharing.

AutomationExcelPDF
0 likes · 4 min read
Convert Excel to PDF Using Python and Spire.XLS
21CTO
21CTO
Aug 21, 2023 · R&D Management

How John Warnock’s Vision Shaped Adobe and Modern Digital Creativity

The article chronicles John Warnock’s journey from a struggling student to co‑founder of Adobe, highlighting his technical innovations, leadership roles, entrepreneurial lessons, and lasting impact on digital publishing, graphics, and the broader tech industry.

AdobeEntrepreneurshipJohn Warnock
0 likes · 11 min read
How John Warnock’s Vision Shaped Adobe and Modern Digital Creativity
Test Development Learning Exchange
Test Development Learning Exchange
Jul 27, 2023 · Artificial Intelligence

Splitting PDF Files and Recognizing MP3 Audio with Python

This guide explains how to split a PDF into separate files using PyPDF2 and provides two Python approaches for converting MP3 audio to text—one leveraging Google Speech‑Recognition for higher accuracy and another using PocketSphinx for complete transcription—complete with ready‑to‑run code examples.

PDFPyPDF2Python
0 likes · 5 min read
Splitting PDF Files and Recognizing MP3 Audio with Python
Python Programming Learning Circle
Python Programming Learning Circle
May 24, 2023 · Fundamentals

pdf2docx: Python Library for Converting PDF Files to DOCX with Features, Limitations, Installation, and Example

The pdf2docx library uses PyMuPDF and python-docx to extract PDF layouts, paragraphs, images, and tables, offering multi‑process conversion while noting current limitations such as lack of OCR and support for only left‑to‑right languages, and provides simple pip installation and a code example.

ConversionLibraryPDF
0 likes · 3 min read
pdf2docx: Python Library for Converting PDF Files to DOCX with Features, Limitations, Installation, and Example
Top Architect
Top Architect
Mar 24, 2023 · Backend Development

Generating PDFs with Dynamic Tables and Images Using Java iText

This tutorial demonstrates how to generate PDF files in Java with iText by filling form templates, inserting images and charts, creating dynamic tables, handling page breaks, and adding custom page headers, footers, and watermarks for backend applications.

BackendCode TutorialDynamic Table
0 likes · 25 min read
Generating PDFs with Dynamic Tables and Images Using Java iText
MaGe Linux Operations
MaGe Linux Operations
Jan 4, 2023 · Fundamentals

10 Powerful Python Automation Scripts to Supercharge Your Workflow

This article presents ten practical Python automation scripts—including image and video optimizers, PDF-to‑image conversion, API data fetching, battery notifications, grammar and spell checkers, internet downloader, world news fetcher, and a PySide2 GUI—each with clear explanations and ready‑to‑run code examples.

APIImageProcessingPDF
0 likes · 12 min read
10 Powerful Python Automation Scripts to Supercharge Your Workflow
360 Quality & Efficiency
360 Quality & Efficiency
Aug 26, 2022 · Backend Development

Using ReportLab Table to Create Custom PDF Greeting Cards in Python

This article demonstrates how to use Python's ReportLab library, specifically the Table and TableStyle classes, to programmatically generate PDF documents such as custom greeting cards, covering basic Table usage, styling options, and step-by-step code examples for creating and customizing PDF layouts.

Greeting CardPDFPython
0 likes · 8 min read
Using ReportLab Table to Create Custom PDF Greeting Cards in Python
360 Quality & Efficiency
360 Quality & Efficiency
Mar 25, 2022 · Information Security

Adding a Graphical Seal and Network Timestamp Electronic Signature to PDF Documents with Java and Spire

This tutorial demonstrates how to programmatically embed a graphical seal and a trusted network timestamp electronic signature into a PDF using Java, the Spire library, and a .pfx certificate, covering preparation, signature creation, positioning, timestamp configuration, visual settings, and final saving.

Electronic SealPDFSpire
0 likes · 5 min read
Adding a Graphical Seal and Network Timestamp Electronic Signature to PDF Documents with Java and Spire
360 Quality & Efficiency
360 Quality & Efficiency
Nov 26, 2021 · Fundamentals

Generating Custom PDF Documents with ReportLab in Python

This article explains how to use Python's ReportLab library to create highly customized PDF documents, covering basic components like canvas, templates, page headers/footers, and demonstrating single‑column and double‑column layouts with text, charts, images, and tables through detailed code examples.

PDFPythonReportLab
0 likes · 9 min read
Generating Custom PDF Documents with ReportLab in Python
Architect
Architect
Nov 1, 2021 · Fundamentals

Document Rendering and Structured Data Extraction in Baidu Wenku: From Layout Data to Flow Data and Chart Metadata

The article explains Baidu Wenku's document conversion pipeline, detailing how various office formats are transformed into PDF layout data, then into adaptive flow data for mobile devices, and describes the technical methods for extracting structured content and chart metadata from PDFs and OOXML documents.

Baidu WenkuData ExtractionOOXML
0 likes · 11 min read
Document Rendering and Structured Data Extraction in Baidu Wenku: From Layout Data to Flow Data and Chart Metadata
Architect's Journey
Architect's Journey
Dec 22, 2020 · Frontend Development

Best Practices for Silent Web PDF Printing in E‑commerce Workflows

This article analyzes three implementation scenarios for silently printing PDF procurement lists in a shipping process, compares PDF‑related technologies on stability, integration and cost, and concludes with a Lodop‑based solution that converts backend data into a hidden canvas for direct printing.

LodopPDFWeb Development
0 likes · 4 min read
Best Practices for Silent Web PDF Printing in E‑commerce Workflows
Sohu Tech Products
Sohu Tech Products
Jul 15, 2020 · Frontend Development

Understanding and Using the Blob Web API in JavaScript

This article explains what a Blob is, introduces the Blob Web API with its constructor, properties, and methods, and demonstrates practical scenarios such as chunked uploads, downloading data, creating Blob URLs, converting to Base64, image compression, PDF generation, and compares Blob with ArrayBuffer, providing complete code examples for each use case.

Base64BlobFile
0 likes · 20 min read
Understanding and Using the Blob Web API in JavaScript
Full-Stack Internet Architecture
Full-Stack Internet Architecture
May 22, 2019 · Backend Development

Common Java Libraries for Project Development

This article introduces a curated list of widely used Java libraries—including logging, XML, PDF, Excel, JSON, HTTP, Apache Commons, Joda‑Time, and HTML parsing tools—explaining their main features and when to choose each for backend development projects.

Apache CommonsExcelPDF
0 likes · 5 min read
Common Java Libraries for Project Development
MaGe Linux Operations
MaGe Linux Operations
Sep 12, 2016 · Operations

Top 6 Linux PDF Viewers You Should Install Today

This guide reviews six popular PDF viewers for Linux, detailing their key features, supported formats, and step‑by‑step installation commands via apt‑get or yum, helping users choose the best tool for reading and managing PDF documents on the desktop.

EvinceFoxit ReaderLinux
0 likes · 7 min read
Top 6 Linux PDF Viewers You Should Install Today