Tagged articles
27 articles
Page 1 of 1
Java Architect Essentials
Java Architect Essentials
Oct 31, 2024 · Backend Development

How to Read Excel, Word, PDF, and Text Files in Java

This article explains how to use Java libraries such as Apache POI, PDFBox, and EasyExcel to read Excel, DOC/DOCX, PDF, and plain text files, providing complete code examples, required Maven dependencies, and step‑by‑step usage instructions for each file type.

Apache POIJavaPDF
0 likes · 13 min read
How to Read Excel, Word, PDF, and Text Files in Java
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Sep 2, 2024 · Artificial Intelligence

Turning PDFs and Word Docs into Searchable Knowledge for RAG Systems

This article explains why generic large language models struggle with domain‑specific data, introduces Retrieval‑Augmented Generation (RAG) as a solution, compares Word and PDF formats, outlines document‑parsing pipelines, reviews open‑source PDF tools, and presents Alibaba Cloud's rule‑based parsing architecture with performance results.

AIDocument ParsingLLM
0 likes · 13 min read
Turning PDFs and Word Docs into Searchable Knowledge for RAG Systems
MaGe Linux Operations
MaGe Linux Operations
Jan 16, 2022 · Backend Development

How to Batch‑Generate Word Files from Excel Using Python’s zipfile and python‑docx

This article demonstrates two Python‑based solutions for automatically replacing placeholders in a Word document with data from an Excel sheet—one using python‑docx (with win32com conversion for .doc files) and another leveraging the zipfile module to edit the underlying XML—complete with code, troubleshooting tips, and handling of formatting issues.

AutomationExcelWindows
0 likes · 10 min read
How to Batch‑Generate Word Files from Excel Using Python’s zipfile and python‑docx
Tencent IMWeb Frontend Team
Tencent IMWeb Frontend Team
Aug 5, 2021 · Fundamentals

Why Markdown Beats Word for Modern Writing and Development

The article compares Microsoft Word's heavyweight rich‑text workflow with lightweight, readable Markdown, explains Markdown's history and standards, and recommends several Markdown‑friendly tools for writing, documentation, and front‑end development.

DocumentationSoftwareRecommendationsWord
0 likes · 9 min read
Why Markdown Beats Word for Modern Writing and Development