Tagged articles
74 articles
Page 1 of 1
AI Engineer Programming
AI Engineer Programming
May 17, 2026 · Fundamentals

Why Are We Still Using Markdown?

The article analyses Markdown's minimalist design, its ambiguous syntax, security flaws such as ReDoS and XSS vulnerabilities, and the growing gap between its original simple transliteration goal and the complex compiler‑like features developers now demand.

CommonMarkReDoSSecurity
0 likes · 14 min read
Why Are We Still Using Markdown?
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 6, 2025 · Artificial Intelligence

How to Optimize RAG Knowledge Base Construction: Parsing, Chunking, and Retrieval

This article explains why building a high‑quality RAG knowledge base is critical, outlines offline parsing techniques for multi‑format documents, presents semantic chunking strategies that preserve structure and context, and shows how to answer interview questions with a robust, production‑ready pipeline.

AI InterviewKnowledge BaseRAG
0 likes · 8 min read
How to Optimize RAG Knowledge Base Construction: Parsing, Chunking, and Retrieval
php Courses
php Courses
Sep 12, 2025 · Backend Development

Boost Your C++ Apps with RapidJSON and pugixml: A Practical Guide

This article explains why C++ developers should adopt third‑party libraries like RapidJSON for high‑performance JSON handling and pugixml for lightweight XML processing, covering integration steps, core APIs, and detailed code examples for parsing, generating, and traversing data structures.

CJSONLibrary
0 likes · 10 min read
Boost Your C++ Apps with RapidJSON and pugixml: A Practical Guide
ITPUB
ITPUB
Jun 28, 2025 · Databases

How Oracle Executes a SQL Statement: From Parsing to Optimizer

This article explains Oracle's step‑by‑step processing of a SQL statement—including the client submission, parsing checks, optimizer decision‑making, row‑source generation, and final execution—while illustrating hard and soft parses, optimizer components, and practical query‑plan queries.

OracleSQLdatabase
0 likes · 25 min read
How Oracle Executes a SQL Statement: From Parsing to Optimizer
Java Tech Enthusiast
Java Tech Enthusiast
Jan 5, 2025 · Backend Development

Six Common JSON Parsing Methods in Java

Java developers can parse JSON using six popular approaches—Jackson for high‑performance, annotation‑driven serialization; Gson for a lightweight, easy‑to‑use API; FastJSON for speed; JsonPath for XPath‑style nested extraction; org.json for simple utility; or manual parsing for full control—each suited to different performance and complexity needs.

GsonJSONJackson
0 likes · 9 min read
Six Common JSON Parsing Methods in Java
DataFunSummit
DataFunSummit
Nov 11, 2024 · Big Data

Understanding Spark SQL Parsing Layer and Its Optimizations

This talk, the third in a Spark series, introduces the Spark SQL parsing layer, explains its architecture and integration with ANTLR4, details core implementation classes, and presents a real‑world optimization case that reduces code complexity and improves maintainability.

Antlr4Big DataScala
0 likes · 15 min read
Understanding Spark SQL Parsing Layer and Its Optimizations
FunTester
FunTester
Oct 24, 2024 · Backend Development

Using gopkg.in/yaml.v3 for YAML Parsing and Generation in Go

This article introduces the YAML format, outlines its syntax rules, and demonstrates how the Go library gopkg.in/yaml.v3 can be used for parsing and generating YAML files, including basic examples, advanced struct mapping, and a summary of its key advantages for backend development.

GoYAMLparsing
0 likes · 9 min read
Using gopkg.in/yaml.v3 for YAML Parsing and Generation in Go
21CTO
21CTO
Apr 6, 2024 · Fundamentals

How I Built Pinecone: From Zero to a Working Compiler

This article chronicles the author's six‑month journey creating the Pinecone programming language, covering its core features, design decisions about compilation versus interpretation, custom lexing and parsing, implementation choices, and practical advice for anyone wanting to build their own language.

C++Compiler designProgramming Language
0 likes · 18 min read
How I Built Pinecone: From Zero to a Working Compiler
php Courses
php Courses
Jul 28, 2023 · Backend Development

Parsing and Generating XML Files in PHP

This article explains how to use PHP's built-in XML extension to parse XML files into a DOM tree and to generate XML documents programmatically, providing step‑by‑step code examples for loading, traversing, creating elements, setting attributes, and saving the resulting XML files.

DOMGenerationPHP
0 likes · 4 min read
Parsing and Generating XML Files in PHP
DataFunSummit
DataFunSummit
Mar 25, 2023 · Artificial Intelligence

How GPT‑4 Has Changed NLP Research: Community Perspectives

A collection of Zhihu answers reflects on how the release of GPT‑4 has reshaped NLP research, dividing the community into LLM‑enthusiasts and skeptics, discussing the relevance of parsing, resource‑driven research directions, and the existential challenges faced by researchers.

AIAcademic CommunityGPT-4
0 likes · 10 min read
How GPT‑4 Has Changed NLP Research: Community Perspectives
Programmer DD
Programmer DD
Mar 17, 2023 · Backend Development

Parse and Format SQL in Java Instantly with JSqlParser

This article introduces JSqlParser, a Java library that parses SQL statements into object structures, demonstrates how to extract query components with sample code, lists supported databases, and highlights additional features like formatting and framework integration.

Backend DevelopmentJSqlParserJava
0 likes · 3 min read
Parse and Format SQL in Java Instantly with JSqlParser
Tencent Cloud Developer
Tencent Cloud Developer
Mar 15, 2023 · Frontend Development

Deep Dive into Tencent Docs' Contribution to VSCode Configuration System

Tencent Docs contributed over 400 lines of core code to VSCode, extending the editor’s configuration system with new package.json fields, dynamic color theming, a conditional @If decorator, a robust expression‑tree parser, and a runtime context map, empowering developers to build highly customizable extensions while keeping the core lightweight.

ExtensionTypeScriptVSCode
0 likes · 20 min read
Deep Dive into Tencent Docs' Contribution to VSCode Configuration System
ByteDance SYS Tech
ByteDance SYS Tech
Dec 2, 2022 · Backend Development

How Sonic‑CPP Boosts JSON Parsing Speed 2.5× Faster Than RapidJSON

Sonic‑CPP, an open‑source C++ JSON library co‑developed by ByteDance’s STE and Service Framework teams, leverages SIMD vectorization, optimized memory layout, on‑demand parsing, and a compact DOM design to achieve up to 2.5× faster parsing than RapidJSON and competitive serialization performance, with extensive benchmark results and production‑grade usage.

C++JSONSIMD
0 likes · 13 min read
How Sonic‑CPP Boosts JSON Parsing Speed 2.5× Faster Than RapidJSON
JD Cloud Developers
JD Cloud Developers
Oct 18, 2022 · Databases

How SQL Engines Turn Queries into Results: Inside ASTs and ANTLR4 Parsing

This article explores the inner workings of SQL engines, detailing the end‑to‑end process from query submission through lexical analysis, parsing, abstract syntax tree generation with ANTLR4, and subsequent logical and physical plan optimization, while also demonstrating practical Java examples for SQL parsing and rewriting.

ASTAntlr4Java
0 likes · 11 min read
How SQL Engines Turn Queries into Results: Inside ASTs and ANTLR4 Parsing
DaTaobao Tech
DaTaobao Tech
Jul 14, 2022 · Frontend Development

Implementing a Simple HTML Parser in JavaScript

The article walks through building a simple JavaScript HTML parser by explaining browser parsing basics, using regular expressions to detect tags, managing a stack to match opening and closing elements, creating element and text node objects, and outlining code snippets while noting omitted features like script and style handling.

DOMHTML parserJavaScript
0 likes · 9 min read
Implementing a Simple HTML Parser in JavaScript
IT Services Circle
IT Services Circle
Jun 18, 2022 · Backend Development

Investigating an Intermittent Fastjson Generic Parsing Bug in a Java Backend

This article recounts a developer's step‑by‑step investigation of an intermittent Fastjson generic‑type parsing error in a Java backend, detailing the initial symptom, debugging process, code examples, discovery of Fastjson’s cached generic handling bug, and the resolution by upgrading to version 1.2.33.

JSONbugfastjson
0 likes · 10 min read
Investigating an Intermittent Fastjson Generic Parsing Bug in a Java Backend
Java Captain
Java Captain
Apr 2, 2022 · Backend Development

Building a Site Search Engine with Java Indexing and File Parsing

This article explains how to build a site‑wide search engine using Java, covering crawling concepts, forward and inverted indexing, module design, tokenization methods, and detailed code examples for file enumeration, HTML parsing, and index generation.

Site Searchindexingparsing
0 likes · 14 min read
Building a Site Search Engine with Java Indexing and File Parsing
Laravel Tech Community
Laravel Tech Community
Mar 8, 2022 · Backend Development

Parsing XML in PHP with DOMDocument, XMLReader, and DOMXPath

The article provides a step‑by‑step guide on parsing XML in PHP, showcasing three techniques—DOMDocument, XMLReader, and DOMXPath—to extract weather data such as condition and temperature, complete with full code examples and explanations of key functions.

DOMDocumentXMLdomxpath
0 likes · 4 min read
Parsing XML in PHP with DOMDocument, XMLReader, and DOMXPath
MaGe Linux Operations
MaGe Linux Operations
Jan 28, 2022 · Databases

How to Parse MySQL Binlog with Go: Build a Simple Replication Demo

This article demonstrates how to use Go to parse MySQL binlog events and synchronize data by implementing a lightweight replication client, covering configuration, server module, packet handling, handshake, registration, dump command, and providing complete source code examples with explanations.

BinlogGoReplication
0 likes · 16 min read
How to Parse MySQL Binlog with Go: Build a Simple Replication Demo
MaGe Linux Operations
MaGe Linux Operations
Jan 3, 2022 · Backend Development

Build a Simple Arithmetic Interpreter in Python Using PLY

This article walks through creating a Python arithmetic interpreter with the PLY library, covering token definitions, lexer rules, BNF grammar, parser implementation, operator precedence, and a runnable REPL, providing complete code and explanations for each step.

BNFPLYPython
0 likes · 9 min read
Build a Simple Arithmetic Interpreter in Python Using PLY
ELab Team
ELab Team
Oct 13, 2021 · Frontend Development

How V8 Supercharges JavaScript: Inside the Engine’s Magic

This article explores the V8 JavaScript engine’s architecture, from its historical origins and source‑code acquisition to its loading strategies, lexical analysis, parsing, compilation, and execution optimizations, illustrating how V8 achieves high performance in browsers and Node.js.

CompilationEngineJIT
0 likes · 23 min read
How V8 Supercharges JavaScript: Inside the Engine’s Magic
Tencent IMWeb Frontend Team
Tencent IMWeb Frontend Team
Aug 26, 2021 · Frontend Development

How to Fix HTML Entity Bugs That Break Rich Text Rendering

This article explains why HTML entities like "<" and ">" can disappear in rich‑text fields, analyzes the underlying tokenizer state machine, and provides a lightweight hack that inserts empty comment nodes to preserve the original text without breaking legacy rendering logic.

EntityHTMLJavaScript
0 likes · 12 min read
How to Fix HTML Entity Bugs That Break Rich Text Rendering
Tencent Cloud Developer
Tencent Cloud Developer
Aug 17, 2021 · Backend Development

Design and Implementation of a Calculation DSL and Engine

The article presents a domain‑specific language that mimics Excel formulas, a stack‑based parser and recursive engine for evaluating calculations, and a multi‑layer architecture—including a dynamic priority scheduler—to efficiently resolve field dependencies, improve maintainability, and enable monitoring across large data systems.

Backend DevelopmentCalculation EngineDSL
0 likes · 11 min read
Design and Implementation of a Calculation DSL and Engine
政采云技术
政采云技术
Jun 8, 2021 · Frontend Development

The Process of Executing JavaScript in V8

This article explains how the V8 engine parses JavaScript into an AST, generates bytecode, employs lazy compilation, caches machine code, and uses the TurboFan optimizing compiler with JIT techniques to balance speed and memory consumption during script execution.

EngineJITV8
0 likes · 12 min read
The Process of Executing JavaScript in V8
Sohu Tech Products
Sohu Tech Products
Jun 2, 2021 · Mobile Development

In‑Depth Analysis of YYModel Source Code and Its JSON Parsing Mechanism

This article provides a detailed examination of the YYModel framework’s source code, explaining how it leverages Objective‑C runtime to parse JSON into model objects, covering key files, internal classes, mapping mechanisms, and the core conversion methods with illustrative code examples.

JSONObjective‑CRuntime
0 likes · 12 min read
In‑Depth Analysis of YYModel Source Code and Its JSON Parsing Mechanism
ByteFE
ByteFE
May 20, 2021 · Fundamentals

Understanding ECMAScript: Cover Grammars and Finite Lookahead

This article explains ECMAScript's cover grammars and finite lookahead mechanisms for resolving ambiguous syntax, focusing on arrow function parameter lists versus parenthesized expressions.

Arrow FunctionsECMAScriptJavaScript
0 likes · 8 min read
Understanding ECMAScript: Cover Grammars and Finite Lookahead
Python Programming Learning Circle
Python Programming Learning Circle
Nov 5, 2020 · Backend Development

Comprehensive Guide to Python Libraries for Web Crawling, Parsing, and Web Development

This article provides an extensive overview of Python libraries and frameworks for web crawling, data extraction, HTML/XML parsing, text processing, asynchronous programming, queue management, cloud execution, and popular web development frameworks such as Django, Flask, Web2py, Tornado, and CherryPy.

data-extractionlibrariesparsing
0 likes · 9 min read
Comprehensive Guide to Python Libraries for Web Crawling, Parsing, and Web Development
FunTester
FunTester
Sep 29, 2020 · Backend Development

How to Quickly Parse Zookeeper XML Configs with Groovy: A Practical Guide

This article walks through building a Groovy utility to parse Zookeeper XML configuration files—covering parsing methods, choosing DOM for small files, implementing a reusable XMLUtil class, and demonstrating the tool with sample code and console output screenshots.

ConfigurationDOM parsingGroovy
0 likes · 5 min read
How to Quickly Parse Zookeeper XML Configs with Groovy: A Practical Guide
Sohu Tech Products
Sohu Tech Products
Jun 24, 2020 · Frontend Development

Understanding Abstract Syntax Trees (AST) and Their Applications in JavaScript Tooling

This article explains what an Abstract Syntax Tree (AST) is, how JavaScript code is parsed into ASTs, the processes of lexical and syntactic analysis, and demonstrates practical AST manipulation using tools like Esprima, Estraverse, Escodegen, and Babel to transform code such as renaming functions, converting arrow functions, and implementing on‑demand imports.

ASTbabelcode transformation
0 likes · 31 min read
Understanding Abstract Syntax Trees (AST) and Their Applications in JavaScript Tooling
Programmer DD
Programmer DD
Nov 27, 2019 · Backend Development

Master Fastjson: Fast Java JSON Parsing and Serialization Guide

This article introduces JSON basics, explains the Fastjson library from Alibaba, outlines its key features, and provides practical Java code examples for parsing, serializing, and converting between JSON strings, objects, arrays, maps, lists, and beans using Fastjson’s API.

DeserializationJSONJava
0 likes · 6 min read
Master Fastjson: Fast Java JSON Parsing and Serialization Guide
WecTeam
WecTeam
Nov 5, 2019 · Fundamentals

Unlocking V8: How JavaScript Is Parsed and Optimized for Speed

This article explains how Google's V8 engine parses JavaScript, distinguishes eager and lazy parsing, inlines functions, manages object hidden classes, and offers practical optimization tips to improve runtime performance in Chrome, Node.js, and Edge.

EngineJavaScriptV8
0 likes · 11 min read
Unlocking V8: How JavaScript Is Parsed and Optimized for Speed
Xueersi Online School Tech Team
Xueersi Online School Tech Team
Oct 25, 2019 · Backend Development

Understanding Nginx Configuration Parsing: Data Structures, ngx_conf_parse, and HTTP Block Processing

This article explains how Nginx parses its configuration file by introducing the core data structures ngx_conf_t and ngx_command_t, describing the versatile ngx_conf_parse function, and detailing the step‑by‑step processing of HTTP, server, and location blocks, including conflict resolution.

BackendConfigurationHTTP
0 likes · 18 min read
Understanding Nginx Configuration Parsing: Data Structures, ngx_conf_parse, and HTTP Block Processing
WecTeam
WecTeam
Oct 24, 2019 · Fundamentals

How to Build a JavaScript Lexer for Arithmetic Expressions Using a Finite State Machine

This article explains how to implement a lexical analyzer in JavaScript that tokenizes simple arithmetic expressions by using a finite state machine, covering the conversion from infix notation to an abstract syntax tree, token definitions, state transitions, and complete source code examples.

ASTFinite State MachineJavaScript
0 likes · 9 min read
How to Build a JavaScript Lexer for Arithmetic Expressions Using a Finite State Machine
WecTeam
WecTeam
Sep 3, 2019 · Frontend Development

Why Understanding JavaScript ASTs Is Essential for Modern Frontend Development

Understanding JavaScript's Abstract Syntax Tree (AST) is crucial for modern code analysis, enabling tools like Babel, ESLint, and Webpack to parse, transform, and generate code beyond regex, with a clear process from lexical analysis to syntax analysis and standardized specifications such as ESTree.

ASTJavaScriptcode analysis
0 likes · 9 min read
Why Understanding JavaScript ASTs Is Essential for Modern Frontend Development
MaGe Linux Operations
MaGe Linux Operations
Jul 2, 2019 · Backend Development

Master Web Scraping with BeautifulSoup: A Complete Python Guide

This tutorial introduces BeautifulSoup, a powerful Python library for parsing HTML and XML, covering installation, basic usage, tag selection, attribute extraction, navigation of parent and sibling nodes, method and CSS selectors, and best‑practice recommendations for efficient web data extraction.

Data ExtractionPythonWeb Scraping
0 likes · 30 min read
Master Web Scraping with BeautifulSoup: A Complete Python Guide
MaoDou Frontend Team
MaoDou Frontend Team
Apr 28, 2019 · Frontend Development

How JavaScript ASTs Power Babel, UglifyJS, and Code Transformations

This article explains what an Abstract Syntax Tree (AST) is, its role in JavaScript tooling such as Babel, UglifyJS, and type checking, details the lexical and syntactic analysis stages of AST generation, and demonstrates a practical Babel plugin example that modifies console output using visitors.

ASTJavaScriptVisitor Pattern
0 likes · 6 min read
How JavaScript ASTs Power Babel, UglifyJS, and Code Transformations
MaGe Linux Operations
MaGe Linux Operations
Dec 31, 2018 · Backend Development

Master Python Web Scraping: 8 Essential urllib2 Techniques

This guide walks through eight practical Python urllib2 techniques for web crawling, covering basic GET/POST requests, proxy usage, cookie management, header spoofing, page parsing with regex and BeautifulSoup, captcha handling, gzip compression, and multithreaded fetching with a simple thread pool.

GzipProxyPython
0 likes · 8 min read
Master Python Web Scraping: 8 Essential urllib2 Techniques
MaGe Linux Operations
MaGe Linux Operations
Dec 24, 2018 · Fundamentals

How Programming Languages Really Work: Inside the Compiler Journey

This article demystifies how programming languages are transformed by compilers, covering the roles of lexical analysis, parsing, abstract syntax trees, code generation, and linking, with practical Rust examples, diagrams, and references to deepen your understanding of language implementation.

Code GenerationProgramming LanguageRust
0 likes · 14 min read
How Programming Languages Really Work: Inside the Compiler Journey
UC Tech Team
UC Tech Team
Dec 11, 2018 · Frontend Development

Binary AST Proposal: Accelerating JavaScript Parsing Performance

The Binary AST proposal introduces a three‑layer binary encoding for JavaScript abstract syntax trees, aiming to reduce parsing time by up to 90 % and improve web application startup performance, with early prototypes showing promising results and growing community interest.

Binary ASTJavaScriptWebAssembly
0 likes · 6 min read
Binary AST Proposal: Accelerating JavaScript Parsing Performance
MaGe Linux Operations
MaGe Linux Operations
Apr 23, 2018 · Backend Development

Essential Python Libraries for Web Scraping and Data Processing

A comprehensive catalog of Python libraries covering network communication, web crawling frameworks, HTML/XML parsing, text manipulation, file format handling, natural language processing, browser automation, concurrency, cloud services, email processing, URL manipulation, multimedia extraction, WebSocket support, DNS resolution, computer vision, proxy servers, and other useful tools for developers.

AutomationPythonWeb Scraping
0 likes · 16 min read
Essential Python Libraries for Web Scraping and Data Processing
Java Captain
Java Captain
Apr 16, 2018 · Fundamentals

XML Parsing in Java: DOM, SAX, JDOM, and DOM4J Comparison and Examples

This article provides a comprehensive guide to parsing XML in Java, covering the official DOM and SAX methods, third‑party JDOM and DOM4J libraries, detailed code examples for each approach, and a performance and feature comparison to help developers choose the most suitable parser.

DOMDOM4JJDOM
0 likes · 20 min read
XML Parsing in Java: DOM, SAX, JDOM, and DOM4J Comparison and Examples
Tencent Music Tech Team
Tencent Music Tech Team
Apr 21, 2017 · Fundamentals

Understanding M4A File Structure and Using the Sample Table Box for Random Access

The article outlines the MP4‑based M4A container’s hierarchical box format, details the Sample Table (stbl) and its essential sub‑boxes (stts, stsc, stco/co64, stsz) that map playback time to file offsets, explains time‑scale conversion, optional boxes, and provides pseudo‑code and parsing tips for precise random‑access seeking.

M4AMedia File FormatSample Table Box
0 likes · 14 min read
Understanding M4A File Structure and Using the Sample Table Box for Random Access
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Mar 4, 2017 · Frontend Development

Why JavaScript Parsing Slows Your Site and How to Speed It Up

This article examines how JavaScript parsing, compilation, and execution dominate web page startup time, presents data from V8 runtime statistics across desktop and mobile browsers, and offers practical techniques—such as code splitting, script streaming, and code caching—to dramatically reduce launch delays.

CompilationJavaScriptV8
0 likes · 15 min read
Why JavaScript Parsing Slows Your Site and How to Speed It Up
21CTO
21CTO
Nov 13, 2015 · Backend Development

Essential Python Libraries for Web Scraping and Data Processing

Discover a comprehensive collection of Python libraries covering network requests, web crawling frameworks, HTML/XML parsing, text manipulation, file format handling, natural language processing, browser automation, asynchronous programming, and more, providing developers with essential tools for efficient web scraping and data processing tasks.

PythonWeb Scrapingdata-processing
0 likes · 18 min read
Essential Python Libraries for Web Scraping and Data Processing
Qunar Tech Salon
Qunar Tech Salon
Jul 27, 2015 · Backend Development

Parsing XML with Perl in Three Lines

This short tutorial shows how to use Perl's XML::Simple module to load an XML file and iterate over its elements with just three lines of code, providing a quick backend solution for XML data extraction.

BackendCodeSnippetXML
0 likes · 2 min read
Parsing XML with Perl in Three Lines