Tagged articles
2 articles
Page 1 of 1
Baobao Algorithm Notes
Baobao Algorithm Notes
Nov 13, 2024 · Artificial Intelligence

Why Cleaning SFT Data Is a Nightmare: Hidden JSON Formatting Pitfalls

Cleaning SFT data for LLMs is surprisingly complex, as subtle JSON formatting variations, inconsistent markdown wrappers, intent settings, and unit handling can cause model inconsistencies, requiring unified standards, careful prompt design, and extensive manual review to ensure reliable training outputs.

JSON formattingLLM data cleaningModel Training
0 likes · 8 min read
Why Cleaning SFT Data Is a Nightmare: Hidden JSON Formatting Pitfalls
Python Crawling & Data Mining
Python Crawling & Data Mining
Dec 25, 2019 · Fundamentals

35 PyCharm Power‑Tips to Boost Your Python Productivity

This article shares 35 practical PyCharm shortcuts and features—including clipboard history, JSON formatting, Linux command integration, quick code refactoring, and Git version control—to help Python developers work faster and more efficiently, with step‑by‑step instructions and visual examples.

Code RefactoringGit integrationIDE Tips
0 likes · 7 min read
35 PyCharm Power‑Tips to Boost Your Python Productivity