Tag

hyperparameters

0 views collected around this technical thread.

Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jul 5, 2024 · Artificial Intelligence

Understanding and Tuning Hyperparameters for Large Language Models

This article explores the role of hyperparameters in large language models, explains each key hyperparameter, and guides readers through manual and automated tuning methods such as random search, grid search, and Bayesian optimization to achieve optimal model performance.

LLMModel Tuningai
0 likes · 18 min read
Understanding and Tuning Hyperparameters for Large Language Models
Architects' Tech Alliance
Architects' Tech Alliance
Sep 3, 2020 · Artificial Intelligence

Deep Learning Specialization Infographic Overview

This article presents a comprehensive English summary of the deep learning specialization infographics originally shared by Andrew Ng, covering fundamentals, logistic regression, shallow and deep neural networks, regularization, optimization, hyperparameters, convolutional and recurrent networks, and practical advice for model building and evaluation.

CNNDeep LearningRNN
0 likes · 21 min read
Deep Learning Specialization Infographic Overview
Sohu Tech Products
Sohu Tech Products
Mar 6, 2019 · Artificial Intelligence

Applying Word2Vec Embeddings to Rental and News Recommendation: Model, Hyper‑parameters, and Optimization

This article explains the fundamentals of the Word2Vec SGNS model, details its hyper‑parameters and training tricks, and demonstrates how customized embeddings are built for rental‑listing and news‑article recommendation, covering data preparation, objective‑function redesign, evaluation, and deployment in both recall and ranking stages.

Cold StartSGNSWord2Vec
0 likes · 14 min read
Applying Word2Vec Embeddings to Rental and News Recommendation: Model, Hyper‑parameters, and Optimization