NewBeeNLP
Author

NewBeeNLP

Always insightful, always fun

119
Articles
0
Likes
1
Views
0
Comments
Recent Articles

Latest from NewBeeNLP

100 recent articles max
NewBeeNLP
NewBeeNLP
Jun 7, 2024 · Artificial Intelligence

Scaling Laws, Synthetic Data, and New Model Architectures: What’s Next?

In a recent round‑table, experts debated the validity of scaling laws, the role of synthetic and semi‑synthetic data in overcoming data scarcity, explored alternatives to Transformers such as RNN‑based models and MOE, and examined techniques for handling long‑context inference efficiently.

Mixture of Expertsmodel architecturescaling laws
0 likes · 12 min read
Scaling Laws, Synthetic Data, and New Model Architectures: What’s Next?
NewBeeNLP
NewBeeNLP
Jun 5, 2024 · Industry Insights

How Top E‑Commerce Platforms Rerank Recommendations: Models, Metrics, Practices

This article examines the role of reranking in modern recommendation pipelines, explains why context‑aware listwise models are needed, surveys the evolution from pointwise to generative and diversity‑aware approaches, and reviews real‑world deployments at companies such as Kuaishou, Alibaba, WeChat, iQIYI, and Meituan, highlighting key challenges, evaluation metrics, and business‑rule integrations.

DiversityRecommender SystemsReranking
0 likes · 28 min read
How Top E‑Commerce Platforms Rerank Recommendations: Models, Metrics, Practices
NewBeeNLP
NewBeeNLP
Jun 3, 2024 · Industry Insights

Tech Industry Pulse: TikTok Rumors, Google Cloud Layoffs, AI Showdowns, and More

A comprehensive roundup of recent tech industry developments covering ByteDance's AI hardware plans, Google Cloud's large‑scale layoffs, Musk versus Yang Li‑kun's AI debate, OpenAI's potential restructuring, Siri's delayed AI upgrade, Pinduoduo's price‑matching tool, and several other notable corporate moves.

AITechnology Newscloud computing
0 likes · 17 min read
Tech Industry Pulse: TikTok Rumors, Google Cloud Layoffs, AI Showdowns, and More
NewBeeNLP
NewBeeNLP
May 31, 2024 · Artificial Intelligence

Can Cleaned Web Data Rival Proprietary Corpora for LLM Training?

This article analyzes whether large‑scale web crawls, when meticulously filtered and deduplicated, can match or surpass the performance of high‑quality curated datasets in training large language models, covering dataset composition, processing pipelines, experimental results, scaling‑law implications, and future data‑efficiency strategies.

Artificial IntelligenceDataset CleaningLLM
0 likes · 23 min read
Can Cleaned Web Data Rival Proprietary Corpora for LLM Training?
NewBeeNLP
NewBeeNLP
May 29, 2024 · Artificial Intelligence

How Ant’s Multimodal Team Boosted Video‑Text Retrieval by 24% and Cut Copyright Search Costs 85%

This article presents Ant Group's multimodal research on video retrieval, detailing a large Chinese video‑text pre‑training dataset, three techniques that raise video‑text semantic search performance by up to 24.5%, and an end‑to‑end video‑video copyright detection system that reduces storage by 85% and speeds up inference 18‑fold.

copyright detectionfine-grained modelinghard sample mining
0 likes · 40 min read
How Ant’s Multimodal Team Boosted Video‑Text Retrieval by 24% and Cut Copyright Search Costs 85%
NewBeeNLP
NewBeeNLP
May 28, 2024 · Artificial Intelligence

How Generative Models Are Redefining Recommendation Systems

This article reviews recent advances in generative recommendation, highlighting challenges such as item representation and multimodal fusion, and summarizing four key research papers that propose novel tokenization, collaborative integration, and transformer-based multimodal approaches to improve recommendation performance.

AI researchGenerative RecommendationLLM
0 likes · 8 min read
How Generative Models Are Redefining Recommendation Systems
NewBeeNLP
NewBeeNLP
May 24, 2024 · Artificial Intelligence

How NoteLLM Boosts Cold‑Start Recommendation with Generative Contrastive Learning

This article reviews the NoteLLM paper, which leverages Llama 2 to create richer text embeddings and automatically generate tags and categories for note recommendation, addressing cold‑start issues through a multitask prompt design, generative‑contrastive learning, and collaborative supervised fine‑tuning, and demonstrates strong offline and online gains.

EmbeddingGenerative Contrastive LearningLLM
0 likes · 14 min read
How NoteLLM Boosts Cold‑Start Recommendation with Generative Contrastive Learning
NewBeeNLP
NewBeeNLP
May 21, 2024 · Industry Insights

How EcomXL Supercharges E‑commerce Image Generation with SDXL Optimizations and 3‑Second Inference

This article details how Alibaba's Wanxiang Lab adapted the SDXL diffusion model for large‑scale e‑commerce image generation, introducing the EcomXL series, a weighted‑distillation fine‑tuning method, hierarchical model fusion, specialized ControlNet variants, and the SLAM inference accelerator to achieve high‑quality, controllable product images within three seconds while boosting business metrics.

AIGCControlNetEcomXL
0 likes · 14 min read
How EcomXL Supercharges E‑commerce Image Generation with SDXL Optimizations and 3‑Second Inference
NewBeeNLP
NewBeeNLP
May 20, 2024 · Artificial Intelligence

How RecGPT Leverages ChatGPT‑Style Prompt Tuning for Better Sequential Recommendation

RecGPT applies a ChatGPT‑like pre‑training and personalized prompt‑tuning paradigm to sequential recommendation, introducing a two‑stage recall mechanism that improves offline HR/NDCG metrics and yields modest online interaction gains in a real‑world short‑video platform.

RecGPTauto-regressive pretraininglarge language model
0 likes · 8 min read
How RecGPT Leverages ChatGPT‑Style Prompt Tuning for Better Sequential Recommendation