Author

NewBeeNLP

Always insightful, always fun

119

Articles

Likes

170

Views

Comments

Latest from NewBeeNLP

100 recent articles max

NewBeeNLP

Jun 7, 2024 · Artificial Intelligence

Scaling Laws, Synthetic Data, and New Model Architectures: What’s Next?

In a recent round‑table, experts debated the validity of scaling laws, the role of synthetic and semi‑synthetic data in overcoming data scarcity, explored alternatives to Transformers such as RNN‑based models and MOE, and examined techniques for handling long‑context inference efficiently.

Mixture of Expertsmodel architecturescaling laws

0 likes · 12 min read

Scaling Laws, Synthetic Data, and New Model Architectures: What’s Next?

NewBeeNLP

Jun 5, 2024 · Industry Insights

How Top E‑Commerce Platforms Rerank Recommendations: Models, Metrics, Practices

This article examines the role of reranking in modern recommendation pipelines, explains why context‑aware listwise models are needed, surveys the evolution from pointwise to generative and diversity‑aware approaches, and reviews real‑world deployments at companies such as Kuaishou, Alibaba, WeChat, iQIYI, and Meituan, highlighting key challenges, evaluation metrics, and business‑rule integrations.

DiversityRerankingindustry practice

0 likes · 28 min read

How Top E‑Commerce Platforms Rerank Recommendations: Models, Metrics, Practices

NewBeeNLP

Jun 3, 2024 · Industry Insights

Tech Industry Pulse: TikTok Rumors, Google Cloud Layoffs, AI Showdowns, and More

A comprehensive roundup of recent tech industry developments covering ByteDance's AI hardware plans, Google Cloud's large‑scale layoffs, Musk versus Yang Li‑kun's AI debate, OpenAI's potential restructuring, Siri's delayed AI upgrade, Pinduoduo's price‑matching tool, and several other notable corporate moves.

AITechnology Newscloud computing

0 likes · 17 min read

Tech Industry Pulse: TikTok Rumors, Google Cloud Layoffs, AI Showdowns, and More

NewBeeNLP

May 31, 2024 · Artificial Intelligence

Can Cleaned Web Data Rival Proprietary Corpora for LLM Training?

This article analyzes whether large‑scale web crawls, when meticulously filtered and deduplicated, can match or surpass the performance of high‑quality curated datasets in training large language models, covering dataset composition, processing pipelines, experimental results, scaling‑law implications, and future data‑efficiency strategies.

Artificial IntelligenceDataset CleaningLLM

0 likes · 23 min read

Can Cleaned Web Data Rival Proprietary Corpora for LLM Training?

NewBeeNLP

May 29, 2024 · Artificial Intelligence

How Ant’s Multimodal Team Boosted Video‑Text Retrieval by 24% and Cut Copyright Search Costs 85%

This article presents Ant Group's multimodal research on video retrieval, detailing a large Chinese video‑text pre‑training dataset, three techniques that raise video‑text semantic search performance by up to 24.5%, and an end‑to‑end video‑video copyright detection system that reduces storage by 85% and speeds up inference 18‑fold.

0 likes · 40 min read

How Ant’s Multimodal Team Boosted Video‑Text Retrieval by 24% and Cut Copyright Search Costs 85%

NewBeeNLP

May 28, 2024 · Artificial Intelligence

How Generative Models Are Redefining Recommendation Systems

This article reviews recent advances in generative recommendation, highlighting challenges such as item representation and multimodal fusion, and summarizing four key research papers that propose novel tokenization, collaborative integration, and transformer-based multimodal approaches to improve recommendation performance.

AI researchGenerative RecommendationLLM

0 likes · 8 min read

How Generative Models Are Redefining Recommendation Systems

NewBeeNLP

May 26, 2024 · Industry Insights

How LMSYS Chatbot Arena Ranks Yi‑Large Among Global LLMs: Insights & Methodology

The LMSYS Chatbot Arena benchmark, using blind user voting and an Elo scoring system, placed China's Yi‑Large model among the top global large language models, detailing its methodology, ranking results, and the broader implications for the AI industry.

AI benchmarkingChatbot ArenaElo ranking

0 likes · 12 min read

How LMSYS Chatbot Arena Ranks Yi‑Large Among Global LLMs: Insights & Methodology

NewBeeNLP

May 24, 2024 · Artificial Intelligence

How NoteLLM Boosts Cold‑Start Recommendation with Generative Contrastive Learning

This article reviews the NoteLLM paper, which leverages Llama 2 to create richer text embeddings and automatically generate tags and categories for note recommendation, addressing cold‑start issues through a multitask prompt design, generative‑contrastive learning, and collaborative supervised fine‑tuning, and demonstrates strong offline and online gains.

EmbeddingGenerative Contrastive LearningLLM

0 likes · 14 min read

How NoteLLM Boosts Cold‑Start Recommendation with Generative Contrastive Learning

NewBeeNLP

May 21, 2024 · Industry Insights

How EcomXL Supercharges E‑commerce Image Generation with SDXL Optimizations and 3‑Second Inference

This article details how Alibaba's Wanxiang Lab adapted the SDXL diffusion model for large‑scale e‑commerce image generation, introducing the EcomXL series, a weighted‑distillation fine‑tuning method, hierarchical model fusion, specialized ControlNet variants, and the SLAM inference accelerator to achieve high‑quality, controllable product images within three seconds while boosting business metrics.

AIGCControlNetEcomXL

0 likes · 14 min read

How EcomXL Supercharges E‑commerce Image Generation with SDXL Optimizations and 3‑Second Inference

NewBeeNLP

May 20, 2024 · Artificial Intelligence

How RecGPT Leverages ChatGPT‑Style Prompt Tuning for Better Sequential Recommendation

RecGPT applies a ChatGPT‑like pre‑training and personalized prompt‑tuning paradigm to sequential recommendation, introducing a two‑stage recall mechanism that improves offline HR/NDCG metrics and yields modest online interaction gains in a real‑world short‑video platform.

Prompt TuningRecGPTauto-regressive pretraining

0 likes · 8 min read

How RecGPT Leverages ChatGPT‑Style Prompt Tuning for Better Sequential Recommendation