Tagged articles

268 articles

Page 3 of 3

Mar 29, 2024 · Artificial Intelligence

Text-to-SQL with Large Language Models: DIN-SQL Approach

The DIN‑SQL approach enhances Text‑to‑SQL performance by using large language models in a decomposed in‑context learning framework with schema linking, query classification, SQL generation, and self‑correction modules, achieving state‑of‑the‑art 85.3% execution accuracy on the Spider benchmark by breaking complex queries into manageable sub‑tasks.

AI researchDatabase QueryingNLP

0 likes · 34 min read

Text-to-SQL with Large Language Models: DIN-SQL Approach

Open Source Tech Hub

Mar 17, 2024 · Artificial Intelligence

What Is Grok? Inside Elon Musk’s New Open‑Source LLM and the ‘Grokking’ Phenomenon

Elon Musk announced the open‑source release of Grok, xAI’s new large‑language‑model chatbot, while recalling his lawsuit against OpenAI; the article explains Grok’s rapid development, links to the GitHub repository, summarizes the seminal “Grokking” research paper that describes a sudden generalization breakthrough in neural networks, and provides reference links.

AI researchgrokgrokking

0 likes · 3 min read

What Is Grok? Inside Elon Musk’s New Open‑Source LLM and the ‘Grokking’ Phenomenon

JD Retail Technology

Mar 12, 2024 · Artificial Intelligence

Multimodal Large Models: Recent Advances, Industry Impact, and Challenges – An Expert Interview

In a detailed interview, Tsinghua researcher Zhao Sicheng and JD Retail senior director Peng Changping discuss the latest progress in multimodal large models, their practical applications in advertising and e‑commerce, persistent challenges such as hallucinations and data alignment, and the skills engineers need to thrive in the emerging AI era.

AI researchMultimodal AIe‑commerce

0 likes · 19 min read

Multimodal Large Models: Recent Advances, Industry Impact, and Challenges – An Expert Interview

Baobao Algorithm Notes

Mar 10, 2024 · Artificial Intelligence

Unlocking Large Model Power: 5 Effective Model Fusion Techniques Explained

This article examines why ensemble methods are crucial for large language models, outlines five core fusion strategies—including model integration, probability integration, graft learning, crowdsourced voting, and Mixture of Experts—provides implementation details, pseudo‑code, and discusses practical challenges and recent research advances.

AI researchMixture of ExpertsModel Fusion

0 likes · 16 min read

Unlocking Large Model Power: 5 Effective Model Fusion Techniques Explained

DataFunTalk

Mar 10, 2024 · Artificial Intelligence

Aligning Graph Models with Large Language Models for Open-Task Scenarios

This talk presents GraphTranslator, a framework that bridges pretrained graph models and large language models to enable unified handling of both predefined and open-ended graph analysis tasks by translating node representations into language tokens and training an alignment producer for node‑text pairs.

AI researchModel Alignmentgraph neural networks

0 likes · 3 min read

Aligning Graph Models with Large Language Models for Open-Task Scenarios

Sohu Tech Products

Mar 6, 2024 · Artificial Intelligence

Analysis of OpenAI Sora: Data Engineering, Network Architecture, and World Model Implications

OpenAI’s Sora video model unifies image and video data into latent spacetime patches via a VAE, trains on original resolutions with GPT‑4‑expanded captions, employs a Diffusion Transformer backbone for patch‑wise denoising, and demonstrates 3D‑consistent, long‑term world‑model capabilities that hint at a unified computer‑vision paradigm and steps toward AGI.

AI researchOpenAI SoraTransformer

0 likes · 9 min read

Analysis of OpenAI Sora: Data Engineering, Network Architecture, and World Model Implications

NetEase Smart Enterprise Tech+

Feb 28, 2024 · Artificial Intelligence

Mastering Multi-Task Learning: Network Designs & Loss Balancing

This article reviews the challenges of multi‑task learning, compares various network architectures such as hard‑parameter sharing, MMoE, CGC, and PLE, and examines loss‑balancing techniques like GradNorm, Dynamic Weight Average and task‑prioritization, offering insights on how to mitigate the “seesaw” effect and improve overall performance.

AI researchNeural Networksdynamic weighting

0 likes · 15 min read

Mastering Multi-Task Learning: Network Designs & Loss Balancing

21CTO

Feb 17, 2024 · Artificial Intelligence

How OpenAI’s Sora Is Pushing Video Generation to New Frontiers

OpenAI’s Sora model demonstrates large‑scale text‑conditional video generation using a diffusion transformer that operates on spatiotemporal patches, supporting variable durations, resolutions, and aspect ratios while showcasing emergent simulation abilities, flexible sampling, and multimodal editing capabilities, though it still has notable limitations.

AI researchSoraTransformer

0 likes · 19 min read

How OpenAI’s Sora Is Pushing Video Generation to New Frontiers

NewBeeNLP

Feb 12, 2024 · Artificial Intelligence

Beyond Dual‑Tower: Advanced Distillation and Interaction Techniques for Recommendation Systems

This article reviews recent advances that enhance dual‑tower recommendation models by injecting interaction information through various knowledge‑distillation strategies and interaction‑enhanced architectures, summarizing methods such as PFD, ENDX, TRMD, VIRT, Distilled‑DualEncoder, ERNIE‑Search, ColBert, IntTower and MVKE.

AI researchdual-towerinteraction modeling

0 likes · 13 min read

Beyond Dual‑Tower: Advanced Distillation and Interaction Techniques for Recommendation Systems

IT Services Circle

Jan 3, 2024 · Artificial Intelligence

Sergey Brin’s Role in Google’s Gemini AI Model and His Return to Technical Work

The article recounts Sergey Brin’s surprising appearance as a core contributor to Google’s Gemini AI model, tracing his early technical career, his semi‑retirement from Alphabet, the company’s AI challenges after ChatGPT, and how Brin returned to help develop Gemini, highlighting internal reactions and his lasting influence.

AI researchGeminiGoogle

0 likes · 9 min read

Sergey Brin’s Role in Google’s Gemini AI Model and His Return to Technical Work

DataFunTalk

Dec 21, 2023 · Artificial Intelligence

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning – Best Long Paper at EMNLP 2023

At EMNLP 2023, the joint WeChat AI and Peking University paper 'Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning' won the Best Long Paper award, revealing that label tokens act as anchors driving information aggregation in shallow layers and prediction flow in deep layers, and proposing methods to improve and diagnose in‑context learning.

AI researchIn-Context LearningInformation Flow

0 likes · 13 min read

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning – Best Long Paper at EMNLP 2023

Baobao Algorithm Notes

Nov 9, 2023 · Artificial Intelligence

Building High‑Performance Vertical Domain LLMs: From Continued Pre‑Training to Retrieval‑Augmented Generation

This article systematically explains how to create vertical domain large language models by continuing pre‑training on domain data, constructing fine‑tuning datasets with self‑instruct, reducing hallucinations, and integrating knowledge retrieval, while also reviewing related papers, products, and system architectures.

AI researchKnowledge Retrievalself-instruct

0 likes · 21 min read

Building High‑Performance Vertical Domain LLMs: From Continued Pre‑Training to Retrieval‑Augmented Generation

DataFunSummit

Oct 17, 2023 · Artificial Intelligence

Enhancing Vision and Language Models with External Knowledge Graphs and Tool Integration

This article reviews recent research on augmenting language and vision models by incorporating external knowledge sources such as knowledge graphs, multi‑source retrieval, and dynamic tool‑calling frameworks, presenting three systems—OREO‑LM, REVEAL, and AVIS—and their experimental results.

AI researchLanguage ModelTool integration

0 likes · 27 min read

Enhancing Vision and Language Models with External Knowledge Graphs and Tool Integration

Ximalaya Technology Team

Oct 10, 2023 · Artificial Intelligence

MiniGPT-5: A Novel Multimodal Generation Model for Coherent Text-Image Synthesis

MiniGPT-5 is a novel multimodal generation model using generative vokens to interleave text and image synthesis, integrating Stable Diffusion and LLMs with a two-stage training that requires no domain-specific annotations, achieving state‑of‑the‑art coherence and quality on benchmarks like CC3M, VIST, and MMDialog.

AI researchStable DiffusionVision Transformer

0 likes · 9 min read

MiniGPT-5: A Novel Multimodal Generation Model for Coherent Text-Image Synthesis

Baobao Algorithm Notes

Oct 8, 2023 · Interview Experience

Must‑Know Large‑Model Interview Questions for RLHF Candidates

The article shares a practitioner’s transition story from reinforcement‑learning‑focused game AI to large‑model work, outlines the challenges faced during job hunting at major Chinese tech firms, and provides a curated list of 23 technical interview questions covering PPO, RLHF, dataset evaluation, model fine‑tuning, and broader LLM concepts.

AI researchInterview PreparationLLM

0 likes · 10 min read

Must‑Know Large‑Model Interview Questions for RLHF Candidates

DataFunTalk

Sep 26, 2023 · Artificial Intelligence

MiniGPT-4: Enhancing Vision‑Language Understanding with Large Language Models

This article presents MiniGPT-4, a multimodal system that combines a frozen visual encoder (Q‑Former + ViT) with an open‑source large language model (Vicuna), describes its motivation, training pipeline, demo capabilities, observed limitations, and includes a brief Q&A session.

AI researchImage CaptioningMiniGPT-4

0 likes · 15 min read

MiniGPT-4: Enhancing Vision‑Language Understanding with Large Language Models

AntTech

Sep 15, 2023 · Artificial Intelligence

Ant Group Unveils Large Graph Model (LGM) Merging Graph Computing with Large Language Models

At the 2023 Bund Conference, Ant Group presented the Large Graph Model (LGM), a research effort that combines graph computing, graph learning, and large language models to enrich heterogeneous graph data and enable more precise insights for complex digital applications, with results accepted at WWW 2023.

AI researchAnt GroupLarge Graph Model

0 likes · 6 min read

Ant Group Unveils Large Graph Model (LGM) Merging Graph Computing with Large Language Models

Baobao Algorithm Notes

Aug 18, 2023 · Artificial Intelligence

Unlocking Domain-Specific Large Model Training: Proven Tricks and Pitfalls

This article shares practical techniques for domain‑specific large model continue pre‑training, including data selection, mixing ratios with general data, multi‑task instruction pre‑training, resource‑aware fine‑tuning strategies, evaluation set design, vocabulary considerations, and deployment constraints for 7‑13B models.

AI researchModel EvaluationSFT

0 likes · 9 min read

Unlocking Domain-Specific Large Model Training: Proven Tricks and Pitfalls

21CTO

Aug 15, 2023 · Artificial Intelligence

Why Do Neural Networks Suddenly ‘Grok’ After Long Training? Insights from Google

Google’s recent research reveals that when small neural networks are trained for extended periods on tasks like modular addition, they can abruptly shift from memorizing training data to genuinely generalizing—a sudden “grokking” phenomenon driven by weight decay and the emergence of periodic weight structures.

AI researchGeneralizationMLP

0 likes · 9 min read

Why Do Neural Networks Suddenly ‘Grok’ After Long Training? Insights from Google

Rare Earth Juejin Tech Community

Aug 1, 2023 · Artificial Intelligence

Do Language Models Learn Language in the Same Stages as Children? An Analysis of GPT‑2 Developmental Trajectories

This article reviews a study that compares the stage‑wise language acquisition of infants with the learning trajectory of GPT‑2, using linguistic probes and statistical tests to determine whether deep language models follow sequential or parallel learning patterns similar to children.

AI researchGPT-2developmental learning

0 likes · 17 min read

Do Language Models Learn Language in the Same Stages as Children? An Analysis of GPT‑2 Developmental Trajectories

Baidu Geek Talk

Jul 26, 2023 · Artificial Intelligence

Insights on AIGC Development and Commercial Applications by Baidu's Chief Architect

Baidu’s chief architect Li Shuanglong outlined how AIGC, driven by advanced large‑language and multimodal models, is already powering commercial tools such as automated copywriting, 2D digital‑human video creation and lead‑generation chatbots, while emphasizing future progress in engineering scalability, algorithmic fidelity, data quality, and scenario‑focused applications.

AI commercializationAI researchAIGC

0 likes · 8 min read

Insights on AIGC Development and Commercial Applications by Baidu's Chief Architect

DataFunSummit

Jun 28, 2023 · Artificial Intelligence

OPPO's CHAOS Pretrained Large Model and GammaE Knowledge‑Graph Multi‑hop Reasoning: Techniques and Insights

This article presents OPPO Research Institute's recent advances in large‑model AI, detailing the CHAOS pretrained model that topped the CLUE leaderboard, the knowledge‑enhanced training pipeline, and the GammaE model for multi‑hop reasoning over knowledge graphs, together with experimental results and practical training tips.

AI researchGammaEKnowledge Graph

0 likes · 20 min read

OPPO's CHAOS Pretrained Large Model and GammaE Knowledge‑Graph Multi‑hop Reasoning: Techniques and Insights

DataFunTalk

Jun 9, 2023 · Artificial Intelligence

Expert Roundtable on Causal Inference and Large Language Models: Opportunities and Challenges

Leading experts discuss how causal inference intersects with large language models, exploring opportunities, challenges, industry applications, and future research directions, while sharing personal journeys into causal reasoning and offering practical advice for practitioners.

AI researchcausal inferenceexpert interview

0 likes · 16 min read

Expert Roundtable on Causal Inference and Large Language Models: Opportunities and Challenges

DataFunSummit

May 31, 2023 · Artificial Intelligence

Evolution of Face Detection Techniques: Datasets, Research Directions, and Future Work

This article reviews the evolution of face detection, covering the Widely‑Face dataset, major research directions such as feature fusion, label assignment, auxiliary supervision, anchor‑free methods, NAS‑based designs, summarizes key papers from S3FD to MogFace, introduces ModelScope implementations, and outlines future challenges and opportunities.

AI researchComputer VisionDatasets

0 likes · 13 min read

Evolution of Face Detection Techniques: Datasets, Research Directions, and Future Work

Kuaishou Tech

Apr 28, 2023 · Artificial Intelligence

How Hyper‑Actor Critic Redefines Reinforcement Learning for Recommendation Systems

This article presents the Hyper‑Actor Critic (HAC) framework that splits reinforcement‑learning policies into continuous hyper‑actions and effective recommendation lists, introduces alignment and supervised losses, and demonstrates superior performance on an online simulator compared to existing RL and supervised methods.

AI researchRecommendation Systemshyper-actor critic

0 likes · 9 min read

How Hyper‑Actor Critic Redefines Reinforcement Learning for Recommendation Systems

Architect

Apr 27, 2023 · Artificial Intelligence

Survey of Large Language Model Research: From GPT‑1 to ChatGPT and Open‑Source Alternatives

This article provides a comprehensive overview of the development of large language models, reviewing classic papers from GPT‑1 through GPT‑4, discussing open‑source implementations such as LLaMA, Alpaca, GLM, and ChatGLM, and analyzing training methods, datasets, and future research directions.

AI researchGPTlarge language models

0 likes · 36 min read

Survey of Large Language Model Research: From GPT‑1 to ChatGPT and Open‑Source Alternatives

21CTO

Apr 21, 2023 · Artificial Intelligence

Essential AI Reading List: LLMs, AutoGPT, Distributed Training & More

This curated collection highlights the latest open‑source LLM breakthroughs, comprehensive surveys, AutoGPT developments, distributed training pitfalls, and practical tools for AI engineers, providing concise descriptions and direct links to each resource for deeper exploration.

AI researchAutoGPTDistributed Training

0 likes · 10 min read

Essential AI Reading List: LLMs, AutoGPT, Distributed Training & More

IT Architects Alliance

Apr 20, 2023 · Artificial Intelligence

Overview of Prominent Large Language Models and Instruction‑Finetuned Variants

This article provides a comprehensive overview of major large language models—including GPT series, T5, LaMDA, LLaMA, BLOOM, and others—detailing their architectures, parameter scales, open‑source status, and the evolution of instruction‑fine‑tuning techniques that improve zero‑shot and few‑shot performance.

AI researchInstruction TuningLLM comparison

0 likes · 24 min read

Overview of Prominent Large Language Models and Instruction‑Finetuned Variants

Architect

Apr 19, 2023 · Artificial Intelligence

Emergence in Large Language Models: Phenomena, Explanations, and Implications

This article reviews the emergence phenomena observed in large language models, explains how model scale, in‑context learning and chain‑of‑thought prompting contribute to sudden performance gains, discusses small‑model alternatives, and explores the relationship between emergence and the training‑time Grokking effect.

AI researchEmergenceIn-Context Learning

0 likes · 13 min read

Emergence in Large Language Models: Phenomena, Explanations, and Implications

DataFunTalk

Apr 19, 2023 · Artificial Intelligence

Is the Daily Emergence of Large Language Models Beneficial?

The article examines the rapid proliferation of large language models, weighing both the opportunities for experimentation and the drawbacks of noise, and argues that establishing authoritative Chinese LLM evaluation benchmarks is essential to guide meaningful progress in the field.

AI researchLLM evaluationlarge language models

0 likes · 7 min read

Is the Daily Emergence of Large Language Models Beneficial?

21CTO

Apr 9, 2023 · Artificial Intelligence

8 Open-Source ChatGPT Alternatives You Can Deploy Today

This article surveys eight popular open‑source ChatGPT alternatives, detailing each model’s size, training data, performance relative to proprietary systems, and providing links to code repositories, demos, and papers for developers interested in building or researching large language models.

AI researchChatGPT alternativesmodel comparison

0 likes · 8 min read

8 Open-Source ChatGPT Alternatives You Can Deploy Today

DataFunTalk

Apr 6, 2023 · Artificial Intelligence

A Comprehensive Survey of Large Language Models: Background, Capabilities, Key Technologies, and Future Directions

This article reviews the rapid progress of large language models (LLMs), covering their historical development, scaling laws, emergent abilities, core technologies such as training and alignment, resource ecosystems, evaluation methods, safety concerns, and prospective research challenges.

AI researchAlignmentLLM

0 likes · 21 min read

A Comprehensive Survey of Large Language Models: Background, Capabilities, Key Technologies, and Future Directions

21CTO

Mar 31, 2023 · Artificial Intelligence

From Student to AI Pioneer: Ilya Sutskever’s Journey Behind ChatGPT

This article chronicles Ilya Sutskever’s two‑decade rise from a young researcher to a leading figure in artificial intelligence, highlighting his early mentorship, breakthroughs in image recognition, language translation, the founding of OpenAI, and the development of GPT and DALL‑E models.

AI researchDeep LearningGPT

0 likes · 13 min read

From Student to AI Pioneer: Ilya Sutskever’s Journey Behind ChatGPT

DataFunSummit

Mar 21, 2023 · Artificial Intelligence

Interview with Huawei Noah's Ark Lab Senior Researcher Zhou Min on Graph Machine Learning: Research, Deployment, Challenges, and Trends

In this DataFun interview, Huawei Noah's Ark Lab senior researcher Zhou Min discusses the state of graph machine learning in academia and industry, covering algorithmic foundations, model variants, practical applications, scalability challenges, and future directions for more universal feature extraction across domains.

AI researchDataFunGraph Machine Learning

0 likes · 9 min read

Interview with Huawei Noah's Ark Lab Senior Researcher Zhou Min on Graph Machine Learning: Research, Deployment, Challenges, and Trends

DataFunTalk

Mar 16, 2023 · Artificial Intelligence

Technical Optimizations and Breakthroughs of GPT‑4: Multimodal Capabilities, Alignment Strategies, and Predictable Scaling

The article summarizes the technical innovations behind GPT‑4, highlighting its multimodal abilities, improved alignment methods, scaling‑law‑based performance prediction, and remaining limitations, while referencing the official OpenAI technical report and community analyses.

AI researchAlignmentGPT-4

0 likes · 10 min read

Technical Optimizations and Breakthroughs of GPT‑4: Multimodal Capabilities, Alignment Strategies, and Predictable Scaling

Python Programming Learning Circle

Mar 10, 2023 · Artificial Intelligence

Google's i‑S2R and GoalsEye: Robot Table‑Tennis Learning from Human Interaction

The article explains how Google's i‑S2R and GoalsEye projects use iterative simulation‑to‑real training, behavior cloning and goal‑conditioned learning to enable robots to play table‑tennis with humans, highlighting the challenges, experimental setup, and performance improvements achieved across player skill levels.

AI researchRoboticsbehavior cloning

0 likes · 6 min read

Google's i‑S2R and GoalsEye: Robot Table‑Tennis Learning from Human Interaction

21CTO

Feb 27, 2023 · Artificial Intelligence

What’s Next for Large Language Models? Emerging Trends Shaping AI

The article explores three emerging directions for next‑generation large language models—self‑generated training data, built‑in verification with external retrieval, and massive sparse‑expert architectures—highlighting recent research, practical challenges, and their potential to reshape AI development.

AI researchgenerative AIlarge language models

0 likes · 17 min read

What’s Next for Large Language Models? Emerging Trends Shaping AI

DataFunTalk

Feb 15, 2023 · Artificial Intelligence

Three Emerging Directions for Next‑Generation Large Language Models

The article outlines three promising research avenues—self‑generated training data, model‑driven fact‑checking, and sparse expert architectures—that could shape the next wave of large language model innovation and address current limitations such as data scarcity and hallucinations.

AI researchlarge language modelsmodel self‑improvement

0 likes · 14 min read

Three Emerging Directions for Next‑Generation Large Language Models

Architect

Feb 9, 2023 · Artificial Intelligence

Emergent Abilities of Large Language Models: Complex Reasoning, Knowledge Reasoning, and Out‑of‑Distribution Robustness

This article reviews recent research on the emergent abilities of large language models—such as chain‑of‑thought reasoning, knowledge retrieval without external sources, and robustness to distribution shifts—examining scaling laws, model size thresholds, and the open questions surrounding a potential paradigm shift from fine‑tuning to in‑context learning.

AI researchchain-of-thought promptingemergent abilities

0 likes · 23 min read

Emergent Abilities of Large Language Models: Complex Reasoning, Knowledge Reasoning, and Out‑of‑Distribution Robustness

DataFunSummit

Feb 7, 2023 · Artificial Intelligence

How to Evaluate OpenAI's Super Conversational Model ChatGPT?

This article compiles three highly upvoted Zhihu answers that examine OpenAI's ChatGPT, discussing its breakthrough impact on NLP, visual in‑context learning, reinforcement‑learning‑from‑human‑feedback, and the broader implications for AI research and development.

AI researchChatGPTIn-Context Learning

0 likes · 10 min read

How to Evaluate OpenAI's Super Conversational Model ChatGPT?

21CTO

Jan 13, 2023 · Artificial Intelligence

How Google’s Muse Is Redefining Text‑to‑Image Generation with Parallel Decoding

Google’s new Muse model, a Transformer‑based text‑to‑image system running on TPUv4, claims to generate 256×256 images in 0.5 seconds—far faster than Imagen—while delivering unprecedented photorealism and deep language understanding through parallel decoding and large‑scale LLM‑conditioned training.

AI researchGoogle MuseLLM conditioning

0 likes · 4 min read

How Google’s Muse Is Redefining Text‑to‑Image Generation with Parallel Decoding

DataFunTalk

Jan 10, 2023 · Artificial Intelligence

Paradigm Shifts in Large Language Model Research and Future Directions

The article reviews the evolution of large language models from the pre‑GPT‑3 era to the present, analyzes the conceptual and technical gaps between Chinese and global research, and outlines key future research directions such as scaling laws, prompting techniques, multimodal training, and efficient model architectures.

AI researchChatGPTIn-Context Learning

0 likes · 73 min read

Paradigm Shifts in Large Language Model Research and Future Directions

Xiaohongshu Tech REDtech

Jan 3, 2023 · Artificial Intelligence

Insights into ChatGPT: Capabilities, Limitations, and Implications for AI Research

During Xiaohongshu’s REDtech livestream, AI researchers examined ChatGPT’s rapid adoption, versatile task performance, and underlying large‑scale pre‑training with in‑context learning, while highlighting persistent hallucinations, weak reasoning, high costs, and limited search‑engine replacement potential, and emphasized the importance of RLHF‑driven human feedback for future multimodal AI research.

AI researchChatGPTRLHF

0 likes · 14 min read

Insights into ChatGPT: Capabilities, Limitations, and Implications for AI Research

Meituan Technology Team

Nov 17, 2022 · Artificial Intelligence

Overview of Recent Meituan Visual Intelligence Research Papers on Content Production, Distribution, and Model Quantization

Meituan’s Visual Intelligence team recently published eight top‑conference papers that advance weakly supervised segmentation, future‑aware captioning, panoptic narrative grounding, video‑text retrieval, open‑vocabulary detection, counterfactual image‑text matching, zero‑shot video classification, and efficient Vision‑Transformer quantization, all directly boosting real‑world content creation, distribution, and model efficiency.

AI researchImage CaptioningModel Quantization

0 likes · 19 min read

Overview of Recent Meituan Visual Intelligence Research Papers on Content Production, Distribution, and Model Quantization

AntTech

Sep 27, 2022 · Artificial Intelligence

Ant Group’s Research Institute Publishes Four NeurIPS 2022 Papers on Advanced Computer Vision and AI

Ant Group’s Ant Technology Research Institute had four papers from its Visual Intelligence Lab accepted at NeurIPS 2022, covering rank diminishing in deep networks, geometry‑aware 3D image synthesis, dynamic discriminators for GANs, and uncertainty‑aware hierarchical refinement for incremental classification, highlighting the institute’s cutting‑edge AI research.

AI researchComputer VisionDeep Learning

0 likes · 8 min read

Ant Group’s Research Institute Publishes Four NeurIPS 2022 Papers on Advanced Computer Vision and AI

JD Cloud Developers

Jan 4, 2022 · Artificial Intelligence

How JD’s Vega v1 Model Dominated GLUE Benchmark, Surpassing Human Performance

JD Explore’s Vega v1 model topped the GLUE benchmark with a 91.3 average score, outperforming Microsoft, Facebook, and Stanford across multiple NLP tasks, including first‑ever human‑level results on sentiment analysis and coreference, showcasing JD’s leading position in deep‑learning research.

AI researchDeep LearningGLUE benchmark

0 likes · 3 min read

How JD’s Vega v1 Model Dominated GLUE Benchmark, Surpassing Human Performance

DataFunTalk

Nov 1, 2021 · Artificial Intelligence

Reflections on Working as an Algorithm Engineer at Meituan and the Rise of Contrastive Learning

The author shares personal experiences as a Meituan algorithm engineer, emphasizing the critical role of labeled data, the emergence of contrastive (self‑supervised) learning across computer vision, NLP, and recommendation systems, and offers practical advice for algorithm engineers to stay competitive.

AI researchMeituanalgorithm engineering

0 likes · 8 min read

Reflections on Working as an Algorithm Engineer at Meituan and the Rise of Contrastive Learning

Kuaishou Tech

Oct 28, 2021 · Artificial Intelligence

Kuaishou Showcases Multimedia Research at ACM MM2021 and Announces Strategic Collaboration with CCF‑MM Committee

At ACM MM2021 in Chengdu, Kuaishou presented two accepted papers on recommendation and outfit compatibility, won the Grand Challenge with its DAP congestion‑control system, and forged a strategic partnership with the CCF‑MM committee to deepen multimedia research collaboration across academia and industry.

ACM MM2021AI researchKuaishou

0 likes · 8 min read

Kuaishou Showcases Multimedia Research at ACM MM2021 and Announces Strategic Collaboration with CCF‑MM Committee

Youku Technology

Sep 29, 2021 · Artificial Intelligence

Reducing the Covariate Shift by Mirror Samples in Cross Domain Alignment

By constructing virtual mirror samples that occupy identical positions across source and target domains, the authors eliminate covariate shift while preserving distribution structure, enabling superior unsupervised domain adaptation that achieves state‑of‑the‑art performance on Office and VisDA benchmarks and improves real‑world lighting and gender‑recognition tasks.

AI researchSOTAcovariate shift

0 likes · 3 min read

Reducing the Covariate Shift by Mirror Samples in Cross Domain Alignment

DataFunTalk

Jul 1, 2021 · Artificial Intelligence

Pre‑Trained Models: Past, Present, and Future – A Comprehensive Survey

This article surveys the evolution of pre‑trained models, covering the origins of transfer and self‑supervised learning, the rise of transformer‑based PTMs such as BERT and GPT, efficient architecture designs, multimodal and multilingual extensions, theoretical analyses, and future research directions for scalable and robust AI systems.

AI researchefficient traininglarge language models

0 likes · 27 min read

Pre‑Trained Models: Past, Present, and Future – A Comprehensive Survey

AntTech

Mar 3, 2021 · Artificial Intelligence

Ant Group Intelligent Service Research Overview: NLP, Dialogue, Recommendation, and Anti‑fraud Papers

The article presents a comprehensive overview of Ant Group's intelligent service research, summarizing recent AI‑focused papers on text classification, stance detection, data augmentation, knowledge distillation for ranking, reinforcement‑learning‑based dialogue clarification, behavior‑cloning dialogue systems, anti‑fraud outbound bots, tag‑based service recommendation, and multi‑agent service groups, while also highlighting future directions and recruitment opportunities.

AI researchAnti‑fraudDialogue Systems

0 likes · 17 min read

Ant Group Intelligent Service Research Overview: NLP, Dialogue, Recommendation, and Anti‑fraud Papers

JD Cloud Developers

Feb 10, 2021 · Artificial Intelligence

Three JD Tech AI Papers Shine at ICASSP 2021

At ICASSP 2021, JD Tech presented three AI research papers—introducing a Neural Kalman Filtering framework for speech enhancement, a cross‑utterance BERT‑based prosody modeling method for end‑to‑end speech synthesis, and a self‑supervised conversational query rewriting approach—each demonstrating superior performance over existing baselines on benchmark datasets.

AI researchICASSP 2021prosody modeling

0 likes · 9 min read

Three JD Tech AI Papers Shine at ICASSP 2021

JD Cloud Developers

Nov 2, 2020 · Artificial Intelligence

This Week’s Tech Highlights: AI Research Breakthroughs, 5G Surge, Multi‑Cloud DB & More

The newsletter recaps recent tech developments, including JD's four AI papers at Interspeech 2020, Shenzhen's supercomputing boost, T‑Mobile's mid‑band 5G expansion, Apple's upcoming A14T iMac processor, MongoDB Atlas multi‑cloud support, Wikimedia's migration to GitLab, and advances in graph neural network pre‑training and deep clustering.

5G expansionAI researchApple Silicon

0 likes · 9 min read

This Week’s Tech Highlights: AI Research Breakthroughs, 5G Surge, Multi‑Cloud DB & More

21CTO

Jan 31, 2020 · Artificial Intelligence

How Microsoft’s First Chinese AI Fellow Is Driving Speech and Language Breakthroughs

Microsoft appointed its first Chinese Global Technical Fellow, Huang Xuedong, as the company’s Global AI CTO, overseeing Azure’s speech, translation, vision, and language services, while highlighting his groundbreaking achievements such as achieving human‑level word error rates and leading AI research teams.

AI researchAzureMicrosoft

0 likes · 7 min read

How Microsoft’s First Chinese AI Fellow Is Driving Speech and Language Breakthroughs

Hulu Beijing

Apr 2, 2019 · Artificial Intelligence

From Object Detection to Language Models: A Deep Dive into AI Advances

This article surveys the evolution of object detection models—comparing one‑stage and two‑stage approaches, their performance trade‑offs, and recent state‑of‑the‑art methods—while also outlining key concepts and breakthroughs in natural language processing, highlighting the impact of deep‑learning models such as BERT.

AI researchBERTDeep Learning

0 likes · 14 min read

From Object Detection to Language Models: A Deep Dive into AI Advances

ITPUB

Feb 23, 2019 · Artificial Intelligence

Explore a 1.59 Million Image NSFW Dataset with 159 Fine-Grained Categories

A data scientist from Besedo has open‑sourced a massive NSFW image dataset containing 1.589 million pictures, organized into 159 primary categories and further sub‑categories, with download scripts and GitHub links, requiring about 500 GB of storage and cautioning against viewing in the office.

AI researchComputer VisionGitHub

0 likes · 3 min read

Explore a 1.59 Million Image NSFW Dataset with 159 Fine-Grained Categories

DataFunTalk

Jan 11, 2019 · Artificial Intelligence

Challenges in Natural Language Understanding and the Neural‑Symbolic Approach (Object‑Oriented Neural Programming)

The article examines why natural language understanding is intrinsically difficult, outlines four core linguistic challenges, proposes a neural‑symbolic integration framework with three design principles, introduces the Object‑Oriented Neural Programming (OONP) architecture, and showcases real‑world applications in public security, legal document analysis, and financial fraud detection.

AI researchKnowledge Graphneural-symbolic integration

0 likes · 16 min read

Challenges in Natural Language Understanding and the Neural‑Symbolic Approach (Object‑Oriented Neural Programming)

Alibaba Cloud Developer

Sep 20, 2018 · Artificial Intelligence

Inside Alibaba DAMO Academy’s AI Vision: New Labs, Chips, and Quantum Ambitions

At the 2018 Hangzhou Yunqi Conference, Alibaba's CTO Zhang Jianfeng outlined DAMO Academy’s expansive AI strategy, unveiling new research labs, a semiconductor venture, quantum computing initiatives, and a youth science award aimed at accelerating data, algorithm, and computing breakthroughs.

AI researchAlibabaDAMO Academy

0 likes · 8 min read

Inside Alibaba DAMO Academy’s AI Vision: New Labs, Chips, and Quantum Ambitions

Meitu Technology

Jun 25, 2018 · Artificial Intelligence

Meitu Short Video Real-Time Classification Challenge MTSVRC

The Meitu Short Video Real-Time Classification Challenge (MTSVRC), co‑hosted by the PRCV conference, Meitu and the Chinese Academy of Sciences, releases the industry‑largest dataset of over 100,000 five‑to‑15‑second videos in 50 categories, judging both classification accuracy and real‑time speed, and offers cash prizes up to ¥100,000 plus presentation opportunities for top teams.

AI researchMTSVRCPRCV 2018

0 likes · 5 min read

Meitu Short Video Real-Time Classification Challenge MTSVRC

Tencent Cloud Developer

Apr 16, 2018 · Artificial Intelligence

Google AI’s ‘Talk to Books’: Conversing with Literature via Semantic Search

Google Research unveiled “Talk to Books,” a semantic‑experience that lets users type statements or questions and receive full‑sentence matches from books, showcasing how AI trained on billions of conversational pairs can understand meaning beyond simple keyword matching.

AI researchGoogle AITalk to Books

0 likes · 4 min read

Google AI’s ‘Talk to Books’: Conversing with Literature via Semantic Search

AntTech

Jan 18, 2018 · Artificial Intelligence

cw2vec: Learning Chinese Word Embeddings with Stroke n-grams

The cw2vec paper, presented at AAAI 2018, introduces a Chinese word embedding method that leverages stroke n‑grams to capture character semantics, proposes a novel loss function, demonstrates consistent improvements over existing models across similarity, analogy, classification and NER tasks, and discusses real‑world AI applications.

AAAI 2018AI researchChinese NLP

0 likes · 7 min read

cw2vec: Learning Chinese Word Embeddings with Stroke n-grams

AntTech

Dec 22, 2017 · Artificial Intelligence

Transfer Learning: Concepts, Challenges, and Recent Research Highlights from CIKM 2017

This article reviews the key concepts, challenges, and recent research on transfer learning presented at CIKM 2017, covering instance, feature, parameter, and relation‑based methods, supervised and unsupervised deep TL approaches, and transitive transfer learning with associated loss formulations and optimization strategies.

AI researchDeep Learningmachine learning

0 likes · 9 min read

Transfer Learning: Concepts, Challenges, and Recent Research Highlights from CIKM 2017

Alibaba Cloud Developer

Jul 28, 2017 · Artificial Intelligence

Inside Alibaba AI Lab: Dr. Wang Gang on Multimodal AI and Edge Computing

In an exclusive interview, Alibaba AI Lab's distinguished scientist Dr. Wang Gang discusses the lab's research on multimodal AI, edge computing, AI hardware, bio‑inspired cognition, quantum‑deep‑learning integration, and the challenges of moving from recognition to true understanding, while also outlining Alibaba's AI talent recruitment plans.

AI researchAI talent recruitmentComputer Vision

0 likes · 25 min read

Inside Alibaba AI Lab: Dr. Wang Gang on Multimodal AI and Edge Computing

Qunar Tech Salon

Jul 3, 2017 · Artificial Intelligence

Interview with Dr. Lv Zhengdong on Neural‑Symbolic Systems and the Future of Natural Language Understanding

Dr. Lv Zhengdong discusses the challenges of true language understanding, the integration of symbolic reasoning with neural networks, recent advances in neural‑symbolic models, and the practical prospects of NLP in domains such as law and finance, emphasizing the need for hybrid approaches.

AI researchNLPinterview

0 likes · 16 min read

Interview with Dr. Lv Zhengdong on Neural‑Symbolic Systems and the Future of Natural Language Understanding

Alibaba Cloud Developer

May 26, 2017 · Artificial Intelligence

How StarCraft Became a Testbed for Advanced AI and Multi‑Agent Learning

This article explains why Alibaba's Cognitive Computing Lab uses StarCraft as a research platform, outlines the game's unique challenges for AI, and details their deep reinforcement learning and multi‑agent approaches, including the BiCNet architecture and experimental results.

AI researchBiCNetMulti-Agent

0 likes · 18 min read

How StarCraft Became a Testbed for Advanced AI and Multi‑Agent Learning

Alibaba Cloud Developer

Feb 24, 2017 · Artificial Intelligence

Unlocking StarCraft AI Research with the Open-Source Gym StarCraft Platform

StarCraft, a classic real‑time strategy game, has become a key testbed for deep reinforcement learning and AI research, and Alibaba's open‑source Gym StarCraft platform now offers Python‑based, TensorFlow‑compatible tools that simplify agent development and evaluation within the OpenAI Gym ecosystem.

AI researchDeep LearningStarCraft

0 likes · 3 min read

Unlocking StarCraft AI Research with the Open-Source Gym StarCraft Platform

Ctrip Technology

Aug 5, 2016 · Artificial Intelligence

Advances in Deep Learning for Speech and Semantic Understanding: Insights from Huawei Noah's Ark Lab

The article reviews a decade of deep‑learning breakthroughs, highlights Huawei Noah's recent research on speech, image and natural‑language processing, and discusses future trends such as neural‑symbolic integration, end‑to‑end learning, and knowledge‑driven AI systems.

AI researchHuaweinatural language processing

0 likes · 8 min read

Advances in Deep Learning for Speech and Semantic Understanding: Insights from Huawei Noah's Ark Lab

Art of Distributed System Architecture Design

Oct 8, 2015 · Artificial Intelligence

Facebook AI Research (FAIR): History, Teams, Projects, and Vision

The article chronicles Facebook's evolution from a social platform into a leading AI research hub, detailing the founding of FAIR, its key personnel, ambitious goals, major projects such as memory networks, embedding world, DeepFace, language technology, and the M assistant, and highlights the open, collaborative nature of its AI work.

AI researchDeep LearningFAIR

0 likes · 17 min read