Tagged articles
268 articles
Page 3 of 3
DaTaobao Tech
DaTaobao Tech
Mar 29, 2024 · Artificial Intelligence

Text-to-SQL with Large Language Models: DIN-SQL Approach

The DIN‑SQL approach enhances Text‑to‑SQL performance by using large language models in a decomposed in‑context learning framework with schema linking, query classification, SQL generation, and self‑correction modules, achieving state‑of‑the‑art 85.3% execution accuracy on the Spider benchmark by breaking complex queries into manageable sub‑tasks.

AI researchDatabase QueryingNLP
0 likes · 34 min read
Text-to-SQL with Large Language Models: DIN-SQL Approach
Open Source Tech Hub
Open Source Tech Hub
Mar 17, 2024 · Artificial Intelligence

What Is Grok? Inside Elon Musk’s New Open‑Source LLM and the ‘Grokking’ Phenomenon

Elon Musk announced the open‑source release of Grok, xAI’s new large‑language‑model chatbot, while recalling his lawsuit against OpenAI; the article explains Grok’s rapid development, links to the GitHub repository, summarizes the seminal “Grokking” research paper that describes a sudden generalization breakthrough in neural networks, and provides reference links.

AI researchgrokgrokking
0 likes · 3 min read
What Is Grok? Inside Elon Musk’s New Open‑Source LLM and the ‘Grokking’ Phenomenon
JD Retail Technology
JD Retail Technology
Mar 12, 2024 · Artificial Intelligence

Multimodal Large Models: Recent Advances, Industry Impact, and Challenges – An Expert Interview

In a detailed interview, Tsinghua researcher Zhao Sicheng and JD Retail senior director Peng Changping discuss the latest progress in multimodal large models, their practical applications in advertising and e‑commerce, persistent challenges such as hallucinations and data alignment, and the skills engineers need to thrive in the emerging AI era.

AI researchMultimodal AIe‑commerce
0 likes · 19 min read
Multimodal Large Models: Recent Advances, Industry Impact, and Challenges – An Expert Interview
Baobao Algorithm Notes
Baobao Algorithm Notes
Mar 10, 2024 · Artificial Intelligence

Unlocking Large Model Power: 5 Effective Model Fusion Techniques Explained

This article examines why ensemble methods are crucial for large language models, outlines five core fusion strategies—including model integration, probability integration, graft learning, crowdsourced voting, and Mixture of Experts—provides implementation details, pseudo‑code, and discusses practical challenges and recent research advances.

AI researchMixture of ExpertsModel Fusion
0 likes · 16 min read
Unlocking Large Model Power: 5 Effective Model Fusion Techniques Explained
DataFunTalk
DataFunTalk
Mar 10, 2024 · Artificial Intelligence

Aligning Graph Models with Large Language Models for Open-Task Scenarios

This talk presents GraphTranslator, a framework that bridges pretrained graph models and large language models to enable unified handling of both predefined and open-ended graph analysis tasks by translating node representations into language tokens and training an alignment producer for node‑text pairs.

AI researchModel Alignmentgraph neural networks
0 likes · 3 min read
Aligning Graph Models with Large Language Models for Open-Task Scenarios
Sohu Tech Products
Sohu Tech Products
Mar 6, 2024 · Artificial Intelligence

Analysis of OpenAI Sora: Data Engineering, Network Architecture, and World Model Implications

OpenAI’s Sora video model unifies image and video data into latent spacetime patches via a VAE, trains on original resolutions with GPT‑4‑expanded captions, employs a Diffusion Transformer backbone for patch‑wise denoising, and demonstrates 3D‑consistent, long‑term world‑model capabilities that hint at a unified computer‑vision paradigm and steps toward AGI.

AI researchOpenAI SoraTransformer
0 likes · 9 min read
Analysis of OpenAI Sora: Data Engineering, Network Architecture, and World Model Implications
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
Feb 28, 2024 · Artificial Intelligence

Mastering Multi-Task Learning: Network Designs & Loss Balancing

This article reviews the challenges of multi‑task learning, compares various network architectures such as hard‑parameter sharing, MMoE, CGC, and PLE, and examines loss‑balancing techniques like GradNorm, Dynamic Weight Average and task‑prioritization, offering insights on how to mitigate the “seesaw” effect and improve overall performance.

AI researchNeural Networksdynamic weighting
0 likes · 15 min read
Mastering Multi-Task Learning: Network Designs & Loss Balancing
21CTO
21CTO
Feb 17, 2024 · Artificial Intelligence

How OpenAI’s Sora Is Pushing Video Generation to New Frontiers

OpenAI’s Sora model demonstrates large‑scale text‑conditional video generation using a diffusion transformer that operates on spatiotemporal patches, supporting variable durations, resolutions, and aspect ratios while showcasing emergent simulation abilities, flexible sampling, and multimodal editing capabilities, though it still has notable limitations.

AI researchSoraTransformer
0 likes · 19 min read
How OpenAI’s Sora Is Pushing Video Generation to New Frontiers
NewBeeNLP
NewBeeNLP
Feb 12, 2024 · Artificial Intelligence

Beyond Dual‑Tower: Advanced Distillation and Interaction Techniques for Recommendation Systems

This article reviews recent advances that enhance dual‑tower recommendation models by injecting interaction information through various knowledge‑distillation strategies and interaction‑enhanced architectures, summarizing methods such as PFD, ENDX, TRMD, VIRT, Distilled‑DualEncoder, ERNIE‑Search, ColBert, IntTower and MVKE.

AI researchdual-towerinteraction modeling
0 likes · 13 min read
Beyond Dual‑Tower: Advanced Distillation and Interaction Techniques for Recommendation Systems
IT Services Circle
IT Services Circle
Jan 3, 2024 · Artificial Intelligence

Sergey Brin’s Role in Google’s Gemini AI Model and His Return to Technical Work

The article recounts Sergey Brin’s surprising appearance as a core contributor to Google’s Gemini AI model, tracing his early technical career, his semi‑retirement from Alphabet, the company’s AI challenges after ChatGPT, and how Brin returned to help develop Gemini, highlighting internal reactions and his lasting influence.

AI researchGeminiGoogle
0 likes · 9 min read
Sergey Brin’s Role in Google’s Gemini AI Model and His Return to Technical Work
DataFunTalk
DataFunTalk
Dec 21, 2023 · Artificial Intelligence

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning – Best Long Paper at EMNLP 2023

At EMNLP 2023, the joint WeChat AI and Peking University paper 'Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning' won the Best Long Paper award, revealing that label tokens act as anchors driving information aggregation in shallow layers and prediction flow in deep layers, and proposing methods to improve and diagnose in‑context learning.

AI researchIn-Context LearningInformation Flow
0 likes · 13 min read
Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning – Best Long Paper at EMNLP 2023
Baobao Algorithm Notes
Baobao Algorithm Notes
Nov 9, 2023 · Artificial Intelligence

Building High‑Performance Vertical Domain LLMs: From Continued Pre‑Training to Retrieval‑Augmented Generation

This article systematically explains how to create vertical domain large language models by continuing pre‑training on domain data, constructing fine‑tuning datasets with self‑instruct, reducing hallucinations, and integrating knowledge retrieval, while also reviewing related papers, products, and system architectures.

AI researchKnowledge Retrievalself-instruct
0 likes · 21 min read
Building High‑Performance Vertical Domain LLMs: From Continued Pre‑Training to Retrieval‑Augmented Generation
DataFunSummit
DataFunSummit
Oct 17, 2023 · Artificial Intelligence

Enhancing Vision and Language Models with External Knowledge Graphs and Tool Integration

This article reviews recent research on augmenting language and vision models by incorporating external knowledge sources such as knowledge graphs, multi‑source retrieval, and dynamic tool‑calling frameworks, presenting three systems—OREO‑LM, REVEAL, and AVIS—and their experimental results.

AI researchLanguage ModelTool integration
0 likes · 27 min read
Enhancing Vision and Language Models with External Knowledge Graphs and Tool Integration
Ximalaya Technology Team
Ximalaya Technology Team
Oct 10, 2023 · Artificial Intelligence

MiniGPT-5: A Novel Multimodal Generation Model for Coherent Text-Image Synthesis

MiniGPT-5 is a novel multimodal generation model using generative vokens to interleave text and image synthesis, integrating Stable Diffusion and LLMs with a two-stage training that requires no domain-specific annotations, achieving state‑of‑the‑art coherence and quality on benchmarks like CC3M, VIST, and MMDialog.

AI researchStable DiffusionVision Transformer
0 likes · 9 min read
MiniGPT-5: A Novel Multimodal Generation Model for Coherent Text-Image Synthesis
Baobao Algorithm Notes
Baobao Algorithm Notes
Oct 8, 2023 · Interview Experience

Must‑Know Large‑Model Interview Questions for RLHF Candidates

The article shares a practitioner’s transition story from reinforcement‑learning‑focused game AI to large‑model work, outlines the challenges faced during job hunting at major Chinese tech firms, and provides a curated list of 23 technical interview questions covering PPO, RLHF, dataset evaluation, model fine‑tuning, and broader LLM concepts.

AI researchInterview PreparationLLM
0 likes · 10 min read
Must‑Know Large‑Model Interview Questions for RLHF Candidates
DataFunTalk
DataFunTalk
Sep 26, 2023 · Artificial Intelligence

MiniGPT-4: Enhancing Vision‑Language Understanding with Large Language Models

This article presents MiniGPT-4, a multimodal system that combines a frozen visual encoder (Q‑Former + ViT) with an open‑source large language model (Vicuna), describes its motivation, training pipeline, demo capabilities, observed limitations, and includes a brief Q&A session.

AI researchImage CaptioningMiniGPT-4
0 likes · 15 min read
MiniGPT-4: Enhancing Vision‑Language Understanding with Large Language Models
AntTech
AntTech
Sep 15, 2023 · Artificial Intelligence

Ant Group Unveils Large Graph Model (LGM) Merging Graph Computing with Large Language Models

At the 2023 Bund Conference, Ant Group presented the Large Graph Model (LGM), a research effort that combines graph computing, graph learning, and large language models to enrich heterogeneous graph data and enable more precise insights for complex digital applications, with results accepted at WWW 2023.

AI researchAnt GroupLarge Graph Model
0 likes · 6 min read
Ant Group Unveils Large Graph Model (LGM) Merging Graph Computing with Large Language Models
Baobao Algorithm Notes
Baobao Algorithm Notes
Aug 18, 2023 · Artificial Intelligence

Unlocking Domain-Specific Large Model Training: Proven Tricks and Pitfalls

This article shares practical techniques for domain‑specific large model continue pre‑training, including data selection, mixing ratios with general data, multi‑task instruction pre‑training, resource‑aware fine‑tuning strategies, evaluation set design, vocabulary considerations, and deployment constraints for 7‑13B models.

AI researchModel EvaluationSFT
0 likes · 9 min read
Unlocking Domain-Specific Large Model Training: Proven Tricks and Pitfalls
21CTO
21CTO
Aug 15, 2023 · Artificial Intelligence

Why Do Neural Networks Suddenly ‘Grok’ After Long Training? Insights from Google

Google’s recent research reveals that when small neural networks are trained for extended periods on tasks like modular addition, they can abruptly shift from memorizing training data to genuinely generalizing—a sudden “grokking” phenomenon driven by weight decay and the emergence of periodic weight structures.

AI researchGeneralizationMLP
0 likes · 9 min read
Why Do Neural Networks Suddenly ‘Grok’ After Long Training? Insights from Google
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Aug 1, 2023 · Artificial Intelligence

Do Language Models Learn Language in the Same Stages as Children? An Analysis of GPT‑2 Developmental Trajectories

This article reviews a study that compares the stage‑wise language acquisition of infants with the learning trajectory of GPT‑2, using linguistic probes and statistical tests to determine whether deep language models follow sequential or parallel learning patterns similar to children.

AI researchGPT-2developmental learning
0 likes · 17 min read
Do Language Models Learn Language in the Same Stages as Children? An Analysis of GPT‑2 Developmental Trajectories
Baidu Geek Talk
Baidu Geek Talk
Jul 26, 2023 · Artificial Intelligence

Insights on AIGC Development and Commercial Applications by Baidu's Chief Architect

Baidu’s chief architect Li Shuanglong outlined how AIGC, driven by advanced large‑language and multimodal models, is already powering commercial tools such as automated copywriting, 2D digital‑human video creation and lead‑generation chatbots, while emphasizing future progress in engineering scalability, algorithmic fidelity, data quality, and scenario‑focused applications.

AI commercializationAI researchAIGC
0 likes · 8 min read
Insights on AIGC Development and Commercial Applications by Baidu's Chief Architect
DataFunSummit
DataFunSummit
Jun 28, 2023 · Artificial Intelligence

OPPO's CHAOS Pretrained Large Model and GammaE Knowledge‑Graph Multi‑hop Reasoning: Techniques and Insights

This article presents OPPO Research Institute's recent advances in large‑model AI, detailing the CHAOS pretrained model that topped the CLUE leaderboard, the knowledge‑enhanced training pipeline, and the GammaE model for multi‑hop reasoning over knowledge graphs, together with experimental results and practical training tips.

AI researchGammaEKnowledge Graph
0 likes · 20 min read
OPPO's CHAOS Pretrained Large Model and GammaE Knowledge‑Graph Multi‑hop Reasoning: Techniques and Insights
DataFunSummit
DataFunSummit
May 31, 2023 · Artificial Intelligence

Evolution of Face Detection Techniques: Datasets, Research Directions, and Future Work

This article reviews the evolution of face detection, covering the Widely‑Face dataset, major research directions such as feature fusion, label assignment, auxiliary supervision, anchor‑free methods, NAS‑based designs, summarizes key papers from S3FD to MogFace, introduces ModelScope implementations, and outlines future challenges and opportunities.

AI researchComputer VisionDatasets
0 likes · 13 min read
Evolution of Face Detection Techniques: Datasets, Research Directions, and Future Work
Kuaishou Tech
Kuaishou Tech
Apr 28, 2023 · Artificial Intelligence

How Hyper‑Actor Critic Redefines Reinforcement Learning for Recommendation Systems

This article presents the Hyper‑Actor Critic (HAC) framework that splits reinforcement‑learning policies into continuous hyper‑actions and effective recommendation lists, introduces alignment and supervised losses, and demonstrates superior performance on an online simulator compared to existing RL and supervised methods.

AI researchRecommendation Systemshyper-actor critic
0 likes · 9 min read
How Hyper‑Actor Critic Redefines Reinforcement Learning for Recommendation Systems
Architect
Architect
Apr 27, 2023 · Artificial Intelligence

Survey of Large Language Model Research: From GPT‑1 to ChatGPT and Open‑Source Alternatives

This article provides a comprehensive overview of the development of large language models, reviewing classic papers from GPT‑1 through GPT‑4, discussing open‑source implementations such as LLaMA, Alpaca, GLM, and ChatGLM, and analyzing training methods, datasets, and future research directions.

AI researchGPTlarge language models
0 likes · 36 min read
Survey of Large Language Model Research: From GPT‑1 to ChatGPT and Open‑Source Alternatives
21CTO
21CTO
Apr 21, 2023 · Artificial Intelligence

Essential AI Reading List: LLMs, AutoGPT, Distributed Training & More

This curated collection highlights the latest open‑source LLM breakthroughs, comprehensive surveys, AutoGPT developments, distributed training pitfalls, and practical tools for AI engineers, providing concise descriptions and direct links to each resource for deeper exploration.

AI researchAutoGPTDistributed Training
0 likes · 10 min read
Essential AI Reading List: LLMs, AutoGPT, Distributed Training & More
IT Architects Alliance
IT Architects Alliance
Apr 20, 2023 · Artificial Intelligence

Overview of Prominent Large Language Models and Instruction‑Finetuned Variants

This article provides a comprehensive overview of major large language models—including GPT series, T5, LaMDA, LLaMA, BLOOM, and others—detailing their architectures, parameter scales, open‑source status, and the evolution of instruction‑fine‑tuning techniques that improve zero‑shot and few‑shot performance.

AI researchInstruction TuningLLM comparison
0 likes · 24 min read
Overview of Prominent Large Language Models and Instruction‑Finetuned Variants
Architect
Architect
Apr 19, 2023 · Artificial Intelligence

Emergence in Large Language Models: Phenomena, Explanations, and Implications

This article reviews the emergence phenomena observed in large language models, explains how model scale, in‑context learning and chain‑of‑thought prompting contribute to sudden performance gains, discusses small‑model alternatives, and explores the relationship between emergence and the training‑time Grokking effect.

AI researchEmergenceIn-Context Learning
0 likes · 13 min read
Emergence in Large Language Models: Phenomena, Explanations, and Implications
DataFunTalk
DataFunTalk
Apr 19, 2023 · Artificial Intelligence

Is the Daily Emergence of Large Language Models Beneficial?

The article examines the rapid proliferation of large language models, weighing both the opportunities for experimentation and the drawbacks of noise, and argues that establishing authoritative Chinese LLM evaluation benchmarks is essential to guide meaningful progress in the field.

AI researchLLM evaluationlarge language models
0 likes · 7 min read
Is the Daily Emergence of Large Language Models Beneficial?
21CTO
21CTO
Apr 9, 2023 · Artificial Intelligence

8 Open-Source ChatGPT Alternatives You Can Deploy Today

This article surveys eight popular open‑source ChatGPT alternatives, detailing each model’s size, training data, performance relative to proprietary systems, and providing links to code repositories, demos, and papers for developers interested in building or researching large language models.

AI researchChatGPT alternativesmodel comparison
0 likes · 8 min read
8 Open-Source ChatGPT Alternatives You Can Deploy Today
21CTO
21CTO
Mar 31, 2023 · Artificial Intelligence

From Student to AI Pioneer: Ilya Sutskever’s Journey Behind ChatGPT

This article chronicles Ilya Sutskever’s two‑decade rise from a young researcher to a leading figure in artificial intelligence, highlighting his early mentorship, breakthroughs in image recognition, language translation, the founding of OpenAI, and the development of GPT and DALL‑E models.

AI researchDeep LearningGPT
0 likes · 13 min read
From Student to AI Pioneer: Ilya Sutskever’s Journey Behind ChatGPT
DataFunSummit
DataFunSummit
Mar 21, 2023 · Artificial Intelligence

Interview with Huawei Noah's Ark Lab Senior Researcher Zhou Min on Graph Machine Learning: Research, Deployment, Challenges, and Trends

In this DataFun interview, Huawei Noah's Ark Lab senior researcher Zhou Min discusses the state of graph machine learning in academia and industry, covering algorithmic foundations, model variants, practical applications, scalability challenges, and future directions for more universal feature extraction across domains.

AI researchDataFunGraph Machine Learning
0 likes · 9 min read
Interview with Huawei Noah's Ark Lab Senior Researcher Zhou Min on Graph Machine Learning: Research, Deployment, Challenges, and Trends
DataFunTalk
DataFunTalk
Mar 16, 2023 · Artificial Intelligence

Technical Optimizations and Breakthroughs of GPT‑4: Multimodal Capabilities, Alignment Strategies, and Predictable Scaling

The article summarizes the technical innovations behind GPT‑4, highlighting its multimodal abilities, improved alignment methods, scaling‑law‑based performance prediction, and remaining limitations, while referencing the official OpenAI technical report and community analyses.

AI researchAlignmentGPT-4
0 likes · 10 min read
Technical Optimizations and Breakthroughs of GPT‑4: Multimodal Capabilities, Alignment Strategies, and Predictable Scaling
Python Programming Learning Circle
Python Programming Learning Circle
Mar 10, 2023 · Artificial Intelligence

Google's i‑S2R and GoalsEye: Robot Table‑Tennis Learning from Human Interaction

The article explains how Google's i‑S2R and GoalsEye projects use iterative simulation‑to‑real training, behavior cloning and goal‑conditioned learning to enable robots to play table‑tennis with humans, highlighting the challenges, experimental setup, and performance improvements achieved across player skill levels.

AI researchRoboticsbehavior cloning
0 likes · 6 min read
Google's i‑S2R and GoalsEye: Robot Table‑Tennis Learning from Human Interaction
21CTO
21CTO
Feb 27, 2023 · Artificial Intelligence

What’s Next for Large Language Models? Emerging Trends Shaping AI

The article explores three emerging directions for next‑generation large language models—self‑generated training data, built‑in verification with external retrieval, and massive sparse‑expert architectures—highlighting recent research, practical challenges, and their potential to reshape AI development.

AI researchgenerative AIlarge language models
0 likes · 17 min read
What’s Next for Large Language Models? Emerging Trends Shaping AI
DataFunTalk
DataFunTalk
Feb 15, 2023 · Artificial Intelligence

Three Emerging Directions for Next‑Generation Large Language Models

The article outlines three promising research avenues—self‑generated training data, model‑driven fact‑checking, and sparse expert architectures—that could shape the next wave of large language model innovation and address current limitations such as data scarcity and hallucinations.

AI researchlarge language modelsmodel self‑improvement
0 likes · 14 min read
Three Emerging Directions for Next‑Generation Large Language Models
Architect
Architect
Feb 9, 2023 · Artificial Intelligence

Emergent Abilities of Large Language Models: Complex Reasoning, Knowledge Reasoning, and Out‑of‑Distribution Robustness

This article reviews recent research on the emergent abilities of large language models—such as chain‑of‑thought reasoning, knowledge retrieval without external sources, and robustness to distribution shifts—examining scaling laws, model size thresholds, and the open questions surrounding a potential paradigm shift from fine‑tuning to in‑context learning.

AI researchchain-of-thought promptingemergent abilities
0 likes · 23 min read
Emergent Abilities of Large Language Models: Complex Reasoning, Knowledge Reasoning, and Out‑of‑Distribution Robustness
DataFunSummit
DataFunSummit
Feb 7, 2023 · Artificial Intelligence

How to Evaluate OpenAI's Super Conversational Model ChatGPT?

This article compiles three highly upvoted Zhihu answers that examine OpenAI's ChatGPT, discussing its breakthrough impact on NLP, visual in‑context learning, reinforcement‑learning‑from‑human‑feedback, and the broader implications for AI research and development.

AI researchChatGPTIn-Context Learning
0 likes · 10 min read
How to Evaluate OpenAI's Super Conversational Model ChatGPT?
21CTO
21CTO
Jan 13, 2023 · Artificial Intelligence

How Google’s Muse Is Redefining Text‑to‑Image Generation with Parallel Decoding

Google’s new Muse model, a Transformer‑based text‑to‑image system running on TPUv4, claims to generate 256×256 images in 0.5 seconds—far faster than Imagen—while delivering unprecedented photorealism and deep language understanding through parallel decoding and large‑scale LLM‑conditioned training.

AI researchGoogle MuseLLM conditioning
0 likes · 4 min read
How Google’s Muse Is Redefining Text‑to‑Image Generation with Parallel Decoding
DataFunTalk
DataFunTalk
Jan 10, 2023 · Artificial Intelligence

Paradigm Shifts in Large Language Model Research and Future Directions

The article reviews the evolution of large language models from the pre‑GPT‑3 era to the present, analyzes the conceptual and technical gaps between Chinese and global research, and outlines key future research directions such as scaling laws, prompting techniques, multimodal training, and efficient model architectures.

AI researchChatGPTIn-Context Learning
0 likes · 73 min read
Paradigm Shifts in Large Language Model Research and Future Directions
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jan 3, 2023 · Artificial Intelligence

Insights into ChatGPT: Capabilities, Limitations, and Implications for AI Research

During Xiaohongshu’s REDtech livestream, AI researchers examined ChatGPT’s rapid adoption, versatile task performance, and underlying large‑scale pre‑training with in‑context learning, while highlighting persistent hallucinations, weak reasoning, high costs, and limited search‑engine replacement potential, and emphasized the importance of RLHF‑driven human feedback for future multimodal AI research.

AI researchChatGPTRLHF
0 likes · 14 min read
Insights into ChatGPT: Capabilities, Limitations, and Implications for AI Research
Meituan Technology Team
Meituan Technology Team
Nov 17, 2022 · Artificial Intelligence

Overview of Recent Meituan Visual Intelligence Research Papers on Content Production, Distribution, and Model Quantization

Meituan’s Visual Intelligence team recently published eight top‑conference papers that advance weakly supervised segmentation, future‑aware captioning, panoptic narrative grounding, video‑text retrieval, open‑vocabulary detection, counterfactual image‑text matching, zero‑shot video classification, and efficient Vision‑Transformer quantization, all directly boosting real‑world content creation, distribution, and model efficiency.

AI researchImage CaptioningModel Quantization
0 likes · 19 min read
Overview of Recent Meituan Visual Intelligence Research Papers on Content Production, Distribution, and Model Quantization
AntTech
AntTech
Sep 27, 2022 · Artificial Intelligence

Ant Group’s Research Institute Publishes Four NeurIPS 2022 Papers on Advanced Computer Vision and AI

Ant Group’s Ant Technology Research Institute had four papers from its Visual Intelligence Lab accepted at NeurIPS 2022, covering rank diminishing in deep networks, geometry‑aware 3D image synthesis, dynamic discriminators for GANs, and uncertainty‑aware hierarchical refinement for incremental classification, highlighting the institute’s cutting‑edge AI research.

AI researchComputer VisionDeep Learning
0 likes · 8 min read
Ant Group’s Research Institute Publishes Four NeurIPS 2022 Papers on Advanced Computer Vision and AI
JD Cloud Developers
JD Cloud Developers
Jan 4, 2022 · Artificial Intelligence

How JD’s Vega v1 Model Dominated GLUE Benchmark, Surpassing Human Performance

JD Explore’s Vega v1 model topped the GLUE benchmark with a 91.3 average score, outperforming Microsoft, Facebook, and Stanford across multiple NLP tasks, including first‑ever human‑level results on sentiment analysis and coreference, showcasing JD’s leading position in deep‑learning research.

AI researchDeep LearningGLUE benchmark
0 likes · 3 min read
How JD’s Vega v1 Model Dominated GLUE Benchmark, Surpassing Human Performance
DataFunTalk
DataFunTalk
Nov 1, 2021 · Artificial Intelligence

Reflections on Working as an Algorithm Engineer at Meituan and the Rise of Contrastive Learning

The author shares personal experiences as a Meituan algorithm engineer, emphasizing the critical role of labeled data, the emergence of contrastive (self‑supervised) learning across computer vision, NLP, and recommendation systems, and offers practical advice for algorithm engineers to stay competitive.

AI researchMeituanalgorithm engineering
0 likes · 8 min read
Reflections on Working as an Algorithm Engineer at Meituan and the Rise of Contrastive Learning
Kuaishou Tech
Kuaishou Tech
Oct 28, 2021 · Artificial Intelligence

Kuaishou Showcases Multimedia Research at ACM MM2021 and Announces Strategic Collaboration with CCF‑MM Committee

At ACM MM2021 in Chengdu, Kuaishou presented two accepted papers on recommendation and outfit compatibility, won the Grand Challenge with its DAP congestion‑control system, and forged a strategic partnership with the CCF‑MM committee to deepen multimedia research collaboration across academia and industry.

ACM MM2021AI researchKuaishou
0 likes · 8 min read
Kuaishou Showcases Multimedia Research at ACM MM2021 and Announces Strategic Collaboration with CCF‑MM Committee
Youku Technology
Youku Technology
Sep 29, 2021 · Artificial Intelligence

Reducing the Covariate Shift by Mirror Samples in Cross Domain Alignment

By constructing virtual mirror samples that occupy identical positions across source and target domains, the authors eliminate covariate shift while preserving distribution structure, enabling superior unsupervised domain adaptation that achieves state‑of‑the‑art performance on Office and VisDA benchmarks and improves real‑world lighting and gender‑recognition tasks.

AI researchSOTAcovariate shift
0 likes · 3 min read
Reducing the Covariate Shift by Mirror Samples in Cross Domain Alignment
DataFunTalk
DataFunTalk
Jul 1, 2021 · Artificial Intelligence

Pre‑Trained Models: Past, Present, and Future – A Comprehensive Survey

This article surveys the evolution of pre‑trained models, covering the origins of transfer and self‑supervised learning, the rise of transformer‑based PTMs such as BERT and GPT, efficient architecture designs, multimodal and multilingual extensions, theoretical analyses, and future research directions for scalable and robust AI systems.

AI researchefficient traininglarge language models
0 likes · 27 min read
Pre‑Trained Models: Past, Present, and Future – A Comprehensive Survey
AntTech
AntTech
Mar 3, 2021 · Artificial Intelligence

Ant Group Intelligent Service Research Overview: NLP, Dialogue, Recommendation, and Anti‑fraud Papers

The article presents a comprehensive overview of Ant Group's intelligent service research, summarizing recent AI‑focused papers on text classification, stance detection, data augmentation, knowledge distillation for ranking, reinforcement‑learning‑based dialogue clarification, behavior‑cloning dialogue systems, anti‑fraud outbound bots, tag‑based service recommendation, and multi‑agent service groups, while also highlighting future directions and recruitment opportunities.

AI researchAnti‑fraudDialogue Systems
0 likes · 17 min read
Ant Group Intelligent Service Research Overview: NLP, Dialogue, Recommendation, and Anti‑fraud Papers
JD Cloud Developers
JD Cloud Developers
Feb 10, 2021 · Artificial Intelligence

Three JD Tech AI Papers Shine at ICASSP 2021

At ICASSP 2021, JD Tech presented three AI research papers—introducing a Neural Kalman Filtering framework for speech enhancement, a cross‑utterance BERT‑based prosody modeling method for end‑to‑end speech synthesis, and a self‑supervised conversational query rewriting approach—each demonstrating superior performance over existing baselines on benchmark datasets.

AI researchICASSP 2021prosody modeling
0 likes · 9 min read
Three JD Tech AI Papers Shine at ICASSP 2021
JD Cloud Developers
JD Cloud Developers
Nov 2, 2020 · Artificial Intelligence

This Week’s Tech Highlights: AI Research Breakthroughs, 5G Surge, Multi‑Cloud DB & More

The newsletter recaps recent tech developments, including JD's four AI papers at Interspeech 2020, Shenzhen's supercomputing boost, T‑Mobile's mid‑band 5G expansion, Apple's upcoming A14T iMac processor, MongoDB Atlas multi‑cloud support, Wikimedia's migration to GitLab, and advances in graph neural network pre‑training and deep clustering.

5G expansionAI researchApple Silicon
0 likes · 9 min read
This Week’s Tech Highlights: AI Research Breakthroughs, 5G Surge, Multi‑Cloud DB & More
21CTO
21CTO
Jan 31, 2020 · Artificial Intelligence

How Microsoft’s First Chinese AI Fellow Is Driving Speech and Language Breakthroughs

Microsoft appointed its first Chinese Global Technical Fellow, Huang Xuedong, as the company’s Global AI CTO, overseeing Azure’s speech, translation, vision, and language services, while highlighting his groundbreaking achievements such as achieving human‑level word error rates and leading AI research teams.

AI researchAzureMicrosoft
0 likes · 7 min read
How Microsoft’s First Chinese AI Fellow Is Driving Speech and Language Breakthroughs
Hulu Beijing
Hulu Beijing
Apr 2, 2019 · Artificial Intelligence

From Object Detection to Language Models: A Deep Dive into AI Advances

This article surveys the evolution of object detection models—comparing one‑stage and two‑stage approaches, their performance trade‑offs, and recent state‑of‑the‑art methods—while also outlining key concepts and breakthroughs in natural language processing, highlighting the impact of deep‑learning models such as BERT.

AI researchBERTDeep Learning
0 likes · 14 min read
From Object Detection to Language Models: A Deep Dive into AI Advances
ITPUB
ITPUB
Feb 23, 2019 · Artificial Intelligence

Explore a 1.59 Million Image NSFW Dataset with 159 Fine-Grained Categories

A data scientist from Besedo has open‑sourced a massive NSFW image dataset containing 1.589 million pictures, organized into 159 primary categories and further sub‑categories, with download scripts and GitHub links, requiring about 500 GB of storage and cautioning against viewing in the office.

AI researchComputer VisionGitHub
0 likes · 3 min read
Explore a 1.59 Million Image NSFW Dataset with 159 Fine-Grained Categories
DataFunTalk
DataFunTalk
Jan 11, 2019 · Artificial Intelligence

Challenges in Natural Language Understanding and the Neural‑Symbolic Approach (Object‑Oriented Neural Programming)

The article examines why natural language understanding is intrinsically difficult, outlines four core linguistic challenges, proposes a neural‑symbolic integration framework with three design principles, introduces the Object‑Oriented Neural Programming (OONP) architecture, and showcases real‑world applications in public security, legal document analysis, and financial fraud detection.

AI researchKnowledge Graphneural-symbolic integration
0 likes · 16 min read
Challenges in Natural Language Understanding and the Neural‑Symbolic Approach (Object‑Oriented Neural Programming)
Meitu Technology
Meitu Technology
Jun 25, 2018 · Artificial Intelligence

Meitu Short Video Real-Time Classification Challenge MTSVRC

The Meitu Short Video Real-Time Classification Challenge (MTSVRC), co‑hosted by the PRCV conference, Meitu and the Chinese Academy of Sciences, releases the industry‑largest dataset of over 100,000 five‑to‑15‑second videos in 50 categories, judging both classification accuracy and real‑time speed, and offers cash prizes up to ¥100,000 plus presentation opportunities for top teams.

AI researchMTSVRCPRCV 2018
0 likes · 5 min read
Meitu Short Video Real-Time Classification Challenge MTSVRC
AntTech
AntTech
Jan 18, 2018 · Artificial Intelligence

cw2vec: Learning Chinese Word Embeddings with Stroke n-grams

The cw2vec paper, presented at AAAI 2018, introduces a Chinese word embedding method that leverages stroke n‑grams to capture character semantics, proposes a novel loss function, demonstrates consistent improvements over existing models across similarity, analogy, classification and NER tasks, and discusses real‑world AI applications.

AAAI 2018AI researchChinese NLP
0 likes · 7 min read
cw2vec: Learning Chinese Word Embeddings with Stroke n-grams
AntTech
AntTech
Dec 22, 2017 · Artificial Intelligence

Transfer Learning: Concepts, Challenges, and Recent Research Highlights from CIKM 2017

This article reviews the key concepts, challenges, and recent research on transfer learning presented at CIKM 2017, covering instance, feature, parameter, and relation‑based methods, supervised and unsupervised deep TL approaches, and transitive transfer learning with associated loss formulations and optimization strategies.

AI researchDeep Learningmachine learning
0 likes · 9 min read
Transfer Learning: Concepts, Challenges, and Recent Research Highlights from CIKM 2017
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 28, 2017 · Artificial Intelligence

Inside Alibaba AI Lab: Dr. Wang Gang on Multimodal AI and Edge Computing

In an exclusive interview, Alibaba AI Lab's distinguished scientist Dr. Wang Gang discusses the lab's research on multimodal AI, edge computing, AI hardware, bio‑inspired cognition, quantum‑deep‑learning integration, and the challenges of moving from recognition to true understanding, while also outlining Alibaba's AI talent recruitment plans.

AI researchAI talent recruitmentComputer Vision
0 likes · 25 min read
Inside Alibaba AI Lab: Dr. Wang Gang on Multimodal AI and Edge Computing
Qunar Tech Salon
Qunar Tech Salon
Jul 3, 2017 · Artificial Intelligence

Interview with Dr. Lv Zhengdong on Neural‑Symbolic Systems and the Future of Natural Language Understanding

Dr. Lv Zhengdong discusses the challenges of true language understanding, the integration of symbolic reasoning with neural networks, recent advances in neural‑symbolic models, and the practical prospects of NLP in domains such as law and finance, emphasizing the need for hybrid approaches.

AI researchNLPinterview
0 likes · 16 min read
Interview with Dr. Lv Zhengdong on Neural‑Symbolic Systems and the Future of Natural Language Understanding
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 24, 2017 · Artificial Intelligence

Unlocking StarCraft AI Research with the Open-Source Gym StarCraft Platform

StarCraft, a classic real‑time strategy game, has become a key testbed for deep reinforcement learning and AI research, and Alibaba's open‑source Gym StarCraft platform now offers Python‑based, TensorFlow‑compatible tools that simplify agent development and evaluation within the OpenAI Gym ecosystem.

AI researchDeep LearningStarCraft
0 likes · 3 min read
Unlocking StarCraft AI Research with the Open-Source Gym StarCraft Platform
Ctrip Technology
Ctrip Technology
Aug 5, 2016 · Artificial Intelligence

Advances in Deep Learning for Speech and Semantic Understanding: Insights from Huawei Noah's Ark Lab

The article reviews a decade of deep‑learning breakthroughs, highlights Huawei Noah's recent research on speech, image and natural‑language processing, and discusses future trends such as neural‑symbolic integration, end‑to‑end learning, and knowledge‑driven AI systems.

AI researchHuaweinatural language processing
0 likes · 8 min read
Advances in Deep Learning for Speech and Semantic Understanding: Insights from Huawei Noah's Ark Lab
Art of Distributed System Architecture Design
Art of Distributed System Architecture Design
Oct 8, 2015 · Artificial Intelligence

Facebook AI Research (FAIR): History, Teams, Projects, and Vision

The article chronicles Facebook's evolution from a social platform into a leading AI research hub, detailing the founding of FAIR, its key personnel, ambitious goals, major projects such as memory networks, embedding world, DeepFace, language technology, and the M assistant, and highlights the open, collaborative nature of its AI work.

AI researchDeep LearningFAIR
0 likes · 17 min read
Facebook AI Research (FAIR): History, Teams, Projects, and Vision