Tagged articles
3 articles
Page 1 of 1
DataFunTalk
DataFunTalk
Apr 19, 2025 · Artificial Intelligence

Microsoft Research's Open‑Source Native 1‑Bit LLM BitNet b1.58 2B4T: Design, Performance, and Deployment

Microsoft Research released BitNet b1.58 2B4T, the first open‑source native 1‑bit large language model with 2 billion parameters, 1.58‑bit effective precision and a 0.4 GB footprint, achieving full‑precision performance while enabling efficient CPU and GPU inference for edge AI applications.

1-bit quantizationCPU inferenceLLM
0 likes · 10 min read
Microsoft Research's Open‑Source Native 1‑Bit LLM BitNet b1.58 2B4T: Design, Performance, and Deployment
Network Intelligence Research Center (NIRC)
Network Intelligence Research Center (NIRC)
May 22, 2023 · Artificial Intelligence

How Microsoft Leverages LLMs to Auto‑Generate Cloud Incident Root Causes and Fixes

Microsoft researchers fine‑tuned GPT‑3.x models with LoRA on over 40,000 cloud incident records, evaluated them with six NLP metrics and human interviews, and found that LLMs can generate root‑cause analyses and mitigation steps comparable to BERT models, especially for machine‑detected failures.

AI for operationsGPT-3LLM
0 likes · 8 min read
How Microsoft Leverages LLMs to Auto‑Generate Cloud Incident Root Causes and Fixes
DataFunTalk
DataFunTalk
Jul 9, 2022 · Artificial Intelligence

Graph Neural Networks Enter the Transformer Era – Seminar by Dr. Zheng Shuxin

The LOGS seminar on July 9, 2022 featured Dr. Zheng Shuxin from Microsoft Research presenting an overview of Transformer models, their success in NLP and CV, recent breakthroughs in applying Transformers to graph data, and future directions for graph processing.

AI SeminarMicrosoft researchTransformer
0 likes · 4 min read
Graph Neural Networks Enter the Transformer Era – Seminar by Dr. Zheng Shuxin