Advancing AI for Humanity

Blog

2026

Microsoft Open-Sources Industry-Leading Embedding Model

April 8, 2026
E5 | Embedding Model

Introducing VibeVoice ASR: Longform, Structured Speech Recognition At Scale (by Microsoft Foundry Blog)

Mar 13, 2026
VibeVoice

Microsoft's new AI training method eliminates bloated system prompts without sacrificing model performance (by VentureBeat)

Mar 3, 2026
On-Policy Context Distillation | Experiential Learning

人工智能的下一代前沿: 科学规模化与学习范式的革命 (by Furu Wei)

Jan 15, 2026
The Next Frontier of AI

2025

Microsoft Releases VibeVoice Open-Source AI Model for Generating Multi-Speaker Podcasts (by WinBuzzer)

Sep 3, 2025
VibeVoice | LatentLM

Microsoft researchers say they’ve developed a hyper-efficient AI model that can run on CPUs (by TechCrunch)

Apr 16, 2025
BitNet | The Era of 1-bit LLMs

2024

Small Bits, Big Ideas: The Amazing Rise of 1-Bit LLMs for Building Faster and Slimer Generative AI Apps (by Forbes)

Nov 22, 2024
BitNet | The Era of 1-bit LLMs

How Microsoft’s next-gen BitNet architecture is turbocharging LLM efficiency (by VentureBeat)

Nov 14, 2024
BitNet (a4.8)

Microsoft’s Differential Transformer cancels attention noise in LLMs (by VentureBeat)

Oct 16, 2024
DIFF Transformer

1-bit LLMs Could Solve AI’s Energy Demands (by IEEE Spectrum)

May 31, 2024
BitNet | 1-bit LLMs

The Era of 1-bit LLMs

February 28, 2024
BitNet (b1.58)

TrOCR | Qualcomm® AI Hub

February 27, 2024
TrOCR

Kosmos-2 | NVIDIA NGC

February 7, 2024
Kosmos-2 MLLM

The Second Curve of Scaling Law

January 15, 2024
Scaling Law

2023

人工智能基础创新的第二增长曲线 (By Furu Wei)

December 1, 2023
General AI

2023

AI voice synthesis tech 🤝 Impromptu (By Reid Hoffman)

November 6, 2023
VALL-E

2023

Turing Bletchley v3 - A Vision-Language Foundation Model

August 29, 2023
(Multilingual) BEiT-3

2023

Retentive Network: A Successor to Transformer for Large Language Models

July 17, 2023
RetNet

2023

LongNet: Scaling Transformers to 1,000,000,000 Tokens

July 5, 2023
LongNet

2023

Kosmos-2: Grounding Multimodal Large Language Models to the World

June 26, 2023
Kosmos-2 MLLM

2023

Achieving Zero-COGS with Microsoft Editor Neural Grammar Checker

May 19, 2023
LLMOps EdgeFormer LLM Accelerator

2023

Kosmos-1: A Multimodal Large Language Model (MLLM)

March 1, 2023
Kosmos-1 MLLM

2023

让天下没有难训练的大模型，微软亚洲研究院开源TorchScale

February 01, 2023
#TorchScale DeepNet Magneto X-MoE

2023

Revolutionizing Document AI with Multimodal Document Foundation Models

January 26, 2023
Layout(X)LM

2023

After ChatGPT and DALL-E, meet VALL-E - the text-to-speech AI that can mimic anyone’s voice

January 12, 2023
VALL-E

2022

Microsoft Turing Universal Language Representation model, T-ULRv6, tops both XTREME and GLUE leaderboards with a single model

October 31, 2022
XLM-E

2022

文档基础模型引领文档智能走向多模态大一统

October 26, 2022
Layout(X)LM MarkupLM

2022

通用多模态基础模型BEiT-3：引领文本、图像、多模态预训练迈向“大一统”

August 30, 2022

BEiT(-3)

2022

文档智能多模态预训练模型LayoutLMv3：兼具通用性与优越性

July 26, 2022
LayoutLM

2021

通用模型、全新框架，WavLM语音预训练模型全解

December 23, 2021
WavLM

2021

WMT 2021冠军来了！重建巴别塔之多语言翻译模型

December 22, 2021
DeltaLM mT6

2021

Multilingual translation at scale: 10000 language pairs and beyond

November 22, 2021
DeltaLM mT6

2021

Microsoft Turing Universal Language Representation model, T-ULRv5, tops XTREME leaderboard and trains 100x faster

September 28, 2021
XLM-E InfoXLM

2021

The science behind semantic search: How AI from Bing is powering Azure Cognitive Search

March 2, 2021
UniLM MiniLM

2020

Microsoft Turing Universal Language Representation model, T-ULRv2, tops XTREME leaderboard

October 19, 2020
InfoXLM

2019

Microsoft’s UniLM AI achieves state-of-the-art performance on summarization and language generation

October 16, 2019
UniLM