Home
Research
Blog
About
Blog
The Second Curve of Scaling Law
January 15, 2024
Scaling Law
2024
Microsoft’s Differential Transformer cancels attention noise in LLMs (by VentureBeat)
Oct 16, 2024
DIFF Transformer
1-bit LLMs Could Solve AI’s Energy Demands (by IEEE Spectrum)
May 31, 2024
BitNet | 1-bit LLMs
The Era of 1-bit LLMs
February 28, 2024
BitNet (b1.58)
TrOCR | Qualcomm® AI Hub
February 27, 2024
TrOCR
Kosmos-2 | NVIDIA NGC
February 7, 2024
Kosmos-2 MLLM
2023
人工智能基础创新的第二增长曲线 (By Furu Wei)
December 1, 2023
General AI
2023
AI voice synthesis tech 🤝 Impromptu (By Reid Hoffman)
November 6, 2023
VALL-E
2023
Turing Bletchley v3 - A Vision-Language Foundation Model
August 29, 2023
(Multilingual) BEiT-3
2023
Retentive Network: A Successor to Transformer for Large Language Models
July 17, 2023
RetNet
2023
LongNet: Scaling Transformers to 1,000,000,000 Tokens
July 5, 2023
LongNet
2023
Kosmos-2: Grounding Multimodal Large Language Models to the World
June 26, 2023
Kosmos-2 MLLM
2023
Achieving Zero-COGS with Microsoft Editor Neural Grammar Checker
May 19, 2023
LLMOps EdgeFormer LLM Accelerator
2023
Kosmos-1: A Multimodal Large Language Model (MLLM)
March 1, 2023
Kosmos-1 MLLM
2023
让天下没有难训练的大模型,微软亚洲研究院开源TorchScale
February 01, 2023
#TorchScale DeepNet Magneto X-MoE
2023
Revolutionizing Document AI with Multimodal Document Foundation Models
January 26, 2023
Layout(X)LM
2023
After ChatGPT and DALL-E, meet VALL-E - the text-to-speech AI that can mimic anyone’s voice
January 12, 2023
VALL-E
2022
Microsoft Turing Universal Language Representation model, T-ULRv6, tops both XTREME and GLUE leaderboards with a single model
October 31, 2022
XLM-E
2022
文档基础模型引领文档智能走向多模态大一统
October 26, 2022
Layout(X)LM MarkupLM
2022
通用多模态基础模型BEiT-3:引领文本、图像、多模态预训练迈向“大一统”
August 30, 2022
BEiT(-3)
2022
文档智能多模态预训练模型LayoutLMv3:兼具通用性与优越性
July 26, 2022
LayoutLM
2021
通用模型、全新框架,WavLM语音预训练模型全解
December 23, 2021
WavLM
2021
WMT 2021冠军来了!重建巴别塔之多语言翻译模型
December 22, 2021
DeltaLM mT6
2021
Multilingual translation at scale: 10000 language pairs and beyond
November 22, 2021
DeltaLM mT6
2021
Microsoft Turing Universal Language Representation model, T-ULRv5, tops XTREME leaderboard and trains 100x faster
September 28, 2021
XLM-E InfoXLM
2021
The science behind semantic search: How AI from Bing is powering Azure Cognitive Search
March 2, 2021
UniLM MiniLM
2020
Microsoft Turing Universal Language Representation model, T-ULRv2, tops XTREME leaderboard
October 19, 2020
InfoXLM
2019
Microsoft’s UniLM AI achieves state-of-the-art performance on summarization and language generation
October 16, 2019
UniLM