Home
Research
Blog
About
Blog
The Second Curve of Scaling Law
January 15, 2024
Scaling Law
2024
How Microsoft’s next-gen BitNet architecture is turbocharging LLM efficiency (by VentureBeat)
Nov 14, 2024
BitNet (a4.8)
Microsoft’s Differential Transformer cancels attention noise in LLMs (by VentureBeat)
Oct 16, 2024
DIFF Transformer
1-bit LLMs Could Solve AI’s Energy Demands (by IEEE Spectrum)
May 31, 2024
BitNet | 1-bit LLMs
The Era of 1-bit LLMs
February 28, 2024
BitNet (b1.58)
TrOCR | Qualcomm® AI Hub
February 27, 2024
TrOCR
Kosmos-2 | NVIDIA NGC
February 7, 2024
Kosmos-2 MLLM
2023
人工智能基础创新的第二增长曲线 (By Furu Wei)
December 1, 2023
General AI
2023
AI voice synthesis tech 🤝 Impromptu (By Reid Hoffman)
November 6, 2023
VALL-E
2023
Turing Bletchley v3 - A Vision-Language Foundation Model
August 29, 2023
(Multilingual) BEiT-3
2023
Retentive Network: A Successor to Transformer for Large Language Models
July 17, 2023
RetNet
2023
LongNet: Scaling Transformers to 1,000,000,000 Tokens
July 5, 2023
LongNet
2023
Kosmos-2: Grounding Multimodal Large Language Models to the World
June 26, 2023
Kosmos-2 MLLM
2023
Achieving Zero-COGS with Microsoft Editor Neural Grammar Checker
May 19, 2023
LLMOps EdgeFormer LLM Accelerator
2023
Kosmos-1: A Multimodal Large Language Model (MLLM)
March 1, 2023
Kosmos-1 MLLM
2023
让天下没有难训练的大模型,微软亚洲研究院开源TorchScale
February 01, 2023
#TorchScale DeepNet Magneto X-MoE
2023
Revolutionizing Document AI with Multimodal Document Foundation Models
January 26, 2023
Layout(X)LM
2023
After ChatGPT and DALL-E, meet VALL-E - the text-to-speech AI that can mimic anyone’s voice
January 12, 2023
VALL-E
2022
Microsoft Turing Universal Language Representation model, T-ULRv6, tops both XTREME and GLUE leaderboards with a single model
October 31, 2022
XLM-E
2022
文档基础模型引领文档智能走向多模态大一统
October 26, 2022
Layout(X)LM MarkupLM
2022
通用多模态基础模型BEiT-3:引领文本、图像、多模态预训练迈向“大一统”
August 30, 2022
BEiT(-3)
2022
文档智能多模态预训练模型LayoutLMv3:兼具通用性与优越性
July 26, 2022
LayoutLM
2021
通用模型、全新框架,WavLM语音预训练模型全解
December 23, 2021
WavLM
2021
WMT 2021冠军来了!重建巴别塔之多语言翻译模型
December 22, 2021
DeltaLM mT6
2021
Multilingual translation at scale: 10000 language pairs and beyond
November 22, 2021
DeltaLM mT6
2021
Microsoft Turing Universal Language Representation model, T-ULRv5, tops XTREME leaderboard and trains 100x faster
September 28, 2021
XLM-E InfoXLM
2021
The science behind semantic search: How AI from Bing is powering Azure Cognitive Search
March 2, 2021
UniLM MiniLM
2020
Microsoft Turing Universal Language Representation model, T-ULRv2, tops XTREME leaderboard
October 19, 2020
InfoXLM
2019
Microsoft’s UniLM AI achieves state-of-the-art performance on summarization and language generation
October 16, 2019
UniLM