Microsoft

Microsoft And The University Of California, Merced Introduces ZeRO-Offload, A Novel Heterogeneous DeepLearning Training Technology To Train Multi-Billion Parameter Models On A Single GPU

We are progressing towards an era of technology that is becoming heavily dependent on Deep Learning (DL) models. As these models' size increases exponentially, it...

Introducing BVR ‘Bridging Visual Representations’: A Novel Module And Applied Plug-In Designed To Better Integrate Different Computer Vision (CV) Object Representations

Microsoft Research Asia and The Institute of Automation, CAS present a unique module based on an attention-based decoder to integrate different computer vision (CV)...

Microsoft Introduces Lobe: A Free Machine Learning Application That Allows You To Create AI Models Without Coding

Microsoft has released Lobe, a free desktop application that lets Windows and Mac users create customized AI models without writing any code. Several customers are...

Adversarial Machine Learning Threat Matrix – A Framework To Defend AI Systems From Adversarial Attacks

Microsoft, in collaboration with MITRE research organization and a dozen other organizations, including IBM, Nvidia, Airbus, and Bosch, has released the Adversarial ML Threat...

Microsoft releases a new version of DeepSpeed tool to enable the creation of deep learning models with a trillion parameters

DeepSpeed, Microsoft’s deep learning optimization library, makes distributed training easy, effective, and efficient. It’s an essential part of Microsoft’s initiative, AI at Scale, to enable...

Hummingbird: A library for compiling trained traditional machine learning models into tensor computations

This is really a cool work out of Microsoft research called hummingbird. You can convert traditional machine learning models to tensor computations to take...

Galileo Introduces Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High...

0
The Galileo Luna represents a significant advancement in language model evaluation. It is specifically designed to address the prevalent issue of hallucinations in large...

Yandex Introduces YaFSDP: An Open-Source AI Tool that Promises to Revolutionize LLM Training by...

0
Developing large language models requires substantial investments in time and GPU resources, translating directly into high costs. The larger the model, the more pronounced...

Gretel AI Releases a New Multilingual Synthetic Financial Dataset on HuggingFace 🤗 for AI...

0
Detecting personally identifiable information PII in documents involves navigating various regulations, such as the EU’s General Data Protection Regulation (GDPR) and various U.S. financial...

Snowflake AI Research Team Unveils Arctic: An Open-Source Enterprise-Grade Large Language Model (LLM) with...

0
Snowflake AI Research has launched the Arctic, a cutting-edge open-source large language model (LLM) specifically designed for enterprise AI applications, setting a new standard...

Google DeepMind Releases RecurrentGemma: One of the Strongest 2B-Parameter Open Language Models Designed for...

0
Language models are the backbone of modern artificial intelligence systems, enabling machines to understand and generate human-like text. These models, which process and predict...

Recent articles

🐝 🐝 Join the Fastest Growing AI Research Newsletter...

X