Amazon

Graph neural networks (GNNs), referred to as neural algorithmic reasoners (NARs), have shown effectiveness in robustly solving algorithmic tasks of varying input sizes, both in and out of distribution. However, NARs are still relatively narrow forms...
The release of the Tulu 2.5 suite by the Allen Institute for AI marks a significant advancement in model training using Direct Preference Optimization (DPO) and Proximal Policy Optimization (PPO). The Tulu 2.5 suite comprises diverse...

Amazon Open-Sources Fortuna, An Open-Source Library For Uncertainty Quantification of Machine Learning ML Models

On examining the class probabilities predicted by a deep neural network classifier, there are times when one can observe that the likelihood of one...

Meet BinoML: A Novel Machine Learning Ranking Model for Precisely Adding Building Numbers to Unlabeled Buildings

Ever wondered how the package that we order online are delivered within such a short time and so accurately? The availability of accurate addresses...

Amazon Researchers Release CoCoA-MT: A Dataset and Benchmark for Controlling formality in Machine Translation

Neural machine translation (NMT) models have steadily improved over the years, and their quality is now quite close to that of human translators. Commonly,...

Amazon Research Introduces MTGenEval: A New Benchmark For Evaluating Gender Bias In Machine Translation

It has been a long-held goal of the field of computer science to develop software capable of translating written text between languages. The last...

This Artificial Intelligence (AI) Paper Presents A Study On The Model Update Regression Issue In NLP Structured Prediction Tasks

Model update regression is the term used to describe the decline in performance in some test cases following a model update, even when the...

Amazon AI Researchers Propose A New Machine Learning Framework Called ‘GRAVL-BERT’: A BERT-Based Graphical Visual-Linguistic Representations For Multimodal Coreference Resolution

The use of multimodal data for AI training has gained popularity, particularly in recent years. The popularity of voice-activated screen devices like the Amazon...

AWS AI Labs Propose A Method That Predicts Bias In Face Recognition Models Using Unlabeled Data

Algorithmic bias has emerged as a major area of study in artificial intelligence in recent years. An examination of facial recognition software in 2018...

Amazon AI Researchers Propose A New Deep Learning-Based Method For Adapting An MDE Model Trained On One Labeled Dataset To Another, Unlabeled Dataset

Depth data is crucial for various robot uses, including navigation, mapping, and obstacle avoidance. Monocular depth estimation (MDE), which makes depth predictions using only...

Latest Machine Learning Research at Amazon Proposes DAEMON, a Novel Graph Neural Network based Framework for Related Product Recommendation

One primary machine learning application today is recommendation systems for e-commerce stores like Amazon. Customers can save time and have more fulfilling buying experiences...

Amazon Open-Sources ‘MINTAKA,’ a Complex, Natural, and Multilingual Question-Answering (QA) Dataset Composed of 20,000 Question-Answer Pairs 

Question answering is to learn predicting answers to a given question using machine learning. While many cutting-edge question-answering models perform well when asked simple...

Amazon Researchers Propose ‘MiCS,’ An Artificial Intelligence (AI) System That Attains High Training Throughput and Near-Linear Scalability on The Cloud by Only Using Data...

Gigantic models are those models that have to be trained using billions or trillions of parameters. Due to significant communication overheads, current general purpose...

Researchers from UC Berkeley and Amazon Introduce an Unsupervised AI Method for Synthesizing Realistic Photos from Scene Sketches

Sketching is a natural means of representing visual signals. With a few light strokes, humans could understand and envision a photo from a sketch....

Galileo Introduces Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High...

0
The Galileo Luna represents a significant advancement in language model evaluation. It is specifically designed to address the prevalent issue of hallucinations in large...

Yandex Introduces YaFSDP: An Open-Source AI Tool that Promises to Revolutionize LLM Training by...

0
Developing large language models requires substantial investments in time and GPU resources, translating directly into high costs. The larger the model, the more pronounced...

Gretel AI Releases a New Multilingual Synthetic Financial Dataset on HuggingFace 🤗 for AI...

0
Detecting personally identifiable information PII in documents involves navigating various regulations, such as the EU’s General Data Protection Regulation (GDPR) and various U.S. financial...

Snowflake AI Research Team Unveils Arctic: An Open-Source Enterprise-Grade Large Language Model (LLM) with...

0
Snowflake AI Research has launched the Arctic, a cutting-edge open-source large language model (LLM) specifically designed for enterprise AI applications, setting a new standard...

Google DeepMind Releases RecurrentGemma: One of the Strongest 2B-Parameter Open Language Models Designed for...

0
Language models are the backbone of modern artificial intelligence systems, enabling machines to understand and generate human-like text. These models, which process and predict...

Recent articles

🐝 🐝 Join the Fastest Growing AI Research Newsletter...

X