Computer Vision

Facebook AI Introduces Ego4D Dataset, A Step Towards Egocentric Perception

Facebook has teamed up with 13 universities in 9 countries to create the first-person perspective datasetĀ Ego4D. It contains more than 700 project participants, wearing...

DeepMind Introduces ‘RGB-Stacking’: A Reinforcement Learning Based Approach For Tackling Robotic Stacking of Diverse Shapes

For many people stacking one thing on top of another seems to be a simple job. Even the most advanced robots, however, struggle to...

A New Apple AI Study Investigates Whether Self-Supervised and Supervised Methods Learn Similar Visual Representations

Self-supervised learning (SSL) is an ML technique that allows computers to predict unknown inputs using observed inputs. One essential goal for self-supervised learning is...

NVIDIA AI Releases StyleGAN3: Alias-Free Generative Adversarial Networks

The recent advances in the quality and resolution of Generative adversarial networks (GAN) have seen a rapid improvement. These techniques are used for various...

Facebook AI Introduces ‘3DETR’ That Increases 3D Comprehension and ‘DepthContrast’, A self-supervised Learning Mechanism That Doesnā€™t Rely On Labels

In todayā€™s world, itā€™s critical to develop systems that can understand 3D data about the world. For example, autonomous automobiles require 3D understanding to...

Researchers From Imperial College London Introduces ‘HeadGAN’, A Novel One-Shot GAN-Based Method For Talking Head Animation And Editing

While recent attempts to solve the problem of head reenactment using a single reference image have shown promising results, most of them perform poorly...

Microsoft Researchers Introduce ‘Mesh Graphormer’, A Graph-Convolution-Reinforced Transformer

While 3D human pose and mesh reconstruction from a single image is a trending area of research because of its applications for human-computer interactions,...

University of Verona Researchers Introduce ‘SEAM Match-RCNN’ and ‘MovingFashion’ Dataset For Retrieving e-Fashion in Social Media Videos Using Computer Vision

The increasing use of social media has led to an exciting new trend in e-fashion known as 'video-to shop.' The idea is that videos...

MIT Researchers Open-Sourced ‘MADDNESS’: An AI Algorithm That Speeds Up Machine Learning Using Approximate Matrix Multiplication (AMM)

Matrix multiplication is one of the essential operations in machine learning (ML). However, these operations are extensively computationally costly due to the extensive use...

NVIDIA AI Proposes A Novel AI Framework For Mixed Reality Tasks, Such As Photorealistic Virtual Object Insertion

It is often challenging to estimate albedo, normals, depth, and 3D spatially-varying lighting from a single image all at the same time. The problem...

Google AI 0pen Sources ‘FedJAX’, A JAX-based Python Library for Federated Learning Simulations

Federated learning is a machine learning environment in which multiple clients (such as mobile devices or entire enterprises, depending on the task at hand)...

Microsoft AI Research Introduces A Huge Synthetic-Face Dataset Along With A Face Analysis Method Using Synthetic Data Alone

We often forget that the most challenging part about machine learning isn't choosing a correct model; it's finding good data. There are concerns with...

Recent articles

Check Out Our Super Cool AI Research Newsletter While It's Still Free

X