Computer Vision

University of Oxford Researchers Release ‘PASS’ Dataset With 1.4M+ Images (Free From Humans) For Self-Supervised Machine Learning

The development of modern machine learning could not have happened without an extensive research dataset. For quite some time, computer vision has relied on...

Google AI Introduces Pathdreamer: A World Model For Indoor Navigation

When navigating around an unknown facility, humans use various visual, spatial, and semantic cues to help them get to their destination quickly. However, taking...

AI Research Group From NVIDIA Unveils An Advanced Framework To Estimate Physically Correct Human Motions

It is no secret that human motion synthesis has been a complex and, as of yet, unmet need. Existing techniques are limited by the...

Google AI Introduces ‘WIT’, A Wikipedia-Based Image Text Dataset For Multimodal Multilingual Machine Learning

Image and text datasets are widely used in many machine learning applications. To model the relationship between images and text, most multimodal Visio-linguistic models...

Facebook AI Releases Captum 0.4: A More Powerful Model Interpretability Library For PyTorch

Among other Machine learning (ML) techniques, deep neural networks have become crucial components for various applications, including image classification, audio recognition, and natural language...

A New Google AI Study Introduces A Mask R-CNN–Based Model For Solving Instance Segmentation Problem

Computer vision (CV) is transforming industries and making life easier for consumers. Many downstream applications, such as self-driving cars, robots, medical imaging, and photo editing,...

Tencent AI Research Unveils ‘PIRenderer’, An AI Model To Control The Generation Of Faces Via Semantic Neural Rendering

Portrait images are an essential type of photograph that can be found in everyday life. The ability to intuitively control the poses and expressions...

Facebook AI Introduces A New Image Generation Model Called ‘IC-GAN’ That Creates High-Quality Images of Unfamiliar Objects And Scenes

Generative adversarial networks (GANs) have been used for few years to generate photorealistic images of objects or scenes that are very similar in style...

Researchers Introduce OncoPetNet: A Deep Learning Based AI System For Mitotic Figure Counting in a Veterinary Diagnostic Lab

Artificial intelligence (AI) has transformed industries all over the world, including the healthcare sector. From fitness bracelets to glucose monitoring, modern technology allows anyone...

Google AI Introduces Two New Families of Neural Networks Called ‘EfficientNetV2’ and ‘CoAtNet’ For Image Recognition

Training efficiency has become a significant factor for deep learning as the neural network models, and training data size grows. GPT-3 is an excellent...

Toshiba Develops AI To The Next Level With World’s Most Accurate Highly Versatile Visual Question Answering (VQA)

Toshiba Corporation has created the world's most accurate and adaptable Visual Question Answering (VQA) AI, which can distinguish not just persons and objects in...

Israeli Researchers Unveil DeepSIM, a Neural Generative Model for Conditional Image Manipulation Based on a Single Image

In recent years, deep neural networks have been proven effective at performing image manipulation tasks for which large training datasets are available such as,...

Recent articles