Computer Vision

Researchers Propose ‘Projected-GANs’, To Improve Image Quality, Sample Efficiency, And Convergence Speed

Generative Adversarial Networks (GANs) are a novel approach to Generative Modeling using deep learning methods, such as convolutional neural networks (1). A Generative Adversarial...

Google Research Introduces ‘SCENIC’: An Open-Source JAX Library For Computer Vision Research

The field of computer vision is quickly advancing, exhibiting the great potential to address everything from global healthcare problems to transportation. Over the last...

Yale Researchers Use Machine Learning To Identify Brain Networks Predictive Of Aggression In Children

Children's mental disorders are defined as significant changes in how children learn, behave, or handle their emotions. Many times these changes create discomfort and...

Rutgers University’s AI Researchers Propose A Slot-Based Autoencoder Architecture, Called SLot Attention TransformEr (SLATE)

DALL·E has shown an impressive ability of composition-based systematic generalization in image generation, but it requires the dataset of text-image pairs and provides compositional...

Microsoft AI Research Releases ‘ORBIT’ Dataset: A Real-World Few-Shot Dataset for Teachable Object Recognition

Object recognition algorithms have come a long way in recent years, but they still require training datasets containing thousands of high-quality, annotated examples for...

ByteDance Proposes An Impressive Multi-Object Tracking Architecture

Multi-object tracking (MOT) involves identifying and following objects as they move about in videos. Currently, available methods obtain identities by associating detection boxes whose...

AWS Launches Computer Vision at the Edge with AWS Panorama Appliance

The launch of a new edge computer vision service SDK from AWS has been eagerly awaited by those who need to analyze camera images....

AI Researchers From Huawei and Shanghai Jiao Tong University Introduce ‘CIPS-3D’: A 3D-Aware Generator of GANs

The StyleGAN architecture is a great way to generate high-quality images, but it lacks the ability to control camera poses precisely. The recent NeRF...

Google AI Introduces SimVLM: Simple Visual Language Model Pre-training With Weak Supervision

The visual language modeling method has lately emerged as a feasible option for content-based image classification. In this method, each image is converted into...

MIT CSAIL, TU Wien, and IST Researchers Introduce Deep Learning Models That Require Fewer Neurons

Today's artificial intelligence technology is intended to mimic nature and replicate the same decision-making abilities that people develop naturally in a computer. Artificial neural networks,...

Facebook AI Introduces ‘Anticipative Video Transformer’ (AVT): An End-To-End Attention-Based Model For Action Anticipation In Videos

Every day, people make countless decisions based on their understanding of their surroundings as a continuous sequence of events. Artificial intelligence systems that can...

Researchers at ETH Zurich & Microsoft Introduce ‘PixLoc’: A Neural Network For Feature Alignment With A 3D Model Of The Environment

In known scenes, camera pose estimation is an intriguing task of 3D geometry recently tackled by many learning algorithms. Many of these techniques try...

Recent articles

Check Out Our Super Cool AI Research Newsletter While It's Still Free

X