Computer Vision

Google AI Releases ‘Objectron Dataset’ Consisting Of 15,000 Annotated Videos And 4M Annotated Images

Computer vision tasks have reached exceptional accuracy with new advancements in machine learning models trained with photos. Adding to these advancements, 3D object understanding...

TensorFlow Introduces Improved Iris Tracking In The Browser With TensorFlow.js Face Landmarks Detection Model

Iris tracking enables many applications, such as hands-free interfaces for assistive technologies and understanding user behavior beyond clicks and gestures. It is also a...

Introducing BVR ‘Bridging Visual Representations’: A Novel Module And Applied Plug-In Designed To Better Integrate Different Computer Vision (CV) Object Representations

Microsoft Research Asia and The Institute of Automation, CAS present a unique module based on an attention-based decoder to integrate different computer vision (CV)...

PyTorch Releases Version 1.7 With New Features Like CUDA 11, New APIs for FFTs, And Nvidia A100 Generation GPUs Support

Team PyTorch has recently released the latest version of PyTorch 1.7, with many changes included in the package.  Significant highlights of the python package are:  It...

Google Meet Introduces In-Browser Machine Learning Solution For Blurring And Replacing Background In A Live Video

Google recently announced ways to blur and replace the background in Google Meet for better focus on the person rather than the surrounding. The...

This Halloween Turn Yourself Into A Zombie With This AI Tool Using StyleGAN2

Many might be interested in an AI tool that changes your picture into a zombie with Halloween season upon us. NVIDIA in early 2019...

Google Launches rǝ: A Browser-Based Toolset To Reconstruct The 3D Structure Of Cities Using Deep Learning and Crowdsourcing

Google launched a browser-based toolset: rǝ (pronounced as re”turn"). rǝ is an open-source and scalable system running on Google Cloud and Kubernetes to reconstruct...

Hayden AI Raises $5M To Bring Safe, Healthy And Equitable Mobility To The Public In Smart Cities

Hayden AI Technologies, Inc. recently raised $5M in a recent funding round. Hayden AI will use these funds to accelerate product development, helping cities...

Facebook AI Open-Sources Graph Transformer Networks (GTN) For Automatic Differentiation With Graphs

Graph Transformer Networks (GTN) is an open-source framework with weighted finite-state transducers (WFSTs), a powerful and expressive type of graph. GTN, just like PyTorch,...

Deep4Air: A Deep Learning Based Framework For Airport Airside Surveillance

Major airports worldwide have undertaken substantial expansion programs to accommodate the steady growth in air traffic, including new runways and taxiways. The complex airside...

NVIDIA Announces ‘Cambridge-1’: UK’s Most Powerful Supercomputer For AI Healthcare Research

NVIDIA announces that it is building a supercomputer named “Cambridge-1,” claiming to be the United Kingdom’s most powerful supercomputer. It aims to help healthcare...

NVIDIA Releases Imaginaire: A Universal PyTorch Library Designed For Various GAN-Based Tasks And Methods

NVIDIA has developed a universal PyTorch library, Imaginaire, with an optimized implementation of various GAN images and video synthesis.  The Imaginaire library currently covers three types of models, providing...

Recent articles

Check Out Our Super Cool AI Research Newsletter While It's Still Free

X