Computer Vision

AI Researchers From China Propose Head Swapper (HeSer) For Few-Shot Head Swapping in the Wild

This Article Is Based On The Research Paper 'Few-Shot Head Swapping in the Wild'. All Credit For This Research Goes To The Researchers 👏👏👏 Please...

AI Researchers Develop ‘CogView2’ For Text-To-Image System That Achieves Significant Speedups 10x Faster Than CogView

This Article Is Based On The Research Paper 'CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers'. All Credit For This Research Goes To...

Meta AI Introduces ‘Make-A-Scene’: A Deep Generative Technique Based On An Autoregressive Transformer For Text-To-Image Synthesis With Human Priors

This Article Is Based On The Research Paper 'Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors'. All Credit For This Research Goes To The Researchers...

Researchers At SenseTime Develop HuMMan: A Multi-Modal 4D Human Dataset For Versatile Sensing And Modeling

This Article Is Based On The Research Paper 'HuMMan: Multi-Modal 4D Human Dataset for Versatile Sensing and Modeling'. All Credit For This Research Goes...

Tensorflow Introduces Depth API To Convert Individual Images To 3D Photos

A depth map is an image channel in computer graphics and computer vision that provides information on the distance of the surface of objects...

AI Researchers Introduce Neural Mixtures of Planar Experts (NeurMiPs): A Novel Planar-Based Scene Representation For Modeling Geometry And Appearance

This Article Is Based On The Research Paper 'NeurMiPs: Neural Mixture of Planar Experts for View Synthesis'. All Credit For This Research Goes...

Northeastern University and Microsoft Researchers Propose A Novel Two-Branch Technique That Expands StyleGAN’s Latent Space

This Article Is Based On The Research Article 'Expanding the Latent Space of StyleGAN for Real Face Editing'. All Credit For This Research Goes...

Deepmind Introduces Flamingo: An Open-Ended Single Visual Language Model (VLM) For Multimodal Machine Learning Research

This Article Is Based On The Research Paper 'Flamingo: a Visual Language Model for Few-Shot Learning'. All Credit For This Research Goes To The...

Researchers @ SenseTime Develop GNR: Generalizable Neural Performer for Human Novel View Synthesis

This Article Is Based On The Research Paper 'Generalizable Neural Performer: Learning Robust Radiance Fields for Human Novel View Synthesis'. All Credit For This...

UTokyo Researchers Introduce A Novel Synthetic Training Data Called Self-Blended Images (SBIs) To Detect Deepfakes

This Article Is Based On The Research Paper 'Detecting Deepfakes with Self-Blended Images'. All Credit For This Research Goes To The Researchers Of This...

Bytedance Researchers Propose CLIP-GEN: A New Self-Supervised Deep Learning Generative Approach Based On CLIP And VQ-GAN To Generate Reliable Samples From Text Prompts

This Article Is Based On The Research Paper 'CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP'. All Credit For This Research Goes To...

This Bengaluru-based Visual Object Intelligence Platform is Making Robots Adapt to Unstructured Surroundings in Manufacturing

Pitch your startup story at asif@marktechpost.com Please don't forget to join our ML Subreddit Machines are now used to manufacture a wide range of items....

Recent articles

Check Out Our Super Cool AI Research Newsletter While It's Still Free

X