Computer Vision

Apple AI Researchers Develop GMPIs (Generative Multiplane Images) For Making A 2D GAN 3D-Aware

Using a given training dataset as a guide, generative adversarial networks (GANs) have achieved excellent results when sampling new pictures that are "similar" to...

The AI-Makeup Artist that Covers Your Identity: CLIP2Protect is an AI Model That Uses Text-Guided Makeup to Protect Facial Privacy

The 90s Sci-fi movies are full of computers that show this rotating profile of a person and display all types of information about the...

Meet DragonDiffusion: A Fine-Grained Image Editing Method Enabling Drag-style Manipulation on Diffusion Models 

Big-scale text-to-image (T2I) diffusion models, which aim to generate images conditioned on a given text/prompt, have seen rapid development thanks to the availability of...

Meet SAM-PT: A New AI Method Extending Segment Anything Model’s (SAM) Capability to Tracking and Segmenting Anything in Dynamic Videos

Numerous applications, such as robotics, autonomous driving, and video editing, benefit from video segmentation. Deep neural networks have made great progress in the last...

HuggingFace Research Introduces LEDITS: The Next Evolution in Real-Image Editing Leveraging DDPM Inversion and Enhanced Semantic Guidance

There has been a major uptick in interest due to the outstanding realism and diversity of picture creation utilizing text-guided diffusion models. With the...

Playing Where’s Waldo? in 3D: OpenMask3D is an AI Model That Can Segment Instances in 3D with Open-Vocabulary Queries

Image segmentation has come a long way in the last decade, thanks to the advancement in neural networks. It is now possible to segment...

Researchers from the University of Wisconsin and ByteDance Introduce PanoHead: The First 3D GAN Framework that Synthesizes View-Consistent Full Head Images with only Single-View...

In computer vision and graphics, photo-realistic portrait image synthesis has been constantly emphasized, with a wide range of downstream applications in virtual avatars, telepresence,...

Meet DiffComplete: An Interesting AI Method that can Complete 3D Objects from Incomplete Shapes

Shape completion on 3D range scans is a challenging task that involves inferring complete 3D shapes from incomplete or partial input data. Previous methods...

Meet Magic123: A Novel Image-to-3D Pipeline that Uses a Two-Stage Coarse-to-Fine Optimization Process to Produce High-Quality High-Resolution 3D Geometry and Textures

Despite only seeing the world in two dimensions, humans are adept at navigating, thinking, and interacting with their three-dimensional environment. This suggests a profoundly...

You Gotta Pump Those Dimensions: DreamEditor is an AI Model That Edits 3D Scenes Using Text-Prompts

The 3D computer vision domain was flooded with NeRFs in recent years. They emerged as a groundbreaking technique and enabled the reconstruction and synthesis...

This AI Tool Explains How AI ‘Sees’ Images And Why It Might Mistake An Astronaut For A Shovel

It is widely recognized that artificial intelligence (AI) has made significant strides in recent years, leading to remarkable achievements and breakthrough outcomes. However, it...

Researchers From Binghamton University Introduce A Privacy-Enhancing Anonymization System (My Face, My Choice) For Everyone To Have Control Over Their Faces In Social Photo...

Anonymization is a critical problem in the context of face recognition and identification algorithms. With the increasing productization of these technologies, ethical concerns have...

Snowflake AI Research Team Unveils Arctic: An Open-Source Enterprise-Grade Large Language Model (LLM) with...

0
Snowflake AI Research has launched the Arctic, a cutting-edge open-source large language model (LLM) specifically designed for enterprise AI applications, setting a new standard...

Google DeepMind Releases RecurrentGemma: One of the Strongest 2B-Parameter Open Language Models Designed for...

0
Language models are the backbone of modern artificial intelligence systems, enabling machines to understand and generate human-like text. These models, which process and predict...

Finally, the Wait is Over: Meta Unveils Llama 3, Pioneering a New Era in...

0
Meta has revealed its latest large language model, the Meta Llama 3, which is a major breakthrough in the field of AI. This new model is not just...

TrueFoundry Releases Cognita: An Open-Source RAG Framework for Building Modular and Production-Ready Applications

0
The field of artificial intelligence is rapidly evolving, and taking a prototype to production stage can be quite challenging. However, TrueFoundry has recently introduced a new...

Meet Zamba-7B: Zyphra’s Novel AI Model That’s Small in Size and Big on Performance

0
In the race to create more efficient and powerful AI models, Zyphra has unveiled a significant breakthrough with its new Zamba-7B model. This compact,...

Recent articles

🐝 🐝 Join the Fastest Growing AI Research Newsletter Read by Researchers from Google + NVIDIA + Meta + Stanford + MIT + Microsoft and many others...

X