Computer Vision

Meet ONE-PEACE: A General Representation Model Towards Unlimited Modalities Across Different Modalities

Representation models have gotten much attention in computer vision, voice, natural language processing, etc. Representation models exhibit high generalization in various downstream tasks after...

Researchers from the National University of Singapore Propose Mind-Video: A New AI Tool That Uses fMRI Data from the Brain to Recreate Video Image

Understanding human cognition has made reconstructing human vision from brain processes intriguing, especially when employing non-invasive technologies like functional Magnetic Resonance Imaging (fMRI). There...

Instant Cameras, Evolved: This Text-to-Image AI Model Can Be Personalized Quickly with Your Images

Text-to-image generation is a term we are all familiar with at this point. The era after the stable diffusion release has brought another meaning...

CMU Researchers Propose STF (Sketching the Future): A New AI Approach that Combines Zero-Shot Text-to-Video Generation with ControlNet to Improve the Output of these...

The popularity of neural network-based methods for creating new video material has increased due to the internet's explosive rise in video content. However, the...

When SAM Meets NeRF: This AI Model Can Segment Anything in 3D

We are all amazed by the generative AI advancements recently, but that does not mean we do not get any significant breakthroughs in other...

Creating Detailed 3D Models from Images: How AI Frameworks are Changing the Game

Three-dimensional (3D) modeling has become critical in various fields, such as architecture and engineering. 3D models are computer-generated objects or environments that can be...

Take Me to Another Dimension: This AI Model Can Generate Realistic Generative 3D Face Models

Generating anything, whether it’s a text or an image, in the digital world has never been easier, thanks to the advancement of neural networks...

Meet Blendify: A Python Framework Developed with a Focus on 3D Computer Vision Visualization

Computer vision is making noteworthy strides in the field of Artificial intelligence and Machine Learning. Its features, like object detection and image recognition, make...

Divide, Train, and Generate: Patch Diffusion is an AI Approach to Make Training Diffusion Models Faster and More Data-Efficient

Image generation has come a long way in the last year. The saga began with the release of Stable Diffusion, and its success has...

Meet YOLO-NAS: An Open-Sourced YOLO-based Architecture Redefining State-of-the-Art in Object Detection

Deci AI has introduced a new object detection model called YOLO-NAS. YOLO-NAS stands for "You Only Look Once - Neural Architecture Search," and it...

Moving Images with No Effort: Text2Video-Zero is an AI Model That Converts Text-to-Image Models to Zero-Shot Video Generators

We have witnessed the rise of generative AI models in the last couple of months. They went from generating low-resolution face-like images to generating...

Could It Be the Patches? This AI Approach Analyzes the Key Contributor to the Success of Vision Transformers

Convolutional neural networks (CNNs) have been the backbone of systems for computer vision tasks. They have been the go-to architecture for all types of...

Recent articles

Be the first to know the latest AI research breakthroughs.

X