Author: Niharika Singh

275 POSTS0 COMMENTS
Niharika is a Technical consulting intern at Marktechpost. She is a third year undergraduate, currently pursuing her B.Tech from Indian Institute of Technology(IIT), Kharagpur. She is a highly enthusiastic individual with a keen interest in Machine learning, Data science and AI and an avid reader of the latest developments in these fields.

Web-Scale Training Unleashed: Deepmind Introduces OWLv2 and OWL-ST, the Game-Changing Tools for Open-Vocabulary Object Detection, Powered by Unprecedented Self-Training Techniques

Open-vocabulary object detection is a critical aspect of various real-world computer vision tasks. However, the limited availability of detection training data and the fragility...

Empowering Robots with Complex Task Performance: Meta AI Develops Visual Affordance Model Using Internet Videos of Human Behavior

Meta AI, a leading artificial intelligence (AI) research organization, has recently unveiled a groundbreaking algorithm that promises to revolutionize the field of robotics. In...

Meet Google’s New Anti-Money-Laundering AI Tool for Banks

Google Cloud, a division of Alphabet, has introduced Anti Money Laundering AI for banks. The proposed AI solution is an innovative tool driven by...

Researchers from Princeton Introduce Infinigen: A Procedural Generator of Photorealistic 3D Scenes of the Natural World

The research team from Princeton University has introduced Infinigen, a groundbreaking procedural generator for photorealistic 3D scenes, in their recent paper titled "Infinite Photorealistic...

Microsoft AI Introduces an Advanced Communication Optimization Strategy Built on ZeRO for Efficient Large Model Training, Unhindered by Batch Size or Bandwidth Limitations

Microsoft researchers introduced a new system called ZeRO++ has been developed to optimize the training of large AI models, addressing the challenges of high...

Revolutionizing Text-to-Image Synthesis: UC Berkeley Researchers Utilize Large Language Models in a Two-Stage Generation Process for Enhanced Spatial and Common Sense Reasoning

Recent advancements in text-to-image generation have emerged diffusion models that can synthesize highly realistic and diverse images. However, despite their impressive capabilities, diffusion models...

Voxel51 Open-Sources VoxelGPT: An AI Assistant That Harnesses GPT-3.5’s Power to Generate Python Code for Computer Vision Dataset Analysis

Voxel51, a prominent innovator in data-centric computer vision and machine learning software, has recently introduced a remarkable breakthrough in the field of computer vision...

Meet CapPa: DeepMind’s Innovative Image Captioning Strategy Revolutionizing Vision Pre-training and Rivaling CLIP in Scalability and Learning Performance

A recent paper titled "Image Captioners Are Scalable Vision Learners Too" presents an intriguing approach called CapPa, which aims to establish image captioning as...

6 AI-Powered Features Transforming Gmail into an Efficient Email Solution

Google's Gmail has been at the forefront of harnessing the power of artificial intelligence (AI) to enhance user experience. With a history of integrating...

WAYVE Introduces GAIA-1: A New Generative AI Model for Autonomy that Creates Realistic Driving Videos by Leveraging Video, Text, and Action Inputs

The automotive industry has long pursued the goal of autonomous driving, recognizing its potential to revolutionize transportation and enhance road safety. However, developing autonomous...

Meta AI Shatters Barriers with Voicebox: An Unprecedented Generative AI Model-Revolutionizing the Field of Speech Synthesis

Meta-AI Researchers have recently achieved a significant breakthrough in generative AI for speech. They have developed Voicebox, an innovative AI model that showcases the...

AMD Unveils Advanced CPU and AI Accelerators, Taking Aim at Nvidia’s Dominance

The American semiconductor company, Advanced Micro Devices (AMD), made significant strides in the chip-making market as it unveiled its highly anticipated CPU and AI...