Technology

Meet Notus: Enhancing Language Models with Data-Driven Fine-Tuning

In the pursuit of refining language models to align more closely with user intent and elevate response quality, a new iteration emerges – Notus....

Researchers from MIT and FAIR Meta Unveil RCG (Representation-Conditioned Image Generation): A Groundbreaking AI Framework in Class-Unconditional Image Generation

How can high-quality images be generated without relying on human annotations? This paper from MIT CSAIL and FAIR Meta has addressed the challenge of...

Apple AI Research Releases MLX: An Efficient Machine Learning Framework Specifically Designed for Apple Silicon

Over the past few years, there have been significant advancements in Machine Learning (ML), with numerous frameworks and libraries developed to simplify our tasks....

Google AI Research Proposes TRICE: A New Machine Learning Algorithm for Tuning LLMs to be Better at Solving Question-Answering Tasks Using Chain-of-Thought (CoT) Prompting

The team of researchers from Google developed a new fine-tuning strategy to address the challenge of generating correct answers using LLMs. The strategy, called...

Meet MVHumanNet: A Large-Scale Dataset that Comprises Multi-View Human Action Sequences of 4,500 Human Identities

Researchers from FNii CUHKSZ, SSE CUHKSZ introduce MVHumanNet, a vast dataset for multi-view human action sequences with extensive annotations, including human masks, camera parameters,...

This AI Research Introduces a Novel Vision-Language Model (‘Dolphins’) Architected to Imbibe Human-like Abilities as a Conversational Driving Assistant

A team of researchers from the University of Wisconsin-Madison, NVIDIA, the University of Michigan, and Stanford University have developed a new vision-language model (VLM)...

Meet PyPose: A PyTorch-based Robotics-Oriented Library that Provides a Set of Tools and Algorithms for Connecting Deep Learning with Physics-based Optimization

Deep learning is finding its utility in all aspects of life. Its applications span diverse fields, from image and speech recognition to medical diagnosis...

How can the Effectiveness of Vision Transformers be Leveraged in Diffusion-based Generative Learning? This Paper from NVIDIA Introduces a Novel Artificial Intelligence Model Called...

How can the effectiveness of vision transformers be leveraged in diffusion-based generative learning? This paper from NVIDIA introduces a novel model called Diffusion Vision...

Researchers from the University of Washington and Google Unveil a Breakthrough in Image Scaling: A Groundbreaking Text-to-Image Model for Extreme Semantic Zooms and Consistent...

New text-to-image models have made tremendous strides recently, opening the door to revolutionary applications like picture creation from a single text input; in contrast...

Is Real-Time 3D Rendering on Mobile Devices Now Possible? Researchers from China Introduced VideoRF: An AI Approach to Enable Real-Time Streaming and Rendering of...

Neural Radiance Fields (NeRF) represent an innovative 3D scene depiction technique in computer graphics and computer vision. Leveraging neural networks, this method is a...

Researchers from Tongji University and Microsoft Unveil STLVQE: A Groundbreaking AI Approach to Online Video Quality Enhancement

With an increase in the use of the internet, the demand for high-quality and real-time video content and seamless experiences in applications like video...

Researchers from ETH Zürich and Max Planck Introduce ‘HOLD’: A Groundbreaking Category-Agnostic AI Method for 3D Hand-Object Reconstruction from Monocular Videos

In a groundbreaking development, researchers from ETH Zürich and the Max Planck Institute for Intelligent Systems have introduced HOLD, an innovative method designed to...

Recent articles