Speech Recognition

Researchers From Columbia University Propose ‘Neural Voice Camouflage’: An Adversarial Attack-Based Approach That Disrupts Automatic Speech Recognition Systems In Real-Time

This Article is written as a summay by Marktechpost Staff based on the Research Paper 'REAL-TIME NEURAL VOICE CAMOUFLAGE'. All Credit For This Research...

Amazon AI Researchers Propose A New Model, Called RescoreBERT, That Trains A BERT Rescoring Model With Discriminative Objective Functions And Improves ASR Rescoring

This Article is written as a summay by Marktechpost Staff based on the Research Paper 'RESCOREBERT: DISCRIMINATIVE SPEECH RECOGNITION RESCORING WITH BERT'. All Credit...

Amazon Researchers Developed a Universal Model Integration Framework That Allows To Customize Production Voice Models in a Quick and Scalable Way

This summary article is based on Amazon research 'Scalable framework lets multiple text-to-speech models coexist' Please don't forget to join our ML Subreddit Alexa and other...

Google AI Propose An Machine Learning (ML) Based Audio Separation Approach That Can Identify Birdsongs For Better Species Classification

Birds are identifiable not only by their appearance but also by their songs. We can appreciate many things around us if we listen carefully...

Meta AI Introduces AV-HuBERT: A State-Of-The-Art Self-Supervised Framework For Understanding Speech That Learns By Both Seeing And Hearing People Speak

AI is used for various speech recognition and understanding activities, ranging from enabling smart speakers to designing aids for persons who are deaf or...

New AI Research Study On The Accuracy Of Distortion Metrics For Audio Adversarial Attacks on Machine Learning Models

With recent developments in machine learning models and their impressive performance in speech recognition tasks, human-computer interaction is becoming increasingly reliant on speech communication....

Researchers At Johns Hopkins Introduce A Machine Learning Model That Can Allow Computers To Understand Human Conversation

Human conversation is dynamic, with many exceptions and unexpected ways to express oneself. In recent years, significant progress has been made to help machine...

Meta AI Develops A Conversational Parser For On-Device Voice Assistants

A variety of devices such as computers, smart speakers, cellphones, etc., utilize conversational assistants for helping users with tasks ranging from calendar management to...

Meta/Facebook AI Releases XLS-R: A Self-Supervised Multilingual Model Trained On 128 Languages For A Variety Of Speech Tasks

Talking to one another is a natural way for people to engage. With advancing speech technology, people are now interacting with devices in day...

MIT AI Researchers Introduce ‘PARP’: A Method To Improve The Efficiency And Performance Of A Neural Network

Recent developments in machine learning have enabled automated speech-recognition technologies, such as Siri, to learn the world's uncommon languages, which lack the enormous volume...

Researchers From Seoul National University, NVIDIA and Microsoft Release ‘ACAV100M’: An Automatically Curated Video Dataset For Self-Supervised Audio-Visual Learning

Audio-visual (AV) learning is defined by delivering and applying instructional content that includes both sound and visual information. The natural relationship between visual observations...

Google AI Introduces Translatotron 2 For Robust Direct Speech-To-Speech Translation

The Natural Language Processing (NLP) domain is experiencing remarkable growth in many areas, including search engines, machine translation, chatbots, home assistants and many more....

Recent articles

Check Out Our Super Cool AI Research Newsletter While It's Still Free

X