Pingchuan Ma

Training AI to read your lips — in multiple languages

While most speech recognition tools analyze audio alone, researchers have also made progress in developing visual speech recognition (VSR) models, which rely on visual input to identify what a speaker is saying.