Speaker
Description
Urban Air Mobility (UAM) is a transportation system that uses electric vertical takeoff and landing (eVTOL) aircraft to transport people or cargo within urban areas, helping to reduce congestion on the ground. Aircraft operate in a busy urban environment, and may be semi or fully autonomous. In this use case there is a need to automate communication between aircraft and ground to reduce communication errors and enhance situational awareness for both pilots and ground crews.
South Korea plans to commercialize UAM by next year with the goal of reducing congestion in the capital of Seoul. We worked with Korea's second largest corporation, SK Telecom, to combine GStreamer's excellent WebRTC support with commodity ML speech to text (STT), text to speech (TTS) and large language model (LLM) packages to create an open source automated communication workflow for UAM.
I will discuss how we implemented and tuned this system, and also touch on one particular problem we faced: how to apply STT to a mixture of Korean and English speech.
Duration of the talk | |
---|---|
Speaker bio | Aaron is a mathematician and developer and enjoys the simple pleasure of squeezing every last ounce of performance out of both hardware and software. He works for Collabora and is based in Toronto, Canada. |