0.75162 | Mosaic | Florida International University | US | Improving Speech-based Emotion Recognition with Contextual Utterance Analysis and LLMs | Enshi Zhang |
0.75059 | Edinburgh | University of Edinburgh | UK | Context and System Fusion in Post-ASR Emotion Recognition with Large Language Models | Pavel Stepachev and Pinzhen Chen |
0.64522 | SLAM | Academia Sinica | TW | Exemplar-Based Methods for Mandarin Electrolaryngeal Speech Voice Conversion | Chia-Hua Wu |
0.58809 | TeamBlack | Columbia University | US | Post-ASR LLM-Based Speech Emotion Recognition: A fight between top LLMs | Sounak Ray |
0.5518 | GPT-3.5 Turbo | Baseline | Global | Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition | Official |