Languages, tasks and validity of audio features

rutgersnpl · November 19, 2024, 4:10pm

Hi, We were trying to convert the audio into language transcripts and we found out that there are several different languages. Are all the pre-generated acoustic features been validated in all the languages included here?

Also are the tasks in the training audio files just cookie theft or are there are a list of different tasks the participants are doing?

hannahmoro · November 21, 2024, 4:21pm

Thanks for writing in! We don’t have additional information to share about the features beyond what we have in the problem description, and similarly regarding the tasks participants were completing.

rutgersnpl · November 25, 2024, 8:30pm

We also noticed that there are several languages in this dataset, is it possible to train and test only on English language data(both linguistic and acoustic) and submit those predictions?

hannahmoro · November 26, 2024, 4:23pm

Hello! Yes, you can explore any modeling strategy that is reproducible and adheres to the challenge rules.

Topic		Replies	Views
Task-specific features PREPARE Challenge	1	86	December 5, 2024
Are combination and linguistic models are acceptable? PREPARE Challenge	1	115	November 15, 2024
Clarification about Model Features PREPARE Challenge	0	107	December 5, 2024
Use of Adult Speech Data for Pretraining and Fine-Tuning Children’s Speech Recognition Challenge	3	162	March 23, 2026
Training data - incredibly corrupt Children’s Speech Recognition Challenge	1	82	April 7, 2026

Languages, tasks and validity of audio features

Related topics