Are combination and linguistic models are acceptable?

maryamzolnoori · November 15, 2024, 1:44pm

Dear Organizer,
We currently have a pipeline designed for cognitive impairment detection that integrates acoustic transformer models with handcrafted acoustic features. In addition, the pipeline incorporates linguistic models—both handcrafted and transformer-based—to analyze syntactic and semantic aspects of language as well as speech fluency.

Could you please clarify whether you are specifically seeking a pipeline built exclusively on acoustic models, or if a pipeline that integrates linguistic models with acoustic models would also be acceptable? Thank you.

hannahmoro · November 15, 2024, 6:43pm

We encourage solvers to explore all possible features, including linguistic and semantic ones, as long as those features are derived from the acoustic data. One note to be aware of is that features should be generated automatically. So you could use a pretrained model to get text from the audio, but you should not be manually transcribing audio to text.

Does that answer your question?

Topic		Replies	Views
Languages, tasks and validity of audio features PREPARE Challenge	3	117	November 26, 2024
Clarification about Model Features PREPARE Challenge	0	107	December 5, 2024
Now that we are done, who wants to talk about what worked? Children’s Speech Recognition Challenge	10	177	April 14, 2026
Qwen_asr is not available Children’s Speech Recognition Challenge	3	266	February 20, 2026
Task-specific features PREPARE Challenge	1	86	December 5, 2024

Are combination and linguistic models are acceptable?

Related topics