New Tutorial: Finetuning Parakeet with NeMo

cszc · March 3, 2026, 8:50pm

Hi Solvers,

We’ve recently published a reference solution for the Children’s Speech Recognition Challenge: Word Track.

In this in-depth blog post and companion repo, we walk through how to fine-tune NVIDIA’s Parakeet ASR model using NeMo on the competition data, resulting in a 0.2370 WER on the public leaderboard.

In the tutorial, we:

Demonstrate how to load and explore the data.
Provide a basic framework for building a model.
Demonstrate how to package your work correctly for submission.

Whether you’re just getting started or looking to benchmark your approach, this should give you a strong foundation to build on. Read the full post here:
https://drivendata.co/blog/child-asr-word-benchmark

Happy modeling!

Topic		Replies	Views
New Tutorial: Finetuning Wav2Vec2 with Hugging Face Transformers for the Phonetic Track Children’s Speech Recognition Challenge	0	121	March 11, 2026
Me Ranting over this Competition Children’s Speech Recognition Challenge	6	300	March 16, 2026
Now that we are done, who wants to talk about what worked? Children’s Speech Recognition Challenge	15	308	June 11, 2026
Qwen_asr is not available Children’s Speech Recognition Challenge	3	274	February 20, 2026
Use of Adult Speech Data for Pretraining and Fine-Tuning Children’s Speech Recognition Challenge	3	177	March 23, 2026

New Tutorial: Finetuning Parakeet with NeMo

Related topics