hey @harish5p & @leoaaa.
The reason for long queues might be that most participants use the code provided in the benchmark notebook, which does inference on CPU, instead of GPU. I just wrote about the issues:
Please make sure that your code leverages GPU for inference.