hey @jsensio.
The reason for long queues might be that most participants use the code provided in the benchmark notebook, which does inference on CPU, instead of GPU. I just wrote about the issues:
Please make sure that your code leverages GPU for inference.