After understanding the setting of this competition, I still don’t have any idea how I can tackle the problem. From my view point, it feels impossible to detect a certain animal specified in csv because we don’t know WHEN and WHERE the animal appears in the video.
For like this problem, what is the basic approach? In other words, how can I frame the problem in a tractable way?
I joined N+1 N+2 fish competition last month. It was first time for me to use object detection network and hard to implement the network. But it was OK because I could see what should be the loss and how I can utilize models to minimize it.
Any hints, comments or references would be appreciated. Thanks.