Score Normalization

greenfieldvision · September 11, 2021, 8:18pm

Hi,

I’m late to the party, but what is the official position regarding score normalization for track 2? The score normalization script (https://github.com/facebookresearch/isc2021/blob/main/scripts/score_normalization.py) implements it but produces output for track 1.

The paper states “We expect that techniques that explicitly optimize the estimation will make that normalization unnecessary”, but it seems possible to implement normalization as a postprocessing step for track 2, following the example of the score normalization script. Here’s a sketch:

The distance between a query and a reference image is
d_qr = sqrt(sum((x_qi - x_ri)^2))
If the descriptors have norm 1,
d_qr = sqrt(2 - 2x_q.x_r)
If we add 1 new dimension and set it to the per query offset for query vectors and to 0 for reference vectors,
d_qr = sqrt(2 - 2x_q.x_r + x_qn^2) = sqrt(2 - 2*(x_q.x_r - x_qn^2/2))

It would be good to know if implicit score normalization is allowed for track 2 and if so, to extend the script functionality to produce output for track 2

mike-dd · September 20, 2021, 2:34pm

Hi greenfieldvision. Apologies for the delay in responding.

I’m not sure I totally understand the question, so feel free to clarify if what follows does not help.

You will want to keep in mind the competition rules related to treating each reference image independently. See this section of the Rules if you are not already familiar with it:

…the used approach should have an equivalent result if all of the reference images are provided at once, or if they are provided in chunks over time. We restrict the focus here, and only allow approaches that treat each new image of the reference set independently and without any interaction with other reference images.

Any normalization you do should therefore depend only on each individual vector. The training set may be a useful resource if you need to compute fixed parameters for any normalization steps.

I hope this helps, but I may not be fully understanding your question. Feel free to clarify if needed.

greenfieldvision · September 20, 2021, 4:10pm

Thanks dd-mike. The question was not about whether other query or other reference images can be used when computing embeddings, but about normalization for track 2. If you look at the score normalization script, it uses training images to do normalization, as per the equation 2 in the paper (https://arxiv.org/pdf/2106.09672.pdf). However, this script only produces track 1 output and I was wondering if it’s acceptable to extend this functionality to track 2. I will take this

to mean yes.

Topic		Replies	Views
Question to orgs about the metric Image Similarity Challenge	2	468	August 4, 2021
Two question about track2 Image Similarity Challenge	12	458	September 1, 2021
What is the use of Similarity normalization Image Similarity Challenge	6	500	October 19, 2021
L2 norm in feature descriptor Video Similarity Challenge	2	240	March 4, 2023
Code execution submissions and eligibility for Phase 2 Video Similarity Challenge	3	302	February 20, 2023

Score Normalization

Related topics