Training on Reference images

gradient.machine · August 4, 2021, 4:27pm

I’m a bit confused that can I train on the reference set ? Since we need to compute the similarity between the reference and query images pair, it is necessary to learn the embeddings of the reference set.

mike-dd · August 16, 2021, 5:04pm

Hi gradient.machine. Thanks for the question.

Use of the reference images for training is discouraged but not prohibited. We would instead strongly encourage you to use the provided training set for training purposes. Check out the “Getting started” blog post for some more information about how to use the training set.

Hope this helps. Good luck!

separate · August 17, 2021, 2:45am

I don’t understand.
The rule says that “Submitted individual predictions may not take into account more than one query image or more than one reference image at a time.”
Does the model trained with reference images is able to take into account only one reference image at each prediction? The model already saw all reference images and learned from them.

mike-dd · August 17, 2021, 12:50pm

Hi separate! Thanks for the question.

The key point here is that predictions need to be independent of one another in the sense that they are not affected by the presence of other images. See the following section of the Rules:

The goal of the competition is to reflect real world settings where new images are continuously added to the reference set, or old images get removed from it. Therefore, the used approach should have an equivalent result if all of the reference images are provided at once, or if they are provided in chunks over time. We restrict the focus here, and only allow approaches that treat each new image of the reference set independently and without any interaction with other reference images.

You should also check out the “Acceptable methods” section (6.5) of the competition paper, which covers the same topic.

Please let us know if anything remains unclear.

wenhaowang · August 17, 2021, 1:33pm

I do NOT understand why you argue that training on the reference dataset is not prohibited. In the early reply, Is legal for training on the reference dataset? and the rule Competition: Facebook AI Image Similarity Challenge: Matching Track have already pointed out "Use of augmented reference images for any other reason, including model training, is prohibited ".

mike-dd · August 17, 2021, 6:34pm

Hi wenhaowang.

Note in the sentence you have quoted that augmenting reference images for training purposes is prohibited. From the Rules section you can see that we define augmenting as applying a transformation to an image to generate a new image, such as the manner in which the query set images were derived from the reference set images.

Applying these transformations to the reference images and then training is prohibited. Training using the reference images themselves without augmentation is not prohibited; but again, participants are encouraged to use the training set for these purposes.

wenhaowang · August 18, 2021, 4:04am

Ok, thanks for your reply.

wenhaowang · August 18, 2021, 4:09am

By the way, is the resize operation, such as resizing an image to 512x512, is regarded as a transformation?

wenhaowang · August 19, 2021, 5:00am

Or, whether RandomCrop is regarded as augmentation? Because for training, images with different shapes are difficult to process.

mike-dd · August 24, 2021, 1:49pm

Thanks for the question, wenhaowang.

Resizing images once as part of preprocessing to get consistent image sizes is allowed, since this is a basic requirement for many CV techniques.

Any form of augmentation is allowed during inference. For example, you could run inference on multiple augmentations of a single image and aggregate the results for that image independently of other images.

Augmentation (including resizing or cropping) for training purposes is only allowed on the training set, not the reference or query sets.

I hope this helps.

From the competition rules on augmenting:

“Augmenting” images refers to applying a transformation to an image to generate a new image, such as the manner in which the query set images were derived from the reference set images.
Augmenting reference images is permitted in the inference process of generating embeddings and matching scores, so long as each reference image is used independently without any interaction with other reference images. Use of augmented reference images for any other reason, including model training, is prohibited.

wenhaowang · August 25, 2021, 4:35pm

Thanks for your reply. It helps.

coin · August 29, 2021, 4:38am

Hi,

Are we allowed to normalized reference images for training?

For example, we first divide the image by 255, and then submit mean and divide std of each channel. If we want to train on reference dataset, are we allowed to do this normalization please?

wenhaowang · August 31, 2021, 6:48am

“Phase III: verification. (November) The organizers verify that the code submitted in advance reproduces the reported results. The participants are expected to reply to requests from the organizers to assist in this validation.
The results will be announced at the NeurIPS competition workshop.”

coin · September 1, 2021, 12:25am

But the organizer cannot catch collaborating between teams. For example, there are three people working together, but they registered the competition sperately as three teams. Thus they would have 3x submitting chances. The final rank would be inaccurate due to this. If they achieved rank-3 in phase-II, then rank-3 to rank-6 are occupied, the original rank-4 would be rank-7. Is this behavior allowed?

wenhaowang · September 1, 2021, 4:07am

I suggest the organizer @mike-dd check the duplication of the final report and codes of the competition. I think in this way, the mentioned behavior could be avoided.

separate · September 4, 2021, 9:52am

Hi @mike-dd,

Resizing images once as part of preprocessing to get consistent image sizes is allowed, since this is a basic requirement for many CV techniques.

Augmentation (including resizing or cropping) for training purposes is only allowed on the training set, not the reference or query sets.

Above two statements seem to contradict each other. Just to be clear, is ‘Resizing images once on the reference sets for training purposes’ allowed?

And another question is about the statement from the paper,

This means that even if the dataset had a single query image and a single reference image, the
score of the image pair would be the same.

For one query-reference image pair A-B, one model that trained on all reference images and another model that trained only on one reference image(B) will have different score, which seems to break the above statement.
So, can I understand that above statement(same score for different dataset size) is applied in ‘inference’ stage? And in ‘training’ stage, we are just okay to train using all reference images?

mike-dd · September 7, 2021, 2:49pm

Hi separate,

Yes to all of your questions.

Resizing images once on the reference sets for training purposes is allowed.

The requirement that an image score be unchanged by the dataset size is applicable at the inference step. It is ok to train on all reference images, but as you know participants are encouraged to use the training set for training purposes.

I hope this helps.

separate · September 7, 2021, 4:30pm

Thanks for answering repeated questions! Really appreciate it.

wenhaowang · September 7, 2021, 4:54pm

Sorry, I cannot understand. @mike-dd

I think they are incompatible.

mike-dd · September 30, 2021, 3:41pm

Hi wenhaowang,

Resizing images, if applied once to an image as part of preprocessing, is not considered augmentation. Augmentation for training purposes is only allowed on the training set, not the reference or query sets.

From the competition rules:

“Augmenting” images refers to applying a transformation to an image to generate a new image, such as the manner in which the query set images were derived from the reference set images.

Again, participants are encouraged to use the training set for training purposes, and to use the ground truth labels as a test/validation set.

Topic		Replies	Views
Is legal for training on the reference dataset? Image Similarity Challenge	1	534	July 29, 2021
Why is the use of augmented reference images prohibited? Image Similarity Challenge	0	337	October 6, 2021
Is augmenting reference images for local validation allowed? Image Similarity Challenge	7	439	October 8, 2021
ImageNet pretraining & Augmented reference set during inference Image Similarity Challenge	4	552	September 2, 2021
Can the ground-truth pairs be used for training? Image Similarity Challenge	9	955	September 10, 2021

Training on Reference images

Related Topics