I believe there are two options to aggregate the jaccard index:

- Compute the jaccard index for each image and compute the average index for all the images.
- Compute an aggregated jaccard index using the area of all the images together

From my point of view, the first option has more sense, but I just want to check which option is used in the evaluation.