Questions about tutorial code

vlucet · August 24, 2023, 5:44pm

Hi there, I really enjoyed the tutorial, it provided a very good starting point. I was wondering of anyone could enlighten me on some of the choices thatw ere made concerning adapting the resnet50 model for our dataset.

In particular, we choose to replace the final fc layer with the following layers:

model.fc = nn.Sequential(
    nn.Linear(2048, 100),  # dense layer takes a 2048-dim input and outputs 100-dim
    nn.ReLU(inplace=True),  # ReLU activation introduces non-linearity
    nn.Dropout(0.1),  # common technique to mitigate overfitting
    nn.Linear(
        100, 8
    ),  # final dense layer outputs 8-dim corresponding to our target classes
)

Why choose 100 in the first layer? why not 1000 or 50?
How common is it to add a ReLU and Dropout layer when adapting something like a resnet?
What if we were to freeze the weights for transfer learning, would we still want to have those new layers at the end of the network?
What about if I wanted to use a different model, would it also require me to add those last layers for transfer learning?

Topic		Replies	Views
Pretrained models Conser-vision Practice Area	1	624	June 22, 2022
My solution (2nd place so far) Hakuna Ma-data	3	1074	February 5, 2020
Transfer learning: What is ResNet learning? Conser-vision Practice Area	1	519	February 13, 2024
Solutions postings Clog Loss: Advance Alzheimer’s Research	4	842	August 12, 2020
My solution and code Hakuna Ma-data	2	625	February 5, 2020

Questions about tutorial code

Related topics