Index


Overall Architecture and Execution

In this part, it’s not that different from a regular Neural Network structure. The biggest difference is that the input is a Feature Map (output) from Level 1 - Patch.

I’m going to leave out majority of the code snippet in this post because it’s pretty much the same as the Level 1 - Patch network which is following the architecture shown above. The only criterion to be careful here is making sure the Feature Map can be fed to the network properly. However, due to overfitting problem in this Level, I’ve implemented additional dropout in every batch.

self.classifier = nn.Sequential(
    nn.Linear(1 * 16 * 16, 128),
    nn.ReLU(),
    nn.Dropout(0.5),

    nn.Linear(128, 128),
    nn.ReLU(),
    nn.Dropout(0.5),

    nn.Linear(128, 64),
    nn.ReLU(),
    nn.Dropout(0.5),

    nn.Linear(64, 4),
)