Shuffle Net is a good paper.
Have several forumlisms as to what makes an efficient CNN architecture.
- Less MAX (memory ON access) time
- Use same channle in and out, as it reduced MAX
- Less group convolutions as it fragments the memory of GPU
- Inplace operations are hecy (almost 20% more overhead)
- Include Shuffle Net in project (pretrained)
- Include Mobile Net in project (pretrained)
- Include DetNet in project, better backbone ?
Now with these 3 backbones
- test best speed accuracy tradeoff, it shoulkd bbe fast like shuffle net, but with more receptive field (like det net ?)
- For accuracy, port mscoco evaluation. Dont code eval yourself.
- For speed, code average of 100 forward passes, for 1 single image.
For optimsation train with adam,
- Should you freeze backbone gradients first ?, maybe you should. Add that
- Take trained object detection, freese backbone, train for lane markings task.
- Use a lightweight semantic segmentation thingy.
- See if it trains. It should at least overfit on 1 example first.