Efficient Convolutaionl Architectures

Shuffle Net is a good paper.

Have several forumlisms as to what makes an efficient CNN architecture.

Less MAX (memory ON access) time
Use same channle in and out, as it reduced MAX
Less group convolutions as it fragments the memory of GPU
Inplace operations are hecy (almost 20% more overhead)

TODO for project

Include Shuffle Net in project (pretrained)
Include Mobile Net in project (pretrained)
Include DetNet in project, better backbone ?

Now with these 3 backbones

test best speed accuracy tradeoff, it shoulkd bbe fast like shuffle net, but with more receptive field (like det net ?)
For accuracy, port mscoco evaluation. Dont code eval yourself.
For speed, code average of 100 forward passes, for 1 single image.

For optimsation train with adam,

Should you freeze backbone gradients first ?, maybe you should. Add that

For segmentation

Take trained object detection, freese backbone, train for lane markings task.
Use a lightweight semantic segmentation thingy.
See if it trains. It should at least overfit on 1 example first.