We're in a hospital dealing with an outbreak. For the first 5 days we have 1 patient on Monday, 2 on Tuesday, etc:
Fortunately, we know how to treat them: 3 doses on day 1, then 2, then 1:
And after 3 days they're cured.
How many doses do we need on each day?

| Input | Kernel | Stride | Padding | Output |
|---|---|---|---|---|
| 1 | same | |||
| 2 | same | |||
| 1 | valid | ??? |
While convolution only happens in 2D, the kernel can be thought of as a 3D volume - there's a separate trainable kernel for each channel
Pooling layers are used to reduce the dimensionality of the feature maps (aka downsampling) by taking the maximum or average of a region







"If I have seen further, it is by standing on the shoulder of giants"
-- Isaac Newton
Garbage in, garbage out
We can artificially increase diversity with data augmentation:
AutoAugment does a bunch of this automatically