Draft: Fixes 1x3 and 3x1 so 'peel' does not do too much of the border.
Two changes: i) Changes the slow path conv2d so that it can handle pad=(0,0), and ii) fixes conv 1x3 and 3x1 so that 'peel' only does the part of the border that it needs.
Two changes: i) Changes the slow path conv2d so that it can handle pad=(0,0), and ii) fixes conv 1x3 and 3x1 so that 'peel' only does the part of the border that it needs.