一、官方文檔介紹

nn.Conv2d：對由多個輸入平面（多通道）組成的輸入信號進行二維卷積

二、torch.nn.Conv2d()函數詳解

參數詳解
torch.nn.Conv2d(in_channels, out_channels, kernel_size, stride=1, padding=0, dilation=1, groups=1, bias=True)

參數	參數類型
`in_channels`	int	Number of channels in the input image	輸入圖像通道數
`out_channels`	int	Number of channels produced by the convolution	卷積產生的通道數
`kernel_size`	(int or tuple)	Size of the convolving kernel	卷積核尺寸，可以設為1個int型數或者一個(int, int)型的元組。例如(2,3)是高2寬3卷積核
`stride`	(int or tuple, optional)	Stride of the convolution. Default: 1	卷積步長，默認為1。可以設為1個int型數或者一個(int, int)型的元組。
`padding`	(int or tuple, optional)	Zero-padding added to both sides of the input. Default: 0	填充操作，控制`padding_mode`的數目。
`padding_mode`	(string, optional)	‘zeros’, ‘reflect’, ‘replicate’ or ‘circular’. Default: ‘zeros’	`padding`模式，默認為Zero-padding 。
`dilation`	(int or tuple, optional)	Spacing between kernel elements. Default: 1	擴張操作：控制kernel點（卷積核點）的間距，默認值:1。
`groups`	(int, optional)	Number of blocked connections from input channels to output channels. Default: 1	group參數的作用是控制分組卷積，默認不分組，為1組。
`bias`	(bool, optional)	If True, adds a learnable bias to the output. Default: True	為真，則在輸出中添加一個可學習的偏差。默認：True。

參數dilation——擴張卷積（也叫空洞卷積）

dilation操作動圖演示如下：
Dilated Convolution with a 3 x 3 kernel and dilation rate 2
擴張卷積核為3×3，擴張率為2

參數groups——分組卷積

Group Convolution顧名思義，則是對輸入feature map進行分組，然后每組分別卷積。假設輸入feature map的尺寸仍為C ∗ H ∗ W ，輸出feature map的數量為N個，如果設定要分成G個groups，則每組的輸入feature map數量為 $\frac{C}{G}GC ，每組的輸出feature map數量為N G \frac{N}{G}GN，每個卷積核的尺寸為C G ∗ K ∗ K \frac{C}{G}∗K∗KGC∗K∗K，卷積核的總數仍為N個，每組的卷積核數量為N G \frac{N}{G}GN ，卷積核只與其同組的輸入map進行卷積，卷積核的總參數量為N ∗ C G ∗ K ∗ K N∗\frac{C}{G}∗K∗KN∗GC∗K∗K，可見，總參數量減少為原來的1 G \frac{1}{G}G1，其連接方式如下圖右所示，group1輸出map數為2，有2個卷積核，每個卷積核的channel數為4，與group1的輸入map的channel數相同，卷積核只與同組的輸入map卷積，而不與其他組的輸入map卷積。$

三、代碼實例

import torch

x = torch.randn(3,1,5,4)
print(x)

conv = torch.nn.Conv2d(1,4,(2,3))
res = conv(x)

print(res.shape)    # torch.Size([3, 4, 4, 2])

輸入：x[ batch_size, channels, height_1, width_1 ]
batch_size，一個batch中樣本的個數 3
channels，通道數，也就是當前層的深度 1
height_1， 圖片的高 5
width_1， 圖片的寬 4

卷積操作：Conv2d[ channels, output, height_2, width_2 ]
channels，通道數，和上面保持一致，也就是當前層的深度 1
output ，輸出的深度 4【需要4個filter】
height_2，卷積核的高 2
width_2，卷積核的寬 3

輸出：res[ batch_size,output, height_3, width_3 ]
batch_size,，一個batch中樣例的個數，同上 3
output， 輸出的深度 4
height_3， 卷積結果的高度 4
width_3，卷積結果的寬度 2

一個樣本卷積示例：

親測有效！！！

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 torch.nn.Conv2d()函數詳解關於torch.nn.Conv2d的筆記 pytorch之nn.Conv1d詳解 pytorch之nn.Conv1d詳解 Pytorch中nn.Conv2d的用法 nn.Conv2d中padding詳解 Pytorch 中nn.Conv2d的參數用法 channel含義詳解 pytorch 中 torch.nn.Linear() 詳解 Pytorch——torch.nn.Sequential()詳解 nn.Conv3d