실습 - CNN으로 MNIST분류

대부분은 15강에서 다룬 패키지와 유사함
여기서는 cnn에 집어넣을 블록과 출력(이미지를 분류하는 클래스)클래스를 어떻게 정의했는지 보는게 중요
(입력사이즈 + 2*패딩사이즈 - (커널사이즈-1)-1)/스트라이드 +1

재활용할 블록

class ConvolutionalBlock(nn.Module) :
    def __init__(self, in_channel, out_channel) :
        self.in_channel = in_channel
        self.out_channel = out_channel
        
        super().__init__()
        
        self.layer = nn.Sequential(
            nn.Conv2d(in_channel, out_channel, (3,3), padding = 1),
            nn.ReLU(),
            nn.BatchNorm2d(out_channel),
            nn.Conv2d(out_channel, out_channel, (3,3), stride = 2, padding =2),
            nn.ReLU(),
            nn.BatchNorm2d(out_channel)
            
        )
        def forward() :
            y = self.layer(x)
            
            return y

코드 설명

블록을 재활용한 모습

(입력사이즈 + 2*패딩사이즈 - (커널사이즈-1)-1)/스트라이드 +1
28+2x1-(3-1)-1 / 2 +1 → 14.5 → 14
14 +2 -2-1 / 2 +1 → 7.5 → 7

class ConvolutionalClassifier(nn.Module) :

    def __init__(self, output_size) :
        self.output_size = output_size
        
        super().__init__()
        
        self.block = nn.Sequential( # 입력사이즈 (n,1,28,28)
            ConvolutionalBlock(1,32), # 28+2x1-(3-1)-1 / 2 +1 → 14.5 → 14 (n,32,14,14)
            ConvolutionalBlock(32,64), # (n,64,7,7)
            ConvolutionalBlock(64,128), # (n, 128, 4,4)
            ConvolutionalBlock(128,256), #(n,256,2,2)
            ConvolutionalBlock(256,512), # (n, 512, 1, 1)
        )
        self.layers = nn.Sequential(
            nn.Linear(512,50),
            nn.ReLU(),
            nn.BatchNorm1d(50), 
            nn.Linear(50,output_size),
            nn.LogSoftmax(dim = -1)
        )
        def forward() :
            assert x.dim() >2
            if x.dim() == 3 :
                x = x.view(-1,1,x.size(-2), x.size(-1))
                z = self.block(x)
                y = self.layers(z.squeeze())
                
                return y

코드 설명

train.py 변경하기

링크타고 들어가면 원래 코드가 어떤지 알 수 있음

라이브러리 변경

mnist_classifier은 전부 15강에서 만들어 놓은 모듈 py형태로 저장해놓으면 끌어올 수 있다.