深度学习经典论文翻译 专栏收录该内容

Alex Krizhevsky, Ilya Sutskever, Geoffrey E Hinton
Abstract:We trained a large, deep convolutional neural network to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes. On the test data, we achieved top-1 and top-5 error rates of 37.5% and 17.0% which is considerably better than the previous state-of-the-art. The neural network, which has 60 million parameters and 650,000 neurons, consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax. To make training faster, we used non-saturating neurons and a very efficient GPU implementation of the convolution operation. To reduce overfitting in the fully-connected layers we employed a recently-developed regularization method called “dropout” that proved to be very effective. We also entered a variant of this model in the ILSVRC-2012 competition and achieved a winning top-5 test error rate of 15.3%, compared to 26.2% achieved by the second-best entry.摘要我们训练了一个大型深度卷积神经网络来将 ImageNet LSVRC-2010 竞赛的 120 万高分辨率的图像分到 1000 不同的类别中. 在测试数据上, 我们得到了 top-1 37.5% 和 top-5 17.0% 的错误率, 这个结果比目前的最好结果好很多. 这个神经网络有 6000 万参数和 650000 个神经元, 包含 5 个卷积层( 某些卷积层后面带有池化层) 和 3 个全连接层, 最后是一个 1000 维的 softmax. 为了训练的更快, 我们使用了非饱和神经元, 并在进行卷积操作时使用了非常有效的 GPU. 为了减少全连接层的过拟合, 我们采用了一个最近开发的名为 dropout 的正则化方法, 结果证明是非常有效的. 我们也使用这个模型的一个变种参加了 ILSVRC-2012 竞赛, 赢得了冠军并且与第二名 top …
What problem does this paper attempt to address?