You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,8 +2,8 @@
2
2
[Torchvision model zoo](https://github.com/pytorch/vision/tree/master/torchvision/models) provides number of implementations of various state-of-the-art architectures, however, most of them are defined and implemented for ImageNet.
3
3
Usually it is very straightforward to use them on other datasets, but sometimes these models need manual setup.
4
4
5
-
Unfortunately, none of the pytorch repositories with ResNets on CIFAR10 provides an implementation as described in the [original paper](https://arxiv.org/abs/1512.03385). If you just use the torchvision's models on CIFAR10 you'll get the model **that differs in number of layers and parameters**. This is unacceptable if you want to directly compare ResNet-s on CIFAR10 with original paper.
6
-
The purpose of this repo is to provide a valid pytorch implementation of ResNet-s for CIFAR10 as described in original paper. Following models are provided:
5
+
Unfortunately, none of the pytorch repositories with ResNets on CIFAR10 provides an implementation as described in the [original paper](https://arxiv.org/abs/1512.03385). If you just use the torchvision's models on CIFAR10 you'll get the model **that differs in number of layers and parameters**. This is unacceptable if you want to directly compare ResNet-s on CIFAR10 with the original paper.
6
+
The purpose of this repo is to provide a valid pytorch implementation of ResNet-s for CIFAR10 as described in the original paper. Following models are provided:
7
7
8
8
| Name | # layers | # params| Test err(paper) | Test err(this impl.)|
The implementation matches description in original paper, with comparable or better test error.
17
+
The implementation matches description of the original paper, with comparable or better test error.
18
18
19
19
## How to run?
20
20
```bash
@@ -24,7 +24,7 @@ chmod +x run.sh && ./run.sh
24
24
```
25
25
26
26
## Details of training
27
-
This implementation follows paper in straightforward manner with some caveats: **First**, original paper uses 45k/5k train/validation split on train data, and selects the best performing model based on the performance on the validation set. This implementation does not do any validation testing, so if you need to compare your results on ResNet head-to-head to orginal paper keep this in mind. **Second**, if you want to train ResNet1202 keep in mind that you need 16GB memory on GPU.
27
+
This implementation follows the paper in straightforward manner with some caveats: **First**, training in the paper uses 45k/5k train/validation split on the train data, and selects the best performing model based on the performance on the validation set. This implementation does not do any validation testing, so if you need to compare your results on ResNet head-to-head to the orginal paper keep this in mind. **Second**, if you want to train ResNet1202 keep in mind that you need 16GB memory on GPU.
0 commit comments