rwightman / pytorch-image-models

TOP 1 ACCURACY TOP 5 ACCURACY
SPEED
MODEL CODE PAPER
ε-REPR
CODE PAPER
ε-REPR
PAPER
GLOBAL RANK
Adversarial Inception V3
77.6% -- 93.7% -- 541.0 #294
CSPDarkNet-53
80.0% -- 95.1% -- 477.3 #135
CSPResNet-50
79.6% -- 94.7% -- 589.9 #178
CSPResNeXt-50
80.0% -- 94.9% -- 590.4 #155
DenseNet-121
75.6% 76.4% 92.6% 93.3% 601.8 #358
DenseNet-Blur-121D
76.6% -- 93.2% -- 571.8 #344
DLA-102
78.0% -- 94.0% -- 481.7 #274
DLA-169
78.7% -- 94.3% -- 360.4 #240
DLA-34
74.6% -- 92.1% -- 713.7 #391
DLA-46-C
64.9% -- 86.3% -- 736.1 #502
DLA-60
77.0% -- 93.3% -- 602.2 #324
DLA-X-102
78.5% -- 94.2% -- 421.2 #249
DLA-X-102 64
79.4% -- 94.6% -- 304.3 #186
DLA-X-46-C
66.0% -- 87.0% -- 727.3 #491
DLA-X-60
78.2% -- 94.0% -- 548.3 #268
DLA-X-60-C
67.9% -- 88.4% -- 731.7 #470
DPN-107
(224x224)
80.2% -- 94.9% -- 198.4 #152
DPN-107
(320x320, Mean-Max Pooling)
81.8% -- 95.9% -- 93.7 #83
DPN-131
(224x224)
79.8% 80.1% 94.7% 94.9% 216.6 #167
DPN-131
(320x320, Mean-Max Pooling)
81.4% 81.5% 95.8% 95.8% 101.7 #89
DPN-68
(224x224)
76.3% 76.4% 93.0% 93.1% 605.5 #352
DPN-68
(320x320, Mean-Max Pooling)
78.5% 78.5% 94.4% 94.5% 400.7 #223
DPN-68b
(224x224)
79.2% -- 94.4% -- 606.5 #203
DPN-68b
(320x320, Mean-Max Pooling)
80.3% -- 95.3% -- 408.0 #122
DPN-92
(224x224)
80.0% 79.3%
94.8% 94.6% 387.7 #159
DPN-92
(320x320, Mean-Max Pooling)
81.3% 81.0%
95.7% 95.5% 212.3 #96
DPN-98
(224x224)
79.7% 80.0% 94.6% 94.8% 278.7 #192
DPN-98
(320x320, Mean-Max Pooling)
81.2% 81.3% 95.6% 95.6% 147.1 #93
ECA-ResNet-101d
82.2% -- 96.1% -- 424.5 #71
ECA-ResNet-50d
80.6% -- 95.3% -- 565.7 #121
ECA-ResNet-Light
80.5% -- 95.2% -- 622.6 #124
EfficientNet-B0
77.7% 76.3%
93.5% 93.2%
705.3 #313
EfficientNet-B0
(AdvProp)
77.1% -- 93.3% -- 664.7 #337
EfficientNet-B0
(AutoAugment)
76.8% -- 93.2% -- 666.7 #332
EfficientNet-B0
(NoisyStudent)
78.7% -- 94.4% -- 639.0 #227
EfficientNet-B1
78.7% 78.8% 94.2% 94.4% 675.9 #259
EfficientNet-B1
(AdvProp)
79.3% -- 94.3% -- 648.9 #198
EfficientNet-B1
(AutoAugment)
78.8% -- 94.2% -- 650.0 #250
EfficientNet-B1
(NoisyStudent)
81.4% -- 95.7% -- 647.4 #86
EfficientNet-B2
80.4% 79.8%
95.1% 94.9% 612.9 #133
EfficientNet-B2
(288x288, 1.0 crop)
80.6% -- 95.3% -- 583.2 #116
EfficientNet-B2
(AdvProp)
80.3% -- 95.0% -- 585.7 #138
EfficientNet-B2
(AutoAugment)
80.1% -- 94.9% -- 587.9 #155
EfficientNet-B2
(NoisyStudent)
82.4% -- 96.2% -- 592.4 #66
EfficientNet-B3
82.1% 81.1%
96.0% 95.5%
473.7 #76
EfficientNet-B3
(320x320, 1.0 crop)
82.3% -- 96.1% -- 449.4 #69
EfficientNet-B3
(AdvProp)
81.8% -- 95.6% -- 399.5 #95
EfficientNet-B3
(AutoAugment)
81.6% -- 95.7% -- 401.6 #84
EfficientNet-B3
(NoisyStudent)
84.0% -- 96.9% -- 398.2 #42
EfficientNet-B4
(AdvProp)
83.3% -- 96.4% -- 249.3 #61
EfficientNet-B4
(AutoAugment)
83.0% -- 96.3% -- 248.0 #56
EfficientNet-B4
(NoisyStudent)
85.2% -- 97.5% -- 249.6 #21
EfficientNet-B5
(AdvProp)
84.3% -- 97.0% -- 130.7 #37
EfficientNet-B5
(NoisyStudent)
86.1% -- 97.8% -- 130.8 #12
EfficientNet-B5
(RandAugment)
83.8% -- 96.7% -- 130.5 #47
EfficientNet-B6
(AdvProp)
84.8% -- 97.1% -- 74.0 #31
EfficientNet-B6
(AutoAugment)
84.1% -- 96.9% -- 73.9 #40
EfficientNet-B6
(NoisyStudent)
86.4% -- 97.9% -- 74.2 #10
EfficientNet-B7
(AdvProp)
85.1% -- 97.2% -- 46.2 #22
EfficientNet-B7
(NoisyStudent)
86.8% -- 98.1% -- 46.6 #5
EfficientNet-B7
(RandAugment)
84.9% -- 97.2% -- 46.4 #33
EfficientNet-B8
(AdvProp)
85.4% -- 97.3% -- 31.6 #16
EfficientNet-B8
(RandAugment)
85.4% -- 97.4% -- 31.6 #25
EfficientNet-CondConv-B0 4 experts
77.3% -- 93.3% -- 657.3 #332
EfficientNet-CondConv-B0 8 experts
77.9% -- 93.7% -- 672.2 #290
EfficientNet-CondConv-B1 8 experts
79.3% -- 94.4% -- 528.4 #191
EfficientNet-EdgeTPU-L
80.4% -- 95.2% -- 291.6 #129
EfficientNet-EdgeTPU-M
79.3% -- 94.3% -- 537.8 #199
EfficientNet-EdgeTPU-S
78.1% -- 93.9% -- 669.6 #279
EfficientNet-L2 475
(NoisyStudent)
88.2% -- 98.6% -- 16.4 #3
EfficientNet-L2
(NoisyStudent)
88.3% -- 98.7% -- 6.0 #2
EfficientNet-Lite0
74.8% -- 92.2% -- 698.2 #385
EfficientNet-Lite1
76.7% -- 93.2% -- 676.3 #342
EfficientNet-Lite2
77.5% -- 93.7% -- 657.1 #291
EfficientNet-Lite3
79.8% -- 94.9% -- 467.0 #154
EfficientNet-Lite4
81.5% -- 95.7% -- 299.4 #87
Ensemble Adversarial Inception V3
80.0% -- 94.9% -- 293.5 #158
FBNet-C
75.1% 74.9% 92.4% -- 733.7 #378
HRNet-W18-C
76.8% -- 93.4% -- 484.3 #317
HRNet-W18-C-Small-V1
72.3% -- 90.7% -- 733.3 #430
HRNet-W18-C-Small-V2
75.1% -- 92.4% -- 649.4 #372
HRNet-W30-C
78.2% -- 94.2% -- 415.2 #246
HRNet-W32-C
78.4% -- 94.2% -- 404.2 #251
HRNet-W40-C
78.9% -- 94.5% -- 309.9 #206
HRNet-W44-C
78.9% -- 94.4% -- 295.4 #225
HRNet-W48-C
79.3% -- 94.5% -- 278.3 #192
HRNet-W64-C
79.5% -- 94.7% -- 239.8 #183
Inception ResNet V2
80.4% 80.1%
95.3% 95.1% 294.5 #126
Inception V3
78.8% 78.8% 94.4% 94.4% 540.8 #222
Inception V4
80.1% -- 95.0% -- 352.3 #148
MixNet-L
78.8% 78.9% 94.2% 94.2% 651.6 #233
MixNet-M
77.3% 77.0% 93.4% 93.3% 698.8 #314
MixNet-S
75.7% 75.8% 92.8% 92.8% 696.2 #356
MixNet-XL
80.5% -- 94.9% -- 542.3 #123
MnasNet-A1
75.5% 75.2% 92.6% 92.5% 711.3 #360
MnasNet-B1
74.7% -- 92.1% -- 702.1 #388
MobileNet V3-Large 0.75
73.4% -- 91.3% -- 728.3 #415
MobileNet V3-Large 1.0
75.8% 75.2%
92.6% -- 730.9 #358
MobileNet V3-Large Minimal 1.0
72.2% -- 90.6% -- 734.0 #432
MobileNet V3-Small 0.75
65.7% -- 86.1% -- 739.5 #499
MobileNet V3-Small 1.0
67.9% -- 87.7% -- 742.8 #477
MobileNet V3-Small Minimal 1.0
62.9% -- 84.2% -- 737.8 #510
Modified Aligned Xception
79.7% 79.8% 94.9% 94.8% 226.0 #156
NASNet-A Large
82.6% -- 96.0% -- 100.2 #72
PNASNet-5
82.8% 82.9% 96.0% 96.2% 99.5 #74
RegNetX-12GF
79.6% -- 94.7% -- 349.1 #174
RegNetX-16GF
79.8% -- 94.8% -- 300.9 #166
RegNetX-1.6GF
76.9% -- 93.4% -- 704.4 #328
RegNetX-200MF
68.8% -- 88.6% -- 726.1 #466
RegNetX-32GF
80.3% -- 95.0% -- 154.3 #139
RegNetX-3.2GF
78.1% -- 94.1% -- 631.1 #264
RegNetX-400MF
72.4% -- 90.8% -- 721.1 #427
RegNetX-4.0GF
78.5% -- 94.3% -- 566.5 #251
RegNetX-600MF
73.8% -- 91.7% -- 698.0 #411
RegNetX-6.4GF
79.1% -- 94.5% -- 468.5 #214
RegNetX-800MF
75.1% -- 92.3% -- 721.4 #377
RegNetX-8.0GF
79.2% -- 94.6% -- 458.7 #202
RegNetY-12GF
80.4% -- 95.1% -- 338.4 #133
RegNetY-16GF
80.3% -- 95.0% -- 275.9 #142
RegNetY-1.6GF
77.9% -- 93.7% -- 707.8 #296
RegNetY-200MF
70.3% -- 89.5% -- 742.6 #452
RegNetY-32GF
80.8% -- 95.2% -- 161.8 #125
RegNetY-3.2GF
82.0% -- 95.9% -- 606.2 #79
RegNetY-400MF
74.0% -- 91.8% -- 728.7 #408
RegNetY-4.0GF
79.2% -- 94.6% -- 595.7 #201
RegNetY-600MF
75.3% -- 92.5% -- 725.9 #367
RegNetY-6.4GF
79.7% -- 94.8% -- 458.6 #171
RegNetY-800MF
76.3% -- 93.1% -- 713.3 #349
RegNetY-8.0GF
79.9% -- 94.8% -- 418.2 #163
Res2Net-50 14x8s
78.1% -- 93.9% -- 545.5 #274
Res2Net-50 26x4s
78.0% -- 93.9% -- 556.0 #287
Res2Net-50 26x6s
78.6% -- 94.1% -- 466.2 #262
Res2Net-50 26x8s
79.2% -- 94.4% -- 403.4 #229
Res2Net-50 48x2s
77.5% -- 93.6% -- 579.3 #310
Res2Net-DLA-60
78.5% 79.5% 94.2% -- 544.0 #253
Res2NeXt-101 26x4s
79.2% -- 94.4% -- 404.5 #214
Res2NeXt-50
78.2% -- 93.9% -- 528.9 #269
Res2NeXt-DLA-60
78.4% -- 94.2% -- 533.1 #258
ResNeSt-101
82.9% -- 96.3% -- 317.1 #58
ResNeSt-14
75.5% -- 92.5% -- 716.8 #364
ResNeSt-200
83.9% -- 96.9% -- 134.5 #45
ResNeSt-26
78.5% -- 94.3% -- 646.3 #252
ResNeSt-269
84.5% -- 97.0% -- 63.7 #34
ResNeSt-50
81.0% -- 95.4% -- 532.1 #108
ResNeSt-50 1s4x24d
81.0% -- 95.3% -- 548.3 #115
ResNeSt-50 4s2x40d
81.1% -- 95.6% -- 455.9 #99
ResNet-101
79.3% -- 94.5% -- 469.9 #198
ResNet-101-C
79.5% -- 94.6% -- 447.2 #195
ResNet-101-D
80.4% -- 95.0% -- 465.0 #141
ResNet-101-S
80.3% -- 95.2% -- 439.5 #131
ResNet-152
79.7% -- 94.7% -- 376.1 #173
ResNet-152-C
79.9% -- 94.8% -- 366.9 #160
ResNet-152-D
80.5% -- 95.2% -- 370.5 #127
ResNet-152-S
81.0% -- 95.4% -- 356.9 #106
ResNet-18
73.3% -- 91.4% -- 701.6 #416
ResNet-18
70.8% -- 89.1% -- 728.7 #449
ResNet-26
75.3% -- 92.6% -- 702.1 #368
ResNet-26-D
76.7% -- 93.1% -- 696.9 #343
ResNet-34
74.6% -- 92.3% -- 725.7 #379
ResNet-50
77.6% -- 93.7% -- 618.9 #297
ResNet-50
81.1% -- 96.0% -- 616.5 #77
ResNet-50
(288x288 Mean-Max Pooling)
80.3% -- 95.6% -- 476.5 #134
ResNet-50
(288x288 Mean-Max Pooling)
80.1% -- 95.3% -- 475.7 #150
ResNet-50-C
78.0% -- 94.0% -- 604.7 #285
ResNet-50-D
79.1% 77.2%
94.5% 93.5%
596.8 #210
ResNet-50-S
78.7% -- 94.2% -- 563.8 #241
ResNet-Blur-50
79.3% -- 94.6% -- 590.1 #188
ResNeXt-101 32x16d
83.3% -- 96.8% -- 141.8 #52
ResNeXt-101 32x16d
84.2% -- 97.2% -- 142.0 #34
ResNeXt-101 32x16d
(288x288 Mean-Max Pooling)
82.7% -- 96.6% -- 78.1 #56
ResNeXt-101 32x16d
(288x288 Mean-Max Pooling)
85.0% -- 97.6% -- 77.9 #25
ResNeXt-101 32x32d
85.1% 85.1% 97.4% 97.5% 58.6 #23
ResNeXt-101 32x32d
(288x288 Mean-Max Pooling)
85.9% -- 97.8% -- 35.3 #13
ResNeXt-101 32x48d
85.4% 85.4% 97.6% 97.6% 30.9 #18
ResNeXt-101 32x48d
(288x288 Mean-Max Pooling)
86.1% -- 97.9% -- 18.8 #10
ResNeXt-101 32x4d
80.9% -- 95.7% -- 388.1 #87
ResNeXt-101 32x4d
80.3% -- 94.9% -- 384.5 #135
ResNeXt-101 32x4d
(288x288 Mean-Max Pooling)
82.1% -- 97.2% -- 259.2 #31
ResNeXt-101 32x8d
82.7% 82.2%
96.6% 96.4% 255.1 #62
ResNeXt-101 32x8d
81.6% -- 97.2% -- 254.6 #86
ResNeXt-101 32x8d
(288x288 Mean-Max Pooling)
85.1% -- 97.6% -- 166.4 #23
ResNeXt-101 32x8d
(288x288 Mean-Max Pooling)
83.5% -- 97.1% -- 166.2 #37
ResNeXt-101 64x4d
80.6% -- 95.0% -- 245.4 #143
ResNeXt-50 32x4d
79.4% -- 94.6% -- 532.7 #190
ResNeXt-50 32x4d
82.2% -- 96.2% -- 529.3 #68
ResNeXt-50 32x4d
(288x288 Mean-Max Pooling)
81.3% -- 96.8% -- 391.3 #50
ResNeXt-50-D 32x4d
79.7% -- 94.9% -- 523.5 #157
ReXNet-1.0x
77.9% -- 93.9% -- 718.6 #285
ReXNet-1.3x
79.5% -- 94.7% -- 717.7 #182
ReXNet-1.5x
80.3% -- 95.2% -- 705.1 #137
ReXNet-2.0x
81.6% -- 95.7% -- 636.8 #91
SelecSLS-42_B
77.2% -- 93.4% -- 703.7 #316
SelecSLS-60
78.0% -- 93.8% -- 694.3 #287
SelecSLS-60_B
78.4% -- 94.2% -- 705.2 #256
SENet-154
81.2% -- 95.4% -- 196.8 #114
SENet-154
81.3% 82.7% 95.5% 96.2% 196.9 #104
SE-ResNet-101
78.4% -- 94.3% -- 463.7 #238
SE-ResNet-152
78.7% -- 94.4% -- 356.8 #242
SE-ResNet-18
71.7% -- 90.3% -- 721.5 #441
SE-ResNet-34
74.8% -- 92.1% -- 721.0 #386
SE-ResNet-50
77.6% -- 93.7% -- 620.0 #292
SE-ResNeXt-101 32x4d
80.9% -- 95.3% -- 364.8 #120
SE-ResNeXt-101 32x4d
80.2% -- 95.0% -- 352.6 #140
SE-ResNeXt-101 64x4d
80.9% -- 95.3% -- 232.4 #111
SE-ResNeXt-26 32x4d
77.1% -- 93.3% -- 657.0 #334
SE-ResNeXt-26-D 32x4d
77.6% -- 93.6% -- 564.6 #302
SE-ResNeXt-26-T 32x4d
78.0% -- 93.7% -- 589.9 #295
SE-ResNeXt-26-TN 32x4d
78.0% -- 93.7% -- 598.4 #293
SE-ResNeXt-50 32x4d
79.1% -- 94.4% -- 519.0 #211
SE-ResNeXt-50 32x4d
79.9% -- 94.8% -- 499.1 #164
Single-Path NAS
74.1% 75.0% 91.8% 92.2% 736.9 #401
SKNet-50
80.1% -- 94.6% -- 434.6 #189
SK-ResNet-18
73.0% -- 91.2% -- 708.6 #421
SK-ResNet-34
76.9% -- 93.3% -- 661.2 #329
VoVNet-19-DW-V2
76.8% -- 93.3% -- 669.2 #336
VoVNet-39-V2
79.3% -- 94.7% -- 596.3 #193
Wide-ResNet-50
81.5% -- 95.5% -- 444.3 #88
Xception
79.0% 79.0% 94.4% 94.5% 335.6 #219
See Full Build Details +get badge code
[![SotaBench](https://img.shields.io/endpoint.svg?url=https://sotabench.com/api/v0/badge/gh/rwightman/pytorch-image-models)](https://sotabench.com/user/rwightman/repos/rwightman/pytorch-image-models)

How the Repository is Evaluated

The full sotabench.py file - source
import torch
from sotabencheval.image_classification import ImageNetEvaluator
from sotabencheval.utils import is_server
from timm import create_model
from timm.data import resolve_data_config, create_loader, DatasetTar
from timm.models import apply_test_time_pool
from tqdm import tqdm
import os

NUM_GPU = 1
BATCH_SIZE = 256 * NUM_GPU


def _entry(model_name, paper_model_name, paper_arxiv_id, batch_size=BATCH_SIZE,
           ttp=False, args=dict(), model_desc=None):
    return dict(
        model=model_name,
        model_description=model_desc,
        paper_model_name=paper_model_name,
        paper_arxiv_id=paper_arxiv_id,
        batch_size=batch_size,
        ttp=ttp,
        args=args)

# NOTE For any original PyTorch models, I'll remove from this list when you add to sotabench to
# avoid overlap and confusion. Please contact me.
model_list = [
    ## Weights ported by myself from other frameworks or trained myself in PyTorch
    _entry('adv_inception_v3', 'Adversarial Inception V3', '1611.01236',
           model_desc='Ported from official Tensorflow weights'),
    _entry('ens_adv_inception_resnet_v2', 'Ensemble Adversarial Inception V3', '1705.07204',
           model_desc='Ported from official Tensorflow weights'),
    _entry('dpn68', 'DPN-68 (224x224)', '1707.01629'),
    _entry('dpn68b', 'DPN-68b (224x224)', '1707.01629'),
    _entry('dpn92', 'DPN-92 (224x224)', '1707.01629'),
    _entry('dpn98', 'DPN-98 (224x224)', '1707.01629'),
    _entry('dpn107', 'DPN-107 (224x224)', '1707.01629'),
    _entry('dpn131', 'DPN-131 (224x224)', '1707.01629'),
    _entry('dpn68', 'DPN-68 (320x320, Mean-Max Pooling)', '1707.01629', ttp=True, args=dict(img_size=320)),
    _entry('dpn68b', 'DPN-68b (320x320, Mean-Max Pooling)', '1707.01629', ttp=True, args=dict(img_size=320)),
    _entry('dpn92', 'DPN-92 (320x320, Mean-Max Pooling)', '1707.01629',
           ttp=True, args=dict(img_size=320), batch_size=BATCH_SIZE//2),
    _entry('dpn98', 'DPN-98 (320x320, Mean-Max Pooling)', '1707.01629',
           ttp=True, args=dict(img_size=320), batch_size=BATCH_SIZE//2),
    _entry('dpn107', 'DPN-107 (320x320, Mean-Max Pooling)', '1707.01629',
           ttp=True, args=dict(img_size=320), batch_size=BATCH_SIZE//4),
    _entry('dpn131', 'DPN-131 (320x320, Mean-Max Pooling)', '1707.01629',
           ttp=True, args=dict(img_size=320), batch_size=BATCH_SIZE//4),
    _entry('efficientnet_b0', 'EfficientNet-B0', '1905.11946'),
    _entry('efficientnet_b1', 'EfficientNet-B1', '1905.11946'),
    _entry('efficientnet_b2', 'EfficientNet-B2', '1905.11946',
           model_desc='Trained from scratch in PyTorch w/ RandAugment'),
    _entry('efficientnet_b2a', 'EfficientNet-B2 (288x288, 1.0 crop)', '1905.11946',
           model_desc='Trained from scratch in PyTorch w/ RandAugment'),
    _entry('efficientnet_b3', 'EfficientNet-B3', '1905.11946',
           model_desc='Trained from scratch in PyTorch w/ RandAugment'),
    _entry('efficientnet_b3a', 'EfficientNet-B3 (320x320, 1.0 crop)', '1905.11946',
           model_desc='Trained from scratch in PyTorch w/ RandAugment'),
    _entry('efficientnet_es', 'EfficientNet-EdgeTPU-S', '1905.11946',
           model_desc='Trained from scratch in PyTorch w/ RandAugment'),
    _entry('efficientnet_em', 'EfficientNet-EdgeTPU-M', '1905.11946',
           model_desc='Trained from scratch in PyTorch w/ RandAugment'),

    _entry('gluon_inception_v3', 'Inception V3', '1512.00567', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet18_v1b', 'ResNet-18', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet34_v1b', 'ResNet-34', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet50_v1b', 'ResNet-50', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet50_v1c', 'ResNet-50-C', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet50_v1d', 'ResNet-50-D', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet50_v1s', 'ResNet-50-S', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet101_v1b', 'ResNet-101', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet101_v1c', 'ResNet-101-C', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet101_v1d', 'ResNet-101-D', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet101_v1s', 'ResNet-101-S', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet152_v1b', 'ResNet-152', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet152_v1c', 'ResNet-152-C', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet152_v1d', 'ResNet-152-D', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet152_v1s', 'ResNet-152-S', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnext50_32x4d', 'ResNeXt-50 32x4d', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnext101_32x4d', 'ResNeXt-101 32x4d', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnext101_64x4d', 'ResNeXt-101 64x4d', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_senet154', 'SENet-154', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_seresnext50_32x4d', 'SE-ResNeXt-50 32x4d', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_seresnext101_32x4d', 'SE-ResNeXt-101 32x4d', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_seresnext101_64x4d', 'SE-ResNeXt-101 64x4d', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_xception65', 'Modified Aligned Xception', '1802.02611', batch_size=BATCH_SIZE//2,
           model_desc='Ported from GluonCV Model Zoo'),

    _entry('mixnet_xl', 'MixNet-XL', '1907.09595', model_desc="My own scaling beyond paper's MixNet Large"),
    _entry('mixnet_l', 'MixNet-L', '1907.09595'),
    _entry('mixnet_m', 'MixNet-M', '1907.09595'),
    _entry('mixnet_s', 'MixNet-S', '1907.09595'),

    _entry('fbnetc_100', 'FBNet-C', '1812.03443',
           model_desc='Trained in PyTorch with RMSProp, exponential LR decay'),
    _entry('mnasnet_100', 'MnasNet-B1', '1807.11626'),
    _entry('semnasnet_100', 'MnasNet-A1', '1807.11626'),
    _entry('spnasnet_100', 'Single-Path NAS', '1904.02877',
           model_desc='Trained in PyTorch with SGD, cosine LR decay'),
    _entry('mobilenetv3_large_100', 'MobileNet V3-Large 1.0', '1905.02244',
           model_desc='Trained in PyTorch with RMSProp, exponential LR decay, and hyper-params matching '
                      'paper as closely as possible.'),

    _entry('resnet18', 'ResNet-18', '1812.01187'),
    _entry('resnet26', 'ResNet-26', '1812.01187', model_desc='Block cfg of ResNet-34 w/ Bottleneck'),
    _entry('resnet26d', 'ResNet-26-D', '1812.01187',
           model_desc='Block cfg of ResNet-34 w/ Bottleneck, deep stem, and avg-pool in downsample layers.'),
    _entry('resnet34', 'ResNet-34', '1812.01187'),
    _entry('resnet50', 'ResNet-50', '1812.01187', model_desc='Trained with AugMix + JSD loss'),
    _entry('resnet50', 'ResNet-50 (288x288 Mean-Max Pooling)', '1812.01187',
           ttp=True, args=dict(img_size=288),
           model_desc='Trained with AugMix + JSD loss'),
    _entry('resnext50_32x4d', 'ResNeXt-50 32x4d', '1812.01187'),
    _entry('resnext50d_32x4d', 'ResNeXt-50-D 32x4d', '1812.01187',
           model_desc="'D' variant (3x3 deep stem w/ avg-pool downscale). Trained with "
                      "SGD w/ cosine LR decay, random-erasing (gaussian per-pixel noise) and label-smoothing"),

    _entry('wide_resnet50_2', 'Wide-ResNet-50', '1605.07146'),

    _entry('seresnet50', 'SE-ResNet-50', '1709.01507'),
    _entry('seresnext26d_32x4d', 'SE-ResNeXt-26-D 32x4d', '1812.01187',
           model_desc='Block cfg of SE-ResNeXt-34 w/ Bottleneck, deep stem, and avg-pool in downsample layers.'),
    _entry('seresnext26t_32x4d', 'SE-ResNeXt-26-T 32x4d', '1812.01187',
           model_desc='Block cfg of SE-ResNeXt-34 w/ Bottleneck, deep tiered stem, and avg-pool in downsample layers.'),
    _entry('seresnext26tn_32x4d', 'SE-ResNeXt-26-TN 32x4d', '1812.01187',
           model_desc='Block cfg of SE-ResNeXt-34 w/ Bottleneck, deep tiered narrow stem, and avg-pool in downsample layers.'),
    _entry('seresnext50_32x4d', 'SE-ResNeXt-50 32x4d', '1709.01507'),

    _entry('skresnet18', 'SK-ResNet-18', '1903.06586'),
    _entry('skresnet34', 'SK-ResNet-34', '1903.06586'),
    _entry('skresnext50_32x4d', 'SKNet-50', '1903.06586'),

    _entry('ecaresnetlight', 'ECA-ResNet-Light', '1910.03151',
           model_desc='A tweaked ResNet50d with ECA attn.'),
    _entry('ecaresnet50d', 'ECA-ResNet-50d', '1910.03151',
           model_desc='A ResNet50d with ECA attn'),
    _entry('ecaresnet101d', 'ECA-ResNet-101d', '1910.03151',
           model_desc='A ResNet101d with ECA attn'),

    _entry('resnetblur50', 'ResNet-Blur-50', '1904.11486'),

    _entry('densenet121', 'DenseNet-121', '1608.06993'),
    _entry('densenetblur121d', 'DenseNet-Blur-121D', '1904.11486',
           model_desc='DenseNet with blur pooling and deep stem'),

    _entry('ese_vovnet19b_dw', 'VoVNet-19-DW-V2', '1911.06667'),
    _entry('ese_vovnet39b', 'VoVNet-39-V2', '1911.06667'),

    _entry('cspresnet50', 'CSPResNet-50', '1911.11929'),
    _entry('cspresnext50', 'CSPResNeXt-50', '1911.11929'),
    _entry('cspdarknet53', 'CSPDarkNet-53', '1911.11929'),

    _entry('tf_efficientnet_b0', 'EfficientNet-B0 (AutoAugment)', '1905.11946',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b1', 'EfficientNet-B1 (AutoAugment)', '1905.11946',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b2', 'EfficientNet-B2 (AutoAugment)', '1905.11946',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b3', 'EfficientNet-B3 (AutoAugment)', '1905.11946', batch_size=BATCH_SIZE//2,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b4', 'EfficientNet-B4 (AutoAugment)', '1905.11946', batch_size=BATCH_SIZE//2,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b5', 'EfficientNet-B5 (RandAugment)', '1905.11946', batch_size=BATCH_SIZE//4,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b6', 'EfficientNet-B6 (AutoAugment)', '1905.11946', batch_size=BATCH_SIZE//8,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b7', 'EfficientNet-B7 (RandAugment)', '1905.11946', batch_size=BATCH_SIZE//8,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b8', 'EfficientNet-B8 (RandAugment)', '1905.11946', batch_size=BATCH_SIZE // 8,
           model_desc='Ported from official Google AI Tensorflow weights'),

    _entry('tf_efficientnet_b0_ap', 'EfficientNet-B0 (AdvProp)', '1911.09665',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b1_ap', 'EfficientNet-B1 (AdvProp)', '1911.09665',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b2_ap', 'EfficientNet-B2 (AdvProp)', '1911.09665',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b3_ap', 'EfficientNet-B3 (AdvProp)', '1911.09665', batch_size=BATCH_SIZE // 2,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b4_ap', 'EfficientNet-B4 (AdvProp)', '1911.09665', batch_size=BATCH_SIZE // 2,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b5_ap', 'EfficientNet-B5 (AdvProp)', '1911.09665', batch_size=BATCH_SIZE // 4,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b6_ap', 'EfficientNet-B6 (AdvProp)', '1911.09665', batch_size=BATCH_SIZE // 8,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b7_ap', 'EfficientNet-B7 (AdvProp)', '1911.09665', batch_size=BATCH_SIZE // 8,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b8_ap', 'EfficientNet-B8 (AdvProp)', '1911.09665', batch_size=BATCH_SIZE // 8,
           model_desc='Ported from official Google AI Tensorflow weights'),

    _entry('tf_efficientnet_b0_ns', 'EfficientNet-B0 (NoisyStudent)', '1911.04252',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b1_ns', 'EfficientNet-B1 (NoisyStudent)', '1911.04252',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b2_ns', 'EfficientNet-B2 (NoisyStudent)', '1911.04252',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b3_ns', 'EfficientNet-B3 (NoisyStudent)', '1911.04252', batch_size=BATCH_SIZE // 2,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b4_ns', 'EfficientNet-B4 (NoisyStudent)', '1911.04252', batch_size=BATCH_SIZE // 2,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b5_ns', 'EfficientNet-B5 (NoisyStudent)', '1911.04252', batch_size=BATCH_SIZE // 4,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b6_ns', 'EfficientNet-B6 (NoisyStudent)', '1911.04252', batch_size=BATCH_SIZE // 8,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b7_ns', 'EfficientNet-B7 (NoisyStudent)', '1911.04252', batch_size=BATCH_SIZE // 8,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_l2_ns_475', 'EfficientNet-L2 475 (NoisyStudent)', '1911.04252', batch_size=BATCH_SIZE // 16,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_l2_ns', 'EfficientNet-L2 (NoisyStudent)', '1911.04252', batch_size=BATCH_SIZE // 64,
           model_desc='Ported from official Google AI Tensorflow weights'),

    _entry('tf_efficientnet_cc_b0_4e', 'EfficientNet-CondConv-B0 4 experts', '1904.04971',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_cc_b0_8e', 'EfficientNet-CondConv-B0 8 experts', '1904.04971',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_cc_b1_8e', 'EfficientNet-CondConv-B1 8 experts', '1904.04971',
           model_desc='Ported from official Google AI Tensorflow weights'),

    _entry('tf_efficientnet_es', 'EfficientNet-EdgeTPU-S', '1905.11946',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_em', 'EfficientNet-EdgeTPU-M', '1905.11946',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_el', 'EfficientNet-EdgeTPU-L', '1905.11946', batch_size=BATCH_SIZE//2,
           model_desc='Ported from official Google AI Tensorflow weights'),

    _entry('tf_efficientnet_lite0', 'EfficientNet-Lite0', '1905.11946',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_lite1', 'EfficientNet-Lite1', '1905.11946',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_lite2', 'EfficientNet-Lite2', '1905.11946',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_lite3', 'EfficientNet-Lite3', '1905.11946', batch_size=BATCH_SIZE // 2,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_lite4', 'EfficientNet-Lite4', '1905.11946', batch_size=BATCH_SIZE // 2,
           model_desc='Ported from official Google AI Tensorflow weights'),

    _entry('tf_inception_v3', 'Inception V3', '1512.00567', model_desc='Ported from official Tensorflow weights'),
    _entry('tf_mixnet_l', 'MixNet-L', '1907.09595', model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_mixnet_m', 'MixNet-M', '1907.09595', model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_mixnet_s', 'MixNet-S', '1907.09595', model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_mobilenetv3_large_100', 'MobileNet V3-Large 1.0', '1905.02244',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_mobilenetv3_large_075', 'MobileNet V3-Large 0.75', '1905.02244',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_mobilenetv3_large_minimal_100', 'MobileNet V3-Large Minimal 1.0', '1905.02244',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_mobilenetv3_small_100', 'MobileNet V3-Small 1.0', '1905.02244',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_mobilenetv3_small_075', 'MobileNet V3-Small 0.75', '1905.02244',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_mobilenetv3_small_minimal_100', 'MobileNet V3-Small Minimal 1.0', '1905.02244',
           model_desc='Ported from official Google AI Tensorflow weights'),

    ## Cadene ported weights (to remove if Cadene adds sotabench)
    _entry('inception_resnet_v2', 'Inception ResNet V2', '1602.07261'),
    _entry('inception_v4', 'Inception V4', '1602.07261'),
    _entry('nasnetalarge', 'NASNet-A Large', '1707.07012', batch_size=BATCH_SIZE // 4),
    _entry('pnasnet5large', 'PNASNet-5', '1712.00559', batch_size=BATCH_SIZE // 4),
    _entry('xception', 'Xception', '1610.02357',  batch_size=BATCH_SIZE//2),
    _entry('legacy_seresnet18', 'SE-ResNet-18', '1709.01507'),
    _entry('legacy_seresnet34', 'SE-ResNet-34', '1709.01507'),
    _entry('legacy_seresnet50', 'SE-ResNet-50', '1709.01507'),
    _entry('legacy_seresnet101', 'SE-ResNet-101', '1709.01507'),
    _entry('legacy_seresnet152', 'SE-ResNet-152', '1709.01507'),
    _entry('legacy_seresnext26_32x4d', 'SE-ResNeXt-26 32x4d', '1709.01507',
           model_desc='Block cfg of SE-ResNeXt-34 w/ Bottleneck'),
    _entry('legacy_seresnext50_32x4d', 'SE-ResNeXt-50 32x4d', '1709.01507'),
    _entry('legacy_seresnext101_32x4d', 'SE-ResNeXt-101 32x4d', '1709.01507'),
    _entry('legacy_senet154', 'SENet-154', '1709.01507'),

    ## Torchvision weights
    # _entry('densenet121'),
    # _entry('densenet161'),
    # _entry('densenet169'),
    # _entry('densenet201'),
    # _entry('inception_v3', paper_model_name='Inception V3', ),
    # _entry('tv_resnet34', , ),
    # _entry('tv_resnet50', , ),
    # _entry('resnet101', , ),
    # _entry('resnet152', , ),
    # _entry('tv_resnext50_32x4d', , ),
    # _entry('resnext101_32x8d', ),
    # _entry('wide_resnet50_2' , ),
    # _entry('wide_resnet101_2', , ),

    ## Facebook WSL weights
    _entry('ig_resnext101_32x8d', 'ResNeXt-101 32x8d', '1805.00932',
           model_desc='Weakly-Supervised pre-training on 1B Instagram hashtag dataset by Facebook Research'),
    _entry('ig_resnext101_32x16d', 'ResNeXt-101 32x16d', '1805.00932',
           model_desc='Weakly-Supervised pre-training on 1B Instagram hashtag dataset by Facebook Research'),
    _entry('ig_resnext101_32x32d', 'ResNeXt-101 32x32d', '1805.00932', batch_size=BATCH_SIZE // 2,
           model_desc='Weakly-Supervised pre-training on 1B Instagram hashtag dataset by Facebook Research'),
    _entry('ig_resnext101_32x48d', 'ResNeXt-101 32x48d', '1805.00932', batch_size=BATCH_SIZE // 4,
           model_desc='Weakly-Supervised pre-training on 1B Instagram hashtag dataset by Facebook Research'),

    _entry('ig_resnext101_32x8d', 'ResNeXt-101 32x8d (288x288 Mean-Max Pooling)', '1805.00932',
           ttp=True, args=dict(img_size=288),
           model_desc='Weakly-Supervised pre-training on 1B Instagram hashtag dataset by Facebook Research'),
    _entry('ig_resnext101_32x16d', 'ResNeXt-101 32x16d (288x288 Mean-Max Pooling)', '1805.00932',
           ttp=True, args=dict(img_size=288), batch_size=BATCH_SIZE // 2,
           model_desc='Weakly-Supervised pre-training on 1B Instagram hashtag dataset by Facebook Research'),
    _entry('ig_resnext101_32x32d', 'ResNeXt-101 32x32d (288x288 Mean-Max Pooling)', '1805.00932',
           ttp=True, args=dict(img_size=288), batch_size=BATCH_SIZE // 4,
           model_desc='Weakly-Supervised pre-training on 1B Instagram hashtag dataset by Facebook Research'),
    _entry('ig_resnext101_32x48d', 'ResNeXt-101 32x48d (288x288 Mean-Max Pooling)', '1805.00932',
           ttp=True, args=dict(img_size=288), batch_size=BATCH_SIZE // 8,
           model_desc='Weakly-Supervised pre-training on 1B Instagram hashtag dataset by Facebook Research'),

    ## Facebook SSL weights
    _entry('ssl_resnet18', 'ResNet-18', '1905.00546',
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnet50', 'ResNet-50', '1905.00546',
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnext50_32x4d', 'ResNeXt-50 32x4d', '1905.00546',
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnext101_32x4d', 'ResNeXt-101 32x4d', '1905.00546',
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnext101_32x8d', 'ResNeXt-101 32x8d', '1905.00546',
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnext101_32x16d', 'ResNeXt-101 32x16d', '1905.00546',
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),

    _entry('ssl_resnet50', 'ResNet-50 (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288),
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnext50_32x4d', 'ResNeXt-50 32x4d (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288),
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnext101_32x4d', 'ResNeXt-101 32x4d (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288),
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnext101_32x8d', 'ResNeXt-101 32x8d (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288),
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnext101_32x16d', 'ResNeXt-101 32x16d (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288), batch_size=BATCH_SIZE // 2,
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),

    ## Facebook SWSL weights
    _entry('swsl_resnet18', 'ResNet-18', '1905.00546',
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnet50', 'ResNet-50', '1905.00546',
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnext50_32x4d', 'ResNeXt-50 32x4d', '1905.00546',
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnext101_32x4d', 'ResNeXt-101 32x4d', '1905.00546',
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnext101_32x8d', 'ResNeXt-101 32x8d', '1905.00546',
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnext101_32x16d', 'ResNeXt-101 32x16d', '1905.00546',
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),

    _entry('swsl_resnet50', 'ResNet-50 (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288),
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnext50_32x4d', 'ResNeXt-50 32x4d (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288),
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnext101_32x4d', 'ResNeXt-101 32x4d (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288),
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnext101_32x8d', 'ResNeXt-101 32x8d (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288),
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnext101_32x16d', 'ResNeXt-101 32x16d (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288), batch_size=BATCH_SIZE // 2,
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),

    ## DLA official impl weights (to remove if sotabench added to source)
    _entry('dla34', 'DLA-34', '1707.06484'),
    _entry('dla46_c', 'DLA-46-C', '1707.06484'),
    _entry('dla46x_c', 'DLA-X-46-C', '1707.06484'),
    _entry('dla60x_c', 'DLA-X-60-C', '1707.06484'),
    _entry('dla60', 'DLA-60', '1707.06484'),
    _entry('dla60x', 'DLA-X-60', '1707.06484'),
    _entry('dla102', 'DLA-102', '1707.06484'),
    _entry('dla102x', 'DLA-X-102', '1707.06484'),
    _entry('dla102x2', 'DLA-X-102 64', '1707.06484'),
    _entry('dla169', 'DLA-169', '1707.06484'),

    ## Res2Net official impl weights (to remove if sotabench added to source)
    _entry('res2net50_26w_4s', 'Res2Net-50 26x4s', '1904.01169'),
    _entry('res2net50_14w_8s', 'Res2Net-50 14x8s', '1904.01169'),
    _entry('res2net50_26w_6s', 'Res2Net-50 26x6s', '1904.01169'),
    _entry('res2net50_26w_8s', 'Res2Net-50 26x8s', '1904.01169'),
    _entry('res2net50_48w_2s', 'Res2Net-50 48x2s', '1904.01169'),
    _entry('res2net101_26w_4s', 'Res2NeXt-101 26x4s', '1904.01169'),
    _entry('res2next50', 'Res2NeXt-50', '1904.01169'),
    _entry('dla60_res2net', 'Res2Net-DLA-60', '1904.01169'),
    _entry('dla60_res2next', 'Res2NeXt-DLA-60', '1904.01169'),

    ## HRNet official impl weights
    _entry('hrnet_w18_small', 'HRNet-W18-C-Small-V1', '1908.07919'),
    _entry('hrnet_w18_small_v2', 'HRNet-W18-C-Small-V2', '1908.07919'),
    _entry('hrnet_w18', 'HRNet-W18-C', '1908.07919'),
    _entry('hrnet_w30', 'HRNet-W30-C', '1908.07919'),
    _entry('hrnet_w32', 'HRNet-W32-C', '1908.07919'),
    _entry('hrnet_w40', 'HRNet-W40-C', '1908.07919'),
    _entry('hrnet_w44', 'HRNet-W44-C', '1908.07919'),
    _entry('hrnet_w48', 'HRNet-W48-C', '1908.07919'),
    _entry('hrnet_w64', 'HRNet-W64-C', '1908.07919'),


    ## SelecSLS official impl weights
    _entry('selecsls42b', 'SelecSLS-42_B', '1907.00837',
           model_desc='Originally from https://github.com/mehtadushy/SelecSLS-Pytorch'),
    _entry('selecsls60', 'SelecSLS-60', '1907.00837',
           model_desc='Originally from https://github.com/mehtadushy/SelecSLS-Pytorch'),
    _entry('selecsls60b', 'SelecSLS-60_B', '1907.00837',
           model_desc='Originally from https://github.com/mehtadushy/SelecSLS-Pytorch'),

    ## ResNeSt official impl weights
    _entry('resnest14d', 'ResNeSt-14', '2004.08955',
           model_desc='Originally from GluonCV'),
    _entry('resnest26d', 'ResNeSt-26', '2004.08955',
           model_desc='Originally from GluonCV'),
    _entry('resnest50d', 'ResNeSt-50', '2004.08955',
           model_desc='Originally from https://github.com/zhanghang1989/ResNeSt'),
    _entry('resnest101e', 'ResNeSt-101', '2004.08955',
           model_desc='Originally from https://github.com/zhanghang1989/ResNeSt'),
    _entry('resnest200e', 'ResNeSt-200', '2004.08955',
           model_desc='Originally from https://github.com/zhanghang1989/ResNeSt'),
    _entry('resnest269e', 'ResNeSt-269', '2004.08955', batch_size=BATCH_SIZE // 2,
           model_desc='Originally from https://github.com/zhanghang1989/ResNeSt'),
    _entry('resnest50d_4s2x40d', 'ResNeSt-50 4s2x40d', '2004.08955',
           model_desc='Originally from https://github.com/zhanghang1989/ResNeSt'),
    _entry('resnest50d_1s4x24d', 'ResNeSt-50 1s4x24d', '2004.08955',
           model_desc='Originally from https://github.com/zhanghang1989/ResNeSt'),

    ## RegNet official impl weighs
    _entry('regnetx_002', 'RegNetX-200MF', '2003.13678'),
    _entry('regnetx_004', 'RegNetX-400MF', '2003.13678'),
    _entry('regnetx_006', 'RegNetX-600MF', '2003.13678'),
    _entry('regnetx_008', 'RegNetX-800MF', '2003.13678'),
    _entry('regnetx_016', 'RegNetX-1.6GF', '2003.13678'),
    _entry('regnetx_032', 'RegNetX-3.2GF', '2003.13678'),
    _entry('regnetx_040', 'RegNetX-4.0GF', '2003.13678'),
    _entry('regnetx_064', 'RegNetX-6.4GF', '2003.13678'),
    _entry('regnetx_080', 'RegNetX-8.0GF', '2003.13678'),
    _entry('regnetx_120', 'RegNetX-12GF', '2003.13678'),
    _entry('regnetx_160', 'RegNetX-16GF', '2003.13678'),
    _entry('regnetx_320', 'RegNetX-32GF', '2003.13678', batch_size=BATCH_SIZE // 2),

    _entry('regnety_002', 'RegNetY-200MF', '2003.13678'),
    _entry('regnety_004', 'RegNetY-400MF', '2003.13678'),
    _entry('regnety_006', 'RegNetY-600MF', '2003.13678'),
    _entry('regnety_008', 'RegNetY-800MF', '2003.13678'),
    _entry('regnety_016', 'RegNetY-1.6GF', '2003.13678'),
    _entry('regnety_032', 'RegNetY-3.2GF', '2003.13678'),
    _entry('regnety_040', 'RegNetY-4.0GF', '2003.13678'),
    _entry('regnety_064', 'RegNetY-6.4GF', '2003.13678'),
    _entry('regnety_080', 'RegNetY-8.0GF', '2003.13678'),
    _entry('regnety_120', 'RegNetY-12GF', '2003.13678'),
    _entry('regnety_160', 'RegNetY-16GF', '2003.13678'),
    _entry('regnety_320', 'RegNetY-32GF', '2003.13678', batch_size=BATCH_SIZE // 2),

    _entry('rexnet_100', 'ReXNet-1.0x', '2007.00992'),
    _entry('rexnet_130', 'ReXNet-1.3x', '2007.00992'),
    _entry('rexnet_150', 'ReXNet-1.5x', '2007.00992'),
    _entry('rexnet_200', 'ReXNet-2.0x', '2007.00992'),
]

if is_server():
    DATA_ROOT = './.data/vision/imagenet'
else:
    # local settings
    DATA_ROOT = './'
DATA_FILENAME = 'ILSVRC2012_img_val.tar'
TAR_PATH = os.path.join(DATA_ROOT, DATA_FILENAME)

for m in model_list:
    model_name = m['model']
    # create model from name
    model = create_model(model_name, pretrained=True)
    param_count = sum([m.numel() for m in model.parameters()])
    print('Model %s, %s created. Param count: %d' % (model_name, m['paper_model_name'], param_count))

    dataset = DatasetTar(TAR_PATH)
    filenames = [os.path.splitext(f)[0] for f in dataset.filenames()]

    # get appropriate transform for model's default pretrained config
    data_config = resolve_data_config(m['args'], model=model, verbose=True)
    test_time_pool = False
    if m['ttp']:
        model, test_time_pool = apply_test_time_pool(model, data_config)
        data_config['crop_pct'] = 1.0

    batch_size = m['batch_size']
    loader = create_loader(
        dataset,
        input_size=data_config['input_size'],
        batch_size=batch_size,
        use_prefetcher=True,
        interpolation=data_config['interpolation'],
        mean=data_config['mean'],
        std=data_config['std'],
        num_workers=6,
        crop_pct=data_config['crop_pct'],
        pin_memory=True)

    evaluator = ImageNetEvaluator(
        root=DATA_ROOT,
        model_name=m['paper_model_name'],
        paper_arxiv_id=m['paper_arxiv_id'],
        model_description=m.get('model_description', None),
    )
    model.cuda()
    model.eval()
    with torch.no_grad():
        # warmup
        input = torch.randn((batch_size,) + data_config['input_size']).cuda()
        model(input)

        bar = tqdm(desc="Evaluation", mininterval=5, total=50000)
        evaluator.reset_time()
        sample_count = 0
        for input, target in loader:
            output = model(input)
            num_samples = len(output)
            image_ids = [filenames[i] for i in range(sample_count, sample_count + num_samples)]
            output = output.cpu().numpy()
            evaluator.add(dict(zip(image_ids, list(output))))
            sample_count += num_samples
            bar.update(num_samples)
            if evaluator.cache_exists:
                break

        bar.close()

    evaluator.save()
    for k, v in evaluator.results.items():
        print(k, v)
    for k, v in evaluator.speed_mem_metrics.items():
        print(k, v)
    torch.cuda.empty_cache()


STATUS
BUILD
COMMIT MESSAGE
RUN TIME
Update vision transformers to be compatible with official code. …
rwightman   736f209  ·  Oct 26 2020
unknown
Add ViT to sotabench
rwightman 477a78e    7613094  ·  Oct 22 2020
unknown
Improve test crop for ViT models. Small now 77.85, added base we…
rwightman   27a93e9  ·  Oct 21 2020
unknown
Add small vision transformer weights. 77.42 top-1.
rwightman   d4db9e7  ·  Oct 21 2020
unknown
Merge pull request #255 from mrT23/master Adding ASL (asymmetri…
rwightman   ccfb575  (+5 commits )  ·  Oct 16 2020
1h:29m:03s
Merge pull request #250 from rwightman/vision_transformer Visio…
rwightman   70ae7f0  (+4 commits )  ·  Oct 13 2020
1h:34m:50s
Add Adafactor and Adahessian optimizers, cleanup optimizer arg p…
rwightman   80078c4  ·  Oct 09 2020
16h:50m:51s
Add missing leaky_relu layer factory defn, update Apex/Native lo…
rwightman   fcb6258  ·  Oct 02 2020
1h:27m:22s
Merge pull request #244 from hollance/master Bug fix: test_time…
rwightman   186075e  (+2 commits )  ·  Oct 02 2020
1h:26m:15s
Missed moving some seresnet -> legacy in sotabench. Check sotabe…
rwightman   4be5b51  ·  Sep 25 2020
1h:57m:21s
Add DropPath (stochastic depth) to ReXNet and VoVNet. RegNet Dro…
rwightman e8ca458    e8e2d9c  ·  Sep 24 2020
17h:37m:39s
Add EfficientNet-EdgeTPU-M (efficientnet_em) model trained nativ…
rwightman   9c40653  ·  Sep 23 2020
1h:55m:46s
Another sotabench.py debug iter
rwightman   3681c5c  ·  Sep 18 2020
0h:15m:16s
Sotabench debugging
rwightman   0802985  ·  Sep 18 2020
0h:13m:44s
Add ResNet weights. 80.5 (top-1) ResNet-50-D, 77.1 ResNet-34-D, …
rwightman   c40384f  ·  Sep 18 2020
0h:13m:46s
Merge pull request #237 from rwightman/utils_cleanup Utils refa…
rwightman   e39bf6e  (+2 commits )  ·  Sep 11 2020
0h:14m:38s
Update README.md
rwightman   9ce42d5  ·  Sep 03 2020
0h:12m:03s
Update README.md
rwightman   0729dbe  ·  Sep 03 2020
unknown
Updated README, add wide_resnet50_2 and seresnext50_32x4d weights
rwightman   33f8a1b  ·  Sep 03 2020
0h:13m:44s
Merge pull request #233 from rwightman/torchamp Native Torch AM…
rwightman   5247eb3  (+9 commits )  ·  Sep 02 2020
0h:13m:50s
Update README.md
rwightman   6d158ad  ·  Aug 26 2020
0h:12m:38s
Fix MobileNetV3 crash with global_pool='', output consistent wit…
rwightman   470220b  ·  Aug 18 2020
0h:12m:32s
Fix a silly bug in Sample version of EvoNorm missing x* part of …
rwightman   fc8b8af  ·  Aug 13 2020
unknown
Update README.md
rwightman   fa26f6c  ·  Aug 12 2020
unknown
Bump version to 0.2.1 and update README
rwightman   f614df7  ·  Aug 12 2020
unknown
Merge pull request #218 from rwightman/cutmix CutMix + MixUp ov…
rwightman   b423bc8  (+6 commits )  ·  Aug 12 2020
unknown
Add CSPResNet50 weights, 79.6 top-1 at 256x256
rwightman   0f5d9d8  ·  Aug 12 2020
unknown
Update test workflow
rwightman   0734c0d  ·  Aug 11 2020
unknown
Fix a few more issues related to #216 w/ TResNet (space2depth) a…
rwightman   b1b6e7c  (+2 commits )  ·  Aug 11 2020
unknown
Merge pull request #216 from yu4u/fix_default_cfgs Fix default_…
rwightman   47794d2  (+2 commits )  ·  Aug 11 2020
unknown
Merge pull request #214 from MohamedAliRashad/patch-1 mobilenet…
rwightman 44d8ecc    078a51d  ·  Aug 08 2020
unknown
Change default_cfg names for senet to include the legacy and mat…
rwightman   d5145fa  ·  Aug 08 2020
17 days, 4h:45m:39s
A few typos and missed updates in changelog
rwightman   6e9d617  ·  Aug 05 2020
0h:12m:51s
Fix some documentation rendering issues
rwightman   57510fd  ·  Aug 05 2020
0h:11m:34s
Merge pull request #175 from rwightman/features Feature extract…
rwightman   80c3051  (+50 commits )  ·  Aug 05 2020
0h:11m:54s
Add `adamp` and 'sgdp' optimizers. Update requirements.txt Upd…
rwightman   e93e571  ·  Jul 25 2020
0h:11m:34s
Add autosquash workflow
rwightman   0915bed  ·  Jul 25 2020
0h:12m:37s
Merge branch 'michalwols-docs'
rwightman   17f4dd2  (+2 commits )  ·  Jul 10 2020
0h:14m:10s
Update setup.py Exclude results from possible packaging as it h…
rwightman   31cf125  ·  Jul 09 2020
0h:13m:45s
Merge pull request #183 from KushajveerSingh/results_diff Add r…
rwightman   0d5550c  (+3 commits )  ·  Jul 09 2020
0h:14m:36s
Fix #173, lr cycle default 0 vs 1. Fix #177, mirror resnest weig…
rwightman   d72ac0d  ·  Jun 29 2020
0h:49m:04s
Remove tests from distrib
rwightman   24e7535  ·  Jun 16 2020
0h:52m:04s
Add ESE-VoVNet-19-DW weights
rwightman   328339a  ·  Jun 15 2020
0h:48m:00s
Fix default interpolation/crop of largest 2 ResNeSt models
rwightman   2d83752  ·  Jun 12 2020
1h:03m:43s
Update README with model results and attribution. Make scheduler…
rwightman   f225ae8  ·  Jun 12 2020
1h:20m:54s
Merge pull request #155 from rwightman/densenet_update_and_more …
rwightman   d1b5ddd  (+22 commits )  ·  Jun 11 2020
2h:00m:55s
Update requirements so PyTorch 1.4 is min, add separate sotabenc…
rwightman   5966654  ·  May 24 2020
0h:40m:56s
Update sotabench.py
rwightman   d79ac48  ·  May 22 2020
0h:12m:19s
Merge pull request #154 from rwightman/tests_bugfixes Add backw…
rwightman   e881383  (+5 commits )  ·  May 21 2020
0h:11m:09s
Merge pull request #150 from rwightman/regnet Add RegNet models…
rwightman 50658b9    ea2e59c  ·  May 18 2020
0h:11m:47s
Merge pull request #148 from rwightman/drop_block_improve Impro…
rwightman 1904ed8    dab9935  ·  May 13 2020
0h:11m:37s
Merge pull request #146 from rwightman/inceptionv3_fix Remove a…
rwightman 17270c6    63addb7  ·  May 12 2020
0h:12m:50s
Merge pull request #145 from rwightman/resnest ResNeSt
rwightman   c4ca016  (+3 commits )  ·  May 12 2020
0h:14m:19s
Refactor test indent
rwightman 9cc289f    5bd1ad1  ·  May 12 2020
0h:16m:12s
Update test_inference.py Not six min
rwightman   e545bb9  ·  May 07 2020
0h:09m:36s
Update test_inference.py Make the timeout 5-min for now, see if…
rwightman   305a2db  ·  May 07 2020
0h:18m:33s
Merge pull request #143 from michalwols/master Setup Github Act…
rwightman   14e01b8  (+4 commits )  ·  May 07 2020
0h:19m:11s
Merge pull request #141 from Animatory/fix_HRNet Fixed HRNet mo…
rwightman f0eb021    6cc11a8  ·  May 05 2020
0h:15m:23s
Merge pull request #140 from yoniaflalo/PR_MultiEpochsDataLoader…
rwightman a7f570c    3b72ebf  ·  May 05 2020
0h:17m:04s
0h:14m:54s
Fix #139. Broken SKResNets after BlurPool addition, as a plus, S…
rwightman   8d8677e  ·  May 04 2020
0h:16m:12s
Update README.md
rwightman   353a79a  ·  May 03 2020
0h:13m:54s
Bump version for Pypi release
rwightman   c9b6f41  ·  May 03 2020
0h:16m:30s
Add EfficientNet pruned models to results files
rwightman   31f4c12  ·  May 03 2020
0h:16m:33s
Fix pruned txt files not being installed during pip install
rwightman   375f3e5  ·  May 03 2020
0h:17m:53s
Merge pull request #136 from yoniaflalo/adding_effnet_pruned ad…
rwightman 9c15d57    8ec554b  ·  May 03 2020
0h:12m:04s
Update README.md
rwightman   a4d20a1  ·  May 01 2020
0h:09m:39s
Fix model create fn not passing num_classes through. Fix #135
rwightman   ea30070  ·  May 01 2020
0h:16m:44s
Update results with new models
rwightman   779cb0f  ·  May 01 2020
0h:16m:45s
Merge branch 'master' of github.com:rwightman/pytorch-models
rwightman   2c438c4  (+9 commits )  ·  May 01 2020
0h:15m:56s
Merge pull request #125 from Separius/patch-1 fix typo in eca
rwightman 20290b5    a5220ad  ·  May 01 2020
0h:19m:08s
Merge branch 'yoniaflalo-adding_ECA_resnet'
rwightman   7a9942a  (+3 commits )  ·  May 01 2020
0h:17m:11s
Bump version for pypi release. Fix #130
rwightman   1d4ac1b  (+6 commits )  ·  Apr 27 2020
0h:15m:31s
Merge pull request #122 from mrT23/master TResNet models
rwightman   ebf82b8  (+3 commits )  ·  Apr 12 2020
0h:49m:15s
Merge pull request #123 from aclex/mobilenetv3_fix_feature_extra…
rwightman bdb165a    e15f979  ·  Apr 12 2020
0h:51m:55s
Remove poorly named metrics from torch imagenet example origins.…
rwightman   13cf688  ·  Apr 10 2020
0h:52m:25s
Bump version for pypi
rwightman   56608c9  (+2 commits )  ·  Apr 09 2020
0h:50m:09s
Merge pull request #117 from VRandme/typo_eca minor PR to fix t…
rwightman 06a50a9    e01ccb8  ·  Apr 07 2020
0h:50m:19s
Merge pull request #115 from rwightman/mobilenetv2-experiment M…
rwightman   c99a5ab  (+3 commits )  ·  Apr 05 2020
0h:50m:19s
Add better resnext50_32x4d weights trained by andravin
rwightman   5a16c53  ·  Mar 18 2020
0h:52m:01s
Merge pull request #105 from rwightman/efficientnet-lite Effici…
rwightman   71b5cd6  (+3 commits )  ·  Mar 18 2020
1h:04m:08s
Merge pull request #99 from andravin/save-last Modified save_ch…
rwightman d92cc4d    7deacf5  ·  Mar 15 2020
0h:47m:33s
Merge pull request #94 from rwightman/lr_noise Learning rate no…
rwightman   56e2ac3  (+4 commits )  ·  Feb 29 2020
0h:52m:50s
Annotate types on drop fns to avoid torchscript error
rwightman   c60069c  ·  Feb 27 2020
0h:52m:10s
version bump for PyPi update
rwightman   cc5a11a  ·  Feb 22 2020
0h:48m:15s
Forgot to add skresnet34 to sotabench
rwightman   d77f45a  ·  Feb 18 2020
0h:48m:25s
Simpler approach to loading entrypoints in hubconf works properly
rwightman   6620770  ·  Feb 18 2020
unknown
Merge pull request #88 from rwightman/attention A lot of attent…
rwightman   e0685dd  (+42 commits )  ·  Feb 18 2020
3h:03m:51s
Add map_location='cpu' to ModelEma resume, should improve #72
rwightman   f098fda  ·  Feb 12 2020
0h:47m:23s
Add L2-475 PyTorch preprocessing result, update sotabench for ne…
rwightman   b949699  ·  Feb 12 2020
4h:27m:05s
Add ported EfficientNet-L2, B0-B7 NoisyStudent weights from TF T…
rwightman   ba15ca4  ·  Feb 12 2020
0h:47m:19s
Remove unused default_init for EfficientNets, experimenting with…
rwightman cade829    d0eb59e  ·  Feb 09 2020
0h:43m:16s
Update README.md
rwightman   5eb0e36  ·  Feb 06 2020
0h:48m:20s
Add PyTorch trained EfficientNet-ES weights from Andrew Lavin
rwightman   5c4991a  ·  Feb 06 2020
0h:44m:26s
Indentation mistake. Fixes #81
rwightman   d66819d  ·  Feb 04 2020
0h:48m:40s
Merge pull request #83 from andravin/validation-batch-size-multi…
rwightman b72013d    65cda1c  ·  Feb 04 2020
0h:47m:33s
Bump version for PyPi update, fix few out of date README items/m…
rwightman   4808b3c  ·  Feb 03 2020
0h:46m:53s
Update README.md
rwightman   5c85389  ·  Feb 02 2020
0h:42m:13s
Update README.md
rwightman   820b73d  ·  Feb 02 2020
unknown
Update README.md Fix relative paths (I think)
rwightman   82c0a2f  ·  Feb 02 2020
unknown
Add results/README.md
rwightman   1ffd2d0  ·  Feb 02 2020
0h:48m:03s
Update sotabench with tf_efficientnet_b8 model
rwightman   fd98fb3  (+4 commits )  ·  Feb 01 2020
1h:12m:15s
Add warning about using sync-bn with zero initialized BN layers.…
rwightman   5b7cc16  ·  Jan 31 2020
0h:41m:02s
Update README.md Typo
rwightman   b18c199  ·  Jan 31 2020
unknown
New ResNet50 JSD + RandAugment weights
rwightman   12dbc74  ·  Jan 31 2020
0h:51m:06s
0h:55m:04s
Update README.md Fix typo
rwightman   e39aae5  ·  Jan 13 2020
0h:05m:21s
Update README.md
rwightman   7a17ee9  (+3 commits )  ·  Jan 12 2020
0h:09m:57s
Update README.md
rwightman   2a88412  ·  Jan 11 2020
0h:05m:23s
Merge pull request #74 from rwightman/augmix-jsd AugMix, JSD lo…
rwightman   d9a6a9d  (+7 commits )  ·  Jan 11 2020
0h:08m:27s
Add tiered narrow ResNet (tn) and weights for seresnext26tn_32x4d
rwightman   a28117e  ·  Jan 11 2020
0h:09m:09s
Update README.md
rwightman   cfa951b  ·  Jan 07 2020
0h:09m:41s
Update README.md
rwightman   7622015  ·  Jan 04 2020
0h:09m:11s
Add updated RandAugment trained EfficientNet-B0 trained weights …
rwightman   ec0dd40  ·  Jan 03 2020
0h:10m:16s
Plural for averaging script.
rwightman   8662454  ·  Jan 03 2020
0h:10m:16s
Add checkpoint averaging script. Add headers, shebangs, exec per…
rwightman 4666cc9    40fea63  ·  Jan 03 2020
0h:11m:02s
ResNet / Res2Net additions: * ResNet torchscript compat * output…
rwightman   53001dd  (+2 commits )  ·  Jan 01 2020
0h:46m:03s
Update README.md Update readme with SelecSLS details.
rwightman   a7fe891  ·  Dec 30 2019
0h:46m:43s
Merge pull request #66 from rwightman/selecsls_updates SelecSLS…
rwightman   e728d70  (+4 commits )  ·  Dec 30 2019
0h:51m:07s
Merge pull request #65 from mehtadushy/selecsls Incorporate Sel…
rwightman   fb3a0f4  (+3 commits )  ·  Dec 30 2019
0h:42m:33s
Update README.md
rwightman   0554b79  ·  Dec 29 2019
0h:40m:38s
Update sotabench.py
rwightman   a4497af  ·  Dec 28 2019
0h:44m:51s
Update README with B3 training details
rwightman   53f578e  (+3 commits )  ·  Dec 28 2019
0h:56m:32s
Add ResNet deep tiered stem and model weights for seresnext26t_3…
rwightman   1f4498f  ·  Dec 28 2019
0h:45m:56s
Add update RandAugment MixNet-XL weights
rwightman   73b7845  ·  Dec 24 2019
0h:49m:50s
Merge pull request #62 from rwightman/reduce-bn Distribute Batc…
rwightman   ff8688c  (+4 commits )  ·  Dec 19 2019
0h:46m:20s
Update README.md Update latest training hparam/command line wit…
rwightman   5d7af97  ·  Dec 05 2019
unknown
Update README.md
rwightman   3129bdb  ·  Dec 04 2019
unknown
New PyTorch trained EfficientNet-B2 weights with my RandAugment …
rwightman   ff421e5  ·  Dec 04 2019
unknown
Update results-all.csv with latest models/weights
rwightman   00b9340  ·  Nov 29 2019
unknown
Update README.md with latest changes
rwightman   5259dbc  ·  Nov 29 2019
unknown
Finish with HRNet, weights and models updated. Improve consisten…
rwightman   3bef524  ·  Nov 29 2019
unknown
Merge pull request #53 from rwightman/condconvs_and_features Ma…
rwightman   3ceeedc  (+12 commits )  ·  Nov 28 2019
unknown
Merge pull request #52 from rwightman/randaugment RandAugment a…
rwightman   db04677  (+5 commits )  ·  Nov 22 2019
unknown
Fix non-prefetch variant of Mixup. Fixes #50
rwightman   4748c6d  ·  Nov 02 2019
unknown
Add TF RandAug weights for B5/B7 EfficientNet models.
rwightman   0d58c50  ·  Oct 30 2019
unknown
Better differentiate sotabench WSL, SSL, and SWSL models via mod…
rwightman   62105ed  ·  Oct 20 2019
unknown
# 9
Map pretrained checkpoint to cpu to avoid issue with some pretra…
rwightman   c099374  ·  Oct 19 2019
unknown
# 8
Add Facebook Research Semi-Supervised and Semi-Weakly Supervised…
rwightman a9eb484    b93fcf0  ·  Oct 19 2019
unknown
# 7
unknown
# 6
Add support for loading args from yaml file (and saving them wit…
rwightman   187ecba  ·  Sep 09 2019
unknown
# 5
Fix Mobilenet V3 model name for sotabench. Minor res2net cleanup.
rwightman   d3ba34e  ·  Sep 05 2019
unknown
# 4
Silly typos.
rwightman   b5a8bb5  ·  Sep 04 2019
unknown
# 3
sotabench fail
rwightman   7dc5d7a  ·  Sep 04 2019
unknown
# 2
Merge pull request #35 from rwightman/res2net_dla Add Res2net a…
rwightman   96364fc  (+2 commits )  ·  Sep 04 2019
unknown
# 1
unknown
# 0
unknown