YOLOX Tiny weights don't match to the network structure

## 🐛 Bug

I checked the keys from state dict of YOLOX model created with the code in this project against the weights I loaded from YOLOX project for tiny sized model and I noticed that there are fewer items in the state dict.

After quite some investigation I figured that there is no corresponding layer for these (names are from YOLOX project):
```
backbone.backbone.dark3.1.m.2.conv1.conv.weight
backbone.backbone.dark3.1.m.2.conv1.bn.weight
backbone.backbone.dark3.1.m.2.conv1.bn.bias
backbone.backbone.dark3.1.m.2.conv1.bn.running_mean
backbone.backbone.dark3.1.m.2.conv1.bn.running_var
backbone.backbone.dark3.1.m.2.conv1.bn.num_batches_tracked
backbone.backbone.dark3.1.m.2.conv2.conv.weight
backbone.backbone.dark3.1.m.2.conv2.bn.weight
backbone.backbone.dark3.1.m.2.conv2.bn.bias
backbone.backbone.dark3.1.m.2.conv2.bn.running_mean
backbone.backbone.dark3.1.m.2.conv2.bn.running_var
backbone.backbone.dark3.1.m.2.conv2.bn.num_batches_tracked
```
I pinpointed this to a specific value in the code which I think is wrong when constructing YOLOX models.

### To Reproduce

1) Download [tiny weights from YOLOX](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_tiny.pth) and load it to pytorch.
2) Create YOLOXNetwork with 80 classes, width of 24 and depth of 1 which should correspond to tiny sized YOLOX network.
3) Compare the model weights (tensors) in state dictionaries (especially the number of them).

See also the code below.

#### Code sample

```
>>> import torch
>>> tiny = torch.load('yolox_tiny.pth', map_location=torch.device('cpu'), weights_only=True)["model"]
>>> len(tiny)
462
>>> import pl_bolts.models.detection
>>> yolox = pl_bolts.models.detection.yolo.torch_networks.YOLOXNetwork(80, width=24, depth=1)
>>> len(yolox.state_dict())
450
>>>
``` 

### Expected behaviour

I would expect that the networks have matching number of tensors.

### Environment

- PyTorch Version (e.g., 1.0): 2.6.0+cpu
- OS (e.g., Linux): Fedora Workstation 41
- How you installed PyTorch (`conda`, `pip`, source): pip
- Build command you used (if compiling from source): -
- Python version: 3.13
- CUDA/cuDNN version: None
- GPU models and configuration: None
- Any other relevant information: -

### Additional context

The code I figured would be the reason for this is [here](https://github.com/Lightning-Universe/lightning-bolts/blob/2c4602aa684e7b90e7ffdcea1d3f93a20f9c2ead/src/pl_bolts/models/detection/yolo/torch_networks.py#L557). If I change that `depth * 2` to `depth * 3`, I get matching number of values (462) in the dictionary. I don't know if the current value is the same as in YOLOv5 but at least for YOLOX it is wrong apparently. I also have not checked any other sized networks, that the shapes of the tensors match, or that I could load the weights after converting the names of the layers in the state dictionary.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

YOLOX Tiny weights don't match to the network structure #1100

🐛 Bug

To Reproduce

Code sample

Expected behaviour

Environment

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

YOLOX Tiny weights don't match to the network structure #1100

Description

🐛 Bug

To Reproduce

Code sample

Expected behaviour

Environment

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions