CerberusDet
Описание
Языки
- Python99%
- Shell0,8%
- Dockerfile0,2%
CerberusDet: Unified Multi-Dataset Object Detection
The code is based on:
Install
Python>=3.8.0 is required.
Docker
Run the docker:
Data
- Use script voc.py to download VOC dataset
For information about the VOC dataset and its creators, visit the PASCAL VOC dataset website.
- Use script objects365_part.py to download subset of Objects365 dataset with 19 animals categories:
['Monkey', 'Rabbit', 'Yak', 'Antelope', 'Pig', 'Bear', 'Deer', 'Giraffe', 'Zebra', 'Elephant',
'Lion', 'Donkey', 'Camel', 'Jellyfish', 'Other Fish', 'Dolphin', 'Crab', 'Seal', 'Goldfish']
Along with Objects365 subset with 12 tableware categories:
[ 'Cup', 'Plate', 'Wine Glass', 'Pot', 'Knife', 'Fork', 'Spoon', 'Chopsticks',
'Cutting/chopping Board', 'Tea pot', 'Kettle', 'Tong']
To download full Objects365 dataset, set in the script objects365_part.py.
The Objects365 dataset is available for the academic purpose only. For information about the dataset and its creators, visit the Objects365 dataset website.
Train
- Download pretrained on COCO yolov8 weights
- Run train process with 1 GPU
- OR run train process with several GPUs:
By default logging will be done with tensorboard, but you can use mlflow if set --mlflow-url, e.g. .
CerberusDet model config details
Example of the model's config for 2 tasks: yolov8x_voc_obj365.yaml
- The model config is based on yolo configs, except that the
is divided into two sections (headandneck)head - The layers of the
section can be shared between tasks or be uniqueneck - The
section defines what the head will be for all tasks, but each task will always have its own unique parametershead - The
parameter of the first neck layer must be a positive ordinal number, specifying from which layer, starting from the beginning of the entire architecture, to take features.from - The
section is optional and defines the architecture configuration for determining the neck layers to be shared among tasks. If not specified, all layers will be shared among tasks, and only the heads will be unique.cerber - The CerberusDet configuration is constructed as follows:
, wherecerber: List[OneBranchConfig]
, whereOneBranchConfig = List[cerber_layer_number, SharedTasksConfig]
- the layer number (counting from the end of the backbone) after which branching should occurcerber_layer_number
, whereSharedTasksConfig = List[OneBranchGroupedTasks]
- the task head numbers (essentially task IDs) that should be in the same branch and share layers thereafterOneBranchGroupedTasks = [number_of_task1_head, number_of_task2_head, ...]
The head numbers will correspond to tasks according to the sequence in which they are listed in the data configuration.
Example for YOLO v8x:
- configuration for 3 tasks. Task id=15 will have all task-specific layers, starting from the 3rd. Tasks id=13, id=14 will share layers 3-6, then after the 6th, they will have their own separate branches with all layers.[[2, [[15], [13, 14]]], [6, [[13], [14]]]]
Evaluation
- Download CerberusDet checkpoint (see below)
- Run script bash_scripts/val.sh
Inference
You can run inference using either the provided bash script or directly via the Python API.
1. Using Bash Script
First, download the CerberusDet checkpoint trained on VOC and parts of the Objects365 dataset (see the Pretrained Checkpoints section below).
Then, run the detection script:
2. Using Python API
You can also integrate CerberusDet into your own code. Below is an example of how to initialize the model, preprocess images, and visualize the results.
NOTE: To run inference using standard YOLOv8 checkpoints, use the
class. Please ensure the following requirements are met:cerberusdet.yolo_wrapper.YOLOV8ForObjectDetectionTip: Class names for specific datasets can be found in the corresponding YAML configuration files located in the
directory.data/`Example using the VOC_07_12_best_state_dict.pt checkpoint (Click to expand)
Pretrained Checkpoints
| Model | Train set | size (pixels) | mAPval 50-95 | mAPval 50 | Speed V100 b32, fp16 (ms) | params (M) | FLOPs @640 (B) |
|---|---|---|---|---|---|---|---|
| YOLOv8x | VOC | 640 | 0.758 | 0.916 | 5.6 | 68 | 257.5 |
| YOLOv8x | Objects365_animals | 640 | 0.43 | 0.548 | 5.6 | 68 | 257.5 |
| YOLOv8x | Objects365_tableware | 640 | 0.56 | 0.68 | 5.6 | 68 | 257.5 |
| YOLOv8x | Objects365_full | 640 | 0.291 | 0.381 | 5.6 | 70 | 267.0 |
| CerberusDet_v8x | VOC, Objects365_animals | 640 | 0.751, 0.432 | 0.918, 0.556 | 7.2 | 105 | 381.3 |
| CerberusDet_v8x | VOC, Objects365_animals, Objects365_tableware | 640 | 0.762, 0.421, 0.56 | 0.927, 0.541, 0.68 | 10 | 142 | 505.1 |
| CerberusDet_v8x | VOC, Objects365_full | 640 | 0.767, 0.355 | 0.932, 0.464 | 7.2 | 107 | 390.8 |
YOLOv8x models were trained with the commit: https://github.com/ultralytics/ultralytics/tree/2bc36d97ce7f0bdc0018a783ba56d3de7f0c0518
Hyperparameter Evolution
See the launch example in the bash_scripts/evolve.sh.
Notes
- To evolve hyperparameters specific to each task, specify initial parameters separately per task and append --evolve_per_task
- To evolve specific set of hyperparameters, specify their names separated by comma via the
argument, e.g.--params_to_evolve--params_to_evolve 'box,cls,dfl' - Use absolute paths to configs.
- Specify search algorith via
. You can use the search algorithms of the ray library (see available values here: predefined_evolvers.py), or--evolver'yolov5'
License
CerberusDet is released under the GNU AGPL v.3 license.
See the file LICENSE for more details.
Citing
If you use our models, code or dataset, we kindly request you to cite our paper and give repository a ⭐
