easyportrait

0

Описание

Языки

  • Python99,9%
  • Shell0,1%
README.md

easyportrait

EasyPortrait - Face Parsing and Portrait Segmentation Dataset

We introduce a large-scale image dataset EasyPortrait for portrait segmentation and face parsing. Proposed dataset can be used in several tasks, such as background removal in conference applications, teeth whitening, face skin enhancement, red eye removal or eye colorization, and so on.

EasyPortrait dataset size is about 91.78GB, and it contains 40,000 RGB images (~38.3K FullHD images) with high quality annotated masks. This dataset is divided into training set, validation set and test set by subject

user_id
. The training set includes 30,000 images, the validation set includes 4,000 images, and the test set includes 6,000 images.

For more information see our paper EasyPortrait – Face Parsing and Portrait Segmentation Dataset.

🔥 Changelog

  • 2023/11/13
    : We release EasyPortrait 2.0. ✌️
    • 40,000 RGB images (~38.3K FullHD images)
    • Added diversity by region, race, human emotions and lighting conditions
    • The data was further cleared and new ones were added
    • Train/val/test split: (30,000) 75% / (4,000) 10% / (6,000) 15% by subject
      user_id
    • Multi-gpu training and testing
    • Added new models for face parsing and portrait segmentation
    • Dataset size is 91.78GB
    • 13,705 unique persons
  • 2023/02/23
    : EasyPortrait (Initial Dataset) 💪
    • Dataset size is 26GB
    • 20,000 RGB images (~17.5K FullHD images) with 9 classes annotated
    • Train/val/test split: (14,000) 70% / (2,000) 10% / (4,000) 20% by subject
      user_id
    • 8,377 unique persons

Old EasyPortrait dataset is also available into branch

EasyPortrait_v1
!

Downloads

LinkSize
images
91.8 GB
annotations
657.1 MB
meta
1.9 MB
train set
68.3 GB
validation set
10.7 GB
test set
12.8 GB

Also, you can download EasyPortrait dataset from Kaggle.

Structure

. ├── images.zip │ ├── train/ # Train set: 30k │ ├── val/ # Validation set: 4k │ ├── test/ # Test set: 6k ├── annotations.zip │ ├── train/ │ ├── val/ │ ├── test/ ├── meta.zip # Meta-information (width, height, brightness, imhash, user_id) ...

Models

We provide some pre-trained models as the baseline for portrait segmentation and face parsing. We use mean Intersection over Union (mIoU) as the main metric.

Portrait segmentation:

Model NameParameters (M)Input shapemIoU
BiSeNet-V256.5384 x 38497.95
DANet190.2384 x 38498.63
DeepLabv3260384 x 38498.63
ExtremeC3Net0.15384 x 38496.54
Fast SCNN6.13384 x 38497.64
FCN + MobileNetv231.17384 x 38498.19
FPN + ResNet50108.911024 × 102498.54
FPN + ResNet50108.91512 × 51298.64
FPN + ResNet50108.91384 x 38498.64
FPN + ResNet50108.91224 × 22498.31
SegFormer-B014.91024 × 102498.74
SegFormer-B014.9512 × 51298.66
SegFormer-B014.9384 x 38498.61
SegFormer-B014.9224 × 22498.17
SINet0.13384 x 38493.32

Face parsing:

Model NameParameters (M)Input shapemIoU
BiSeNet-V256.5384 x 38476.72
DANet190.2384 x 38479.3
DeepLabv3260384 x 38479.11
EHANet44.81384 x 38472.56
Fast SCNN6.13384 x 38467.56
FCN + MobileNetv231.17384 x 38475.23
FPN + ResNet50108.911024 × 102485.37
FPN + ResNet50108.91512 × 51283.33
FPN + ResNet50108.91384 x 38481.83
FPN + ResNet50108.91224 × 22475.6
SegFormer-B014.91024 × 102485.42
SegFormer-B014.9512 × 51283.19
SegFormer-B014.9384 x 38481.38
SegFormer-B014.9224 × 22474.83

Annotations

Annotations are presented as 2D-arrays, images in

*.png
format with several classes:

IndexClass
0BACKGROUND
1PERSON
2SKIN
3LEFT_BROW
4RIGHT_BROW
5LEFT_EYE
6RIGHT_EYE
7LIPS
8TEETH

Also, we provide some additional meta-information for dataset in

annotations/meta.zip
file:

image_nameuser_idheightwidthsetbrightness
0a753e021-...56...720960train126
14ff04492-...ba...19201440test173
2e8934c99-...1d...19201440val187

where:

  • image_name
    - image file name without extension
  • user_id
    - unique anonymized user ID
  • height
    - image height
  • width
    - image width
  • brightness
    - image brightness
  • set
    - "train", "test" or "val" for train / test / val subsets respectively

Images

easyportrait

Training, Evaluation and Testing on EasyPortrait

The code is based on MMSegmentation with 0.30.0 version.

Models were trained and evaluated on 8 NVIDIA V100 GPUs with CUDA 11.2.

For installation process follow the instructions here and use the requirements.txt file in our repository.

Training

For single GPU mode:

For distributed training mode:

Evaluation

For single GPU mode:

For distributed evaluation mode:

Run demo

Authors and Credits

Citation

You can cite the paper using the following BibTeX entry:

@article{EasyPortrait, title={EasyPortrait - Face Parsing and Portrait Segmentation Dataset}, author={Kapitanov, Alexander and Kvanchiani, Karina and Kirillova Sofia}, journal={arXiv preprint arXiv:2304.13509}, year={2023} }

License

Creative Commons License
This work is licensed under a variant of Creative Commons Attribution-ShareAlike 4.0 International License.

Please see the specific license.