google-research

Форк
0

README.md

Sign Language Detection

This is a TensorFlow implementation of the model proposed in Real-Time Sign Language Detection using Human Pose Estimation, published in SLRTP 2020.

This model is used in the Real-TIme Sign Language Detection for Videoconferencing demo published in ECCV 2020.

Models

This repository includes pre-trained models for both python (py) and javascript (js) .

Model Card

Usage

You can use the included models to perform inference or fine-tuning.

To load a model in python, use tensorflow.python.keras.models.load_model('models/py/model.h5').

To load a model in the browser, use tf.loadLayersModel('models/js/model.json') from tfjs.

You can use the train.py script to train the model from scratch using a tfrecord dataset file.

python -m train --dataset_path="data.tfrecord" --device="/GPU:0"

Dataset

The provided models were trained on the Public DGS Corpus.

The dataset is represented as a tfrecord file where each video has 4 properties: 1. fps:Int64List - the framerate of the video 1. pose_data:BytesList - human pose estimation, as a tensor of the shape (frames, 1, points, dimensions) 1. pose_confidence:BytesList - human pose estimation confidence, as a tensor of the shape (frames, 1, points) 1. is_signing:BytesList - a bytes object representing weather the user was signing or not in every frame.

Please see examples/create_tfrecord.py for an example of creating this record.

In this work, we use a 50:25:25 data split. The official split used in the trained models can be found in the split directory.

Citations

@inproceedings{moryossef2020sign,
  title={Real-Time Sign Language Detection using Human Pose Estimation},
  author={Amit Moryossef and Ioannis Tsochantaridis and Roee Aharoni and Sarah Ebling and S. Narayanan},
  journal={SLRTP},
  year={2020},
}


# If you are using the Public DGS Corpus
@inproceedings{hanke2020extending,
  title={{E}xtending the {P}ublic {DGS} {C}orpus in Size and Depth},
  author={Hanke, Thomas and Schulder, Marc and Konrad, Reiner and Jahn, Elena},
  booktitle={Proceedings of the LREC2020 9th Workshop on the Representation and Processing of Sign Languages: Sign Language Resources in the Service of the Language Community, Technological Challenges and Application Perspectives},
  pages={75--82},
  year={2020}
}

Использование cookies

Мы используем файлы cookie в соответствии с Политикой конфиденциальности и Политикой использования cookies.

Нажимая кнопку «Принимаю», Вы даете АО «СберТех» согласие на обработку Ваших персональных данных в целях совершенствования нашего веб-сайта и Сервиса GitVerse, а также повышения удобства их использования.

Запретить использование cookies Вы можете самостоятельно в настройках Вашего браузера.