google-research
..
/
tubevit
README.md
Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning
This is the JAX/Flax implementation of the CVPR-2023 paper "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning".
The code will be added here soon.