Video Transformer Network (https://arxiv.org/abs/2102.00719)#388
Video Transformer Network (https://arxiv.org/abs/2102.00719)#388bomri wants to merge 1 commit intofacebookresearch:mainfrom
Conversation
bomri
commented
Mar 24, 2021
- VTN model setup
- add the ability to return the entire video
- add support to return the frames index
- update defaults
- VIT_B_VTN.yaml
- adjusting the if-else in pack_pathway_output
- VTN README.md + update main README.md + update MODEL_ZOO
- VTN model setup - add the ability to return the entire video - add support to return the frames index - update defaults - VIT_B_VTN.yaml - adjusting the if-else in pack_pathway_output - VTN README.md + update main README.md + update MODEL_ZOO
|
Hi @feichtenhofer, we recently published our work on video action recognition using Transformers (https://arxiv.org/abs/2102.00719). As PySlowFast aims to provide novel research implementations in this domain, we modified our codebase and models to make them available via this repository. We'd appreciate it if you could consider merging our pull request, we think it would be great to share it here with the community. |
|
+1, we would also appreciate the inclusion of this model in PySlowFast. |
|
+1 would be great to have this model as a backbone for experiments, thank you! |
|
Hi @bomri thanks for this pull request, and glad PySlowFast is of help for your research. We would need to do a careful review before merging this, because it adds some nontrivial overhead to the main logics, especially as it adds several functionalities and configurations to the core PySF code. Generally, we would prefer if you could use a fork and we can re-link the implementation, similar as external projects are linked in detectron2 https://github.com/facebookresearch/detectron2/tree/master/projects#external-projects. Related to this, we will be updating the codebase with some ViT baselines from a concurrent work around next week which should hopefully provide one more base for future work on video transformers I'm adding @haooooooqi here for further help on this pull request |
|
Thank you @feichtenhofer for your response. |