This is a PyTorch version of fairseq, a sequence-to-sequence learning toolkit from Facebook AI Research. The original authors of this reimplementation are (in no particular order) Sergey Edunov, Myle ...
You just find this toolkit for multimodal video understanding! It contains implementation of two recent multi-modal video understanding papers VideoCLIP (EMNLP, 2021) and VLM (ACL Findings, 2021), ...
こんにちは最近 Whisper API と戯れることが多い bbz です。 そんな折、 speech-to-text な話題が twitter のタイムラインに Today we're sharing new progress on our AI speech work. Our Massively Multilingual Speech (MMS) project has now scaled ...
※ 注意点:hydraやomegaconfを自動ダウングレードしていく点を忘れずにいてください。 ※buildが終わったらpip versionを基にもどしておくことを忘れずに ...
Abstract: Predicting the products of chemical reactions is a conspicuous difficulty in organic chemistry. The model in this paper is used to predict the compounds produced under known reactants and ...