Divot huggingface.co api & TencentARC Divot github AI Model

Introduction of Divot

Model Details of Divot

Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation

We introduce Divot, a Di ffusion-Powered V ide o T okenizer, which leverages the diffusion process for self-supervised video representation learning. We posit that if a video diffusion model can effectively de-noise video clips by taking the features of a video tokenizer as the condition, then the tokenizer has successfully captured robust spatial and temporal information. Additionally, the video diffusion model inherently functions as a de-tokenizer, decoding videos from their representations. Building upon the Divot tokenizer, we present Divot-LLM through video-to-text autoregression and text-to-video generation by modeling the distributions of continuous-valued Divot features with a Gaussian Mixture Model.

All models, training code and inference code are released!

TODOs

Release the pretrained tokenizer and de-tokenizer of Divot.
Release the pretrained and instruction tuned model of Divot-LLM.
Release inference code of Divot.
Release training and inference code of Divot-LLM.
Release training code of Divot.
Release de-tokenizer adaptation training code.

Introduction

We utilize the diffusion procedure to learn a video tokenizer in a self-supervised manner for unified comprehension and generation, where the spatiotemporal representations serve as the condition of a diffusion model to de-noise video clips. Additionally, the proxy diffusion model functions as a de-tokenizer to decode realistic video clips from the video representations.

After training the the Divot tokenizer, video features from the Divot tokenizer are fed into the LLM to perform next-word prediction for video comprehension, while learnable queries are input into the LLM to model the distributions of Divot features using a Gaussian Mixture Model (GMM) for video generation. During inference, video features are sampled from the predicted GMM distribution to decode videos using the de-tokenizer.

Usage

Dependencies

Python >= 3.8 (Recommend to use Anaconda )
PyTorch >=2.1.0
NVIDIA GPU + CUDA

Installation

Clone the repo and install dependent packages

git clone https://github.com/TencentARC/Divot.git
cd Divot
pip install -r requirements.txt

Model Weights

We release the pretrained tokenizer and de-tokenizer, pre-trained and instruction-tuned Divot-LLM. Please download the checkpoints and save them under the folder ./pretrained . For example, ./pretrained/Divot_tokenizer_detokenizer .

You also need to download Mistral-7B-Instruct-v0.1 and CLIP-ViT-H-14-laion2B-s32B-b79K , and save them under the folder ./pretrained .

Inference

Video Reconstruction with Divot

python3 src/tools/eval_Divot_video_recon.py

Video Comprehension with Divot-LLM

python3 src/tools/eval_Divot_video_comp.py

Video Generation with Divot-LLM

python3 src/tools/eval_Divot_video_gen.py

Training

Pre-training

Download the checkpoints of pre-trained Mistral-7B-Instruct-v0.1 and CLIP-ViT-H-14-laion2B-s32B-b79K , and save them under the folder ./pretrained .
Prepare the training data in the format of webdataset.
Run the following script.

sh scripts/train_Divot_pretrain_comp_gen.sh

Instruction-tuning

Download the checkpoints of pre-trained Divot tokenizer and Divot-LLM in Divot , and save them under the folder ./pretrained .
Prepare the instruction data in the format of webdataset (for generation) and jsonl (for comprehension, where each line stores a dictionary used to specify the video_path, question, and answer).
Run the following script.

### For video comprehension
sh scripts/train_Divot_sft_comp.sh

### For video generation
sh scripts/train_Divot_sft_gen.sh

Inference with your own model

Obtain "pytorch_model.bin" with the following script.

cd train_output/sft_comp/checkpoint-xxxx
python3 zero_to_fp32.py . pytorch_model.bin

Merge your trained lora with the original LLM model using the following script.

python3 src/tools/merge_agent_lora_weight.py

Load your merged model in "mistral7b_merged_xxx" and and corresponding "agent" path, For example,

llm_cfg_path = 'configs/clm_models/mistral7b_merged_sft_comp.yaml'
agent_cfg_path = 'configs/clm_models/agent_7b_in64_out64_video_gmm_sft_comp.yaml'

License

Divot is licensed under the Apache License Version 2.0 for academic purpose only except for the third-party components listed in License .

Acknowledge

Our code for Divot tokenizer and de-tokenizer is built upon DynamiCrafter . Thanks for their excellent work!

Runs of TencentARC Divot on huggingface.co

Total runs

24-hour runs

3-day runs

7-day runs

30-day runs

More Information About Divot huggingface.co Model

More Divot license Visit here:

https://choosealicense.com/licenses/apache-2.0

Divot huggingface.co

Divot huggingface.co is an AI model on huggingface.co that provides Divot's model effect (), which can be used instantly with this TencentARC Divot model. huggingface.co supports a free trial of the Divot model, and also provides paid use of the Divot. Support call Divot model through api, including Node.js, Python, http.

Divot huggingface.co Url

https://huggingface.co/TencentARC/Divot

TencentARC Divot online free

Divot huggingface.co is an online trial and call api platform, which integrates Divot's modeling effects, including api services, and provides a free online trial of Divot, you can try Divot online for free by clicking the link below.

TencentARC Divot online free url in huggingface.co:

https://huggingface.co/TencentARC/Divot

Divot install

Divot is an open source model from GitHub that offers a free installation service, and any user can find Divot on GitHub to install. At the same time, huggingface.co provides the effect of Divot install, users can directly use Divot installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

Divot install url in huggingface.co:

https://huggingface.co/TencentARC/Divot

huggingface.co

TencentARC/InstantMesh

Total runs: 75.8K

Run Growth: -691

Growth Rate: -1.04%

Updated: 4월 11 2024

huggingface.co

TencentARC/PhotoMaker

Create photos, paintings and avatars for anyone in any style within seconds.

Total runs: 35.2K

Run Growth: -43.4K

Growth Rate: -124.12%

Updated: 7월 22 2024

huggingface.co

TencentARC/PhotoMaker-V2

Total runs: 30.3K

Run Growth: 6.6K

Growth Rate: 21.61%

Updated: 7월 22 2024

huggingface.co

TencentARC/t2i-adapter-sketch-sdxl-1.0

Total runs: 9.8K

Run Growth: 134

Growth Rate: 1.37%

Updated: 9월 08 2023

huggingface.co

TencentARC/t2i-adapter-canny-sdxl-1.0

Total runs: 6.8K

Run Growth: 1.1K

Growth Rate: 16.29%

Updated: 9월 07 2023

huggingface.co

TencentARC/t2i-adapter-lineart-sdxl-1.0

Total runs: 6.7K

Run Growth: 363

Growth Rate: 5.55%

Updated: 9월 07 2023

huggingface.co

TencentARC/t2i-adapter-depth-midas-sdxl-1.0

Total runs: 5.9K

Run Growth: 1.0K

Growth Rate: 17.41%

Updated: 9월 07 2023

huggingface.co

TencentARC/t2i-adapter-openpose-sdxl-1.0

Total runs: 5.3K

Run Growth: 1.4K

Growth Rate: 27.11%

Updated: 9월 07 2023

huggingface.co

TencentARC/t2i-adapter-depth-zoe-sdxl-1.0

Total runs: 4.3K

Run Growth: 38

Growth Rate: 0.89%

Updated: 9월 08 2023

huggingface.co

TencentARC/t2iadapter_depth_sd15v2

Total runs: 2.6K

Run Growth: 327

Growth Rate: 12.57%

Updated: 7월 31 2023

huggingface.co

TencentARC/t2iadapter_sketch_sd15v2

Total runs: 2.4K

Run Growth: 57

Growth Rate: 2.32%

Updated: 8월 01 2023

huggingface.co

TencentARC/t2iadapter_canny_sd15v2

Total runs: 2.4K

Run Growth: 152

Growth Rate: 6.31%

Updated: 7월 31 2023

huggingface.co

TencentARC/LLaMA-Pro-8B

Total runs: 2.3K

Run Growth: 962

Growth Rate: 42.17%

Updated: 1월 08 2024

huggingface.co

TencentARC/LLaMA-Pro-8B-Instruct

Total runs: 2.2K

Run Growth: 743

Growth Rate: 34.30%

Updated: 1월 07 2024

huggingface.co

TencentARC/t2iadapter_zoedepth_sd15v1

Total runs: 2.0K

Run Growth: 210

Growth Rate: 10.38%

Updated: 7월 31 2023

huggingface.co

TencentARC/Mistral_Pro_8B_v0.1

Total runs: 217

Run Growth: -181

Growth Rate: -82.27%

Updated: 2월 27 2024

huggingface.co

TencentARC/StereoCrafter

Total runs: 160

Run Growth: 44

Growth Rate: 26.04%

Updated: 12월 27 2024

huggingface.co

TencentARC/t2iadapter_openpose_sd14v1

Total runs: 148

Run Growth: -542

Growth Rate: -354.25%

Updated: 7월 31 2023

huggingface.co

TencentARC/NVComposer

Total runs: 122

Run Growth: -78

Growth Rate: -55.71%

Updated: 12월 16 2024

huggingface.co

TencentARC/flux-mini

Total runs: 114

Run Growth: 22

Growth Rate: 19.30%

Updated: 11월 29 2024

huggingface.co

TencentARC/t2iadapter_depth_sd14v1

Total runs: 57

Run Growth: 35

Growth Rate: 61.40%

Updated: 7월 31 2023

huggingface.co

TencentARC/t2iadapter_color_sd14v1

Total runs: 54

Run Growth: -18

Growth Rate: -35.29%

Updated: 7월 31 2023

huggingface.co

TencentARC/t2iadapter_sketch_sd14v1

Total runs: 50

Run Growth: 29

Growth Rate: 60.42%

Updated: 7월 31 2023

huggingface.co

TencentARC/t2iadapter_canny_sd14v1

Total runs: 41

Run Growth: 12

Growth Rate: 30.77%

Updated: 7월 31 2023

huggingface.co

TencentARC/QA-CLIP-ViT-L-14

Total runs: 41

Run Growth: -103

Growth Rate: -264.10%

Updated: 5월 16 2023

huggingface.co

TencentARC/QA-CLIP-ViT-B-16

Total runs: 33

Run Growth: -78

Growth Rate: -243.75%

Updated: 5월 16 2023

huggingface.co

TencentARC/t2iadapter_seg_sd14v1

Total runs: 27

Run Growth: 9

Growth Rate: 40.91%

Updated: 7월 31 2023

huggingface.co

TencentARC/MetaMath-Mistral-Pro

Total runs: 24

Run Growth: 3

Growth Rate: 12.50%

Updated: 2월 27 2024

huggingface.co

TencentARC/Open-MAGVIT2-Tokenizer-128-resolution

Total runs: 19

Run Growth: 3

Growth Rate: 15.79%

Updated: 1월 02 2025

huggingface.co

TencentARC/Open-MAGVIT2-Tokenizer-256-resolution

Total runs: 18

Run Growth: 9

Growth Rate: 50.00%

Updated: 1월 02 2025

huggingface.co

TencentARC/SEED-Story

Total runs: 16

Run Growth: -3

Growth Rate: -17.65%

Updated: 8월 26 2024

huggingface.co

TencentARC/IBQ-Tokenizer-16384

Total runs: 12

Run Growth: 2

Growth Rate: 16.67%

Updated: 12월 30 2024

huggingface.co

TencentARC/Open-MAGVIT2-AR-XL-256-resolution

Total runs: 12

Run Growth: 2

Growth Rate: 16.67%

Updated: 1월 02 2025

huggingface.co

TencentARC/t2iadapter_keypose_sd14v1

Total runs: 12

Run Growth: 2

Growth Rate: 16.67%

Updated: 7월 14 2023

huggingface.co

TencentARC/Open-MAGVIT2-AR-B-256-resolution

Total runs: 9

Run Growth: 2

Growth Rate: 22.22%

Updated: 1월 02 2025

huggingface.co

TencentARC/IBQ-AR-XXL

Total runs: 9

Run Growth: 3

Growth Rate: 33.33%

Updated: 12월 30 2024

huggingface.co

TencentARC/IBQ-Tokenizer-262144

Total runs: 7

Run Growth: -5

Growth Rate: -71.43%

Updated: 12월 30 2024

huggingface.co

TencentARC/IBQ-Tokenizer-1024

Total runs: 6

Run Growth: -2

Growth Rate: -33.33%

Updated: 12월 30 2024

huggingface.co

TencentARC/IBQ-Tokenizer-8192

Total runs: 5

Run Growth: -2

Growth Rate: -40.00%

Updated: 12월 30 2024

huggingface.co

TencentARC/IBQ-AR-XL

Total runs: 5

Run Growth: -1

Growth Rate: -20.00%

Updated: 12월 30 2024

huggingface.co

TencentARC/IBQ-AR-L

Total runs: 5

Run Growth: -2

Growth Rate: -40.00%

Updated: 12월 30 2024

huggingface.co

TencentARC/Open-MAGVIT2-AR-L-256-resolution

Total runs: 5

Run Growth: -6

Growth Rate: -120.00%

Updated: 1월 02 2025

huggingface.co

TencentARC/IBQ-Tokenizer-16384-Pretrain

Total runs: 4

Run Growth: 4

Growth Rate: 100.00%

Updated: 2월 13 2025

huggingface.co

TencentARC/IBQ-Tokenizer-262144-Pretrain

Total runs: 4

Run Growth: 4

Growth Rate: 100.00%

Updated: 2월 13 2025

huggingface.co

TencentARC/IBQ-AR-B

Total runs: 4

Run Growth: -6

Growth Rate: -150.00%

Updated: 12월 30 2024

huggingface.co

TencentARC/ViT-Lens

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 6월 29 2024

huggingface.co

TencentARC/FreeSplatter

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 12월 19 2024

huggingface.co

TencentARC/mllm-npu-llama2-qwenvl-vit

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 7월 10 2024

huggingface.co

TencentARC/ColorFlow

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 1월 12 2025

huggingface.co

TencentARC/SmartEdit-7B

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 4월 27 2024

huggingface.co

TencentARC/MasaCtrl

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 8월 20 2023

huggingface.co

TencentARC/ImageConductor

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 7월 09 2024

huggingface.co

TencentARC/T2I-Adapter

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 8월 22 2023

huggingface.co

TencentARC/BrushEdit

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 12월 16 2024

huggingface.co

TencentARC/DI-PCG

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 12월 20 2024

huggingface.co

TencentARC/QA-CLIP

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 8월 28 2023

huggingface.co

TencentARC/Moto

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 12월 17 2024

huggingface.co

TencentARC/SmartEdit-13B

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 4월 27 2024

huggingface.co

TencentARC/Mira-v1

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 8월 13 2024

huggingface.co

TencentARC/MotionCtrl

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 7월 19 2024

huggingface.co

TencentARC/Mira-v0

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 4월 11 2024

huggingface.co

TencentARC/Open-MAGVIT2

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 9월 09 2024

huggingface.co

TencentARC/GFPGANv1

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 10월 08 2022

huggingface.co

TencentARC/CustomNet

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 7월 22 2024

huggingface.co

TencentARC/ViSFT

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 1월 20 2024

TencentARC / Divot

Introduction of Divot

Model Details of Divot

Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation

TODOs

Introduction

Usage

Dependencies

Installation

Model Weights

Inference

Video Reconstruction with Divot

Video Comprehension with Divot-LLM

Video Generation with Divot-LLM

Training

Pre-training

Instruction-tuning

Inference with your own model

License

Acknowledge

Runs of TencentARC Divot on huggingface.co

More Information About Divot huggingface.co Model

More Divot license Visit here:

Divot huggingface.co

Divot huggingface.co Url

TencentARC Divot online free

TencentARC Divot online free url in huggingface.co:

Divot install

Divot install url in huggingface.co:

Url of Divot

Divot huggingface.co Url

Provider of Divot huggingface.co

Other API from TencentARC