zsxkib / star

STAR Video Upscaler: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

replicate.com
Total runs: 239
24-hour runs: 1
7-day runs: 9
30-day runs: 234
Github
Model's Last Updated: January 31 2025

Introduction of star

Model Details of star

Readme

STAR: Spatial-Temporal Video Super-Resolution

STAR is a powerful text-guided video super-resolution model that can enhance low-quality videos while maintaining temporal consistency. It leverages text-to-video models to generate high-quality reference frames and combines them with spatial-temporal features for superior upscaling results.

More visual results can be found on our project page and video demo .

Usage

The model accepts: - A video file (supported formats: mp4, avi, mov) - Optional text prompt describing the video content - Target upscaling factor (default: 4x)

The model outputs an enhanced, higher-resolution version of the input video.

Limitations
  • For optimal results, input videos should be at least 240p resolution
  • Processing time increases with video length and resolution
  • Due to VRAM requirements, longer videos may need to be processed in segments
  • The CogVideoX-5B variant only supports 720x480 input resolution
Model Versions

Two variants are available:

  1. I2VGen-XL-based:
  2. Light degradation model: Best for mild quality enhancement
  3. Heavy degradation model: Optimized for severely degraded videos

  4. CogVideoX-5B-based:

  5. Specialized for heavy degradation scenarios
  6. Fixed input resolution of 720x480
Citation
@misc{xie2025starspatialtemporalaugmentationtexttovideo,
      title={STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution}, 
      author={Rui Xie and Yinhong Liu and Penghao Zhou and Chen Zhao and Jun Zhou and Kai Zhang and Zhenyu Zhang and Jian Yang and Zhenheng Yang and Ying Tai},
      year={2025},
      eprint={2501.02976},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}
License
  • I2VGen-XL-based models: MIT License
  • CogVideoX-5B-based model: CogVideoX License

Maintained by @zsxkib for Replicate integration

Runs of zsxkib star on replicate.com

239
Total runs
1
24-hour runs
2
3-day runs
9
7-day runs
234
30-day runs

More Information About star replicate.com Model

star replicate.com

star replicate.com is an AI model on replicate.com that provides star's model effect (STAR Video Upscaler: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution), which can be used instantly with this zsxkib star model. replicate.com supports a free trial of the star model, and also provides paid use of the star. Support call star model through api, including Node.js, Python, http.

zsxkib star online free

star replicate.com is an online trial and call api platform, which integrates star's modeling effects, including api services, and provides a free online trial of star, you can try star online for free by clicking the link below.

zsxkib star online free url in replicate.com:

https://replicate.com/zsxkib/star

star install

star is an open source model from GitHub that offers a free installation service, and any user can find star on GitHub to install. At the same time, replicate.com provides the effect of star install, users can directly use star installed effect in replicate.com for debugging and trial. It also supports api for free installation.

star install url in replicate.com:

https://replicate.com/zsxkib/star

star install url in github:

https://github.com/zsxkib/STAR

Url of star

Provider of star replicate.com

Other API from zsxkib

replicate

📖 PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Total runs: 1.2M
Run Growth: 0
Growth Rate: 0.00%
Updated: May 16 2024
replicate

Blip 3 / XGen-MM, Answers questions about images ({blip3,xgen-mm}-phi3-mini-base-r-v1)

Total runs: 1.1M
Run Growth: 100.0K
Growth Rate: 9.09%
Updated: May 13 2024
replicate

Make realistic images of real people instantly

Total runs: 803.0K
Run Growth: 31.1K
Growth Rate: 3.88%
Updated: December 11 2024
replicate

⚡️FLUX PuLID: FLUX-dev based Pure and Lightning ID Customization via Contrastive Alignment🎭

Total runs: 601.2K
Run Growth: 0
Growth Rate: 0.00%
Updated: September 16 2024
replicate

Create song covers with any RVC v2 trained AI voice from audio files.

Total runs: 564.1K
Run Growth: 2.1K
Growth Rate: 0.37%
Updated: November 15 2023
replicate

🎨 Fill in masked parts of images with FLUX.1-dev 🖌️

Total runs: 336.3K
Run Growth: 8.1K
Growth Rate: 2.42%
Updated: August 19 2024
replicate

✍️✨Prompts to auto-magically relights your images

Total runs: 183.1K
Run Growth: 14.0K
Growth Rate: 7.65%
Updated: May 21 2024
replicate

Age prediction using CLIP - Patched version of `https://replicate.com/andreasjansson/clip-age-predictor` that works with the new version of cog!

Total runs: 179.5K
Run Growth: 200
Growth Rate: 0.11%
Updated: June 06 2023
replicate

Add sound to video. An advanced AI model that synthesizes high-quality audio from video content, enabling seamless video-to-audio transformation

Total runs: 149.3K
Run Growth: 78.3K
Growth Rate: 53.34%
Updated: December 12 2024
replicate

✨DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Total runs: 131.7K
Run Growth: 100
Growth Rate: 0.08%
Updated: October 12 2023
replicate

allenai/Molmo-7B-D-0924, Answers questions and caption about images

Total runs: 76.6K
Run Growth: 12.0K
Growth Rate: 15.71%
Updated: September 26 2024
replicate

🎨 AnimateDiff (w/ MotionLoRAs for Panning, Zooming, etc): Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

Total runs: 54.8K
Run Growth: 100
Growth Rate: 0.18%
Updated: October 05 2023
replicate

📽️ Increase Framerate 🎬 ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation

Total runs: 51.3K
Run Growth: 0
Growth Rate: 0.00%
Updated: October 09 2023
replicate

Jina-CLIP v2: 0.9B multimodal embedding model with 89-language multilingual support, 512x512 image resolution, and Matryoshka representations

Total runs: 17.1K
Run Growth: 16.1K
Growth Rate: 94.15%
Updated: November 28 2024
replicate

Monster Labs' Controlnet QR Code Monster v2 For SD-1.5 on top of AnimateDiff Prompt Travel (Motion Module SD 1.5 v2)

Total runs: 10.0K
Run Growth: 0
Growth Rate: 0.00%
Updated: October 31 2023
replicate

🖼️✨Background images + prompts to auto-magically relights your images (+normal maps🗺️)

Total runs: 9.2K
Run Growth: 500
Growth Rate: 5.43%
Updated: May 21 2024
replicate

Real-Time Open-Vocabulary Object Detection

Total runs: 8.4K
Run Growth: 500
Growth Rate: 5.95%
Updated: February 12 2024
replicate

Text-to-Video + Image-to-Video: Pyramid Flow Autoregressive Video Generation method based on Flow Matching

Total runs: 8.0K
Run Growth: 200
Growth Rate: 2.50%
Updated: October 10 2024
replicate

Hunyuan-Video LoRA Explorer + Trainer

Total runs: 7.8K
Run Growth: 2.1K
Growth Rate: 27.27%
Updated: January 24 2025
replicate

AuraSR v2: Second-gen GAN-based Super-Resolution for real-world applications

Total runs: 7.4K
Run Growth: 1.8K
Growth Rate: 24.32%
Updated: July 31 2024
replicate

Create your own Realistic Voice Cloning (RVC v2) dataset using a YouTube link

Total runs: 7.1K
Run Growth: 100
Growth Rate: 1.41%
Updated: November 20 2023
replicate

🎨 Fill in masked parts of images with FLUX.1-schnell 🖌️

Total runs: 6.1K
Run Growth: 400
Growth Rate: 6.56%
Updated: August 15 2024
replicate

🎨AnimateDiff Prompt Travel🧭 Seamlessly Navigate and Animate Between Text-to-Image Prompts for Dynamic Visual Narratives

Total runs: 5.6K
Run Growth: 0
Growth Rate: 0.00%
Updated: October 19 2023
replicate

FlashFace: Human Image Personalization with High-fidelity Identity Preservation

Total runs: 3.8K
Run Growth: 300
Growth Rate: 7.89%
Updated: April 22 2024
replicate

Make realistic images of real people instantly (w/ ip-adapter-plus-face_sdxl_vit-h)

Total runs: 3.6K
Run Growth: 200
Growth Rate: 5.56%
Updated: July 14 2024
replicate

🖼️ Super fast 1.5B Image Captioning/VQA Multimodal LLM (Image-to-Text) 🖋️

Total runs: 2.2K
Run Growth: 0
Growth Rate: 0.00%
Updated: February 01 2024
replicate

MimicMotion: High-quality human motion video generation with pose-guided control

Total runs: 2.2K
Run Growth: 100
Growth Rate: 4.55%
Updated: July 16 2024
replicate

Idefics3-8B-Llama3, Answers questions and caption about images

Total runs: 2.1K
Run Growth: 100
Growth Rate: 4.76%
Updated: August 15 2024
replicate

AuraSR: GAN-based Super-Resolution for real-world

Total runs: 2.1K
Run Growth: 200
Growth Rate: 9.52%
Updated: June 27 2024
replicate

Qwen 2: A 7 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Total runs: 1.6K
Run Growth: 0
Growth Rate: 0.00%
Updated: June 25 2024
replicate

Cubiq's ComfyUI InstantID node running `instantid_basic.json` example

Total runs: 1.5K
Run Growth: 0
Growth Rate: 0.00%
Updated: August 19 2024
replicate

A state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from text descriptions

Total runs: 1.4K
Run Growth: 440
Growth Rate: 31.43%
Updated: December 11 2024
replicate

Upscale videos + images with BSRGAN

Total runs: 1.3K
Run Growth: 1.2K
Growth Rate: 95.31%
Updated: January 29 2025
replicate

✨Stable Diffusion 3 w/ ⚡InstantX's Canny, Pose, and Tile ControlNets🖼️

Total runs: 1.1K
Run Growth: 0
Growth Rate: 0.00%
Updated: June 21 2024
replicate

🎼FluxMusic Text-to-Music Generation with Rectified Flow Transformer🎶

Total runs: 1.1K
Run Growth: 267
Growth Rate: 24.27%
Updated: September 26 2024
replicate

Image tagger fine-tuned on WaifuDiffusion w/ (SwinV2, SwinV2, ConvNext, and ViT)

Total runs: 991
Run Growth: 17
Growth Rate: 1.72%
Updated: May 22 2024
replicate

🫦 Realistic facial expression manipulation (lip-syncing) using audio or video

Total runs: 967
Run Growth: 57
Growth Rate: 6.01%
Updated: June 12 2024
replicate

🎙️Hololive text-to-speech and voice-to-voice (Japanese🇯🇵 + English🇬🇧)

Total runs: 847
Run Growth: 366
Growth Rate: 43.21%
Updated: June 03 2024
replicate

Unofficial Re-Trained AnimateAnyone (Image + DWPose Video → Animated Video of Image)

Total runs: 833
Run Growth: 0
Growth Rate: 0.00%
Updated: January 18 2024
replicate

🐲 DragGAN 🐉 - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold"

Total runs: 583
Run Growth: 0
Growth Rate: 0.00%
Updated: July 07 2023
replicate

Surrealist digital art featuring whimsical, anthropomorphic characters with exaggerated textures and vibrant color blocking

Total runs: 480
Run Growth: 0
Growth Rate: 0.00%
Updated: October 08 2024
replicate

MEMO is a state-of-the-art open-weight model for audio-driven talking video generation.

Total runs: 473
Run Growth: 176
Growth Rate: 37.29%
Updated: December 11 2024
replicate

Easily create video datasets with auto-captioning for Hunyuan-Video LoRA finetuning

Total runs: 464
Run Growth: 355
Growth Rate: 76.67%
Updated: January 23 2025
replicate

Transform your text into a beautiful two-tone color gradient that represents your emotions.

Total runs: 416
Run Growth: 0
Growth Rate: 0.00%
Updated: June 06 2023
replicate

Super High Quality Depth Maps 🗺️: An End-to-End Tile-Based Framework 🏗️ for High-Resolution Monocular Metric Depth Estimation 🔍📏

Total runs: 345
Run Growth: 0
Growth Rate: 0.00%
Updated: December 27 2023
replicate

SVFR: A Unified Framework for Generalized Video Face Restoration

Total runs: 311
Run Growth: 108
Growth Rate: 34.84%
Updated: January 14 2025
replicate

Qwen 2: A 1.5 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Total runs: 206
Run Growth: 2
Growth Rate: 0.97%
Updated: June 25 2024
replicate

Qwen 2: A 0.5 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Total runs: 196
Run Growth: 7
Growth Rate: 3.57%
Updated: June 24 2024
replicate

Convert speech in audio to text w/ `tiny`, `small`, `base`, and `large-v3` models

Total runs: 125
Run Growth: 0
Growth Rate: 0.00%
Updated: July 01 2024
replicate

Powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text prompt

Total runs: 118
Run Growth: 0
Growth Rate: 0.00%
Updated: October 23 2024
replicate

🗣️ TalkNet-ASD: Detect who is speaking in a video

Total runs: 83
Run Growth: 3
Growth Rate: 3.61%
Updated: May 01 2024
replicate

SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory

Total runs: 77
Run Growth: 26
Growth Rate: 33.77%
Updated: November 28 2024
replicate

Generate high-quality videos from text prompts using StepVideo

Total runs: 51
Run Growth: 31
Growth Rate: 63.27%
Updated: February 25 2025
replicate

A "Hello World" model for me to get to grips with `cog` and Replicate

Total runs: 44
Run Growth: 0
Growth Rate: 0.00%
Updated: June 05 2023
replicate

SAM 2: Segment Anything v2 (for in Images + Videos)

Total runs: 19
Run Growth: 0
Growth Rate: 0.00%
Updated: July 31 2024
replicate

Hibiki: High-Fidelity Simultaneous Speech-To-Speech Translation

Total runs: 8
Run Growth: 1
Growth Rate: 12.50%
Updated: February 10 2025
replicate

Remove background from images using BRIA-RMBG-2.0

Total runs: 2
Run Growth: 0
Growth Rate: 0.00%
Updated: November 25 2024