About 75,700,000 results
Open links in new tab
  1. Video-R1: Reinforcing Video Reasoning in MLLMs - GitHub

    Feb 23, 2025 · Video-R1 significantly outperforms previous models across most benchmarks. Notably, on VSI-Bench, which focuses on spatial reasoning in videos, Video-R1-7B achieves a …

  2. 知乎 - 有问题,就会有答案

    知乎,中文互联网高质量的问答社区和创作者聚集的原创内容平台,于 2011 年 1 月正式上线,以「让人们更好的分享知识、经验和见解,找到自己的解答」为品牌使命。知乎凭借认真、专业 …

  3. Wan: Open and Advanced Large-Scale Video Generative Models

    Jul 28, 2025 · Wan: Open and Advanced Large-Scale Video Generative Models We are excited to introduce Wan2.2, a major upgrade to our foundational video models. With Wan2.2, we have …

  4. hao-ai-lab/FastVideo - GitHub

    FastVideo is a unified post-training and inference framework for accelerated video generation. FastVideo features an end-to-end unified pipeline for accelerating diffusion models, starting …

  5. Generate Video Overviews in NotebookLM - Google Help

    Video Overviews, including voices and visuals, are AI-generated and may contain inaccuracies or audio glitches. NotebookLM may take a while to generate the Video Overview, feel free to …

  6. Wan: Open and Advanced Large-Scale Video Generative Models

    Feb 25, 2025 · Wan: Open and Advanced Large-Scale Video Generative Models In this repository, we present Wan2.1, a comprehensive and open suite of video foundation models …

  7. DepthAnything/Video-Depth-Anything - GitHub

    Jan 21, 2025 · ByteDance †Corresponding author This work presents Video Depth Anything based on Depth Anything V2, which can be applied to arbitrarily long videos without …

  8. GitHub - Lightricks/LTX-Video: Official repository for LTX-Video

    Nov 22, 2024 · LTX-Video is the first DiT-based video generation model that can generate high-quality videos in real-time. It can generate 30 FPS videos at 1216×704 resolution, faster than it …

  9. MAGREF: Masked Guidance for Any-Reference Video Generation

    MAGREF: Masked Guidance for Any-Reference Video Generation 🔥 News [2025.06.20] 🙏 Thanks to Kijai for developing the ComfyUI nodes for MAGREF and FP8-quantized Hugging Face mode! …

  10. WEIFENG2333/VideoCaptioner: 卡卡字幕助手 - GitHub

    About 🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手 - 视频字幕生成、断句、校正、字幕翻译全流程处理! - A powered tool for easy and efficient video subtitling.