본문 바로가기

generation

(4)

[LLaVA-Video 논문 리뷰] - VIDEO INSTRUCTION TUNING WITH SYNTHETIC DATA *LLaVA-Video를 위한 논문 리뷰 글입니다! 궁금하신 점은 댓글로 남겨주세요! LLaVA-Video paper: https://arxiv.org/abs/2410.02713 Video Instruction Tuning With Synthetic DataThe development of video large multimodal models (LMMs) has been hindered by the difficulty of curating large amounts of high-quality raw data from the web. To address this, we propose an alternative approach by creating a high-quality synthetic dataset ..

[MeshAnything 논문 리뷰] - MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers *MeshAnything를 위한 논문 리뷰 글입니다! 궁금하신 점은 댓글로 남겨주세요! MeshAnything paper: https://arxiv.org/abs/2406.10163 MeshAnything: Artist-Created Mesh Generation with Autoregressive TransformersRecently, 3D assets created via reconstruction and generation have matched the quality of manually crafted assets, highlighting their potential for replacement. However, this potential is largely unrealized because thes..

[LGM 논문 리뷰] Large Multi-View Gaussian Model for High-Resolution 3D Content Creation *LGM를 위한 논문 리뷰 글입니다! 궁금하신 점은 댓글로 남겨주세요! LGM github: LGM (kiui.moe) LGMLGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation Arxiv 2024 Jiaxiang Tang1, Zhaoxi Chen2, Xiaokang Chen1, Tengfei Wang3, Gang Zeng1, Ziwei Liu2 1 Peking University 2 S-Lab, Nanyang Technological University 3 Shanghai AI Lame.kiui.moeContents1. Simple Introduction2. Background Knowledge: Gaussia..

[Tune-A-Video 논문 리뷰] One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation *Tune-A-Video를 위한 논문 리뷰 글입니다! 궁금하신 점은 댓글로 남겨주세요! Tune-A-Video paper: [2212.11565] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation (arxiv.org) Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation To replicate the success of text-to-image (T2I) generation, recent works employ large-scale video datasets to train a text-to-video (T..

이전 1 다음

티스토리툴바