We present Hunyuan-DiT, a text-to-image diffusion transformer with fine-grained understanding of both English and Chinese. To construct Hunyuan-DiT, we carefully designed the transformer structure, ...
We introduce JavisDiT, a novel & SoTA Joint Audio-Video Diffusion Transformer designed for synchronized audio-video generation (JAVG) from open-ended user prompts. We hope to set a new standard for ...
Abstract: Given the rapid increase of textual data in various fields, text summarization has become essential for efficient information handling. Over recent decades, numerous methods have been ...
Abstract: The extraction of crucial information from large texts is making significant progress and is a necessity in all professions. In this era of social media, people seek quick and short bits of ...