Skip to content
View superhero-7's full-sized avatar
🏎️
Rush...
🏎️
Rush...

Block or report superhero-7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
superhero-7/README.md

Hi there 👋

  • 😄 Former intern at BAAI and Zhipu AI, where my core work centered on the training of image foundation models.
  • 👯 Previously a member of ByteDance’s Intelligent Creation Lab, with a focus on the DreamID series (encompassing DreamID, DreamID-V, and DreamID-Omni); currently part of ByteDance’s Seed Vision Application team, dedicated to the development of Seedance 2.0.
  • 🧠 Research interests lie in Large Multimodal Models (covering multimodal generation, understanding, agents, acceleration, and efficient inference), as well as all product-related topics associated with multimodality.
  • ⚡ Open to collaborations and discussions on multimodal technology and product innovation.
  • 💬 Reach me via fulong_ye@163.com or yefulong@bytedance.com.

Pinned Loading

  1. AltDiffusion AltDiffusion Public

    Source code for paper: "AltDiffusion: A multilingual Text-to-Image diffusion model"

    Python 44 3

  2. FlagAI-Open/FlagAI FlagAI-Open/FlagAI Public

    FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.

    Python 3.9k 422

  3. DreamID DreamID Public

    HTML 105 6

  4. bytedance/DreamID-V bytedance/DreamID-V Public

    DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer

    Python 558 82

  5. DreamID-Omni DreamID-Omni Public

    Forked from Guoxu1233/DreamID-Omni

    DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation