Skip to content

Pinned Loading

  1. DepR DepR Public

    (ICCV 2025) DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion

    Jupyter Notebook 24

  2. OverLayBench OverLayBench Public

    (NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps

    Python 25 2

  3. BLIVA BLIVA Public

    (AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions

    Python 260 25

  4. TokenCompose TokenCompose Public

    (CVPR 2024) đź§© TokenCompose: Text-to-Image Diffusion with Token-level Supervision

    Jupyter Notebook 136 5

  5. CoaT CoaT Public

    (ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers

    Jupyter Notebook 235 31

  6. TESTR TESTR Public

    (CVPR 2022) Text Spotting Transformers

    Python 190 23

Repositories

Showing 10 of 23 repositories

Most used topics

Loading…