Contributing Multimodal Attention Patterns

Dear @alan-cooney @neelnanda-io ! :)

I forked the repo and extended it to visualize multimodal attention patterns in vision-language-models. I would love to contribute the component to the main `CircuitsVis`, because it could foster more interpretability research into multimodal models.

* Here is an [html example link](https://storage.googleapis.com/tomas-ruiz-multimodal-attn/2025-02-03-visualization/PaliGemma2_layer_15_attention_heads.html) of the new component
* I [created a blog post](https://tomasruizt.github.io/posts/multimodal-attn/) showing how to use the component to find new research hypothesis
* This is the [link to the Github fork](https://github.com/tomasruizt/CircuitsVis). Atm, the component sits on top of the `AttentionPatterns` component, but it would probably be safer to contribute it as a separate react component.
* I'm also very happy about feedback for the component and/or for the ideas in the blog post.

Below is a screenshot of the new component. Let me know what you think :) 

![Image](https://github.com/user-attachments/assets/a82086a5-2e6e-46e0-9c4e-0e06bd2d4f10)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contributing Multimodal Attention Patterns #95

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Contributing Multimodal Attention Patterns #95

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions