Skip to content

Implement mixed-modal early-fusion architecture #5

@vdyma

Description

@vdyma

Add the ability to process the following modalities:

  • Image
  • Audio

Modify the architecture to process modalities with early-fusion as in Chameleon.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions