Skip to content

Enhance the transformer to efficiently process large context sizes #4

@vdyma

Description

@vdyma

One of the ideas is to implement multi-scale architecture such as Megabye.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions