Skip to content
This repository was archived by the owner on Mar 12, 2026. It is now read-only.
This repository was archived by the owner on Mar 12, 2026. It is now read-only.

Sample identifier should not depend only on HF datasets #16

@damian1996

Description

@damian1996

Maybe we should consider other sources for datasets not only Hugging Face?
At this moment we have something like this:

class SampleIdentifier(BaseModel):
    dataset_name: str = Field(..., description='Name of the source dataset')
    hf_repo: str = Field(..., description='HuggingFace repository identifier')
    hf_split: HfSplit = Field(..., description='HuggingFace split identifier')
    hf_index: int = Field(..., description='Index in the HuggingFace dataset')

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions