Skip to content

Is there a way to build an index while keeping it on disk (GraphIndexBuilder + OnDiskGraphIndex ?) #125

@eolivelli

Description

@eolivelli

I am writing a POC to integrate JVector into HerdDB.

This is my work, for reference: diennea/herddb#814

This issue is about asking if there is a good way to have an GraphIndexBuilder backed by a OnDiskGraphIndex.
In HerdDB the index is always "open for writes" and it seems that GraphIndexBuilder is currently keeping everything on the Heap.

My current plan is to "flush" the index periodically to disk (during a checkpoint) but it doesn't seem efficient and it will lead to unwanted behaviour of the service (big writes to disk). Usually the checkpoint in HerdDB is like flushing a bunch of metadata with the list of "active pages".

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions