Skip to content

integrate pyarrow and no copy ipc for sharing larger data #9

@shababo

Description

@shababo

Part of the initial idea behind Ether was to dynamically communicate between instances and process for multiple forms of data. So far we have implemented small message passing/function calling with ZMQ, but we also want to handle larger data in an efficient way. I'm pretty sure the path to doing this is by using Pyarrow/Apache Arrow and no-copy IPC between processes.

BONUS: This could help unlock integration with Hugginface datasets and other users of Arrow.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions