This is the github for Hyperparam, where we share open-source contributions to the AI and Data Engineering communities. AI needs lots of data, so we're building tools for working with massive text datasets in the browser.
🦜 Hyparquet — Parquet file parser for loading datasets in the browser.
🐤 Hyparquet Writer — Parquet file writer in JavaScript.
⛄ Icebird — Apache Iceberg table reader in JavaScript.
🐿️ Squirreling — Async SQL engine for querying large datasets in the browser.
🏛️ HighTable — Windowed table component for viewing arbitrarily large datasets.
🔍 HypGrep — Full text search for parquet with a compact n-gram index.
📐 HypVector — Store and query embedding vectors directly out of parquet files.
🦙 HyLlama — Parse metadata from llama.cpp gguf files in JavaScript.
👀 Hyperparam CLI — Scalable dataset viewer for machine learning datasets.