Skip to content

document and explain output format #26

@Gldkslfmsd

Description

@Gldkslfmsd

hi,

can you please document and explain the output format? Because @pe-trik 's OmniSTEval uses it as one supported output format.
I want to use OmniSTEval for evaluating output of my project without merging my code with simulstream (unless I'll be convinced that it is most reasonable way).

I saw one example here: https://raw.githubusercontent.com/pe-trik/OmniSTEval/refs/heads/main/examples/simulstream_example/simulstream_log.jsonl

{"model_loading_time": 22.951433420181274}
{"id": 0, "metadata": {"wav_name": "//storage/MT/sara/MUSTC/en-de/data/tst-COMMON_wav/ted_1096.wav"}}
{"id": 0, "total_audio_processed": 1.0, "computation_time": 0.4919137954711914, "generated_tokens": [], "deleted_tokens": []}
{"id": 0, "total_audio_processed": 2.0, "computation_time": 0.1842632293701172, "generated_tokens": [], "deleted_tokens": []}
{"id": 0, "total_audio_processed": 3.0, "computation_time": 0.18722200393676758, "generated_tokens": [], "deleted_tokens": []}
{"id": 0, "total_audio_processed": 4.0, "computation_time": 0.19884991645812988, "generated_tokens": [], "deleted_tokens": []}
...
{"id": 0, "total_audio_processed": 14.0, "computation_time": 0.2456223964691162, "generated_tokens": ["\u2581Zur", "\u00fcck"], "deleted_tokens": []}
{"id": 0, "total_audio_processed": 15.0, "computation_time": 0.24962759017944336, "generated_tokens": ["\u2581in", "\u2581New", "\u2581York", ",", "\u2581ich", "\u2581bin"], "deleted_tokens": []}
  • is model_loading time really used?
  • what is total_audio_processed? is it necessary to have it when the output is empty?
  • what if it is a cascade, and we don't know how many audio ASR processed at the time when LLM translated a piece?
  • do the generated tokens need to be as BPE tokens? What if a model does not use BPE tokens, but full text?

I would suggest this format for STEval tools. What would you think, @pe-trik and @mgaido91 ?

{ document: "doc1.wav", updates: [{ "emission_time": 1.34343, "text": "prefix", "unconfirmed_text": suffix"}, ...] }
{ document: "doc2.wav", updates: [{ "emission_time": 1.3223213, "text": "prefix", "unconfirmed_text": suffix"}, ...] }

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions