Tokenizing a file with a lot of nested open arrays takes almost a minute

When tokenizing the files in the https://github.com/Kotlin/kotlinx.serialization repo, the `cl100k_base` tokenizer struggled on the following files:
- [n_structure_open_array_object.json](https://github.com/Kotlin/kotlinx.serialization/blob/master/formats/json-tests/jvmTest/resources/spec_cases/n_structure_open_array_object.json) took 53.7s to tokenize
- [n_structure_100000_opening_arrays.json](https://github.com/Kotlin/kotlinx.serialization/blob/master/formats/json-tests/jvmTest/resources/spec_cases/n_structure_100000_opening_arrays.json) took 6.9s to tokenize

While the rest of the files usually took less than a millisecond.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tokenizing a file with a lot of nested open arrays takes almost a minute #55

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Tokenizing a file with a lot of nested open arrays takes almost a minute #55

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions