Memory leak in parse_response usage of pydantic by avilaton · Pull Request #3068 · openai/openai-python

avilaton · 2026-04-07T18:51:31Z

I understand that this repository is auto-generated and my pull request may not be merged

Changes being requested

Address a memory leak in parse_response detected using the OpenAI client in a webserver context (gunicorn, gevent)

Additional context & links

Problem

client.responses.parse() can trigger sustained memory growth on pydantic >= 2.11 (see #1181).

The issue is that parse_response() used subscripted runtime generic aliases (for example ParsedResponse[T]) when calling construct_type_unchecked(). In pydantic v2, this can cause repeated runtime generic specialization/schema work in a hot path.

Fix

Use the non-subscripted runtime classes in parse_response() when calling construct_type_unchecked():

ParsedResponseOutputText[TextFormatT] → ParsedResponseOutputText
ParsedResponseOutputMessage[TextFormatT] → ParsedResponseOutputMessage
ParsedResponse[TextFormatT] → ParsedResponse

Why this is safe

construct_type_unchecked() constructs models loosely and does not require runtime generic specialization for correctness here. parse_text() still produces the typed parsed payload, and the return type for callers remains ParsedResponse[TextFormatT].

Tests

Added a regression test in tests/lib/responses/test_parsing.py that fails if parse_response() routes through _validate_non_model_type.
Added an opt-in memory characterization test (OPENAI_RUN_MEMORY_TESTS=1) in tests/lib/responses/test_parsing.py.
Ran responses suites against the mock server: 158 passed.

savvasp-123 · 2026-04-13T18:32:42Z

This is a real issue

afurm · 2026-04-13T19:03:09Z

cast(Any, ParsedResponseOutputText) bypasses the type checker entirely, making it impossible to catch accidental type mismatches in refactors. A safer pattern would be to define non-generic aliases at the module level — e.g., _ParsedResponseOutputTextBase = ParsedResponseOutputText.__pydantic_generic_metadata__['args'][0] — and use those directly. This preserves type-checker coverage while avoiding the subscripted generic path that triggers the pydantic overhead. Is there a reason cast(Any, ...) was preferred over a named type alias here?

avilaton · 2026-04-13T19:47:07Z

cast(Any, ParsedResponseOutputText) bypasses the type checker entirely, making it impossible to catch accidental type mismatches in refactors. A safer pattern would be to define non-generic aliases at the module level — e.g., _ParsedResponseOutputTextBase = ParsedResponseOutputText.__pydantic_generic_metadata__['args'][0] — and use those directly. This preserves type-checker coverage while avoiding the subscripted generic path that triggers the pydantic overhead. Is there a reason cast(Any, ...) was preferred over a named type alias here?

No preference, we are just trying to contain a memory leak that caused us a lot of headaches which are much worse than any type checking error we might have wanted to avoid. Any solution to the problem works, we are in fact only consumers of the library and wanted to see if we could raise this issue. Thanks for replying, we don't have a preference on how it gets resolved but did see at least 3 memory related issues. This made it to production for us before we noticed the oomkilled errors on our pods.

savvasp-123 · 2026-04-13T19:49:37Z

I tried this PR and it didn't seem to work for me. Perhaps I did something incorrectly. But there are definitely memory leak issues with the Responses async .parse()

fix(responses): avoid runtime generic specialization in parse_response

ca13a91

avilaton requested a review from a team as a code owner April 7, 2026 18:51

avilaton changed the title ~~fix(responses): avoid runtime generic specialization in parse_response~~ Memory leak in parse_response usage of pydantic Apr 7, 2026

avilaton mentioned this pull request Apr 10, 2026

fix(structured outputs): resolve memory leak in parse methods #2860

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory leak in parse_response usage of pydantic#3068

Memory leak in parse_response usage of pydantic#3068
avilaton wants to merge 1 commit intoopenai:mainfrom
avilaton:fix/responses-parse-memory-leak

avilaton commented Apr 7, 2026

Uh oh!

savvasp-123 commented Apr 13, 2026

Uh oh!

afurm commented Apr 13, 2026

Uh oh!

avilaton commented Apr 13, 2026

Uh oh!

savvasp-123 commented Apr 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

avilaton commented Apr 7, 2026

Changes being requested

Additional context & links

Problem

Fix

Why this is safe

Tests

Uh oh!

savvasp-123 commented Apr 13, 2026

Uh oh!

afurm commented Apr 13, 2026

Uh oh!

avilaton commented Apr 13, 2026

Uh oh!

savvasp-123 commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

savvasp-123 commented Apr 13, 2026 •

edited

Loading