[C API] Add `PyTupleWriter` API

# Feature or enhancement

Hi,

Creating a tuple with the current C API has multiple issues:

* `PyTuple_SetItem()` and `PyTuple_SET_ITEM()` modify an **immutable** tuple.
* `PyTuple_New()` creates an incomplete object: items are set to `NULL`. This is bad:
  * https://github.com/capi-workgroup/problems/issues/56 
  * https://github.com/capi-workgroup/api-evolution/issues/36
* `PyTuple_New()` tracks directly the tuple in the garbage collector. For example, `gc.get_objects()` gives access to the incomplete tuple. Using the tuple, like calling `repr(tuple)`, can crash Python.
* [`_PyTuple_Resize()`](https://docs.python.org/dev/c-api/tuple.html#c._PyTuple_Resize) is private which is surprising for a documented API. It stays private because it has issues.
* `_PyTuple_Resize()` modifies an **immutable** tuple.
* `_PyTuple_Resize()` must not be used of the refcount is greater than `1`: the API is fragile.

I propose adding a new efficient `PyTupleWriter` API: work on a temporary "writer" object, and then call `Finish()` on it to get the tuple.

I already proposed a similar API in 2023: [`_PyTupleBuilder`](https://github.com/python/cpython/pull/107139). Since that, Python C API got the [`PyUnicodeWriter` API](https://docs.python.org/dev/c-api/unicode.html#pyunicodewriter) and the [`PyBytesWriter` API](https://docs.python.org/dev/c-api/bytes.html#pybyteswriter) which are efficient writers for `str` and `bytes` objects, and the C API Working Group was created. The proposed API is now public and allocates the structure on the heap memory to hide the implementation details (the structure).

Mark Shannon asked if it would be possible to work on a list and then convert the list to a tuple, but it's [less efficient](https://github.com/python/cpython/issues/107137#issuecomment-1689718305). Tuples are commonly used in Python, and so creating a tuple should be efficient.

Mark Shannon also tried to initialize tuple items to `None` instead of `NULL` in `PyTuple_New()`. His attempt failed because of implementation issues. Also, this change only fix some of the issues that I listed, not all of them.

An alternative is to fill a C array of Python objects, call the new `PyTuple_FromArray()`, and then call `Py_DECREF()` on the array items. It requires to allocate and deallocate an array, and call `Py_DECREF()` on items. It can be less efficient.


### Linked PRs
* gh-139891
* gh-140129

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[C API] Add `PyTupleWriter` API #139888

Feature or enhancement

Linked PRs

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

[C API] Add PyTupleWriter API #139888

Description

Feature or enhancement

Linked PRs

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions

[C API] Add `PyTupleWriter` API #139888