SFT (local backend) #530

Kovbo · 2026-01-22T02:48:06Z

No description provided.

Move batching and shuffling logic from SFTConfig into iterator functions. train_sft now accepts Iterable[List[Trajectory]] instead of individual trajectories, simplifying the API and making batch management more explicit. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

angkywilliam · 2026-01-23T23:58:47Z

src/art/types.py



+class SFTConfig(pydantic.BaseModel):
+    learning_rate: float = 1e-4


Remove custom_lr_schedule
Make learning_rate: float | list[float]

angkywilliam · 2026-01-24T00:11:00Z

src/art/dev/train.py

+            Used to identify where assistant turns begin (train on responses only).
+    """
+
+    instruction_part: str


We probably can keep this class as empty?
Unsure if instruction_part and response_part is a good fit for experimental feature

angkywilliam · 2026-01-24T00:11:58Z

src/art/local/backend.py

+            batch_size = 2  # Default to 2 for SFT
+
+        # Determine learning rates
+        if config.custom_lr_schedule and len(config.custom_lr_schedule) > 0:


Refactor/Remove custom_lr_schedule.learning_rate is float | list[float]

Add validation for num_learning_rate == num_batches

angkywilliam · 2026-01-24T00:21:17Z

src/art/unsloth/service.py

+
+        # Save checkpoint after training
+        # Name checkpoint by final training step: starting_step + num_batches
+        final_step = get_step_from_dir(self.output_dir) + len(sft_batches)


Checkpoint step should be still incremented by 1.
Checkpoint step != Gradient step

angkywilliam · 2026-01-24T00:26:04Z

src/art/utils/model_config.py

+        response_part="<|im_start|>assistant\n",
+    ),
+    # Qwen 3 models (with thinking tokens)
+    "Qwen/Qwen3-8B": ModelConfig(


How we decide to support all of this model?

Prefer to keep it simple and start with model that's widely use in OpenPipe Platform and ART?

Research Qwen chat template, iirc <think></think> only show up at the last turn. We may need to remove <think></think> in response_part in Qwen.

angkywilliam · 2026-01-24T02:30:39Z

src/art/utils/sft.py

+        progress_bar.close()
+
+
+def iterate_file(


Have iterate_file take in epoch
See the following PR for reference

angkywilliam · 2026-01-24T02:32:15Z

src/art/utils/sft.py

+                yield _parse_jsonl_line(line)
+
+
+async def train_sft_from_file(


Modify this so user can have the training continue running after closing their laptop.

Iterate_file(file, epoch)

Write to local disk

Upload to wandb artifact

Calculate lr

Call train_sft(url, lr)

Monitor training status

angkywilliam and others added 30 commits November 13, 2025 13:11

SFT data iterator

498d3df

Add SFT LR utils

3bd818f

train_sft skeleton

66ec620

SFT Shape 0.1

4aeda2f

Add shuffle to SFTConfig

4ff152b

change SFT args order

b6f0380

Tokenize SFT Batch

9138b07

Add num_trainable_tokens to SFTBatch

18a7897

draft train_sft

90bf94b

Flatten trajectory for train_sft

12e2142

Tokenize SFT Batches support flat list and add padding

4ea6c5e

Fix max_length duplicate name issue

f7bb203

Remove unused file

d59e524

remove unused typing

7f6309a

sft iterator

5ec5575

SFT Iterator

d6688cf

Use Unsloth for train on response

6c63af5

Merge branch 'main' of github.com:OpenPipe/ART into sft

d2b39d5

refactoring

ca5177b

implement local backend SFT training

c3a06b4

Add SFT to Local Backend

9cf747d

avg loss

28205cb

refactor, sft works good

64454b1

Merge branch 'sft' of github.com:OpenPipe/ART into sft

739eb45

Merge remote-tracking branch 'origin/main' into sft

9918f65

remove logging

fb706f9

move tokenizer, update backend

08d87d1

update lr schedule and tests

0573bc8

refactor sft training from file

904c3ff

Kovbo added 9 commits January 21, 2026 00:04

change batch sft

2078d5e

refactor step count based on checkpoints

381ac7d

update sft warmup script

4bc79ed

fix model registration

db6833c

make local random

9544df9

refactor backend

c6b2874

refactor

834b37e

Merge branch 'main' of github.com:OpenPipe/ART into sft

736f259

update example

84e6ceb

Kovbo requested a review from angkywilliam January 22, 2026 02:48

Kovbo added 4 commits January 22, 2026 03:25

Pyright fix

e2ea1ec

remove iterate file epochs, refactor

0fa52f8

refactor

e43cbea

Merge branch 'main' of github.com:OpenPipe/ART into sft-local-backend

2fae9c8

Kovbo marked this pull request as ready for review January 22, 2026 21:42

Kovbo added 4 commits January 22, 2026 23:19

add serverless endpoint

d336f18

Rename training_folder_url to training_data_url

c9f63fe

update defaults, change reporting

61ff551

update lables

997b69f

angkywilliam reviewed Jan 24, 2026

View reviewed changes

Kovbo added 2 commits January 26, 2026 19:45

make sft to produce only one checkpoint step

e67accd

refactor train from file

3238810

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SFT (local backend) #530

SFT (local backend) #530

Uh oh!

Kovbo commented Jan 22, 2026

Uh oh!

angkywilliam Jan 23, 2026

Uh oh!

angkywilliam Jan 24, 2026

Uh oh!

angkywilliam Jan 24, 2026

Uh oh!

angkywilliam Jan 24, 2026

Uh oh!

angkywilliam Jan 24, 2026

Uh oh!

angkywilliam Jan 24, 2026

Uh oh!

angkywilliam Jan 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants



		class SFTConfig(pydantic.BaseModel):
		learning_rate: float = 1e-4

SFT (local backend) #530

Are you sure you want to change the base?

SFT (local backend) #530

Uh oh!

Conversation

Kovbo commented Jan 22, 2026

Uh oh!

angkywilliam Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

angkywilliam Jan 24, 2026

Choose a reason for hiding this comment

Uh oh!

angkywilliam Jan 24, 2026

Choose a reason for hiding this comment

Uh oh!

angkywilliam Jan 24, 2026

Choose a reason for hiding this comment

Uh oh!

angkywilliam Jan 24, 2026

Choose a reason for hiding this comment

Uh oh!

angkywilliam Jan 24, 2026

Choose a reason for hiding this comment

Uh oh!

angkywilliam Jan 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants