Skip to content

Poor reproduction performance #3

Description

@serenity-ruochen

I highly appreciate your outstanding work. However, when I reproduced this paper, I obtained the following results, which are quite different from the performance of your Table 1 R2R-CE. Could you help me find out what the reason is? Thank you~

2026-06-14 17:27:36,855 Episodes evaluated: 1839
2026-06-14 17:27:36,855 Average episode steps_taken: 324.125612
2026-06-14 17:27:36,855 Average episode distance_to_goal: 8.485219
2026-06-14 17:27:36,855 Average episode success: 0.132137
2026-06-14 17:27:36,855 Average episode oracle_success: 0.275693
2026-06-14 17:27:36,855 Average episode path_length: 40.218249
2026-06-14 17:27:36,855 Average episode spl: 0.062156
100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1839/1839 [21:41:55<00:00, 42.48s/it]
Evaluation completed!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions