Is Equation.10 in paper right ?

Thanks for your remarkable work.

I have a problem about the Eq.10 in your [arXiv paper](https://www.arxiv.org/pdf/2502.11494).

<img width="935" alt="Image" src="https://github.com/user-attachments/assets/ab5543ba-6eaa-4df6-93bf-27620c17ee12" />

The equation is about calculating retained set using pivot tokens, I understand the retained tokens should have low similarity with pivots, but the **latter union** operation confuses me, because it means if one token is similar with any pivot, it will be retained even it has high duplication scores with other pivots. Is that reasonable ? It seems that the correct way is to use intersection operation to retain tokens that has low duplication score from all pivot ?

Hope to get response from you. Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is Equation.10 in paper right ? #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Is Equation.10 in paper right ? #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions