Hi Yeon, may I ask the questions about the ranking strategy? From my understanding, the direct rank for contrastive learning actually has little help for the SimCSE model, however, it largely enhances the performance when we conduct the distillation. I am very glad if you could correct me if anything I misunderstood.
Does this mean that the ranking strategy is actually good for distilling information from teachers, what kind of information is unsupervised distilled?
Hi Yeon, may I ask the questions about the ranking strategy? From my understanding, the direct rank for contrastive learning actually has little help for the SimCSE model, however, it largely enhances the performance when we conduct the distillation. I am very glad if you could correct me if anything I misunderstood.
Does this mean that the ranking strategy is actually good for distilling information from teachers, what kind of information is unsupervised distilled?