Would you be able to publish the sentence-level human judgement data for calculate correlation of Automatic Metrics. Thank you.