nice job! In stage2, does `depth_pred = self.model.decode_depth(rgb_latent)` output the square sqrt of the depth, or is it the depth itself?