The e_control parameter fixed in the
, is not correctly used in work/FastSpeech2/model/modules.py -> VarianceAdaptor.forward function.
Currently:
if self.energy_feature_level == "phoneme_level":
energy_prediction, energy_embedding = self.get_energy_embedding(
x, energy_target, src_mask, p_control
)
x = x + energy_embedding
and
if self.energy_feature_level == "frame_level":
energy_prediction, energy_embedding = self.get_energy_embedding(
x, energy_target, mel_mask, p_control
)
x = x + energy_embedding
Fix:
if self.energy_feature_level == "phoneme_level":
energy_prediction, energy_embedding = self.get_energy_embedding(
x, energy_target, src_mask, e_control
)
x = x + energy_embedding
and
if self.energy_feature_level == "frame_level":
energy_prediction, energy_embedding = self.get_energy_embedding(
x, energy_target, mel_mask, e_control
)
x = x + energy_embedding
The
, is not correctly used in
e_controlparameter fixed in thework/FastSpeech2/model/modules.py->VarianceAdaptor.forwardfunction.Currently:
and
Fix:
and