Why clamp inputs to all tanh calls?

https://github.com/dalab/hyperbolic_nn/blob/45be2f6698706bba48bb015e7a8ed32408d0cf6e/util.py#L26


We are trying to reimplement the layers proposed by the [Hyperbolic Neural Networks]( https://arxiv.org/abs/1805.09112) paper. We use float64 instead of float32 for the entire model and inputs. Hence, we avoid numerical instability. However, if we do not clamp the inputs to the tanh functions between (-15, 15), the network does not seem to train at all. It would be great if you could provide a reason for doing this and for picking the value of 15. 

PS: I really liked the paper and thank you for making the code available. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Why clamp inputs to all tanh calls? #1

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Why clamp inputs to all tanh calls? #1

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions