Book ; Online: Optimization Dynamics of Equivariant and Augmented Neural Networks
2023
Abstract: We investigate the optimization of multilayer perceptrons on symmetric data. We compare the strategy of constraining the architecture to be equivariant to that of using augmentation. We show that, under natural assumptions on the loss and non-linearities, ...
Abstract | We investigate the optimization of multilayer perceptrons on symmetric data. We compare the strategy of constraining the architecture to be equivariant to that of using augmentation. We show that, under natural assumptions on the loss and non-linearities, the sets of equivariant stationary points are identical for the two strategies, and that the set of equivariant layers is invariant under the gradient flow for augmented models. Finally, we show that stationary points may be unstable for augmented training although they are stable for the equivariant models. Comment: v2: Revised manuscript. Mostly small edits, apart from new experiments (see Appendix E) |
---|---|
Keywords | Computer Science - Machine Learning ; Mathematics - Optimization and Control ; 68T07 ; 20C35 ; 37N40 |
Publishing date | 2023-03-23 |
Publishing country | us |
Document type | Book ; Online |
Database | BASE - Bielefeld Academic Search Engine (life sciences selection) |
Full text online
More links
Kategorien
Inter-library loan at ZB MED
Your chosen title can be delivered directly to ZB MED Cologne location if you are registered as a user at ZB MED Cologne.