Tensor dropout for robust learning

Kolbeinsson, A.; Kossaifi, J.; Panagakis, Y.; Bulat, A.; Kumar, A.A.; Tzoulaki, I.; Matthews, P.M.

doi:10.1109/JSTSP.2021.3064182

Μονάδα:

Ερευνητικό υλικό ΕΚΠΑ

Τίτλος:

Tensor dropout for robust learning

Γλώσσες Τεκμηρίου:

Αγγλικά

Περίληψη:

CNNs achieve high levels of performance by leveraging deep, over-parametrized neural architectures, trained on large datasets. However, they exhibit limited generalization abilities outside their training domain and lack robustness to corruptions such as noise and adversarial attacks. To improve robustness and obtain more computationally and memory efficient models, better inductive biases are needed. To provide such inductive biases, tensor layers have been successfully proposed to leverage multi-linear structure through higher-order computations. In this paper, we propose tensor dropout, a randomization technique that can be applied to tensor factorizations, such as those parametrizing tensor layers. In particular, we study tensor regression layers, parametrized by low-rank weight tensors and augmented with our proposed tensor dropout. We empirically show that our approach improves generalization for image classification on ImageNet and CIFAR-100. We also establish state-of-the-art accuracy for phenotypic trait prediction on the largest available dataset of brain MRI (U.K. Biobank), where multi-linear structure is paramount. In all cases, we demonstrate superior performance and significantly improved robustness, both to noisy inputs and to adversarial attacks. We establish the theoretical validity of our approach and the regularizing effect of tensor dropout by demonstrating the link between randomized tensor regression with tensor dropout and deterministic regularized tensor regression. © 2007-2012 IEEE.

Έτος δημοσίευσης:

2021

Συγγραφείς:

Kolbeinsson, A.
Kossaifi, J.
Panagakis, Y.
Bulat, A.
Kumar, A.A.
Tzoulaki, I.
Matthews, P.M.

Περιοδικό:

IEEE Journal on Selected Topics in Signal Processing

Εκδότης:

Institute of Electrical and Electronics Engineers, Inc. (IEEE)

Τόμος:

Αριθμός / τεύχος:

Σελίδες:

630-640

Λέξεις-κλειδιά:

Image enhancement; Large dataset; Magnetic resonance imaging, Generalization ability; Linear structures; Memory efficient; Neural architectures; Phenotypic traits; Randomization techniques; State of the art; Tensor factorization, Tensors

Επίσημο URL (Εκδότης):

https://doi.org/10.1109/JSTSP.2021.3064182

DOI:

10.1109/JSTSP.2021.3064182

Μόνιμη διεύθυνση:

https://pergamos.lib.uoa.gr/uoa/dl/object/3034967

Tensor dropout for robust learning

Το ψηφιακό υλικό του τεκμηρίου δεν είναι διαθέσιμο.