Deakin University
Browse

File(s) under embargo

Sparse subnetwork inference for neural network epistemic uncertainty estimation with improved Hessian approximation

Version 2 2024-06-01, 02:28
Version 1 2024-04-08, 04:05
journal contribution
posted on 2024-06-01, 02:28 authored by Yinsong ChenYinsong Chen, Samson YuSamson Yu, Jason K Eshraghian, Chee Peng Lim
Despite significant advances in deep neural networks across diverse domains, challenges persist in safety-critical contexts, including domain shift sensitivity and unreliable uncertainty estimation. To address these issues, this study investigates Bayesian learning for uncertainty handling in modern neural networks. However, the high-dimensional, non-convex nature of the posterior distribution poses practical limitations for epistemic uncertainty estimation. The Laplace approximation, as a cost-efficient Bayesian method, offers a practical solution by approximating the posterior as a multivariate normal distribution but faces computational bottlenecks in precise covariance matrix computation and storage. This research employs subnetwork inference, utilizing only a subset of the parameter space for Bayesian inference. In addition, a Kronecker-factored and low-rank representation is explored to reduce space complexity and computational costs. Several corrections are introduced to converge the approximated curvature to the exact Hessian matrix. Numerical results demonstrate the effectiveness and competitiveness of this method, whereas qualitative experiments highlight the impact of Hessian approximation granularity and parameter space utilization in Bayesian inference on mitigating overconfidence in predictions and obtaining high-quality uncertainty estimates.

History

Journal

APL Machine Learning

Volume

2

Location

Melville, NY

eISSN

2770-9019

Language

eng

Publication classification

C1 Refereed article in a scholarly journal

Issue

2

Publisher

AIP Publishing

Usage metrics

    Research Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC