Monitizer: Automating Design and Evaluation of Neural Network Monitors

Azeem, Muqsit; Grobelna, Marta; Kanav, Sudeep; Kretinsky, Jan; Mohr, Stefanie; Rieder, Sabine

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2405

Computer Science > Machine Learning

Title: Monitizer: Automating Design and Evaluation of Neural Network Monitors

Authors: Muqsit Azeem, Marta Grobelna, Sudeep Kanav, Jan Kretinsky, Stefanie Mohr, Sabine Rieder

(Submitted on 16 May 2024)

Abstract: The behavior of neural networks (NNs) on previously unseen types of data (out-of-distribution or OOD) is typically unpredictable. This can be dangerous if the network's output is used for decision-making in a safety-critical system. Hence, detecting that an input is OOD is crucial for the safe application of the NN. Verification approaches do not scale to practical NNs, making runtime monitoring more appealing for practical use. While various monitors have been suggested recently, their optimization for a given problem, as well as comparison with each other and reproduction of results, remain challenging. We present a tool for users and developers of NN monitors. It allows for (i) application of various types of monitors from the literature to a given input NN, (ii) optimization of the monitor's hyperparameters, and (iii) experimental evaluation and comparison to other approaches. Besides, it facilitates the development of new monitoring approaches. We demonstrate the tool's usability on several use cases of different types of users as well as on a case study comparing different approaches from recent literature.

Comments:	accepted at CAV 2024
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
Cite as:	arXiv:2405.10350 [cs.LG]
	(or arXiv:2405.10350v1 [cs.LG] for this version)

Submission history

From: Stefanie Mohr [view email]
[v1] Thu, 16 May 2024 13:19:51 GMT (486kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2405.10350

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Monitizer: Automating Design and Evaluation of Neural Network Monitors

Submission history