Using Nagios to monitor the Telescope Manager (TM) of the Square Kilometre Array (SKA)

Matteo Canzari,Matteo Di Carlo,Mauro Dolci,Riccardo Smareglia
DOI: https://doi.org/10.48550/arXiv.1902.07575
2019-02-20
Abstract:SKA (Square Kilometer Array), currently under design, will be a huge radio-astronomical facility, whose management will be performed by a suite of software applications called Telescope Manager (SKA TM) via the TANGO framework. In order to ensure the proper and uninterrupted operation of TM, a local monitoring and control system (<a class="link-external link-http" href="http://TM.LMC" rel="external noopener nofollow">this http URL</a>) is being developed, with the goal to perform monitoring, lifecycle control and fault management of TM. For the monitoring activity, central in <a class="link-external link-http" href="http://TM.LMC" rel="external noopener nofollow">this http URL</a>, Nagios (automated by the lifecycle management tool Chef) has been proposed as main toolkit to check resources, services and status of every TM application both at generic and performance level: for this latter purpose, a custom agent has been developed. This led to an integrated fault management module, based on Nagios-Chef integration, which can efficiently handle any abnormal situation
Instrumentation and Methods for Astrophysics
What problem does this paper attempt to address?