ASPCS
 
Back to Volume
Paper: Empowering SKA Data Challenges: A Homogeneous Platform for Enhanced Collaboration and Scalability Fully Aligned with Open Science
Volume: 541, ADASS XXXIII
Page: 361
Authors: Manuel Parra-Royón; James Collinson; Jesús Sánchez-Castañeda; Pablo Llopis-Sanmillán; Carolina Lindqvist; MA Mendoza; Susana Sánchez-Expósito; Laura Darriba; Javier Moldón; Julián Garrido; Jesús Salgado; Lourdes Verdes-Montenegro
DOI: 10.26624/PONQ1847
Abstract: The Square Kilometre Array Observatory (SKAO) is leading a collaborative effort to build and operate an advanced radio telescope. The SKAO Science Data Challenges (SDCs) are competitions aimed at fostering innovative data analysis techniques for the substantial data expected from the SKAO. Traditionally, these challenges have relied on computing resources from various scientific institutions, resulting in an uneven user experience due to differing configurations of Virtual Machines (VMs) or high-performance computing (HPC) resources. To address this, our proposal suggests implementing a uniform platform for computing and storage resources to ensure fairness, scalability, and consistent collaboration among teams. This homogeneous setup would simplify resource management, support, and evaluation, ultimately enhancing efficiency and producing dependable results. In this context, JupyterHub facilitates the provisioning of compute resources via Kubernetes, offering user demand scaling and centralized authentication. We present the configuration process of a Kubernetes cluster and the setup for BinderHub/JupyterHub, along with a practical use case demonstrating data analysis and workflow in radio astronomy, leveraging Dask for parallel and distributed computing on large cloud-based Kubernetes clusters. To test this environment we have deploy it in several nodes within the SRCNet cloud platforms, ESP-SRC (Spain), CHSRC (Switzerland) and SKAO (UK), along with the resources of the CESGA supercomputing center (Spain).
Back to Volume