The architecture and administration of the ATLAS online computing system

2006 
The needs of ATLAS experiment at the upcoming LHC accelerator, CERN, in terms of data transmission rates and processing power require a large cluster of computers (of the order of thousands) administrated and exploited in a coherent and optimal manner. Requirements like stability, robustness and fast recovery in case of failure impose a server- client system architecture with servers distributed in a tree like structure and clients booted from the network. For security reasons, the system should be accessible only through an application gateway and, also to ensure the autonomy of the system, the network services should be provided internally by dedicated machines in synchronization with CERN IT depart me nt ' s central services. The paper describes a small scale implementation of the system architecture that fits the given requirements and constraints. Emphasis is put on the mechanisms and tools used to net boot the clients via the “Boot With Me” project and to synchronize information within the cluster via the “Nile” tool.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    1
    References
    6
    Citations
    NaN
    KQI
    []