Monitoring and control of large-scale distributed systems

2016 
An important part of managing large-scale, distributed computing systems is a monitoring service that is able to monitor and track in real-time many site facilities, networks, and tasks in progress. The monitoring information gathered is essential for developing the required higher level services, the components that provide decision support and some degree of automated decisions and for maintaining and optimizing workflow in large-scale distributed systems. Our strategy in trying to satisfy the demands of data intensive applications was to move to more synergetic relationships between the applications, computing and storage facilities and the network infrastructure. These orchestration and global optimization functions are performed by higher-level agent-based services which are able to collaborate and cooperate in performing a wide range of distributed information-gathering and processing tasks.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    1
    Citations
    NaN
    KQI
    []