1
Monitoring and Workflow management Monitoring and Workflow management in large distributed systems in large distributed systems
March 2011
Monitoring and Workflow management Monitoring and Workflow - - PowerPoint PPT Presentation
Monitoring and Workflow management Monitoring and Workflow management in large distributed systems in large distributed systems March 2011 1 The MonALISA Framework The MonALISA Framework MonALISA is a Dynamic, Distributed Service System
1
March 2011
Iosif Legrand March 2011
2 2
any type of information from different systems, to analyze it in near real time and to provide support for automated control decisions and global
multi-threaded, self-describing agent-based subsystems which are registered as dynamic services, and are able to collaborate and cooperate in performing a wide range of monitoring tasks. These agents can analyze and process the information, in a distributed way, and to provide optimization decisions in large scale distributed applications.
3
3
Regional or Global High Level Regional or Global High Level Services, Services, Repositories & Clients Repositories & Clients Secure and reliable communication Secure and reliable communication Dynamic load balancing Dynamic load balancing Scalability & Replication Scalability & Replication AAA for Clients AAA for Clients Distributed Dynamic Distributed Dynamic Registration and Discovery- Registration and Discovery- based on a lease based on a lease mechanism and remote events mechanism and remote events
JINI-Lookup Services Secure & Public MonALISA services Proxies HL services Agents Network of
Distributed System for gathering and Distributed System for gathering and analyzing information based on analyzing information based on mobile agents: mobile agents: Customized aggregation, Triggers, Customized aggregation, Triggers, Actions Actions
Iosif Legrand March 2011
4
4
Data Store Data Cache Service & DB Configuration Control (SSL)
Predicates & Agents Data (via ML Proxy)
Applications
Clients or Higher Level Services
WS Clients and service
Web Service WSDL SOAP
Lookup Service Lookup Service
Registration D i s c
e r y
Postgres AGENTS AGENTS FILTERS / TRIGGERS FILTERS / TRIGGERS
Monitoring Modules Monitoring Modules
Collects any type of information
Dynamic Loading
Push and Pull
Iosif Legrand March 2011
5
Lookup Service
MonALISA Service
Lookup Service Client (other service)
Discovery Registration (signed certificate)
MonALISA Service MonALISA Service
Services Proxy Multiplexer Services Proxy Multiplexer Client (other service)
Admin SSL connection Trust keystore AAA services Client authentication
Data Data Filters & Agents Filters & Agents
Trust keystore
Application Applications
Iosif Legrand March 2011
6
6
TOPOLOGY JOBS ACCOUNTING
Running Jobs
Iosif Legrand March 2011
7
CMS is using MonALISA and ApMon to monitor all the production and analysis
More than 3 years continuous operation without any problems
Organize and structure Monitoring Information
Rate of collected monitoring values Total Collected values Collected ~5* 1010 monitoring values in the last 12 months Rates up to more than 6000 values per second Lost in UDP < 5*10-6
Iosif Legrand March 2011
8
User-level task monitoring
Organize and structure Monitoring Information
Iosif Legrand March 2011
9
User-level task monitoring
Organize and structure Monitoring Information
Iosif Legrand March 2011
10
10
Long History DB
LCG Tools
MonALISA @Site
ApMon AliEn Job Agent ApMon AliEn Job Agent ApMon AliEn Job Agent
MonALISA @CERN MonALISA LCG Site
ApMon AliEn CE ApMon AliEn SE ApMon Cluster Monitor ApMon AliEn TQ ApMon AliEn Job Agent ApMon AliEn Job Agent ApMon AliEn Job Agent ApMon AliEn CE ApMon AliEn SE ApMon Cluster Monitor ApMon AliEn IS ApMon AliEn Optimizers ApMon AliEn Brokers ApMon MySQL Servers ApMon CastorGrid Scripts ApMon API Services
A g g r e g a t e d D a t a rss vsz c p u t i m e run time job slots f r e e s p a c e n r .
f i l e s
files
Q u e u e d J
A g e n t s
cpu ksi2k job status d i s k u s e d
processes
l
d net In/out jobs status sockets migrated mbytes active sessions MyProxy status Alerts Actions
Iosif Legrand March 2011
11
Iosif Legrand March 2011
12
Iosif Legrand March 2011
Iosif Legrand August 2009
14
Iosif Legrand March 2011
15
Iosif Legrand March 2011
16
Two levels of decisions:
local (autonomous), global (correlations).
Actions triggered by:
values above/below given thresholds, absence/presence of values, correlations between any values.
Action types:
alerts (emails/instant msg/atom feeds), automatic charts annotations in the repository, running custom code, like securely
connectivity – optimize traffic, submit jobs, (re)start global service.
ML Service ML Service ML Service ML Service
Actions based on Actions based on global information global information
Actions based on Actions based on local information local information
Sensors Sensors Local Local decisions decisions Global Global decisions decisions
Global ML Services
Iosif Legrand March 2011
17
ALICE: Automatic job submission ALICE: Automatic job submission Restarting Services Restarting Services
17
MySQL daemon is automatically restarted when it runs out of memory Trigger: threshold on VSZ memory usage ALICE Production jobs queue is kept full by the automatic submission Trigger: threshold on the number of aliprod waiting jobs Administrators are kept up-to-date on the services’ status Trigger: presence/absence of monitored information
Iosif Legrand March 2011
18
resubmit error jobs until a target completion percentage is reached, submit new jobs when necessary (watching the task queue size for each service account)
restart site services, whenever tests of VoBox services fail but the central services are OK, send email notifications / add chart annotations when a problem was not solved by a restart dynamically modify the DNS aliases of central services for an efficient load-balancing.
Iosif Legrand March 2011
19
Topology & Status & Peering Topology & Status & Peering Real Time Topology for L2 Circuits
Iosif Legrand March 2011
Iosif Legrand August 2009
20
Artur Barczyk, 04/16/2009
AMS-GVA(GEANT) CHI-NYC (Qwest) GVA – NYC (GC) GVA – NYC (Colt) Ref @ CERN) CHI-GVA (Qwest) CHI-GVA (GC) AMS-NYC(GC)
0-95% 95-97% 97-98% 98-99% 99-100% 100%
99.5% 97.9% 99.9% 96.6% 99.3% 98.9% 99.5%
P1 P1
work K
21
Iosif Legrand March 2011
22
ALARMS and Automatic notifications for USLHCnet ALARMS and Automatic notifications for USLHCnet
Iosif Legrand March 2011
Iosif Legrand August 2009
23
24
Iosif Legrand March 2011
25
∈
T u v
) , (
Resilient Overlay Network that optimize real-time communication
Iosif Legrand March 2011
26
Frequent measurements of RTT, jitter, traffic and lost packages The MST is recreated in ~ 1 S case on communication problems.
Iosif Legrand March 2011
27
Iosif Legrand March 2011
28
Glimmerglass Switch Example
Iosif Legrand March 2011
29
29
Internet
A
>FDT A/fileX B/path/ OS path available Configuring interfaces Starting Data Transfer
Monitor Control TL1 Optical Switch MonALISA Service MonALISA Distributed Service System
B
OS Agent
A c t i v e l i g h t p a t h
Regular IP path
Real time monitoring APPLICATION
LISA AGENT
LISA sets up
LISA APPLICATION “use eth1.2, …”
LISA LISA Agent Agent
DATA
CREATES AN END TO END PATH < 1s
Detects errors and automatically recreate the Detects errors and automatically recreate the path in less than the TCP timeout path in less than the TCP timeout
Iosif Legrand March 2011
30
CERN Geneva CALTECH Pasadena Starlight Manlan USLHCnet Internet2
“Fiber cut” simulations The traffic moves from one transatlantic line to the other one FDT transfer (CERN – CALTECH) continues uninterrupted TCP fully recovers in ~ 20s
1 2 3 4
FDT Transfer
4 Fiber cuts simulations
200+ MBytes/sec From a 1U Node 4 fiber cut emulations
Iosif Legrand March 2011
The MonALISA package includes:
in each state, LM sensors, APC UPSs), log files tailing
New modules can be easly added by implementing a simple Java interface. Filters can be used to generate new aggregate data. The Service can also react to the monitoring data it receives (actions alarms). MonALISA can run code as distributed agents for global optimization
Iosif Legrand March 2011
Iosif Legrand March 2011
32
32
Major Communities
ALICE CMS ATLAS PANDA EVO LGC RUSSIA OSG MXG RoEduNet USLHCNET ULTRALIGHT Enlightened
U
MonALISA Today Running 24 X 7 at ~360 Sites
Collecting ~ 2 million “persistent” parameters in real-time 80 million “volatile” parameters per day Update rate of ~25,000 parameter updates/sec Monitoring 40,000 computers > 100 WAN Links > 8,000 complete end-to-end
network path measurements
Tens of Thousands of Grid jobs
running concurrently
Controls jobs summation, different central services for the Grid,
EVO topology, FDT …
The MonALISA repository system serves
~8 million user requests per year.