GridICE: Requirements, Architecture and Experience of a Monitoring - - PowerPoint PPT Presentation

gridice requirements architecture and experience of a
SMART_READER_LITE
LIVE PREVIEW

GridICE: Requirements, Architecture and Experience of a Monitoring - - PowerPoint PPT Presentation

Enabling Grids for E-sciencE GridICE: Requirements, Architecture and Experience of a Monitoring Tool for Grid Systems Sergio Andreozzi INFN-CNAF (Italy) sergio.andreozzi@cnaf.infn.it CHEP2006, Mumbai (India), 13-17 February 2006


slide-1
SLIDE 1

INFSO-RI-508833

Enabling Grids for E-sciencE

www.eu-egee.org

GridICE: Requirements, Architecture and Experience of a Monitoring Tool for Grid Systems

Sergio Andreozzi INFN-CNAF (Italy)

sergio.andreozzi@cnaf.infn.it

CHEP2006, Mumbai (India), 13-17 February 2006

slide-2
SLIDE 2

CHEP2006, Mumbai (India), 13-17 February 2005 2

Enabling Grids for E-sciencE

INFSO-RI-508833

Outline

  • Monitoring a Grid

– general introduction – users’ viewpoints

  • GridICE

– architecture – implementation – experience

slide-3
SLIDE 3

CHEP2006, Mumbai (India), 13-17 February 2005 3

Enabling Grids for E-sciencE

INFSO-RI-508833

  • We need monitoring functionalities:

– to observe the composition, state and features of available resources – to analyze their behavior and performance – to detect and prevent fault situations

  • In the context of Grid computing, two important categories
  • f monitoring systems are:

– Application monitoring – Infrastructure monitoring

What is Monitoring

We focus on

slide-4
SLIDE 4

CHEP2006, Mumbai (India), 13-17 February 2005 4

Enabling Grids for E-sciencE

INFSO-RI-508833

Main Phases of the Monitoring Process

Generation Distributing Presenting Processing

sensors enquiring entities and encoding the measurements according to a schema transmission of the events from the source to any interested parties abstract the huge number of received events in order to enable the consumer to draw conclusions about the operation of the monitored system e.g., filtering according to some predefined criteria, or summarising a group of events

slide-5
SLIDE 5

CHEP2006, Mumbai (India), 13-17 February 2005 5

Enabling Grids for E-sciencE

INFSO-RI-508833

Different Viewpoints /1 We focus on the following categories of users:

– VO manager

actual set of resources accessible to VO members

– Grid operator

all resources under responsibility of a Grid Operator Center

– Site administrator

site resources offered to a Grid

slide-6
SLIDE 6

CHEP2006, Mumbai (India), 13-17 February 2005 6

Enabling Grids for E-sciencE

INFSO-RI-508833

The GridICE Monitoring Tool

slide-7
SLIDE 7

CHEP2006, Mumbai (India), 13-17 February 2005 7

Enabling Grids for E-sciencE

INFSO-RI-508833

Overview

  • GridICE:

– a distributed monitoring tool for Grid systems

started in late 2002 (EU-DataTAG project) is evolving in the context of EU-EGEE

– fully integrated with the LCG-2.x Middleware

Metering and publishing of data can be configured via LCG standard installation mechanisms Self-configurable collection and presentation

  • just give the URL of the root Grid Information Service (GIS)

– using W3C standards to offer easy access to monitoring data

slide-8
SLIDE 8

CHEP2006, Mumbai (India), 13-17 February 2005 8

Enabling Grids for E-sciencE

INFSO-RI-508833

Taxonomy

  • Taxonomy of monitoring systems [2]

– S = sensor C = consumer P = publisher – R = republisher H = hierarchy of republishers

  • GridICE is a 2nd level of monitoring systems with a

centralized republisher

slide-9
SLIDE 9

CHEP2006, Mumbai (India), 13-17 February 2005 9

Enabling Grids for E-sciencE

INFSO-RI-508833

Sensor (S) and Publisher (P)

  • Sensor and publisher are in the scope of each

administrative domain:

– Sensor

Extension of the GLUE Schema-based information already available in the GIS adopted by LCG (i.e., Globus MDS 2.x):

  • fabric-level information
  • job monitoring
  • summary info for computing resources
  • network connectivity from a Grid viewpoint [4]

– Publisher

Adopt the available Grid Information Service in LCG

slide-10
SLIDE 10

CHEP2006, Mumbai (India), 13-17 February 2005 10

Enabling Grids for E-sciencE

INFSO-RI-508833

Metering and Site Level Distribution

  • f extended information

Sensor Local Publisher Monitored Entity Site Persistent Storage Site Consumer Site Publisher Site Collector Site – administrative domain default choice – CERN Lemon

slide-11
SLIDE 11

CHEP2006, Mumbai (India), 13-17 February 2005 11

Enabling Grids for E-sciencE

INFSO-RI-508833

Central Republisher (R)

Persistent storage Discovery Consumers Scheduler HTML Data aggregation and abstraction XML Notification Charts

slide-12
SLIDE 12

CHEP2006, Mumbai (India), 13-17 February 2005 12

Enabling Grids for E-sciencE

INFSO-RI-508833

Screenshots

slide-13
SLIDE 13

CHEP2006, Mumbai (India), 13-17 February 2005 13

Enabling Grids for E-sciencE

INFSO-RI-508833

Current Project Status

– Integrated and deployed with LCG 2.x – Installed servers are monitoring Grid resources in the scope of:

  • Next steps:
  • Security and Privacy concerns
  • Dealing with heterogeneous publisher interfaces
  • Adopt new data-warehousing features available

in the open source software

  • New service-specific sensors

EGEE EGEE-SWE RDIG EGEE-SEE Grid.it GILDA CMS ATLAS EUMedGrid EUChinaGRID BalticGrid EELA BeGrid

slide-14
SLIDE 14

CHEP2006, Mumbai (India), 13-17 February 2005 14

Enabling Grids for E-sciencE

INFSO-RI-508833

Conclusion

  • Monitoring of Grid systems is a complex activity in metering,

distributing, processing and presenting

  • GridICE has been designed as an infrastructure monitoring tool

for Grid systems

  • Requirements have been considered from three main different

categories of users

  • The experience in production environments is positive

– a stable service working in 24*7

  • Future work targeted at dealing with multiple producer interfaces,

improving security aspects and extending the set of measurements

slide-15
SLIDE 15

CHEP2006, Mumbai (India), 13-17 February 2005 15

Enabling Grids for E-sciencE

INFSO-RI-508833

References

[1] S. Andreozzi, N. De Bortoli, S. Fantinel, A. Ghiselli, G. L. Rubini, G. Tortone, M. C. Vistoli GridICE: a monitoring service for Grid systems, Future Generation Computer System 21 (2005) 559–571 [2] S. Zanikolas, R. Sakellariou, A taxonomy of grid monitoring systems, Future Generation Computer Systems 21 (2005) 163–188 [3] S. Andreozzi, N. De Bortoli, S. Fantinel, G.L. Rubini, G. Tortone. Design and Implementation of a Notification Model for Grid Monitoring Events. CHEP04, Interlaken (CH), Sep 2004 [4] S. Andreozzi, A. Ciuffoletti, A. Ghiselli, C. Vistoli. Monitoring the Connectivity of a Grid. In Proceedings of the 2nd International Workshop on Middleware for Grid Computing (MGC 2004) in conjunction with the 5th ACM/IFIP/USENIX International Middleware Conference, Toronto, Canada, October 2004.

Dissemination: http://grid.infn.it/gridice