Performability at Yahoo Search Amr Awadallah and a bunch of other - - PowerPoint PPT Presentation

performability at yahoo search
SMART_READER_LITE
LIVE PREVIEW

Performability at Yahoo Search Amr Awadallah and a bunch of other - - PowerPoint PPT Presentation

Performability at Yahoo Search Amr Awadallah and a bunch of other yahoos amr@yahoo-inc.com Now, A word from our sponsor What is Yahoo Search ? Web Results (Served by Google) Direct Display (Yahoo Content) Inside Yahoo (Yahoo


slide-1
SLIDE 1

Performability at Yahoo Search

Amr Awadallah and a bunch of other yahoos amr@yahoo-inc.com

slide-2
SLIDE 2

Now, A word from our sponsor ☺

  • What is Yahoo Search ?
  • Web Results (Served by Google)
  • Direct Display (Yahoo Content)
  • Inside Yahoo (Yahoo Self Promotion)
  • Sponsored Listings (Overture)
  • Media Ads (e.g. North Banner)
slide-3
SLIDE 3

What to Measure?

Yahoo! Search

users user actions: search, click value extraction user satisfaction Revenue Retention Rate, Increased Usage, Word of Mouth CPU, Harddisk space, Memory Map, Core Dumps, Net IO, QPS, Latency, PVs, Clicks, …

slide-4
SLIDE 4

The Holy Grail: Real-time CTR

  • CTR = Click Through Rate = Clicks/Pages
  • Advantages:
  • Does not change significantly from week to week

(filters out seasonal effects)

  • Very sensitive to any small problem taking place
  • Quickly deviates from norm in case of faults
  • Can be done at many levels of granulity

(e.g. total CTR, Web CTR, Sponsored CTR, Per-Server CTR, … )

slide-5
SLIDE 5

Search Machines Click Machines

Capture

Count Collectors

Nagios

Aggregate Process Report

Real-time Trending Pagers

Quick grep agent parses apache logs on the fly and sends messages to the count collectors every 5 minutes.

slide-6
SLIDE 6

Examples:

  • CSI: Crash Scene Investigation ☺
  • Forensic evidence tend to disappear over time

(CYOA principle)

slide-7
SLIDE 7

One Caveat:

  • Instrumentation for real-time metrics adds

another point of failure, specially click tracking

What next?

  • Accumulate human knowledge into rule-based

systems that can follow the same diagnosis steps that a human goes through to locate the reason for the fault.

  • Can we expand RT-CTR to other Internet Apps?
slide-8
SLIDE 8