Skip to content

Files

53 lines (42 loc) · 3.59 KB
·

metrics_events.md

File metadata and controls

53 lines (42 loc) · 3.59 KB
·

Metrics and events

Metrics

Here's the list of metrics sent by the controller and the injector.

Controller

  • chaos.controller.restart increments when the controller is restarted
  • chaos.controller.reconcile increments when the reconcile loop is called
  • chaos.controller.reconcile.duration is the time passed in the reconcile loop
  • chaos.controller.inject.duration is the time it took to fully inject the disruption since its creation
  • chaos.controller.cleanup.duration is the time it took to fully cleanup the disruption since its deletion
  • chaos.controller.pods.created increments when a chaos pod is created
  • chaos.controller.pods.gauge is the total count of existing chaos pods
  • chaos.controller.informed increments when the pod informer receives an event to process before reconciliation
  • chaos.controller.orphan.found increments when a chaos pod without a corresponding disruption resource is found
  • chaos.controller.selector.cache.triggered signals a selector cache trigger
  • chaos.controller.selector.cache.gauge reports how many caches are still in the cache array to prevent leaks
  • chaos.controller.disruption.completed_duration is the complete life time of the disruption, from creation to deletion
  • chaos.controller.disruption.ongoing_duration is the duration of the disruption so far, from creation to now
  • chaos.controller.disruptions.stuck_on_removal increments when a disruption is stuck on removal
  • chaos.controller.disruptions.stuck_on_removal_total is the total count of existing disruption being flagged as stuck on removal
  • chaos.controller.disruptions.gauge is the total count of existing disruption
  • chaos.controller.disruptions.count increments when a disruption is finished
  • chaos.controller.watcher.calls_total increments each time any watcher handles an OnChange event
  • chaos.cron.controller.schedule.too_late increments each time a DisruptionCron has missed the time to schedule its disruption
  • chaos.cron.controller.schedule.target_missing increments each time a DisruptionCron cannot find its target
  • chaos.cron.controller.schedule.missing_target_found increments each time a DisruptionCron which couldn't find its target, is now able to find it
  • chaos.cron.controller.schedule.missing_target_deleted increments each time a DisruptionCron self deletes because its target was missing for too long
  • chaos.cron.controller.schedule.next_scheduled is the time between now and when the next disruption for this DisruptionCron should run
  • chaos.cron.controller.schedule.disruption_scheduled increments each time a DisruptionCron schedules a child disruption
  • chaos.cron.controller.schedule.paused increments each time a DisruptionCron reconciles while in a paused state

Admission webhooks

  • chaos.controller.validation.failed increments when a disruption fails to be validated from the admission webhook
  • chaos.controller.validation.created increments when a disruption is created
  • chaos.controller.validation.updated increments when a disruption is updated
  • chaos.controller.validation.deleted increments when a disruption is deleted

Injector

  • chaos.injector.injected increments when a disruption is injected
  • chaos.injector.cleaned increments when a disruption is cleaned
  • chaos.injector.reinjected increments when a disruption is reinjected
  • chaos.injector.cleaned_for_reinjection increments when a disruption is cleaned after a reinjection

Events

The chaos-controller can send multiple events on targeted resources and on the disruption itself.

The list can be found at api/v1beta1/events.go