Now let’s move forward; here comes the fun part.
Please make sure you place all your config files in a dedicated directory. We will refer to this
configuration folder as $BPMON_BASE
.
Note that you can export the BPMON_BASE
environment variable or use the -b/--base
flag to point
BPMON
to your own configuration directory (by default BPMON
expects to find its configuration in
the current directory).
Prepare the directory structure
In $BPMON_BASE
run:
mkdir bp.d && mkdir runners
Generate the Main Configuration
BPMON provides a couple of sub commands that allow you to manage your main configuration file.
When starting a new setup from scratch bpmon config init
comes handy. This will print an annotated
configuration file prefilled with defaults to the standard output:
$ bpmon config init
# The default section is - as the name suggests - read by default. Note that you can
# define as many sections as your want. You can for examlpe define a 'test' section
# that refers to a test connection string for icinga and/or influx. To reduce
# boilerplate consider anchors (http://yaml.org/spec/1.2/spec.html#id2765878)
#
# default: &anchor_to_default
# ...
# test:
# << *anchor_to_default
#
# Sections other than 'default' can be used via the -s/--section flag.
default:
# global_recipients will be added to the repicients list of all BP
global_recipients: []
# First BPMON needs to have access to your Icinga2 API. Learn more on by reading
# https://docs.icinga.com/icinga2/latest/doc/module/icinga2/chapter/icinga2-api.
checker:
# kind defines the checker implementation to be used by BPMON. Currently
# only icinga is implemented.
kind: icinga
# The connection string describes how to connect to your Icinga API. The
# string needs to follow the pattern:
# [protocol]://[user]:[passwd]@[hostname]:[port]
connection: ""
# BPMON verifies if a https connection is trusted. If you wont to trust a
# connection with an invalid certificate you have to set this to true.
tls_skip_verify: false
# timeout defines how long BPMON waits for each request to the checker to
# recieve a response. The string is parsed as a goland duration, refer to
# its documentation for more details:
# https://golang.org/pkg/time/#ParseDuration
timeout: 10s
# The connection to the InfluxDB is required in order to persist the the state, eg.
# the write subcommand.
store:
# kind defines the store implementation to be used by BPMON. Currently
# only influx is implemented.
kind: influx
# The connection string describes how to connect to your Influx Database.
# The string needs to follow the pattern:
# [protocol]://[user]:[passwd]@[hostname]:[port]
connection: ""
# timeout is read as a go (golang) duration, please refer to
# https://golang.org/pkg/time/#Duration for a detailed explanation.
timeout: 10s
# save_ok tells BPMON which data points should be persisted if the state is 'ok'.
# By default 'OK' states aro only saved to InfluxDB if its an BP measurement.
# That means that 'OK' states for KPIs and SVCs will not be saved for the sake of
# of storage required. 'OK' states of BPs are saved as 'heart beat' of BPMON.
save_ok:
- BP
# This will tell BPMON to compare the current status against the last
# status saved in InfluxDB and adds some values to the measurement
# accordingly. This then allows to generate reports such as 'Tell me
# only when a status is changed from good to bad'. This only runs against
# types listed in 'save_ok' since only these are persisted 'correctly'.
get_last_status: true
# if debug is set to true all queries generated and executed by bpmon will
# be logged to stdout.
debug: false
# BPMON verifies if a https connection is trusted. If you wont to trust a
# connection with an invalid certificate you have to set this to true
tls_skip_verify: false
# Define your office hours et al. according to your service level
# agreements (SLA). You will reference themlater in your BP definitions.
availabilities: {}
# Extend the default rules. The default rules are provided by the checker implementation
# and can be reviewed via bpmon config print.
rules: {}
# dashboard configures the dashboard subcommand.
dashboard:
# listener tells the dashboard where to bind. This string
# should match the pattern [ip]:[port].
listener: 127.0.0.1:8910
# static is the path to the directory that sould be served
# at the root of the server. This should contain the UI of the
# Dashboard
static: ""
# grant_write is a list of recipients which are allowed to access the annotate
# endpoint via POST request.
grant_write: []
# env allows you to setup your configuration file structure according to your
# requirements.
env:
# runners is the directory where your custom runners are stored. The path must be
# relative to your base directory (-b/--base). The path must exist.
runner: runners/
# bp is the directory where your buisness process definitions are stored. The path must be
# relative to your base directory (-b/--base). The path must exist.
bp: bp.d/
Pipe this output in a file called config.yaml
.
bpmon config init > $BPMON_BASE/config.yaml
Connect to Icinga and Influx database
To get some insights on what can be configured please read the comment in
this generated file. For now we only need to setup the checker
and store
parts of the configuration to get things started.
In default.checker.connection
add the connection string to access your icinga API…
In default.store
we have two options:
- If you have an Influx database ready paste the connection string at
default.store.connection
. - If you don’t want to persist historical data right now set
default.store.get_last_status
to false. Addhttp://in.existent
atdefault.store.connection
.
Define an availability
Often we have some time slots in which the availability of a system is guaranteed. Add those time slots to your main configuration in default.availabilities
:
---
default:
...
availabilities:
high:
monday: [ "allday" ]
tuesday: [ "allday" ]
wednesday: [ "allday" ]
thursday: [ "allday" ]
friday: [ "allday" ]
saturday: [ "allday" ]
sunday: [ "allday" ]
medium:
monday: [ "06:00:00-20:00:00" ]
tuesday: [ "06:00:00-20:00:00" ]
wednesday: [ "06:00:00-20:00:00" ]
thursday: [ "06:00:00-20:00:00" ]
friday: [ "06:00:00-20:00:00" ]
saturday: [ "06:00:00-20:00:00" ]
sunday: [ "06:00:00-20:00:00" ]
low:
monday: [ "08:00:00-12:00:00", "13:30:00-17:00:00" ]
tuesday: [ "08:00:00-12:00:00", "13:30:00-17:00:00" ]
wednesday: [ "08:00:00-12:00:00", "13:30:00-17:00:00" ]
thursday: [ "08:00:00-12:00:00", "13:30:00-17:00:00" ]
friday: [ "08:00:00-12:00:00", "13:30:00-17:00:00" ]
...
In this case we have three availabilities defined: ‘high’, ‘medium’, ‘low’. Name yours however your want, just make sure the names make sense to you.
That’s it for the main configuration! Let’s move on…