Welcome to Pandora FMS Community!

Find answers, ask questions, and connect with our community around the world.

  • Alert configuration

    Posted by seinloft on November 19, 2008 at 04:19

    Dear all,

    I’ve just installed our new PandoraFMS server. Since we’ve used V1.3.1 now for a long time we configured a new server with V2 especially for WMI querys.

    But my first problem is in configuring alerts. Since we have 12 locations connected via MPLS network or VPN over the internet with sometimes heavy load during network backuptimes, we receive many false positive alerts.

    I played a bit arround with configuration and found some points to start with, but indeed not everything is as we expect it to be.

    Here you find the current configuration:

    Pandora console:
    Pandora FMS v2.0 – Build PC081030

    pandora_server.conf
    * changed icmp_checks to 10

    Config.pm
    * changed networktimeout to 10

    Example:
    agent modul host alive interval 60 sec
    alert time threshold 1 day
    alert min. number of alerts 3

    Situation:
    * flapping line or instable connection or heavy load on line with QoS (ICMP has a very low priority in WAN)
    * each time icmp ping is returning 0 -> talerta_agente_modulo.internal_counter is increased
    * talerta_agente_modulo.internal_counter > talerta_agente_modulo.min_alerts alert is fired

    Comments:
    now I receive alerts once a day if icmp ping reports 4 times 0 e.g. 08:00am, 12:00pm, 02:00pm and 04:00pm, but what i wanted is that I receive an alert while this happens successively and not spread over the whole day.

    I read the documentation for “Time threshold” and decreased it to 10 min. but now in case of a nightly error my mailbox is overloaded every 10 min with an alert, because I’m not able to validate the error.

    I’ve checked the database and my first impression of how to solve my problem is that in case of icmp ping returns 1 ( good ) the internal_counter should be set to 0.

    Is this a possible solution for my problem or am I understanding the way of alerts in PandoraFMS completely wrong?

    Proposals and suggestions are welcome.

    Kind regards,

    Sascha

    manu replied 16 years, 4 months ago 3 Members · 9 Replies
  • 9 Replies
  • manu

    Member
    November 19, 2008 at 05:21
    0 Karma points
    Community rank: tentacle-noob-1 Tentacle noob
    Like it
    Up
    0
    Down
    Drop it
    ::

    Hi Sascha,

    The solution is to use the Threshold as you did already and you might want to play with the Min Number of Alerts as well, so you can set something like: Min Number of Alerts = 2, so this means that the alert has to be fired twice to be able to receive the big alerts (let’s call it that way), which is the email, for instance.

    We have fixed some bugs in 2.0 already. They’re in the branches in the SVN, so you have 2 ways to keep your PandoraFMS up to date, one is to download the branches and do the update manually, and the other one is using the Update Manager from the Webconsole and just with a click.

    Cheers!
    Manuel.

  • seinloft

    Member
    November 21, 2008 at 04:36
    0 Karma points
    Community rank: tentacle-noob-1 Tentacle noob
    Like it
    Up
    0
    Down
    Drop it
    ::

    Hi Manuel,

    thanks for the quick replay, but unfortunately I did not come any further with my problem.

    So I’ve added a screenshot to clarify my problem.

    Agent ICMP ( Host alive ) reported 4 times timeout so alert has been fired.
    Alert is defined as threshold 1 day – min # alerts 3.

    During 4 hours it seems that we have 3 times conquestion on the WAN and the ICMP packages gets
    dropped or take a long time to replay.

    But the server is still up all the time.

    Is it possible for me to configure the alerts in the following way:

    only fire an alert if 3 time successively the “Host alive” returns 0?

    Means that the database record in talerta_agente_modulo.internal_counter
    is set to 0 for the specific id_agente_modulo when the “Host alive” returns 1.

    By the way, I’m currently using the thrunk version of Pandora FMS v2.1-dev – Build PC081016.
    But I had the same issue with the latest branch version.

    Thanks for helping,

    Sascha

  • manu

    Member
    November 21, 2008 at 04:54
    0 Karma points
    Community rank: tentacle-noob-1 Tentacle noob
    Like it
    Up
    0
    Down
    Drop it
    ::

    [cite]Posted By: seinloft[/cite]
    Is it possible for me to configure the alerts in the following way:

    only fire an alert if 3 time successively the “Host alive” returns 0?

    To do this:

    Min Value: 0 Max Value:0

    Min Number Alerts: 3 Max Number of alerts: 1
    Threshold 1 day

    Let me know how that goes. The key here could be the recovery, do you have recovery active? If so, the recover will set the counter to 0 everytime the condition is not satisfy no matter if the alert was fired or not.

  • seinloft

    Member
    November 21, 2008 at 05:24
    0 Karma points
    Community rank: tentacle-noob-1 Tentacle noob
    Like it
    Up
    0
    Down
    Drop it
    ::

    That’s exact what I’ve configured 3 – 1 – 1 day and recovery enabled!

    I’m testing for ping “host active” every 120 sec. But internal_counter will only be set to 0
    after alert was fired and not if ping “host active” is successfull.

    So 4 times a day a ping was missed the alert is fired even if in the meantime the ping “host active” reports OK.

    Current behaviour:

    30 times OK
    1 time NOK -> internal_counter++ => 1
    30 times OK
    1 time NOK -> internal_counter++ => 2
    30 times OK
    1 time NOK -> internal_counter++ => 3
    30 times OK
    1 time NOK -> internal_counter++ => 4, internal_counter > min_alerts -> alert fired
    1 time OK -> recovery alert fired -> internal_counter = 0

    What I would like to have is:

    30 times OK
    1 time NOK -> internal_counter++ => 1
    1 time NOK -> internal_counter++ => 2
    1 time OK -> internal_counter = 0

    Perhaps I can do this in the code myself if this is not the desired behaviour.
    But right now I was not able to dected the part of code where this could be done.

    Thanks,

    Sascha

  • manu

    Member
    November 21, 2008 at 05:32
    0 Karma points
    Community rank: tentacle-noob-1 Tentacle noob
    Like it
    Up
    0
    Down
    Drop it
    ::

    Can you try to disable the alert recovery and see if it works as you expect?

  • seinloft

    Member
    December 1, 2008 at 02:39
    0 Karma points
    Community rank: tentacle-noob-1 Tentacle noob
    Like it
    Up
    0
    Down
    Drop it
    ::

    Hi,

    any news for my problem?

    I really would like to make the changed myself, but unfortunately I’m not able to find the
    exact part of the Perl script to make the change.

    Thanks for replaying.

    Sascha

  • rnovoa

    Member
    December 1, 2008 at 04:38
    0 Karma points
    Community rank: tentacle-noob-1 Tentacle noob
    Like it
    Up
    0
    Down
    Drop it
    ::

    Hi Sascha,

    We will be making those changes to the alert engine soon, but if you want to give it a try yourself, the file you are looking for is ‘pandora_server/lib/PandoraFMS/DB.pm’ 🙂

  • seinloft

    Member
    December 1, 2008 at 05:41
    0 Karma points
    Community rank: tentacle-noob-1 Tentacle noob
    Like it
    Up
    0
    Down
    Drop it
    ::

    Thank you very much.

    Since I came from C# and Java, Perl is not my prefered language for development. So I’ll wait for the changes in the trunk version.
    Is it possible to let me know when it is done, or do I just have to look at the ChangeLog file?

    Thanks,

    Sascha

  • manu

    Member
    December 1, 2008 at 06:22
    0 Karma points
    Community rank: tentacle-noob-1 Tentacle noob
    Like it
    Up
    0
    Down
    Drop it
    ::

    You can join the developers list and follow the commits
    https://lists.sourceforge.net/lists/listinfo/pandora-develop

Start of Discussion
1 of 9 replies November 2008
Now