groupmonitor.lua is sometimes faulty - Printable Version +- Logic Machine Forum (https://forum.logicmachine.net) +-- Forum: LogicMachine eco-system (https://forum.logicmachine.net/forumdisplay.php?fid=1) +--- Forum: Scripting (https://forum.logicmachine.net/forumdisplay.php?fid=8) +--- Thread: groupmonitor.lua is sometimes faulty (/showthread.php?tid=5046) |
groupmonitor.lua is sometimes faulty - rbourgeon - 20.10.2023 Hello, In my company we handle a pretty big pool of LogicMachines and we noticed that sometimes, event-based script that are active, don't trigger when they should. Here are more details about the issue:
RE: groupmonitor.lua is sometimes faulty - admin - 23.10.2023 Have you tested this with latest firmware? The most likely cause is that LM is overloaded. Run this script and check the last column (drops) values. If at least one row has a non-zero value it means that a certain system part was unable to process incoming data quick enough. Receive buffer got full and certain messages got dropped. Code: res = io.readfile('/proc/net/udp') To prevent this scripts must use grp.checkwrite/grp.checkupdate where applicable. Modbus send delta values and KNX filtering tables should be adjusted. RE: groupmonitor.lua is sometimes faulty - gjniewenhuijse - 23.10.2023 i See a lot of drops. * string: sl local_address rem_address st tx_queue rx_queue tr tm->when retrnsmt uid timeout inode ref pointer drops 88: 0100007F:EAD0 FFFFFF7F:1A40 01 00000000:00000000 00:00000000 00000000 0 0 492 2 cf9c6f00 0 111: 0100007F:03E7 00000000:0000 07 00000000:00000000 00:00000000 00000000 0 0 500 2 cf9c7400 0 113: 00000000:14E9 00000000:0000 07 00000000:00000000 00:00000000 00000000 65534 0 1148 2 cf32ef00 2401 148: 0100007F:0B0C 00000000:0000 07 00000000:00000000 00:00000000 00000000 0 0 836 2 cf9c7900 0 159: 0100007F:2717 00000000:0000 07 00000000:00000B00 00:00000000 00000000 0 0 914 2 cf9c6500 0 180: FFFFFF7F:1A2C 00000000:0000 07 00000000:00000000 00:00000000 00000000 0 0 993 2 cf32e280 0 200: FFFFFF7F:1A40 00000000:0000 07 00000000:00000000 00:00000000 00000000 0 0 65729512 2 cc8b6f00 0 200: FFFFFF7F:1A40 00000000:0000 07 00000000:00000000 00:00000000 00000000 0 0 1020 2 cf32ea00 0 200: FFFFFF7F:1A40 00000000:0000 07 00000000:00000000 00:00000000 00000000 0 0 1015 2 cf32ec80 663 200: FFFFFF7F:1A40 00000000:0000 07 00000000:00000000 00:00000000 00000000 0 0 1006 2 cf32e780 0 200: FFFFFF7F:1A40 00000000:0000 07 00000000:00000000 00:00000000 00000000 0 0 998 2 cf32e500 0 200: FFFFFF7F:1A40 00000000:0000 07 00000000:00000000 00:00000000 00000000 0 0 960 2 cf32e000 0 200: FFFFFF7F:1A40 00000000:0000 07 00000000:00050180 00:00000000 00000000 0 0 938 2 cf9c6280 6853867 200: FFFFFF7F:1A40 00000000:0000 07 00000000:00000000 00:00000000 00000000 0 0 837 2 cf9c7b80 0 200: FFFFFF7F:1A40 00000000:0000 07 00000000:00000000 00:00000000 00000000 0 0 507 2 cf9c7680 0 200: FFFFFF7F:1A40 00000000:0000 07 00000000:00000000 00:00000000 00000000 0 0 491 2 cf9c6c80 0 200: FFFFFF7F:1A40 00000000:0000 07 00000000:00000000 00:00000000 00000000 0 0 485 2 cf9c6780 66 223: 00000000:0E57 00000000:0000 07 00000000:00000000 00:00000000 00000000 0 0 494 2 cf9c7180 0 241: 0100007F:8B69 FFFFFF7F:04B0 01 00000000:00000000 00:00000000 00000000 0 0 486 2 cf9c6a00 0 RE: groupmonitor.lua is sometimes faulty - rbourgeon - 23.10.2023 (23.10.2023, 08:08)admin Wrote: Have you tested this with latest firmware? I haven't encountered this issue yet on LMs that are equipped with 2023 firmware. Thanks for the idea of looking if there was any drop in UDP packets; next time I encounter this bug, I will check it immediately. RE: groupmonitor.lua is sometimes faulty - admin - 23.10.2023 @gjniewenhuijse, your system looks overloaded too. Another indication is non-zero rx_queue value. It means that messages cannot be processes quickly enough and are queued. You can run this script, it will log all processes that have at least one message dropped. Code: inodes = {} Consider installing Group monitor app and check for duplicates and unnecessary object traffic that causes such overloads. RE: groupmonitor.lua is sometimes faulty - gjniewenhuijse - 24.10.2023 important output: string: lua /lib/genohm-scada/core/scripting-resident.lua 159, drops: 6908586 script 159: i checked every 10 seconds the knx bus with this script Code: -- resident knx connection handeler RE: groupmonitor.lua is sometimes faulty - admin - 24.10.2023 Such monitoring script must use 0 sleep time. RE: groupmonitor.lua is sometimes faulty - rbourgeon - 04.06.2024 lua /lib/genohm-scada/core/scripting-resident.lua 210, drops: 246 lua /lib/genohm-scada/core/groupmonitor.lua, drops: 1694 The groupmonitor.lua drops really are a critical problem. Sometimes I do a grp.write() on a object, a script that listens to the localbus sees the telegram on the bus, but the object is never updated in the database!! So I can't even ensure that when I do a grp.write() on an object and then a grp.getvalue() on the same object 30 seconds later, I will effectively read the value that I wrote 30 seconds ago. This is really critical. What could be done to ensure it? RE: groupmonitor.lua is sometimes faulty - admin - 04.06.2024 The only thing that you can do is to reduce the overall system load (telegram count) because you are hitting the system limits. |