Через примерно полгода эксплуатации TAU72.IP с asterisk (SIP, H.323 отключен) начали с периодичностью от 2 раз в сутки до 1 раза в 2-3 суток отваливаться абонентские порты. Со стороны asterisk это выглядит очень похоже на описанное в теме viewtopic.php?f=7&t=5167, в логах asterisk примерно следующее:
Код: Выделить всё
[2015-12-18 16:22:04] NOTICE[1156] chan_sip.c: Peer '296' is now UNREACHABLE! Last qualify: 30
[2015-12-18 16:22:05] NOTICE[1156] chan_sip.c: Peer '127' is now UNREACHABLE! Last qualify: 20
[2015-12-18 16:22:06] NOTICE[1156] chan_sip.c: Peer '176' is now UNREACHABLE! Last qualify: 27
[2015-12-18 16:22:07] NOTICE[1156] chan_sip.c: Peer '175' is now UNREACHABLE! Last qualify: 26
[2015-12-18 16:22:08] NOTICE[1156] chan_sip.c: Peer '177' is now UNREACHABLE! Last qualify: 27
[2015-12-18 16:22:09] NOTICE[1156] chan_sip.c: Peer '188' is now Lagged. (6102ms / 5000ms)
[2015-12-18 16:22:14] NOTICE[1156] chan_sip.c: Peer '296' is now Reachable. (27ms / 5000ms)
[2015-12-18 16:22:15] NOTICE[1156] chan_sip.c: Peer '127' is now Reachable. (18ms / 5000ms)
[2015-12-18 16:22:16] NOTICE[1156] chan_sip.c: Peer '176' is now Reachable. (24ms / 5000ms)
[2015-12-18 16:22:18] NOTICE[1156] chan_sip.c: Peer '175' is now Reachable. (29ms / 5000ms)
[2015-12-18 16:22:18] NOTICE[1156] chan_sip.c: Peer '177' is now Reachable. (24ms / 5000ms)
[2015-12-18 16:22:20] NOTICE[1156] chan_sip.c: Peer '188' is now Reachable. (26ms / 5000ms)
Со стороны абонентов выглядит как разрыв всех активных соединений и полное отсутствие реакции шлюза (включая dialtone) около 15 секунд.
В dmesg шлюза при этом появляется следующее:
Код: Выделить всё
tau72: unhandled page fault (11) at 0x00000018, code 0x017
pgd = c1c3c000
[00000018] *pgd=03920031, *pte=00000000, *ppte=00000000
Pid: 585, comm: tau72
CPU: 0 Not tainted (2.6.22.19-4.03.0-c300evm #290)
PC is at 0xf2310
LR is at 0xf2248
pc : [<000f2310>] lr : [<000f2248>] psr: 20000010
sp : bd7ffa28 ip : 00000000 fp : 00000000
r10: 00300178 r9 : 002c6d74 r8 : 00000000
r7 : 00848700 r6 : 00000000 r5 : 00000384 r4 : 0071d680
r3 : 00000001 r2 : 002c6d74 r1 : 00000004 r0 : 00000000
Flags: nzCv IRQs on FIQs on Mode USER_32 Segment user
Control: 00c5387f Table: 02c3c008 DAC: 00000015
[<c0026ca8>] (show_regs+0x0/0x4c) from [<c002b428>] (__do_user_fault+0x9c/0xa8)
r4:c1df7d60
[<c002b38c>] (__do_user_fault+0x0/0xa8) from [<c002b654>] (do_page_fault+0x1a4/0x218)
r7:c25e1278 r6:c1d4f3f4 r5:00000017 r4:ffffffec
[<c002b4b0>] (do_page_fault+0x0/0x218) from [<c0025224>] (do_DataAbort+0x3c/0xa4)
[<c00251e8>] (do_DataAbort+0x0/0xa4) from [<c0025e00>] (ret_from_exception+0x0/0x10)
Exception stack(0xc22e3fb0 to 0xc22e3ff8)
3fa0: 00000000 00000004 002c6d74 00000001
3fc0: 0071d680 00000384 00000000 00848700 00000000 002c6d74 00300178 00000000
3fe0: 00000000 bd7ffa28 000f2248 000f2310 20000010 ffffffff
r8:00000000 r7:00848700 r6:00000000 r5:00000384 r4:ffffffff
В syslog несколько больше информации:
Код: Выделить всё
18-12-2015 16:21:55 kernel: tau72: unhandled page fault (11) at 0x00000018, code 0x017
18-12-2015 16:21:55 kernel: pgd = c1c3c000
18-12-2015 16:21:55 kernel: [00000018] *pgd=03920031, *pte=00000000, *ppte=00000000
18-12-2015 16:21:55 kernel: Pid: 585, comm: tau72
18-12-2015 16:21:55 kernel: CPU: 0 Not tainted (2.6.22.19-4.03.0-c300evm #290)
18-12-2015 16:21:55 kernel: PC is at 0xf2310
18-12-2015 16:21:55 kernel: LR is at 0xf2248
18-12-2015 16:21:55 kernel: pc : [<000f2310>] lr : [<000f2248>] psr: 20000010
18-12-2015 16:21:55 kernel: sp : bd7ffa28 ip : 00000000 fp : 00000000
18-12-2015 16:21:55 kernel: r10: 00300178 r9 : 002c6d74 r8 : 00000000
18-12-2015 16:21:55 kernel: r7 : 00848700 r6 : 00000000 r5 : 00000384 r4 : 0071d680
18-12-2015 16:21:55 kernel: r3 : 00000001 r2 : 002c6d74 r1 : 00000004 r0 : 00000000
18-12-2015 16:21:55 kernel: Flags: nzCv IRQs on FIQs on Mode USER_32 Segment user
18-12-2015 16:21:55 kernel: Control: 00c5387f Table: 02c3c008 DAC: 00000015
18-12-2015 16:21:55 kernel: [<c0026ca8>] (show_regs+0x0/0x4c) from [<c002b428>] (__do_user_fault+0x9c/0xa8)
18-12-2015 16:21:55 kernel: r4:c1df7d60
18-12-2015 16:21:55 kernel: [<c002b38c>] (__do_user_fault+0x0/0xa8) from [<c002b654>] (do_page_fault+0x1a4/0x218)
18-12-2015 16:21:55 kernel: r7:c25e1278 r6:c1d4f3f4 r5:00000017 r4:ffffffec
18-12-2015 16:21:55 kernel: [<c002b4b0>] (do_page_fault+0x0/0x218) from [<c0025224>] (do_DataAbort+0x3c/0xa4)
18-12-2015 16:21:55 kernel: [<c00251e8>] (do_DataAbort+0x0/0xa4) from [<c0025e00>] (ret_from_exception+0x0/0x10)
18-12-2015 16:21:55 kernel: Exception stack(0xc22e3fb0 to 0xc22e3ff8)
18-12-2015 16:21:55 kernel: 3fa0: 00000000 00000004 002c6d74 00000001
18-12-2015 16:21:55 kernel: 3fc0: 0071d680 00000384 00000000 00848700 00000000 002c6d74 00300178 00000000
18-12-2015 16:21:55 kernel: 3fe0: 00000000 bd7ffa28 000f2248 000f2310 20000010 ffffffff
18-12-2015 16:21:55 kernel: r8:00000000 r7:00848700 r6:00000000 r5:00000384 r4:ffffffff
18-12-2015 16:21:55 kernel: csme: skb_recv_datagram() failed
18-12-2015 16:21:56 tau72-1: 2.13.1-d0cdbae|(55.754)[app:WARN]H323 aliase not specified, gatekeeper will not be used!
18-12-2015 16:21:56 tau72-1: 2.13.1-d0cdbae|(55.758)[app:WARN]unknown read parameter name: "use_user_to_dial_conference" with value: "0"
18-12-2015 16:21:56 tau72-1: 2.13.1-d0cdbae|(55.767)[app:WARN]unknown read parameter name: "use_user_to_dial_conference" with value: "0"
18-12-2015 16:21:56 tau72-1: 2.13.1-d0cdbae|(55.774)[app:WARN]unknown read parameter name: "use_user_to_dial_conference" with value: "0"
18-12-2015 16:21:56 tau72-1: 2.13.1-d0cdbae|(55.782)[app:WARN]unknown read parameter name: "use_user_to_dial_conference" with value: "0"
18-12-2015 16:21:56 tau72-1: 2.13.1-d0cdbae|(55.790)[app:WARN]unknown read parameter name: "use_user_to_dial_conference" with value: "0"
18-12-2015 16:21:56 tau72-1: 2.13.1-d0cdbae|(55.798)[app:WARN]unknown read parameter name: "use_user_to_dial_conference" with value: "0"
18-12-2015 16:21:56 tau72-1: 2.13.1-d0cdbae|(55.806)[app:WARN]unknown read parameter name: "use_user_to_dial_conference" with value: "0"
18-12-2015 16:21:56 tau72-1: 2.13.1-d0cdbae|(55.814)[app:WARN]unknown read parameter name: "use_user_to_dial_conference" with value: "0"
18-12-2015 16:22:01 tau72-1: 2.13.1-d0cdbae|(00.421)[app:ERR]vapi: Master Dev. InitDevice failed Status CNF_ERROR_CSME_MCP_ACTIVE, try recovery
18-12-2015 16:22:01 tau72-1: 2.13.1-d0cdbae|(00.845)[app:ERR]vapi_chan_rd_blocked() Create conn 37 error -68058 (VAPI_ERR_-68058)
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.740)[app:info]hio: ports initialize - starting...
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.742)[app:info]Port 0: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.744)[app:info]Port 1: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.746)[app:info]Port 2: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.748)[app:info]Port 3: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.751)[app:info]Port 4: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.753)[app:info]Port 5: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.754)[app:info]Port 6: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.756)[app:info]Port 7: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.758)[app:info]Port 8: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.761)[app:info]Port 9: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.763)[app:info]Port 10: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.765)[app:info]Port 11: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.766)[app:info]Port 12: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.768)[app:info]Port 13: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.771)[app:info]Port 14: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.773)[app:info]Port 15: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.774)[app:info]Port 16: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.776)[app:info]Port 17: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.778)[app:info]Port 18: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.781)[app:info]Port 19: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.783)[app:info]Port 20: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.785)[app:info]Port 21: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.787)[app:info]Port 22: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.788)[app:info]Port 23: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.792)[app:info]Port 24: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.795)[app:info]Port 25: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.797)[app:info]Port 26: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.799)[app:info]Port 27: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.801)[app:info]Port 28: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.803)[app:info]Port 29: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.805)[app:info]Port 30: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.808)[app:info]Port 31: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.811)[app:info]Port 32: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.813)[app:info]Port 33: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.815)[app:info]Port 34: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.817)[app:info]Port 35: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.820)[app:info]Port 36: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.822)[app:info]Port 37: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.824)[app:info]Port 39: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.827)[app:info]Port 40: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.829)[app:info]Port 41: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.831)[app:info]Port 42: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.833)[app:info]Port 43: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.836)[app:info]Port 44: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.838)[app:info]Port 45: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.840)[app:info]Port 46: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.842)[app:info]Port 47: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.845)[app:info]Port 48: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.847)[app:info]Port 49: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.850)[app:info]Port 50: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.852)[app:info]Port 51: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.853)[app:info]Port 52: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.856)[app:info]Port 54: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.857)[app:info]Port 55: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.860)[app:info]Port 56: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.862)[app:info]Port 57: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.865)[app:info]Port 58: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.867)[app:info]Port 59: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.870)[app:info]Port 60: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.872)[app:info]Port 61: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.874)[app:info]Port 62: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.875)[app:info]Port 63: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.877)[app:info]Port 64: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.879)[app:info]Port 65: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.881)[app:info]Port 66: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:06 tau72-1: 2.13.1-d0cdbae|(05.883)[app:info]Port 67: flash=[100;1000] tx gain=0 rx gain=-70 power_mode="normal"
18-12-2015 16:22:09 tau72-1: 2.13.1-d0cdbae|(08.271)[app:info]hio: all ports init complete
18-12-2015 16:22:09 tau72-1: 2.13.1-d0cdbae|(08.274)[app:info]hio: Fun Temp Threshold = 45
Если я правильно понял лог, то процесс tau72 упал и был перезапущен, что собственно и вызвало разрывы соединений.
При этом в течение ~5 месяцев эксплуатации шлюза никаких проблем не возникало.
Текущая версия ПО шлюза: 2.13.1-d0cdbae Sat Mar 14 09:19:59 2015
В настройках стоит постоянная работа вентиляторов, температура шлюза по его датчикам не превышает 28 градусов.
Информация из оболочки шлюза:
Код: Выделить всё
[admin@tau72-1:/root]free
total used free shared buffers
Mem: 44640 29600 15040 0 8
Swap: 0 0 0
Total: 44640 29600 15040
[admin@tau72-1:/root]df
Filesystem 1K-blocks Used Available Use% Mounted on
/dev/root 16384 11988 4396 73% /
/dev/mtdblock6 3036 1 3035 0% /fsext
Прошу посоветовать, как можно избавиться от этого поведения.
Заранее спасибо.