Hi All,
After developing smsc as a R&D project, i encountered some issues. Mainly my software catch the sms and generate SRI and handle the forwardSM. Average message load is around 300 messages per second. In this case out of MSG count in gctload -t1 is increased gradually and the map process chew 100% CPU along with the application. When i check the gctload -t3 module 0x00 is queued. Only way to recover after this is restarting whole process.
GCTLOAD System status
System restart time: 2013-05-30 06:03:08.587
Congestion module Id: 0x21
GCTLIB library: V1.45
Internal system error: 0
GCTLIB Atomic: Enabled
Timed licenses in use: No
Partition[0]
Parameter size: 320
MSGs in partition: 15000
MSGs allocated : 3237
MSGs free: 11763
Maximum MSGs allocated: 3237
Max alloc since reset: 3237
Time of last max: 2013-05-30 07:43:03.909
Out of MSG count: 237366
Congestion onset: 12000
Congestion abate: 6000
Congestion status: 0
Congestion count: 0
Cong count since reset: 0
Last congestion onset: 0000-00-00 00:00:00.000
Partition[1]
Parameter size: 4200
MSGs in partition: 4000
MSGs allocated : 0
MSGs free: 4000
Maximum MSGs allocated: 3351
Max alloc since reset: 3351
Time of last max: 2013-05-30 06:35:14.325
Out of MSG count: 0
Congestion onset: 3200
Congestion abate: 1600
Congestion status: 0
Congestion count: 2
Cong count since reset: 2
Last congestion onset: 2013-05-30 06:35:14.207
GCTLOAD Message Queue Status
LOCAL=0x00, MSGs queued=1229, last read by pid=9739
LOCAL=0x14, MSGs queued=1, last read by pid=9747
LOCAL=0x15, MSGs queued=91, last read by pid=9748
LOCAL=0x2d, MSGs queued=0, last read by pid=9761
LOCAL=0x33, MSGs queued=0, last read by pid=9746
LOCAL=0x55, MSGs queued=0, last read by pid=9749
LOCAL=0x56, MSGs queued=0, last read by pid=9751
LOCAL=0x57, MSGs queued=0, last read by pid=9752
LOCAL=0xcf, MSGs queued=0, last read by pid=9741
LOCAL=0xd0, MSGs queued=0, last read by pid=9750
LOCAL=0xd1, MSGs queued=1916, last read by pid=9743
LOCAL=0xd2, MSGs queued=0, last read by pid=9745
LOCAL=0xef, MSGs queued=2, last read by pid=9742
(Module 0x56,0x57,0x55 was newly created for application uses )
Output of top
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
9761 root 25 0 410m 67m 23m S 1640.0 0.2 34:57.93 smppgw64
9747 root 25 0 42812 30m 22m R 100.0 0.1 2:10.73 tcap
9739 root 25 0 24104 18m 18m R 99.5 0.1 2:03.71 tim
9741 root 25 0 24596 11m 11m R 99.5 0.0 2:01.70 s7_mgt
9742 root 25 0 24148 22m 22m R 99.5 0.1 5:00.04 s7_log
9743 root 25 0 27288 25m 22m R 99.5 0.1 2:13.38 sctp
9745 root 25 0 37092 24m 22m R 99.5 0.1 2:08.34 m3ua
9746 root 25 0 35492 23m 22m R 50.7 0.1 1:18.11 sccp
9748 root 25 0 46932 34m 22m R 50.7 0.1 1:49.14 map
22122 mysql 15 0 7392m 4.9g 4756 S 44.9 15.5 37012:07 mysqld
4245 root 18 0 70596 6592 1396 D 2.0 0.0 0:12.26 sendmail
6447 root 15 0 3820 620 492 S 2.0 0.0 38:16.41 acpid
(smppgw64 my application)
Please help me solve this issue, because this is happened frequently.
Thanks
Prabath