Modify

Opened 7 years ago

Closed 6 years ago

Last modified 3 years ago

#7807 closed defect (no_response)

ppp won't die

Reported by: Brian May <brian@…> Owned by: developers
Priority: response-needed Milestone: Chaos Calmer 15.05
Component: kernel Version: Backfire 10.03.1 RC5
Keywords: Cc:

Description

Copied from <https://forum.openwrt.org/viewtopic.php?id=26228>:

Hello,

Just trying out openwrt for the first time, in particular on my D-Link 502t.

I am using the 10.03.1-rc1, because the 10.03 image wasn't usable for me.

I have noticed some quirks though. Perhaps the most serious one is that that PPP won't die gracefully for any reason. As a result I have to power off and back on the system every time I want to restart it (or if it restarts itself).

Before PPP shutdown, everything works fine, I can ping remote hosts, DNS works - everything that I would expect in fact.

I get the following messages logged:

Aug 22 21:01:24 OpenWrt daemon.notice pppd[563]: restoring old default route to br-lan [202.173.153.97]
Aug 22 21:01:24 OpenWrt daemon.debug pppd[563]: Script /etc/ppp/ip-down started (pid 822)
Aug 22 21:01:24 OpenWrt daemon.debug pppd[563]: sent [LCP TermReq id=0x2 "User request"]
Aug 22 21:01:24 OpenWrt daemon.debug pppd[563]: rcvd [LCP TermAck id=0x2]
Aug 22 21:01:24 OpenWrt daemon.notice pppd[563]: Connection terminated.
Aug 22 21:01:34 OpenWrt user.emerg kernel: unregister_netdevice: waiting for pppoa-wan to become free. Usage count = 1
Aug 22 21:01:44 OpenWrt user.emerg kernel: unregister_netdevice: waiting for pppoa-wan to become free. Usage count = 1
Aug 22 21:01:54 OpenWrt user.emerg kernel: unregister_netdevice: waiting for pppoa-wan to become free. Usage count = 1

kill -9 of the pppd process doesn't work.

Which I am guessing means there is a bug somewhere inside the kernel, and the kernel is blocked waiting for itself to free pppoa-wan which will never become free because it is blocked. Or something like that.

root@OpenWrt:~# uname -a
Linux OpenWrt 2.6.32.16 #1 Mon Aug 9 01:08:19 PDT 2010 mips GNU/Linux

Any workarounds? Apart from the obvious one of using PPP persist to attempt to ensure PPP doesn't die or exit?

Brian May

Attachments (0)

Change History (26)

comment:1 Changed 7 years ago by Brian May <brian@…>

At the time of the crash, strace shows:

write(5, "sent [LCP TermReq id=0x3 \"User r"..., 40) = 40
write(5, "\n", 1)                       = 1
write(10, "\300!\5\3\0\20User request", 18) = 18
gettimeofday({1281333655, 532712}, NULL) = 0
gettimeofday({1281333655, 533680}, NULL) = 0
read(10, 0x498c92, 1502)                = -1 EAGAIN (Resource temporarily unavailable)
read(11, 0x498c92, 1502)                = -1 EAGAIN (Resource temporarily unavailable)
rt_sigprocmask(SIG_BLOCK, NULL, [], 16) = 0
rt_sigprocmask(SIG_BLOCK, [HUP INT TERM USR2 CHLD], NULL, 16) = 0
rt_sigprocmask(SIG_UNBLOCK, [HUP INT TERM USR2 CHLD], NULL, 16) = 0
gettimeofday({1281333655, 542938}, NULL) = 0
sysinfo({uptime=188, loads=[7072, 10048, 4480] totalram=12787712, freeram=1146880, sharedram=0, bufferram=1212416} totalswap=0, freeswap=0, procs=29}) = 0
gettimeofday({1281333655, 547761}, NULL) = 0
_newselect(12, [10 11], NULL, [10 11], {2, 984951}

i.e. it appears to be hanging inside _newselect.

I am a bit puzzled why unregister_netdevice(...) is getting called, userland doesn't seem to be doing anything that I can see to trigger this.

comment:2 Changed 7 years ago by Brian May <brian@…>

Hmmm. Occasionally it works. Here is the strace when it works:

write(6, "sent [LCP TermReq id=0x3 \"User r"..., 40) = 40
write(6, "\n", 1)                       = 1
write(11, "\300!\5\3\0\20User request", 18) = 18
gettimeofday({1281333563, 110846}, NULL) = 0
gettimeofday({1281333563, 111823}, NULL) = 0
read(11, 0x498c92, 1502)                = -1 EAGAIN (Resource temporarily unavailable)
read(12, 0x498c92, 1502)                = -1 EAGAIN (Resource temporarily unavailable)
rt_sigprocmask(SIG_BLOCK, NULL, [], 16) = 0
rt_sigprocmask(SIG_BLOCK, [HUP INT TERM USR2 CHLD], NULL, 16) = 0
rt_sigprocmask(SIG_UNBLOCK, [HUP INT TERM USR2 CHLD], NULL, 16) = 0
gettimeofday({1281333563, 121233}, NULL) = 0
sysinfo({uptime=96, loads=[31232, 15328, 5760] totalram=12787712, freeram=892928, sharedram=0, bufferram=1212416} totalswap=0, freeswap=0, procs=29}) = 0
gettimeofday({1281333563, 123565}, NULL) = 0
_newselect(13, [11 12], NULL, [11 12], {2, 987281} <unfinished ...>
+++ killed by SIGKILL +++

I don't understand what kills the process within the SIGKILL - this seems to be the only difference compared when it crashes.

comment:3 Changed 7 years ago by rpc@…

I very much like the wr1043nd usb modem Sagem Fast 800th

Communication occurs in the following points:
1st Adsl line is not connected to the modem Sagem Fast 800
2nd During operation, the provider resets the line moments adslhyba
3rd ISP changes the IP address (I am not sure at 100%)

Jan  1 01:47:35 OpenWrt user.info kernel: usb 1-1: new full speed USB device using ar71xx-ehci and address 2
Jan  1 01:47:36 OpenWrt user.info kernel: usb 1-1: configuration #1 chosen from 1 choice
Jan  1 01:47:36 OpenWrt user.info kernel: usb 1-1: [ueagle-atm] ADSL device founded vid (0X1110) pid (0X9042) Rev (0X3000): Eagle IV
Jan  1 01:47:36 OpenWrt user.info kernel: usb 1-1: reset full speed USB device using ar71xx-ehci and address 2
Jan  1 01:47:36 OpenWrt user.info kernel: usb 1-1: [ueagle-atm] pre-firmware device, uploading firmware
Jan  1 01:47:36 OpenWrt user.info kernel: usb 1-1: [ueagle-atm] loading firmware ueagle-atm/eagleIV.fw
Jan  1 01:47:36 OpenWrt user.info kernel: usb 1-1: firmware: requesting ueagle-atm/eagleIV.fw
Jan  1 01:47:37 OpenWrt user.info kernel: usb 1-1: [ueagle-atm] firmware uploaded
Jan  1 01:47:37 OpenWrt user.info kernel: usb 1-1: USB disconnect, address 2
Jan  1 01:47:39 OpenWrt user.info kernel: usb 1-1: new full speed USB device using ar71xx-ehci and address 3
Jan  1 01:47:39 OpenWrt user.info kernel: usb 1-1: configuration #1 chosen from 1 choice
Jan  1 01:47:39 OpenWrt user.info kernel: usb 1-1: [ueagle-atm] ADSL device founded vid (0X1110) pid (0X9041) Rev (0X3029): Eagle IV
Jan  1 01:47:39 OpenWrt user.info kernel: usb 1-1: reset full speed USB device using ar71xx-ehci and address 3
Jan  1 01:47:39 OpenWrt user.info kernel: usb 1-1: [ueagle-atm] (re)booting started
Jan  1 01:47:41 OpenWrt user.info kernel: usb 1-1: firmware: requesting ueagle-atm/DSP4p.bin
Jan  1 01:47:41 OpenWrt daemon.info pppd[1636]: Plugin pppoatm.so loaded.
Jan  1 01:47:41 OpenWrt daemon.notice pppd[1636]: pppd 2.4.4 started by root, uid 0
Jan  1 01:47:41 OpenWrt daemon.info pppd[1636]: Using interface pppoa-wan
Jan  1 01:47:41 OpenWrt daemon.notice pppd[1636]: Connect: pppoa-wan <--> 0.35
Jan  1 01:47:42 OpenWrt user.info kernel: usb 1-1: [ueagle-atm] ATU-R firmware version : a10050b
Jan  1 01:47:42 OpenWrt user.info kernel: usb 1-1: firmware: requesting ueagle-atm/CMV4p.bin.v2
Jan  1 01:47:42 OpenWrt user.info kernel: usb 1-1: [ueagle-atm] modem started, waiting synchronization...
Jan  1 01:48:12 OpenWrt daemon.warn pppd[1636]: LCP: timeout sending Config-Requests
Jan  1 01:48:12 OpenWrt daemon.notice pppd[1636]: Connection terminated.
Jan  1 01:48:22 OpenWrt user.emerg kernel: unregister_netdevice: waiting for pppoa-wan to become free. Usage count = 1
Jan  1 01:48:32 OpenWrt user.emerg kernel: unregister_netdevice: waiting for pppoa-wan to become free. Usage count = 1
Jan  1 01:48:42 OpenWrt user.emerg kernel: unregister_netdevice: waiting for pppoa-wan to become free. Usage count = 1

At this point I Backfire 10.03.1-rc2. The same message I had on-rc1 10.03.1 and 10.03.1. Backfire At 10.03 it was but very rarely.

comment:4 Changed 7 years ago by Brian May <brian@…>

After upgrading to 10.03.1-rc2 I haven't encountered this problem again. Fingers crossed...

comment:5 Changed 7 years ago by jow

  • Resolution set to fixed
  • Status changed from new to closed

I assume it is permanently fixed.

comment:6 Changed 7 years ago by ddxx0n

  • Resolution fixed deleted
  • Status changed from closed to reopened

Unfortunately, I have encountered a/the similar bug even with recent trunk versions (running 26660 just now on ar71xx (WNDR3700v1) w/ 2.6.37.6).

My DSL line gets dropped by the provider every 24 hours, and sometimes pppd fails to reconnect and freezes, even "killall -9 pppd" doesn't work. The only way to resolve this is to reboot the router. Obviously, this is rather serious since I cannot rely on my router to stay reliably connected 24/7.

The same error occurs *every* time when manually issuing "ifdown wan", "killall pppd" or "killall -HUP pppd". The only change is that the the STAT of pppd from "ps ax" changes S to D and pppd becomes frozen. In each case, dmesg shows ad infinitum:

unregister_netdevice: waiting for pppoe-wan to become free. Usage count = 1
unregister_netdevice: waiting for pppoe-wan to become free. Usage count = 1
unregister_netdevice: waiting for pppoe-wan to become free. Usage count = 1
unregister_netdevice: waiting for pppoe-wan to become free. Usage count = 1
unregister_netdevice: waiting for pppoe-wan to become free. Usage count = 1
unregister_netdevice: waiting for pppoe-wan to become free. Usage count = 1
unregister_netdevice: waiting for pppoe-wan to become free. Usage count = 1
unregister_netdevice: waiting for pppoe-wan to become free. Usage count = 1
unregister_netdevice: waiting for pppoe-wan to become free. Usage count = 1
unregister_netdevice: waiting for pppoe-wan to become free. Usage count = 1
unregister_netdevice: waiting for pppoe-wan to become free. Usage count = 1
unregister_netdevice: waiting for pppoe-wan to become free. Usage count = 1
unregister_netdevice: waiting for pppoe-wan to become free. Usage count = 1

You get the picture. Some insight on this would be very much apreciated.

comment:7 Changed 7 years ago by ddxx0n

addition: on my system pppd fails to restart every time, not just sometimes as written above. Basically, it is just impossible to kill pppd and I have to reboot the router to redial the connection.

I will have to revert to an some weeks older trunk revision, it used to work there.

comment:8 Changed 7 years ago by peter@…

I have exactly the same problem, pppd fails to restart and hangs, only way to resolve this is by a reboot -f, normal reboot won't work. This is a big problem for me as I use hsdpa connections witch rarely connect in one shot, meaning most of the times the unit just hangs.

comment:9 Changed 7 years ago by jow

  • Whats the full "ps w" output at the time pppd is in D state?
  • What kernel modules are loaded?

Here's something that looks similar: https://bugzilla.redhat.com/show_bug.cgi?id=665364

comment:10 Changed 7 years ago by peter@…

The problem in my case seems to be the package qos-scripts. Uninstalling it had no affect, but doing a fresh install of openwrt and installing all but qos-scripts did the trick. So in my case its SOLVED :-)

comment:11 Changed 7 years ago by jow

I also suspect some qos related kmods, maybe in conjunction with iptables or ip rules. Though we need to narrow down which one exactly...

comment:12 Changed 7 years ago by jow

  • Priority changed from normal to lowest

comment:13 Changed 7 years ago by jow

  • Priority changed from lowest to response-needed

comment:14 Changed 7 years ago by ddxx0n

Sorry it took a while for me to realize that something is happening in this ticket, I don't run a website change watcher and don't have a trac account over here to notifiy me.

... the output of the 'ps w' command with the dead pppd:

  PID USER       VSZ STAT COMMAND
    1 root      1448 S    init
    2 root         0 SW   [kthreadd]
    3 root         0 SW   [ksoftirqd/0]
    4 root         0 SW   [kworker/0:0]
    6 root         0 SW   [rcu_kthread]
    7 root         0 SW<  [khelper]
    8 root         0 SW   [kworker/u:1]
   53 root         0 SW   [sync_supers]
   55 root         0 SW   [bdi-default]
   57 root         0 SW<  [kblockd]
   89 root         0 SW   [kswapd0]
  149 root         0 SW<  [ar71xx-spi]
  162 root         0 SW   [mtdblock0]
  167 root         0 SW   [mtdblock1]
  172 root         0 SW   [mtdblock2]
  177 root         0 SW   [mtdblock3]
  182 root         0 SW   [mtdblock4]
  187 root         0 SW   [mtdblock5]
  192 root         0 SW   [mtdblock6]
  398 root         0 SWN  [jffs2_gcd_mtd4]
  422 root         0 SW   [khubd]
  431 root         0 SW   [scsi_eh_0]
  432 root         0 SW   [usb-storage]
  434 root         0 SW   [kworker/u:2]
  457 root      1448 S    init
  480 root      1436 S    syslogd -s 16 -O /usb/root/var/log/messages -S
  482 root      1436 S    klogd
  495 root       816 S    /sbin/hotplug2 --override --persistent --set-worker
  649 root      1564 D    /usr/sbin/pppd plugin rp-pppoe.so mtu 1492 mru 1492
  698 root         0 SW<  [cfg80211]
 1271 root       776 S    yaddns -f /tmp/yaddns.conf -p /var/run/yaddns.pid -D
 1292 root         0 SW   [flush-8:0]
 1293 root         0 SW   [jbd2/sda4-8]
 1294 root         0 SW<  [ext4-dio-unwrit]
 1572 root      3296 S    hostapd -P /var/run/wifi-phy0.pid -B /var/run/hostap
 2222 root      1424 S    /usr/sbin/uhttpd -f -h /www -r PHOENIX -x /cgi-bin -
 2226 root      1152 S    /usr/sbin/uhttpd -f -h /www/nzbgetweb -r PHOENIX -t
 2229 root      1180 S    xinetd
 2255 nobody     928 S    dnsmasq -i br-lan -K -D -f -y -Z -b -E --enable-tftp
 2284 root      2624 S    ntpd -g -p /var/run/ntpd.pid
 2301 root      1516 S    /usr/sbin/collectd -f
 3016 root      1444 S    watchdog -t 5 /dev/watchdog
 3251 root         0 SW   [kworker/0:2]
 5109 root      1448 S    telnetd -i -l /usr/local/bin/login.sh
 5110 root      1456 S    /bin/ash --login
 5141 root         0 Z    [ip-down]

... the loaded kernel modules 'lsmod':

BusyBox v1.18.4 (2011-04-30 23:47:58 CEST) built-in shell (ash)
Enter 'help' for a list of built-in commands.

  _______                     ________        __
 |       |.-----.-----.-----.|  |  |  |.----.|  |_
 |   -   ||  _  |  -__|     ||  |  |  ||   _||   _|
 |_______||   __|_____|__|__||________||__|  |____|
          |__| W I R E L E S S   F R E E D O M
 ATTITUDE ADJUSTMENT (bleeding edge, r26802) ----------
  * 1/4 oz Vodka      Pour all ingredents into mixing
  * 1/4 oz Gin        tin with ice, strain into glass.
  * 1/4 oz Amaretto
  * 1/4 oz Triple sec
  * 1/4 oz Peach schnapps
  * 1/4 oz Sour mix
  * 1 splash Cranberry juice
 -----------------------------------------------------
root@PHOENIX:/# lsmod
Module                  Size  Used by    Not tainted
sch_red                 3456  2
sch_sfq                 4592  2
sch_hfsc               13712  1
cls_fw                  3152  4
sch_ingress              848  1
act_mirred              2480  1
act_connmark             768  1
em_u32                   496  0
cls_u32                 5760  1
ifb                     2304  0
sg                     18704  0
gpio_buttons            1920  0
ums_usbat               7296  0
ums_sddr55              4736  0
ums_sddr09              8704  0
ums_karma               1456  0
ums_jumpshot            3568  0
ums_isd200              4864  0
ums_freecom             1872  0
ums_datafab             4624  0
ums_cypress             2064  0
ums_alauda              8256  0
leds_wndr3700_usb        672  0
xt_TPROXY               2016  0
xt_socket               1584  0
nf_tproxy_core           656  1 xt_TPROXY,[permanent]
nf_conntrack_netlink    10624  0
nfnetlink_queue         4800  0
nfnetlink_log           5152  0
nfnetlink               1616  3 nf_conntrack_netlink,nfnetlink_queue,nfnetlink_log
xt_CHAOS                1696  4
xt_TARPIT               1776  1
xt_DELUDE               1360  1
xt_condition            1584  0
ipt_ULOG                3888  7
ip_queue                3808  0
nf_nat_tftp              400  0
nf_conntrack_tftp       2352  1 nf_nat_tftp
nf_nat_snmp_basic       6928  0
nf_nat_sip              5088  0
nf_conntrack_sip       15920  1 nf_nat_sip
nf_nat_rtsp             2976  0
nf_conntrack_rtsp       3888  1 nf_nat_rtsp
nf_nat_pptp             1312  0
nf_conntrack_pptp       3088  1 nf_nat_pptp
nf_nat_h323             4560  0
nf_conntrack_h323      33008  1 nf_nat_h323
nf_nat_proto_gre         784  1 nf_nat_pptp
nf_conntrack_proto_gre     2304  1 nf_conntrack_pptp
nf_nat_amanda            608  0
nf_conntrack_amanda     1552  1 nf_nat_amanda
nf_nat_irc               784  0
nf_conntrack_irc        2496  1 nf_nat_irc
nf_nat_ftp               976  0
nf_conntrack_ftp        4416  1 nf_nat_ftp
xt_policy               1856  0
xt_esp                   672  0
ipt_ah                   608  0
xt_iprange               864  0
xt_HL                   1200  0
xt_hl                    720  0
ipt_ECN                 1280  0
xt_CLASSIFY              448  0
xt_time                 1472  0
xt_tcpmss                896  0
xt_statistic             720  0
xt_mark                  592 55
xt_length                592  2
ipt_ecn                  880  0
xt_DSCP                 1296  0
xt_dscp                  912  0
xt_string                688  0
xt_layer7               8976 28
xt_quota                 672  0
xt_pkttype               544  0
xt_owner                 608  0
compat_xtables          1536  3 xt_CHAOS,xt_TARPIT,xt_DELUDE
ipt_REDIRECT             592  0
ipt_NETMAP               592  0
ipt_MASQUERADE           944  1
iptable_nat             2608  1
nf_nat                 10512 13 nf_nat_tftp,nf_nat_sip,nf_nat_rtsp,nf_nat_pptp,nf_nat_h323,nf_nat_proto_gre,nf_nat_amanda,nf_nat_irc,nf_nat_ftp,ipt_REDIRECT,ipt_NETMAP,ipt_MASQUERADE,iptable_nat
xt_recent               5744  6
xt_helper                800  0
xt_connmark              960  0
xt_connbytes            1312  0
xt_conntrack            1648  0
xt_NOTRACK               464  2
iptable_raw              560  1
xt_state                 624  7
nf_conntrack_ipv4       4336 38 iptable_nat,nf_nat
nf_defrag_ipv4           624  3 xt_TPROXY,xt_socket,nf_conntrack_ipv4
nf_conntrack           37808 31 act_connmark,nf_conntrack_netlink,nf_nat_tftp,nf_conntrack_tftp,nf_nat_snmp_basic,nf_nat_sip,nf_conntrack_sip,nf_nat_rtsp,nf_conntrack_rtsp,nf_nat_pptp,nf_conntrack_pptp,nf_nat_h323,nf_conntrack_h323,nf_conntrack_proto_gre,nf_nat_amanda,nf_conntrack_amanda,nf_nat_irc,nf_conntrack_irc,nf_nat_ftp,nf_conntrack_ftp,xt_layer7,ipt_MASQUERADE,iptable_nat,nf_nat,xt_helper,xt_connmark,xt_connbytes,xt_conntrack,xt_NOTRACK,xt_state,nf_conntrack_ipv4
ipt_REJECT              1664  3
xt_TCPMSS               1872  2
ipt_LOG                 5840  1
xt_comment               416  0
xt_multiport            1104 25
xt_mac                   544  0
xt_limit                 960  7
iptable_mangle           848  1
iptable_filter           624  1
ip_tables               8512  4 iptable_nat,iptable_raw,iptable_mangle,iptable_filter
xt_tcpudp               1568 41
x_tables               10064 50 xt_TPROXY,xt_socket,xt_CHAOS,xt_condition,ipt_ULOG,xt_policy,xt_esp,ipt_ah,xt_iprange,xt_HL,xt_hl,ipt_ECN,xt_CLASSIFY,xt_time,xt_tcpmss,xt_statistic,xt_mark,xt_length,ipt_ecn,xt_DSCP,xt_dscp,xt_string,xt_layer7,xt_quota,xt_pkttype,xt_owner,compat_xtables,ipt_REDIRECT,ipt_NETMAP,ipt_MASQUERADE,iptable_nat,xt_recent,xt_helper,xt_connmark,xt_connbytes,xt_conntrack,xt_NOTRACK,iptable_raw,xt_state,ipt_REJECT,xt_TCPMSS,ipt_LOG,xt_comment,xt_multiport,xt_mac,xt_limit,iptable_mangle,iptable_filter,ip_tables,xt_tcpudp
tun                    11568  0
ppp_async               5968  0
loop                   10624  0
ath9k                  73264  0
ath9k_common            1168  1 ath9k
ath9k_hw              237120  2 ath9k,ath9k_common
ath                    11360  2 ath9k,ath9k_hw
nls_utf8                 784  0
nls_iso8859_15          3312  0
nls_iso8859_1           2800  0
nls_cp850               3568  0
nls_cp437               4336  0
mac80211              190480  1 ath9k
ts_fsm                  2496  0
ts_bm                   1360  0
ts_kmp                  1296  5
crc_ccitt                944  1 ppp_async
cfg80211              117328  2 ath9k,mac80211
compat                   656  0
xts                     1920  0
gf128mul                6672  1 xts
arc4                     768  4
aes_generic            29840  2
crypto_blkcipher        9056  1 xts
cryptomgr               1808  0
aead                    3568  1 cryptomgr
crypto_hash             7824  0
crypto_algapi           9008  7 xts,arc4,aes_generic,crypto_blkcipher,cryptomgr,aead,crypto_hash
pppoe                   7200  2
pppox                   1152  1 pppoe
ppp_generic            18384  7 ppp_async,pppoe,pppox
slhc                    4000  1 ppp_generic
usb_storage            32624 12 ums_usbat,ums_sddr55,ums_sddr09,ums_karma,ums_jumpshot,ums_isd200,ums_freecom,ums_datafab,ums_cypress,ums_alauda
ohci_hcd               16560  0
ehci_hcd               32128  0
sd_mod                 21504  3
ext4                  220496  1
jbd2                   36240  1 ext4
usbcore                95024 14 ums_usbat,ums_sddr55,ums_sddr09,ums_karma,ums_jumpshot,ums_isd200,ums_freecom,ums_datafab,ums_cypress,ums_alauda,usb_storage,ohci_hcd,ehci_hcd
scsi_mod               67776  4 sg,ums_cypress,usb_storage,sd_mod
nls_base                4544  6 nls_utf8,nls_iso8859_15,nls_iso8859_1,nls_cp850,nls_cp437,usbcore
mbcache                 3504  1 ext4
crc16                    944  1 ext4
leds_gpio               1584  0
button_hotplug          2672  0
gpio_keys_polled        2080  0
input_polldev           2064  2 gpio_buttons,gpio_keys_polled
input_core             19792  5 gpio_buttons,button_hotplug,gpio_keys_polled,input_polldev
root@PHOENIX:/#

I'll have a look if getting rid of qos helps.

comment:15 Changed 7 years ago by ddxx0n

Like peter, uninstalling the qos stuff did not help:

opkg remove qos-scripts luci-app-qos
mv /etc/qos /etc/qos.disabled
opkg remove kmod-ifb

these are the remaining kernel modules, pppd still broken after killall -HUP pppd:

root@PHOENIX:/# lsmod
Module                  Size  Used by    Not tainted
sg                     18704  0
gpio_buttons            1920  0
ums_usbat               7296  0
ums_sddr55              4736  0
ums_sddr09              8704  0
ums_karma               1456  0
ums_jumpshot            3568  0
ums_isd200              4864  0
ums_freecom             1872  0
ums_datafab             4624  0
ums_cypress             2064  0
ums_alauda              8256  0
leds_wndr3700_usb        672  0
xt_TPROXY               2016  0
xt_socket               1584  0
nf_tproxy_core           656  1 xt_TPROXY,[permanent]
nf_conntrack_netlink    10624  0
nfnetlink_queue         4800  0
nfnetlink_log           5152  0
nfnetlink               1616  3 nf_conntrack_netlink,nfnetlink_queue,nfnetlink_log
xt_CHAOS                1696  4
xt_TARPIT               1776  1
xt_DELUDE               1360  1
xt_condition            1584  0
ipt_ULOG                3888  7
ip_queue                3808  0
nf_nat_tftp              400  0
nf_conntrack_tftp       2352  1 nf_nat_tftp
nf_nat_snmp_basic       6928  0
nf_nat_sip              5088  0
nf_conntrack_sip       15920  1 nf_nat_sip
nf_nat_rtsp             2976  0
nf_conntrack_rtsp       3888  1 nf_nat_rtsp
nf_nat_pptp             1312  0
nf_conntrack_pptp       3088  1 nf_nat_pptp
nf_nat_h323             4560  0
nf_conntrack_h323      33008  1 nf_nat_h323
nf_nat_proto_gre         784  1 nf_nat_pptp
nf_conntrack_proto_gre     2304  1 nf_conntrack_pptp
nf_nat_amanda            608  0
nf_conntrack_amanda     1552  1 nf_nat_amanda
nf_nat_irc               784  0
nf_conntrack_irc        2496  1 nf_nat_irc
nf_nat_ftp               976  0
nf_conntrack_ftp        4416  1 nf_nat_ftp
xt_policy               1856  0
xt_esp                   672  0
ipt_ah                   608  0
xt_iprange               864  0
xt_HL                   1200  0
xt_hl                    720  0
ipt_ECN                 1280  0
xt_CLASSIFY              448  0
xt_time                 1472  0
xt_tcpmss                896  0
xt_statistic             720  0
xt_mark                  592 55
xt_length                592  2
ipt_ecn                  880  0
xt_DSCP                 1296  0
xt_dscp                  912  0
xt_string                688  0
xt_layer7               8976 28
xt_quota                 672  0
xt_pkttype               544  0
xt_owner                 608  0
compat_xtables          1536  3 xt_CHAOS,xt_TARPIT,xt_DELUDE
ipt_REDIRECT             592  0
ipt_NETMAP               592  0
ipt_MASQUERADE           944  1
iptable_nat             2608  1
nf_nat                 10512 13 nf_nat_tftp,nf_nat_sip,nf_nat_rtsp,nf_nat_pptp,nf_nat_h323,nf_nat_proto_gre,nf_nat_amanda,nf_nat_irc,nf_nat_ftp,ipt_REDIRECT,ipt_NETMAP,ipt_MASQUERADE,iptable_nat
xt_recent               5744  6
xt_helper                800  0
xt_connmark              960  0
xt_connbytes            1312  0
xt_conntrack            1648  0
xt_NOTRACK               464  2
iptable_raw              560  1
xt_state                 624  7
nf_conntrack_ipv4       4336 38 iptable_nat,nf_nat
nf_defrag_ipv4           624  3 xt_TPROXY,xt_socket,nf_conntrack_ipv4
nf_conntrack           37808 30 nf_conntrack_netlink,nf_nat_tftp,nf_conntrack_tftp,nf_nat_snmp_basic,nf_nat_sip,nf_conntrack_sip,nf_nat_rtsp,nf_conntrack_rtsp,nf_nat_pptp,nf_conntrack_pptp,nf_nat_h323,nf_conntrack_h323,nf_conntrack_proto_gre,nf_nat_amanda,nf_conntrack_amanda,nf_nat_irc,nf_conntrack_irc,nf_nat_ftp,nf_conntrack_ftp,xt_layer7,ipt_MASQUERADE,iptable_nat,nf_nat,xt_helper,xt_connmark,xt_connbytes,xt_conntrack,xt_NOTRACK,xt_state,nf_conntrack_ipv4
ipt_REJECT              1664  3
xt_TCPMSS               1872  2
ipt_LOG                 5840  1
xt_comment               416  0
xt_multiport            1104 25
xt_mac                   544  0
xt_limit                 960  7
iptable_mangle           848  1
iptable_filter           624  1
ip_tables               8512  4 iptable_nat,iptable_raw,iptable_mangle,iptable_filter
xt_tcpudp               1568 41
x_tables               10064 50 xt_TPROXY,xt_socket,xt_CHAOS,xt_condition,ipt_ULOG,xt_policy,xt_esp,ipt_ah,xt_iprange,xt_HL,xt_hl,ipt_ECN,xt_CLASSIFY,xt_time,xt_tcpmss,xt_statistic,xt_mark,xt_length,ipt_ecn,xt_DSCP,xt_dscp,xt_string,xt_layer7,xt_quota,xt_pkttype,xt_owner,compat_xtables,ipt_REDIRECT,ipt_NETMAP,ipt_MASQUERADE,iptable_nat,xt_recent,xt_helper,xt_connmark,xt_connbytes,xt_conntrack,xt_NOTRACK,iptable_raw,xt_state,ipt_REJECT,xt_TCPMSS,ipt_LOG,xt_comment,xt_multiport,xt_mac,xt_limit,iptable_mangle,iptable_filter,ip_tables,xt_tcpudp
tun                    11568  0
ppp_async               5968  0
loop                   10624  0
ath9k                  73264  0
ath9k_common            1168  1 ath9k
ath9k_hw              237120  2 ath9k,ath9k_common
ath                    11360  2 ath9k,ath9k_hw
nls_utf8                 784  0
nls_iso8859_15          3312  0
nls_iso8859_1           2800  0
nls_cp850               3568  0
nls_cp437               4336  0
mac80211              190480  1 ath9k
ts_fsm                  2496  0
ts_bm                   1360  0
ts_kmp                  1296  5
crc_ccitt                944  1 ppp_async
cfg80211              117328  2 ath9k,mac80211
compat                   656  0
xts                     1920  0
gf128mul                6672  1 xts
arc4                     768  4
aes_generic            29840  3
crypto_blkcipher        9056  1 xts
cryptomgr               1808  0
aead                    3568  1 cryptomgr
crypto_hash             7824  0
crypto_algapi           9008  7 xts,arc4,aes_generic,crypto_blkcipher,cryptomgr,aead,crypto_hash
pppoe                   7200  2
pppox                   1152  1 pppoe
ppp_generic            18384  7 ppp_async,pppoe,pppox
slhc                    4000  1 ppp_generic
usb_storage            32624 12 ums_usbat,ums_sddr55,ums_sddr09,ums_karma,ums_jumpshot,ums_isd200,ums_freecom,ums_datafab,ums_cypress,ums_alauda
ohci_hcd               16560  0
ehci_hcd               32128  0
sd_mod                 21504  3
ext4                  220496  1
jbd2                   36240  1 ext4
usbcore                95024 14 ums_usbat,ums_sddr55,ums_sddr09,ums_karma,ums_jumpshot,ums_isd200,ums_freecom,ums_datafab,ums_cypress,ums_alauda,usb_storage,ohci_hcd,ehci_hcd
scsi_mod               67776  4 sg,ums_cypress,usb_storage,sd_mod
nls_base                4544  6 nls_utf8,nls_iso8859_15,nls_iso8859_1,nls_cp850,nls_cp437,usbcore
mbcache                 3504  1 ext4
crc16                    944  1 ext4
leds_gpio               1584  0
button_hotplug          2672  0
gpio_keys_polled        2080  0
input_polldev           2064  2 gpio_buttons,gpio_keys_polled
input_core             19792  5 gpio_buttons,button_hotplug,gpio_keys_polled,input_polldev

I'll try reinstalling OpenWrt w/o qos-scripts.

comment:16 Changed 7 years ago by anonymous

ok, I've found two things (1) (2). I am not yet reay to admit that reinstalling the os on linux should be the solution, thus I removed & reinstalled various packages while sometimes running 'firstboot' to reset everything to the known broken state.

... removing these qos/ppp related packages didn't do anything: 'opkg remove kmod-ifb kmod-ipt-queue luci-app-qos qos-scripts luci-app-multiwan multiwan tc'

... but 'opkg remove iptables-mod-ipopt kmod-ipt-ipopt' did by disabling ppp entirely since there seems to be a missing package dependency for ppp which depends on slhc from kmod-ipt-ipopt. (1) You might want to fix this in the Makefile of ppp or kmod-ppp, or this could happen:

root@PHOENIX:/lib/modules/2.6.38.4# insmod ./ppp_generic.ko
insmod: can't insert './ppp_generic.ko': unknown symbol in module, or unknown parameter

root@PHOENIX:/lib/modules/2.6.38.4# dmesg
ppp_generic: Unknown symbol slhc_toss (err 0)
ppp_generic: Unknown symbol slhc_compress (err 0)
ppp_generic: Unknown symbol slhc_free (err 0)
ppp_generic: Unknown symbol slhc_uncompress (err 0)
ppp_generic: Unknown symbol slhc_remember (err 0)
ppp_generic: Unknown symbol slhc_init (err 0)

By this pure accident, I discovered that loading the whole ppp kernel module chain *last* circumvents the mysterious second lock wich breaks the hangup of pppd. (2) Thus, my current workaround is to move ppp_generic.ko somewhere else and manually insmod it via a script after the system boot is complete.

Of course, this still doesn't tell us where this second ppp lock originates from.

comment:17 Changed 7 years ago by ddxx0n

It was interesting that peter's router only reboots with 'reboot -f' - this is true with my setup, too, and might be caused by the same problem that hangs pppd.

For what it's worth, here's my workaround if anyone has got the same problem and recompiling w/o qos is not wanted or doesn't help. Insert this in rc.local:

if [ -f /lib/modules/ppp_generic.ko ]; then
        insmod /lib/modules/ppp_generic.ko
        ifup wan
else
        mv /lib/modules/$(uname -r)/ppp_generic.ko /lib/modules
        reboot -f
fi

comment:18 Changed 7 years ago by ddxx0n

Even recompiling w/o qos scripts didn't help with my system, thus I am stuck with my workaround mentioned above for the time being.

If anyone has an idea on how to determine what locks ppp apart from pppd I'm open to suggestions...

comment:19 Changed 7 years ago by jow

Is this still the case with RC5?

comment:20 Changed 7 years ago by nbd

  • Resolution set to no_response
  • Status changed from reopened to closed

comment:21 Changed 6 years ago by jow

  • Resolution no_response deleted
  • Status changed from closed to reopened

This issue has been observed again on rc5, using a Huawai E620 Modem connected to a TP-Link TL-WR1043ND running kernel 2.6.39.4 . No further services running, IPv6 has been ruled out as source. No QoS installed. Occurs with both option and usb-serial drivers.

comment:22 Changed 6 years ago by jow

  • Component changed from packages to kernel
  • Priority changed from response-needed to normal
  • Version changed from Backfire 10.03.1 RC1 to Backfire 10.03.1 RC5

comment:23 Changed 6 years ago by Nilfred <nilfred@…>

#10095 may help? About terminate between some LCP events.

comment:24 Changed 6 years ago by nbd

  • Priority changed from normal to response-needed

is this issue still present on 10.03.1 and trunk?

comment:25 Changed 6 years ago by nbd

  • Resolution set to no_response
  • Status changed from reopened to closed

comment:26 Changed 4 years ago by jow

  • Milestone changed from Backfire 10.03.2 to Chaos Calmer (trunk)

Milestone Backfire 10.03.2 deleted

Add Comment

Modify Ticket

Action
as closed .
The resolution will be deleted. Next status will be 'reopened'.
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.