crash on NetBSD

Tracy Di Marco White gendalia at gmail.com
Sat Jan 13 09:47:05 CET 2007


Our webservers use arla to access the space they provide.  A few days
ago I salvaged several volumes, and the process left hundreds of
__ORPHANFILE_.#*.#* in the root of the volumes involved.  Since that
day, two of our webservers have started having arla fail. The two
machines involved are running NetBSD 2.0.2/i386, and both are now
running arla 0.43.  They were running 0.37, previously, upgrading did
not change the results in this case, although it did dramatically
reduce the number of "nnpfs_send_message_wakeup_vmultiple" in syslog.

Any ideas why we have a one to one match of volumes with these files,
and volumes that cause arlad to exit (so far).

-Tracy

[Fri Jan 12 08:52:02 2007] [error] (19)Operation not supported by
device: access to /~username/research.htm failed

I have a couple traces:

#0  0x0806dadb in getentry (page0=0x48589000, num=1166, npages=19)
    at fdir.c:130
130          if (page->header.pg_tag != htons(AFSDIRMAGIC) &&
(gdb) bt
#0  0x0806dadb in getentry (page0=0x48589000, num=1166, npages=19)
    at fdir.c:130
#1  0x0806db6e in find_entry (page0=0x48589000, name=0x8112038 "WWW",
    npages=19) at fdir.c:151
#2  0x0806dbe4 in find_by_name (page0=0x48589000, name=0x8112038 "WWW",
    fid=0x483d7d90, dir=0x8271d2c, npages=19) at fdir.c:174
#3  0x0806de08 in fdir_lookup (the_fbuf=0x483d7880, dir=0x8271d2c,
    name=0x8112038 "WWW", file=0x483d7d90) at fdir.c:252
#4  0x0804a8a5 in adir_lookup (centry=0x8271d24, name=0x8112038 "WWW",
    file=0x483d7d90) at adir.c:89
#5  0x0805c367 in cm_lookup (entry=0x483d7d10, name=0x8112038 "WWW",
    res=0x483d7d90, ce=0x483d7d1c, follow_mount_point=1) at inter.c:494
#6  0x0805fca5 in nnpfs_message_getnode (fd=9, h=0x8112010, size=296)
    at messages.c:539
#7  0x0806c09e in nnpfs_message_receive (fd=9, h=0x8112010, size=296)
    at nnpfs.c:203
#8  0x0805e1a9 in process_message (msg_length=296, msg=0x8112010 "(\001",
    self=0x8112000) at kernel.c:104
#9  0x0805e4d7 in sub_thread (v_myself=0x8112000) at kernel.c:215
#10 0x0808d6a4 in Create_Process_Part2 () at lwp_asm.c:753
#11 0xefeeedec in ?? ()
#12 0x0808f4cd in IOMGR_Select (nfds=10, readfds=0xbfbffaa0, writefds=0x0,
    exceptfds=0x0, timeout=0x0) at iomgr.c:761
---Type <return> to continue, or q <return> to quit---
#13 0x0805ed03 in kernel_interface (args=0xbfbffaf4) at kernel.c:493
#14 0x0806d076 in arla_start (device_file=0xbfbffc68 "/dev/nnpfs0",
    cache_dir=0x80a1491 "/vice") at arlad.c:168
#15 0x0806d1e0 in main (argc=0, argv=0xbfbffb9c) at arlad.c:278
#16 0x0804a5a2 in ___start ()

#0  0x0806dc71 in used_slot (page=0x482e6000, off=1) at fdir.c:192
192         return page->header.pg_bitmap[off / 8] & (1 << (off % 8));
(gdb) bt
#0  0x0806dc71 in used_slot (page=0x482e6000, off=1) at fdir.c:192
#1  0x0806dcd1 in first_slotp (page=0x482e6000, off=0) at fdir.c:203
#2  0x0806dfc2 in fdir_readdir (the_fbuf=0x483d7c10,
    func=0x806c94f <write_dirent>, arg=0x483d7c30, dir=
        {Cell = 1, fid = {Volume = 537946712, Vnode = 1, Unique = 1}},
    offset=0x0) at fdir.c:324
#3  0x08068cb6 in conv_dir_sub (e=0x824992c, ce=0x80e0054, tokens=1,
    cache_handle=0x483d7cd0, cache_name=0x483d7df0 "02/2C@",
    cache_name_sz=256, func=0x806c94f <write_dirent>,
    flush_func=0x806c898 <flushbuf>, blocksize=512) at subr.c:126
#4  0x0806cac9 in conv_dir (e=0x824992c, ce=0x80e0054, tokens=1,
    cache_handle=0x483d7cd0, cache_name=0x483d7df0 "02/2C@", cache_name_sz=256)
    at bsd-subr.c:109
#5  0x08063987 in nnpfs_message_open (fd=9, h=0x8114010, size=48)
    at messages.c:1969
#6  0x0806c09e in nnpfs_message_receive (fd=9, h=0x8114010, size=48)
    at nnpfs.c:203
#7  0x0805e1a9 in process_message (msg_length=48, msg=0x8114010 "0",
    self=0x8114000) at kernel.c:104
#8  0x0805e4d7 in sub_thread (v_myself=0x8114000) at kernel.c:215
#9  0x0808d6a4 in Create_Process_Part2 () at lwp_asm.c:753
#10 0xefeeedec in ?? ()
#11 0x0808f4cd in IOMGR_Select (nfds=10, readfds=0xbfbff6b0, writefds=0x0,
---Type <return> to continue, or q <return> to quit---
    exceptfds=0x0, timeout=0x0) at iomgr.c:761
#12 0x0805ed03 in kernel_interface (args=0xbfbff704) at kernel.c:493
#13 0x0806d076 in arla_start (device_file=0xbfbff898 "/dev/nnpfs0",
    cache_dir=0x80a1491 "/vice") at arlad.c:168
#14 0x0806d1e0 in main (argc=0, argv=0xbfbff7a8) at arlad.c:278
#15 0x0804a5a2 in ___start ()


More information about the Arla-drinkers mailing list