Arla on FreeBSD

Kostik Belousov kostikbel at gmail.com
Thu Feb 15 11:45:37 CET 2007


On Thu, Feb 15, 2007 at 11:16:00AM +0100, Tomas Olsson wrote:
> Kostik Belousov <kostikbel at gmail.com> writes:
> > > I'm already funded and can work full time on this, but a FreeBSD hacker
> > > would help a lot.  Any volunteers?
> > 
> > Sorry for me pointing out obvious, why not continue to use fs@ as place
> > where to ask ?
> >
> You're very right, I'm just to shy to do it... Thanks.
> 
> Anyway;
> 
> Arla is built around a "small" caching fs driver (nnpfs) servicing user
> requests by asking the 'arlad' daemon for help or just operating on local
> files created/fetched by arlad. They communicate over a char device.
> 
> A simple read would be handled as such:
>         getnode/getdata rpc to arlad
>         installnode/installdata + wakeup msgs from arlad
>         VOP_READ() on newly fetched cache file
> 
> Subsequent reads on the same data would skip the rpc part, unless arlad has
> invalidated the node.
> 
> Previously, there was a 1:1 mapping between nnpfs vnode and cache file. The
> installdata message was then handled by fetching the cache file's vnode (in
> arlad's context), storing it in the nnpfs_node for future reference/access.
> Now we ended up with one cache file per "block" (large) of data, and
> decided that it would be better to open/access/close the cache "block" file
> on each access.  The closest we could get to the olden ways was to open the
> directory where a node's cache blocks reside, in arlad's context.
> 
> The interesting part is how we open and access the cache files, and from
> what context. arlad is in chroot() to avoid recursive lookups across /, and
> it seems like a good idea to avoid such lookups now too.
> 
> So the main question is how to properly do VOP_{LOOKUP,CREATE,WRITE} etc on
> cache files in this dual context world, without mixing identities in bad
> ways or confusing the OS too much.
> 
> The currently messed up code lives in
> http://cvsweb.stacken.kth.se/cvsweb.cgi/arla/nnpfs/bsd/
> 
> Most interesting is nnpfs_vnodeops-common.c (nnpfs_write_common()) and
> nnpfs_blocks.c (open_file())

I made really quick look at the places you mentioned. I have some
comment for open_file(). For FreeBSD >= 6.x, the right way to open vnode
from the kernel code is to use vn_open() (and then vn_close()) API.
Something along the lines (this is for already existing file):

	td = curthread;
	NDINIT(&nd, LOOKUP, FOLLOW | MPSAFE, UIO_USERSPACE, fname, td);
	flags = FREAD | FWRITE;
	error = vn_open(&nd, &flags, 0, -1);
	if (error)
		return (error);
	vfslocked = NDHASGIANT(&nd);
	NDFREE(&nd, NDF_ONLY_PNBUF);
	vp = nd.ni_vp;
vp is now locked, shall be unlocked by VOP_UNLOCK() before returning to
usermode. Giant is conditionally locked based on MP-safeness of the fs vp
belongs to. When Giant-protected region shall be leaved, use
	VFS_UNLOCK_GIANT(vfslocked);

To close the vnode, use
	vn_close(vp, FREAD|FWRITE, td->td_ucred, td);

See, for instance, kern/kern_ktrace.c, ufs/ufs/ufs_quota.c or
security/audit/audit_syscalls.c for real code that does this.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 187 bytes
Desc: not available
Url : http://lists.stacken.kth.se/pipermail/arla-drinkers/attachments/20070215/d635a7e4/attachment.bin


More information about the Arla-drinkers mailing list