|  | ROMFS - ROM FILE SYSTEM | 
|  |  | 
|  | This is a quite dumb, read only filesystem, mainly for initial RAM | 
|  | disks of installation disks.  It has grown up by the need of having | 
|  | modules linked at boot time.  Using this filesystem, you get a very | 
|  | similar feature, and even the possibility of a small kernel, with a | 
|  | file system which doesn't take up useful memory from the router | 
|  | functions in the basement of your office. | 
|  |  | 
|  | For comparison, both the older minix and xiafs (the latter is now | 
|  | defunct) filesystems, compiled as module need more than 20000 bytes, | 
|  | while romfs is less than a page, about 4000 bytes (assuming i586 | 
|  | code).  Under the same conditions, the msdos filesystem would need | 
|  | about 30K (and does not support device nodes or symlinks), while the | 
|  | nfs module with nfsroot is about 57K.  Furthermore, as a bit unfair | 
|  | comparison, an actual rescue disk used up 3202 blocks with ext2, while | 
|  | with romfs, it needed 3079 blocks. | 
|  |  | 
|  | To create such a file system, you'll need a user program named | 
|  | genromfs.  It is available via anonymous ftp on sunsite.unc.edu and | 
|  | its mirrors, in the /pub/Linux/system/recovery/ directory. | 
|  |  | 
|  | As the name suggests, romfs could be also used (space-efficiently) on | 
|  | various read-only media, like (E)EPROM disks if someone will have the | 
|  | motivation.. :) | 
|  |  | 
|  | However, the main purpose of romfs is to have a very small kernel, | 
|  | which has only this filesystem linked in, and then can load any module | 
|  | later, with the current module utilities.  It can also be used to run | 
|  | some program to decide if you need SCSI devices, and even IDE or | 
|  | floppy drives can be loaded later if you use the "initrd"--initial | 
|  | RAM disk--feature of the kernel.  This would not be really news | 
|  | flash, but with romfs, you can even spare off your ext2 or minix or | 
|  | maybe even affs filesystem until you really know that you need it. | 
|  |  | 
|  | For example, a distribution boot disk can contain only the cd disk | 
|  | drivers (and possibly the SCSI drivers), and the ISO 9660 filesystem | 
|  | module.  The kernel can be small enough, since it doesn't have other | 
|  | filesystems, like the quite large ext2fs module, which can then be | 
|  | loaded off the CD at a later stage of the installation.  Another use | 
|  | would be for a recovery disk, when you are reinstalling a workstation | 
|  | from the network, and you will have all the tools/modules available | 
|  | from a nearby server, so you don't want to carry two disks for this | 
|  | purpose, just because it won't fit into ext2. | 
|  |  | 
|  | romfs operates on block devices as you can expect, and the underlying | 
|  | structure is very simple.  Every accessible structure begins on 16 | 
|  | byte boundaries for fast access.  The minimum space a file will take | 
|  | is 32 bytes (this is an empty file, with a less than 16 character | 
|  | name).  The maximum overhead for any non-empty file is the header, and | 
|  | the 16 byte padding for the name and the contents, also 16+14+15 = 45 | 
|  | bytes.  This is quite rare however, since most file names are longer | 
|  | than 3 bytes, and shorter than 15 bytes. | 
|  |  | 
|  | The layout of the filesystem is the following: | 
|  |  | 
|  | offset	    content | 
|  |  | 
|  | +---+---+---+---+ | 
|  | 0	| - | r | o | m |  \ | 
|  | +---+---+---+---+	The ASCII representation of those bytes | 
|  | 4	| 1 | f | s | - |  /	(i.e. "-rom1fs-") | 
|  | +---+---+---+---+ | 
|  | 8	|   full size	|	The number of accessible bytes in this fs. | 
|  | +---+---+---+---+ | 
|  | 12	|    checksum	|	The checksum of the FIRST 512 BYTES. | 
|  | +---+---+---+---+ | 
|  | 16	| volume name	|	The zero terminated name of the volume, | 
|  | :               :	padded to 16 byte boundary. | 
|  | +---+---+---+---+ | 
|  | xx	|     file	| | 
|  | :    headers	: | 
|  |  | 
|  | Every multi byte value (32 bit words, I'll use the longwords term from | 
|  | now on) must be in big endian order. | 
|  |  | 
|  | The first eight bytes identify the filesystem, even for the casual | 
|  | inspector.  After that, in the 3rd longword, it contains the number of | 
|  | bytes accessible from the start of this filesystem.  The 4th longword | 
|  | is the checksum of the first 512 bytes (or the number of bytes | 
|  | accessible, whichever is smaller).  The applied algorithm is the same | 
|  | as in the AFFS filesystem, namely a simple sum of the longwords | 
|  | (assuming bigendian quantities again).  For details, please consult | 
|  | the source.  This algorithm was chosen because although it's not quite | 
|  | reliable, it does not require any tables, and it is very simple. | 
|  |  | 
|  | The following bytes are now part of the file system; each file header | 
|  | must begin on a 16 byte boundary. | 
|  |  | 
|  | offset	    content | 
|  |  | 
|  | +---+---+---+---+ | 
|  | 0	| next filehdr|X|	The offset of the next file header | 
|  | +---+---+---+---+	  (zero if no more files) | 
|  | 4	|   spec.info	|	Info for directories/hard links/devices | 
|  | +---+---+---+---+ | 
|  | 8	|     size      |	The size of this file in bytes | 
|  | +---+---+---+---+ | 
|  | 12	|   checksum	|	Covering the meta data, including the file | 
|  | +---+---+---+---+	  name, and padding | 
|  | 16	| file name     |	The zero terminated name of the file, | 
|  | :               :	padded to 16 byte boundary | 
|  | +---+---+---+---+ | 
|  | xx	| file data	| | 
|  | :		: | 
|  |  | 
|  | Since the file headers begin always at a 16 byte boundary, the lowest | 
|  | 4 bits would be always zero in the next filehdr pointer.  These four | 
|  | bits are used for the mode information.  Bits 0..2 specify the type of | 
|  | the file; while bit 4 shows if the file is executable or not.  The | 
|  | permissions are assumed to be world readable, if this bit is not set, | 
|  | and world executable if it is; except the character and block devices, | 
|  | they are never accessible for other than owner.  The owner of every | 
|  | file is user and group 0, this should never be a problem for the | 
|  | intended use.  The mapping of the 8 possible values to file types is | 
|  | the following: | 
|  |  | 
|  | mapping		spec.info means | 
|  | 0	hard link	link destination [file header] | 
|  | 1	directory	first file's header | 
|  | 2	regular file	unused, must be zero [MBZ] | 
|  | 3	symbolic link	unused, MBZ (file data is the link content) | 
|  | 4	block device	16/16 bits major/minor number | 
|  | 5	char device		    - " - | 
|  | 6	socket		unused, MBZ | 
|  | 7	fifo		unused, MBZ | 
|  |  | 
|  | Note that hard links are specifically marked in this filesystem, but | 
|  | they will behave as you can expect (i.e. share the inode number). | 
|  | Note also that it is your responsibility to not create hard link | 
|  | loops, and creating all the . and .. links for directories.  This is | 
|  | normally done correctly by the genromfs program.  Please refrain from | 
|  | using the executable bits for special purposes on the socket and fifo | 
|  | special files, they may have other uses in the future.  Additionally, | 
|  | please remember that only regular files, and symlinks are supposed to | 
|  | have a nonzero size field; they contain the number of bytes available | 
|  | directly after the (padded) file name. | 
|  |  | 
|  | Another thing to note is that romfs works on file headers and data | 
|  | aligned to 16 byte boundaries, but most hardware devices and the block | 
|  | device drivers are unable to cope with smaller than block-sized data. | 
|  | To overcome this limitation, the whole size of the file system must be | 
|  | padded to an 1024 byte boundary. | 
|  |  | 
|  | If you have any problems or suggestions concerning this file system, | 
|  | please contact me.  However, think twice before wanting me to add | 
|  | features and code, because the primary and most important advantage of | 
|  | this file system is the small code.  On the other hand, don't be | 
|  | alarmed, I'm not getting that much romfs related mail.  Now I can | 
|  | understand why Avery wrote poems in the ARCnet docs to get some more | 
|  | feedback. :) | 
|  |  | 
|  | romfs has also a mailing list, and to date, it hasn't received any | 
|  | traffic, so you are welcome to join it to discuss your ideas. :) | 
|  |  | 
|  | It's run by ezmlm, so you can subscribe to it by sending a message | 
|  | to romfs-subscribe@shadow.banki.hu, the content is irrelevant. | 
|  |  | 
|  | Pending issues: | 
|  |  | 
|  | - Permissions and owner information are pretty essential features of a | 
|  | Un*x like system, but romfs does not provide the full possibilities. | 
|  | I have never found this limiting, but others might. | 
|  |  | 
|  | - The file system is read only, so it can be very small, but in case | 
|  | one would want to write _anything_ to a file system, he still needs | 
|  | a writable file system, thus negating the size advantages.  Possible | 
|  | solutions: implement write access as a compile-time option, or a new, | 
|  | similarly small writable filesystem for RAM disks. | 
|  |  | 
|  | - Since the files are only required to have alignment on a 16 byte | 
|  | boundary, it is currently possibly suboptimal to read or execute files | 
|  | from the filesystem.  It might be resolved by reordering file data to | 
|  | have most of it (i.e. except the start and the end) laying at "natural" | 
|  | boundaries, thus it would be possible to directly map a big portion of | 
|  | the file contents to the mm subsystem. | 
|  |  | 
|  | - Compression might be an useful feature, but memory is quite a | 
|  | limiting factor in my eyes. | 
|  |  | 
|  | - Where it is used? | 
|  |  | 
|  | - Does it work on other architectures than intel and motorola? | 
|  |  | 
|  |  | 
|  | Have fun, | 
|  | Janos Farkas <chexum@shadow.banki.hu> |