| Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 1 | USING VFAT | 
|  | 2 | ---------------------------------------------------------------------- | 
|  | 3 | To use the vfat filesystem, use the filesystem type 'vfat'.  i.e. | 
|  | 4 | mount -t vfat /dev/fd0 /mnt | 
|  | 5 |  | 
|  | 6 | No special partition formatter is required.  mkdosfs will work fine | 
|  | 7 | if you want to format from within Linux. | 
|  | 8 |  | 
|  | 9 | VFAT MOUNT OPTIONS | 
|  | 10 | ---------------------------------------------------------------------- | 
|  | 11 | umask=###     -- The permission mask (for files and directories, see umask(1)). | 
|  | 12 | The default is the umask of current process. | 
|  | 13 |  | 
|  | 14 | dmask=###     -- The permission mask for the directory. | 
|  | 15 | The default is the umask of current process. | 
|  | 16 |  | 
|  | 17 | fmask=###     -- The permission mask for files. | 
|  | 18 | The default is the umask of current process. | 
|  | 19 |  | 
|  | 20 | codepage=###  -- Sets the codepage number for converting to shortname | 
|  | 21 | characters on FAT filesystem. | 
|  | 22 | By default, FAT_DEFAULT_CODEPAGE setting is used. | 
|  | 23 |  | 
|  | 24 | iocharset=name -- Character set to use for converting between the | 
|  | 25 | encoding is used for user visible filename and 16 bit | 
|  | 26 | Unicode characters. Long filenames are stored on disk | 
|  | 27 | in Unicode format, but Unix for the most part doesn't | 
|  | 28 | know how to deal with Unicode. | 
|  | 29 | By default, FAT_DEFAULT_IOCHARSET setting is used. | 
|  | 30 |  | 
| Alexey Dobriyan | 4de151d | 2006-03-22 00:13:35 +0100 | [diff] [blame] | 31 | There is also an option of doing UTF-8 translations | 
| Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 32 | with the utf8 option. | 
|  | 33 |  | 
|  | 34 | NOTE: "iocharset=utf8" is not recommended. If unsure, | 
|  | 35 | you should consider the following option instead. | 
|  | 36 |  | 
| Alexey Dobriyan | 4de151d | 2006-03-22 00:13:35 +0100 | [diff] [blame] | 37 | utf8=<bool>   -- UTF-8 is the filesystem safe version of Unicode that | 
| Paolo Ornati | 670e9f3 | 2006-10-03 22:57:56 +0200 | [diff] [blame] | 38 | is used by the console.  It can be enabled for the | 
| Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 39 | filesystem with this option. If 'uni_xlate' gets set, | 
| Alexey Dobriyan | 4de151d | 2006-03-22 00:13:35 +0100 | [diff] [blame] | 40 | UTF-8 gets disabled. | 
| Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 41 |  | 
|  | 42 | uni_xlate=<bool> -- Translate unhandled Unicode characters to special | 
|  | 43 | escaped sequences.  This would let you backup and | 
|  | 44 | restore filenames that are created with any Unicode | 
|  | 45 | characters.  Until Linux supports Unicode for real, | 
|  | 46 | this gives you an alternative.  Without this option, | 
|  | 47 | a '?' is used when no translation is possible.  The | 
|  | 48 | escape character is ':' because it is otherwise | 
|  | 49 | illegal on the vfat filesystem.  The escape sequence | 
|  | 50 | that gets used is ':' and the four digits of hexadecimal | 
|  | 51 | unicode. | 
|  | 52 |  | 
|  | 53 | nonumtail=<bool> -- When creating 8.3 aliases, normally the alias will | 
|  | 54 | end in '~1' or tilde followed by some number.  If this | 
|  | 55 | option is set, then if the filename is | 
|  | 56 | "longfilename.txt" and "longfile.txt" does not | 
|  | 57 | currently exist in the directory, 'longfile.txt' will | 
|  | 58 | be the short alias instead of 'longfi~1.txt'. | 
|  | 59 |  | 
| OGAWA Hirofumi | 28ec039 | 2007-05-08 00:31:01 -0700 | [diff] [blame] | 60 | usefree       -- Use the "free clusters" value stored on FSINFO. It'll | 
|  | 61 | be used to determine number of free clusters without | 
|  | 62 | scanning disk. But it's not used by default, because | 
|  | 63 | recent Windows don't update it correctly in some | 
|  | 64 | case. If you are sure the "free clusters" on FSINFO is | 
|  | 65 | correct, by this option you can avoid scanning disk. | 
|  | 66 |  | 
| Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 67 | quiet         -- Stops printing certain warning messages. | 
|  | 68 |  | 
|  | 69 | check=s|r|n   -- Case sensitivity checking setting. | 
|  | 70 | s: strict, case sensitive | 
|  | 71 | r: relaxed, case insensitive | 
|  | 72 | n: normal, default setting, currently case insensitive | 
|  | 73 |  | 
|  | 74 | shortname=lower|win95|winnt|mixed | 
|  | 75 | -- Shortname display/create setting. | 
|  | 76 | lower: convert to lowercase for display, | 
|  | 77 | emulate the Windows 95 rule for create. | 
|  | 78 | win95: emulate the Windows 95 rule for display/create. | 
|  | 79 | winnt: emulate the Windows NT rule for display/create. | 
|  | 80 | mixed: emulate the Windows NT rule for display, | 
|  | 81 | emulate the Windows 95 rule for create. | 
|  | 82 | Default setting is `lower'. | 
|  | 83 |  | 
|  | 84 | <bool>: 0,1,yes,no,true,false | 
|  | 85 |  | 
|  | 86 | TODO | 
|  | 87 | ---------------------------------------------------------------------- | 
|  | 88 | * Need to get rid of the raw scanning stuff.  Instead, always use | 
|  | 89 | a get next directory entry approach.  The only thing left that uses | 
|  | 90 | raw scanning is the directory renaming code. | 
|  | 91 |  | 
|  | 92 |  | 
|  | 93 | POSSIBLE PROBLEMS | 
|  | 94 | ---------------------------------------------------------------------- | 
|  | 95 | * vfat_valid_longname does not properly checked reserved names. | 
|  | 96 | * When a volume name is the same as a directory name in the root | 
|  | 97 | directory of the filesystem, the directory name sometimes shows | 
|  | 98 | up as an empty file. | 
|  | 99 | * autoconv option does not work correctly. | 
|  | 100 |  | 
|  | 101 | BUG REPORTS | 
|  | 102 | ---------------------------------------------------------------------- | 
|  | 103 | If you have trouble with the VFAT filesystem, mail bug reports to | 
|  | 104 | chaffee@bmrc.cs.berkeley.edu.  Please specify the filename | 
|  | 105 | and the operation that gave you trouble. | 
|  | 106 |  | 
|  | 107 | TEST SUITE | 
|  | 108 | ---------------------------------------------------------------------- | 
|  | 109 | If you plan to make any modifications to the vfat filesystem, please | 
|  | 110 | get the test suite that comes with the vfat distribution at | 
|  | 111 |  | 
|  | 112 | http://bmrc.berkeley.edu/people/chaffee/vfat.html | 
|  | 113 |  | 
|  | 114 | This tests quite a few parts of the vfat filesystem and additional | 
|  | 115 | tests for new features or untested features would be appreciated. | 
|  | 116 |  | 
|  | 117 | NOTES ON THE STRUCTURE OF THE VFAT FILESYSTEM | 
|  | 118 | ---------------------------------------------------------------------- | 
|  | 119 | (This documentation was provided by Galen C. Hunt <gchunt@cs.rochester.edu> | 
|  | 120 | and lightly annotated by Gordon Chaffee). | 
|  | 121 |  | 
|  | 122 | This document presents a very rough, technical overview of my | 
|  | 123 | knowledge of the extended FAT file system used in Windows NT 3.5 and | 
|  | 124 | Windows 95.  I don't guarantee that any of the following is correct, | 
|  | 125 | but it appears to be so. | 
|  | 126 |  | 
|  | 127 | The extended FAT file system is almost identical to the FAT | 
|  | 128 | file system used in DOS versions up to and including 6.223410239847 | 
|  | 129 | :-).  The significant change has been the addition of long file names. | 
|  | 130 | These names support up to 255 characters including spaces and lower | 
|  | 131 | case characters as opposed to the traditional 8.3 short names. | 
|  | 132 |  | 
|  | 133 | Here is the description of the traditional FAT entry in the current | 
|  | 134 | Windows 95 filesystem: | 
|  | 135 |  | 
|  | 136 | struct directory { // Short 8.3 names | 
|  | 137 | unsigned char name[8];          // file name | 
|  | 138 | unsigned char ext[3];           // file extension | 
|  | 139 | unsigned char attr;             // attribute byte | 
|  | 140 | unsigned char lcase;		// Case for base and extension | 
|  | 141 | unsigned char ctime_ms;		// Creation time, milliseconds | 
|  | 142 | unsigned char ctime[2];		// Creation time | 
|  | 143 | unsigned char cdate[2];		// Creation date | 
|  | 144 | unsigned char adate[2];		// Last access date | 
|  | 145 | unsigned char reserved[2];	// reserved values (ignored) | 
|  | 146 | unsigned char time[2];          // time stamp | 
|  | 147 | unsigned char date[2];          // date stamp | 
|  | 148 | unsigned char start[2];         // starting cluster number | 
|  | 149 | unsigned char size[4];          // size of the file | 
|  | 150 | }; | 
|  | 151 |  | 
|  | 152 | The lcase field specifies if the base and/or the extension of an 8.3 | 
|  | 153 | name should be capitalized.  This field does not seem to be used by | 
|  | 154 | Windows 95 but it is used by Windows NT.  The case of filenames is not | 
|  | 155 | completely compatible from Windows NT to Windows 95.  It is not completely | 
|  | 156 | compatible in the reverse direction, however.  Filenames that fit in | 
|  | 157 | the 8.3 namespace and are written on Windows NT to be lowercase will | 
|  | 158 | show up as uppercase on Windows 95. | 
|  | 159 |  | 
|  | 160 | Note that the "start" and "size" values are actually little | 
|  | 161 | endian integer values.  The descriptions of the fields in this | 
|  | 162 | structure are public knowledge and can be found elsewhere. | 
|  | 163 |  | 
|  | 164 | With the extended FAT system, Microsoft has inserted extra | 
|  | 165 | directory entries for any files with extended names.  (Any name which | 
|  | 166 | legally fits within the old 8.3 encoding scheme does not have extra | 
|  | 167 | entries.)  I call these extra entries slots.  Basically, a slot is a | 
|  | 168 | specially formatted directory entry which holds up to 13 characters of | 
|  | 169 | a file's extended name.  Think of slots as additional labeling for the | 
|  | 170 | directory entry of the file to which they correspond.  Microsoft | 
|  | 171 | prefers to refer to the 8.3 entry for a file as its alias and the | 
|  | 172 | extended slot directory entries as the file name. | 
|  | 173 |  | 
|  | 174 | The C structure for a slot directory entry follows: | 
|  | 175 |  | 
|  | 176 | struct slot { // Up to 13 characters of a long name | 
|  | 177 | unsigned char id;               // sequence number for slot | 
|  | 178 | unsigned char name0_4[10];      // first 5 characters in name | 
|  | 179 | unsigned char attr;             // attribute byte | 
|  | 180 | unsigned char reserved;         // always 0 | 
|  | 181 | unsigned char alias_checksum;   // checksum for 8.3 alias | 
|  | 182 | unsigned char name5_10[12];     // 6 more characters in name | 
|  | 183 | unsigned char start[2];         // starting cluster number | 
|  | 184 | unsigned char name11_12[4];     // last 2 characters in name | 
|  | 185 | }; | 
|  | 186 |  | 
|  | 187 | If the layout of the slots looks a little odd, it's only | 
|  | 188 | because of Microsoft's efforts to maintain compatibility with old | 
|  | 189 | software.  The slots must be disguised to prevent old software from | 
|  | 190 | panicking.  To this end, a number of measures are taken: | 
|  | 191 |  | 
|  | 192 | 1) The attribute byte for a slot directory entry is always set | 
|  | 193 | to 0x0f.  This corresponds to an old directory entry with | 
|  | 194 | attributes of "hidden", "system", "read-only", and "volume | 
|  | 195 | label".  Most old software will ignore any directory | 
|  | 196 | entries with the "volume label" bit set.  Real volume label | 
|  | 197 | entries don't have the other three bits set. | 
|  | 198 |  | 
|  | 199 | 2) The starting cluster is always set to 0, an impossible | 
|  | 200 | value for a DOS file. | 
|  | 201 |  | 
|  | 202 | Because the extended FAT system is backward compatible, it is | 
|  | 203 | possible for old software to modify directory entries.  Measures must | 
|  | 204 | be taken to ensure the validity of slots.  An extended FAT system can | 
|  | 205 | verify that a slot does in fact belong to an 8.3 directory entry by | 
|  | 206 | the following: | 
|  | 207 |  | 
|  | 208 | 1) Positioning.  Slots for a file always immediately proceed | 
|  | 209 | their corresponding 8.3 directory entry.  In addition, each | 
|  | 210 | slot has an id which marks its order in the extended file | 
|  | 211 | name.  Here is a very abbreviated view of an 8.3 directory | 
|  | 212 | entry and its corresponding long name slots for the file | 
|  | 213 | "My Big File.Extension which is long": | 
|  | 214 |  | 
|  | 215 | <proceeding files...> | 
|  | 216 | <slot #3, id = 0x43, characters = "h is long"> | 
|  | 217 | <slot #2, id = 0x02, characters = "xtension whic"> | 
|  | 218 | <slot #1, id = 0x01, characters = "My Big File.E"> | 
|  | 219 | <directory entry, name = "MYBIGFIL.EXT"> | 
|  | 220 |  | 
|  | 221 | Note that the slots are stored from last to first.  Slots | 
|  | 222 | are numbered from 1 to N.  The Nth slot is or'ed with 0x40 | 
|  | 223 | to mark it as the last one. | 
|  | 224 |  | 
|  | 225 | 2) Checksum.  Each slot has an "alias_checksum" value.  The | 
|  | 226 | checksum is calculated from the 8.3 name using the | 
|  | 227 | following algorithm: | 
|  | 228 |  | 
|  | 229 | for (sum = i = 0; i < 11; i++) { | 
|  | 230 | sum = (((sum&1)<<7)|((sum&0xfe)>>1)) + name[i] | 
|  | 231 | } | 
|  | 232 |  | 
|  | 233 | 3) If there is free space in the final slot, a Unicode NULL (0x0000) | 
|  | 234 | is stored after the final character.  After that, all unused | 
|  | 235 | characters in the final slot are set to Unicode 0xFFFF. | 
|  | 236 |  | 
|  | 237 | Finally, note that the extended name is stored in Unicode.  Each Unicode | 
|  | 238 | character takes two bytes. |