| Sukadev Bhattiprolu | 784c4d8 | 2009-01-02 13:42:34 +0000 | [diff] [blame] | 1 |  | 
 | 2 | To support containers, we now allow multiple instances of devpts filesystem, | 
 | 3 | such that indices of ptys allocated in one instance are independent of indices | 
 | 4 | allocated in other instances of devpts. | 
 | 5 |  | 
 | 6 | To preserve backward compatibility, this support for multiple instances is | 
 | 7 | enabled only if: | 
 | 8 |  | 
 | 9 | 	- CONFIG_DEVPTS_MULTIPLE_INSTANCES=y, and | 
 | 10 | 	- '-o newinstance' mount option is specified while mounting devpts | 
 | 11 |  | 
 | 12 | IOW, devpts now supports both single-instance and multi-instance semantics. | 
 | 13 |  | 
 | 14 | If CONFIG_DEVPTS_MULTIPLE_INSTANCES=n, there is no change in behavior and | 
 | 15 | this referred to as the "legacy" mode. In this mode, the new mount options | 
 | 16 | (-o newinstance and -o ptmxmode) will be ignored with a 'bogus option' message | 
 | 17 | on console. | 
 | 18 |  | 
 | 19 | If CONFIG_DEVPTS_MULTIPLE_INSTANCES=y and devpts is mounted without the | 
 | 20 | 'newinstance' option (as in current start-up scripts) the new mount binds | 
 | 21 | to the initial kernel mount of devpts. This mode is referred to as the | 
 | 22 | 'single-instance' mode and the current, single-instance semantics are | 
 | 23 | preserved, i.e PTYs are common across the system. | 
 | 24 |  | 
 | 25 | The only difference between this single-instance mode and the legacy mode | 
 | 26 | is the presence of new, '/dev/pts/ptmx' node with permissions 0000, which | 
 | 27 | can safely be ignored. | 
 | 28 |  | 
 | 29 | If CONFIG_DEVPTS_MULTIPLE_INSTANCES=y and 'newinstance' option is specified, | 
 | 30 | the mount is considered to be in the multi-instance mode and a new instance | 
 | 31 | of the devpts fs is created. Any ptys created in this instance are independent | 
 | 32 | of ptys in other instances of devpts. Like in the single-instance mode, the | 
 | 33 | /dev/pts/ptmx node is present. To effectively use the multi-instance mode, | 
 | 34 | open of /dev/ptmx must be a redirected to '/dev/pts/ptmx' using a symlink or | 
 | 35 | bind-mount. | 
 | 36 |  | 
 | 37 | Eg: A container startup script could do the following: | 
 | 38 |  | 
 | 39 | 	$ chmod 0666 /dev/pts/ptmx | 
 | 40 | 	$ rm /dev/ptmx | 
 | 41 | 	$ ln -s pts/ptmx /dev/ptmx | 
 | 42 | 	$ ns_exec -cm /bin/bash | 
 | 43 |  | 
 | 44 | 	# We are now in new container | 
 | 45 |  | 
 | 46 | 	$ umount /dev/pts | 
 | 47 | 	$ mount -t devpts -o newinstance lxcpts /dev/pts | 
 | 48 | 	$ sshd -p 1234 | 
 | 49 |  | 
 | 50 | where 'ns_exec -cm /bin/bash' calls clone() with CLONE_NEWNS flag and execs | 
 | 51 | /bin/bash in the child process.  A pty created by the sshd is not visible in | 
 | 52 | the original mount of /dev/pts. | 
 | 53 |  | 
 | 54 | User-space changes | 
 | 55 | ------------------ | 
 | 56 |  | 
 | 57 | In multi-instance mode (i.e '-o newinstance' mount option is specified at least | 
 | 58 | once), following user-space issues should be noted. | 
 | 59 |  | 
 | 60 | 1. If -o newinstance mount option is never used, /dev/pts/ptmx can be ignored | 
 | 61 |    and no change is needed to system-startup scripts. | 
 | 62 |  | 
 | 63 | 2. To effectively use multi-instance mode (i.e -o newinstance is specified) | 
 | 64 |    administrators or startup scripts should "redirect" open of /dev/ptmx to | 
 | 65 |    /dev/pts/ptmx using either a bind mount or symlink. | 
 | 66 |  | 
 | 67 | 	$ mount -t devpts -o newinstance devpts /dev/pts | 
 | 68 |  | 
 | 69 |    followed by either | 
 | 70 |  | 
 | 71 | 	$ rm /dev/ptmx | 
 | 72 | 	$ ln -s pts/ptmx /dev/ptmx | 
 | 73 | 	$ chmod 666 /dev/pts/ptmx | 
 | 74 |    or | 
 | 75 | 	$ mount -o bind /dev/pts/ptmx /dev/ptmx | 
 | 76 |  | 
 | 77 | 3. The '/dev/ptmx -> pts/ptmx' symlink is the preferred method since it | 
 | 78 |    enables better error-reporting and treats both single-instance and | 
 | 79 |    multi-instance mounts similarly. | 
 | 80 |  | 
 | 81 |    But this method requires that system-startup scripts set the mode of | 
 | 82 |    /dev/pts/ptmx correctly (default mode is 0000). The scripts can set the | 
 | 83 |    mode by, either | 
 | 84 |  | 
 | 85 |    	- adding ptmxmode mount option to devpts entry in /etc/fstab, or | 
 | 86 | 	- using 'chmod 0666 /dev/pts/ptmx' | 
 | 87 |  | 
 | 88 | 4. If multi-instance mode mount is needed for containers, but the system | 
 | 89 |    startup scripts have not yet been updated, container-startup scripts | 
 | 90 |    should bind mount /dev/ptmx to /dev/pts/ptmx to avoid breaking single- | 
 | 91 |    instance mounts. | 
 | 92 |  | 
 | 93 |    Or, in general, container-startup scripts should use: | 
 | 94 |  | 
 | 95 | 	mount -t devpts -o newinstance -o ptmxmode=0666 devpts /dev/pts | 
 | 96 | 	if [ ! -L /dev/ptmx ]; then | 
 | 97 | 		mount -o bind /dev/pts/ptmx /dev/ptmx | 
 | 98 | 	fi | 
 | 99 |  | 
 | 100 |    When all devpts mounts are multi-instance, /dev/ptmx can permanently be | 
 | 101 |    a symlink to pts/ptmx and the bind mount can be ignored. | 
 | 102 |  | 
 | 103 | 5. A multi-instance mount that is not accompanied by the /dev/ptmx to | 
 | 104 |    /dev/pts/ptmx redirection would result in an unusable/unreachable pty. | 
 | 105 |  | 
 | 106 | 	mount -t devpts -o newinstance lxcpts /dev/pts | 
 | 107 |  | 
 | 108 |    immediately followed by: | 
 | 109 |  | 
 | 110 | 	open("/dev/ptmx") | 
 | 111 |  | 
 | 112 |     would create a pty, say /dev/pts/7, in the initial kernel mount. | 
 | 113 |     But /dev/pts/7 would be invisible in the new mount. | 
 | 114 |  | 
 | 115 | 6. The permissions for /dev/pts/ptmx node should be specified when mounting | 
 | 116 |    /dev/pts, using the '-o ptmxmode=%o' mount option (default is 0000). | 
 | 117 |  | 
 | 118 | 	mount -t devpts -o newinstance -o ptmxmode=0644 devpts /dev/pts | 
 | 119 |  | 
 | 120 |    The permissions can be later be changed as usual with 'chmod'. | 
 | 121 |  | 
 | 122 | 	chmod 666 /dev/pts/ptmx | 
 | 123 |  | 
 | 124 | 7. A mount of devpts without the 'newinstance' option results in binding to | 
 | 125 |    initial kernel mount.  This behavior while preserving legacy semantics, | 
 | 126 |    does not provide strict isolation in a container environment. i.e by | 
 | 127 |    mounting devpts without the 'newinstance' option, a container could | 
 | 128 |    get visibility into the 'host' or root container's devpts. | 
 | 129 |     | 
 | 130 |    To workaround this and have strict isolation, all mounts of devpts, | 
 | 131 |    including the mount in the root container, should use the newinstance | 
 | 132 |    option. |