| Evgeniy Polyakov | 7672d0b | 2005-09-11 19:15:07 -0700 | [diff] [blame] | 1 | /*****************************************/ | 
 | 2 | Kernel Connector. | 
 | 3 | /*****************************************/ | 
 | 4 |  | 
 | 5 | Kernel connector - new netlink based userspace <-> kernel space easy | 
 | 6 | to use communication module. | 
 | 7 |  | 
 | 8 | Connector driver adds possibility to connect various agents using | 
 | 9 | netlink based network.  One must register callback and | 
 | 10 | identifier. When driver receives special netlink message with | 
 | 11 | appropriate identifier, appropriate callback will be called. | 
 | 12 |  | 
 | 13 | From the userspace point of view it's quite straightforward: | 
 | 14 |  | 
 | 15 | 	socket(); | 
 | 16 | 	bind(); | 
 | 17 | 	send(); | 
 | 18 | 	recv(); | 
 | 19 |  | 
 | 20 | But if kernelspace want to use full power of such connections, driver | 
 | 21 | writer must create special sockets, must know about struct sk_buff | 
 | 22 | handling...  Connector allows any kernelspace agents to use netlink | 
 | 23 | based networking for inter-process communication in a significantly | 
 | 24 | easier way: | 
 | 25 |  | 
 | 26 | int cn_add_callback(struct cb_id *id, char *name, void (*callback) (void *)); | 
 | 27 | void cn_netlink_send(struct cn_msg *msg, u32 __group, int gfp_mask); | 
 | 28 |  | 
 | 29 | struct cb_id | 
 | 30 | { | 
 | 31 | 	__u32			idx; | 
 | 32 | 	__u32			val; | 
 | 33 | }; | 
 | 34 |  | 
 | 35 | idx and val are unique identifiers which must be registered in | 
 | 36 | connector.h for in-kernel usage.  void (*callback) (void *) - is a | 
 | 37 | callback function which will be called when message with above idx.val | 
 | 38 | will be received by connector core.  Argument for that function must | 
 | 39 | be dereferenced to struct cn_msg *. | 
 | 40 |  | 
 | 41 | struct cn_msg | 
 | 42 | { | 
 | 43 | 	struct cb_id 		id; | 
 | 44 |  | 
 | 45 | 	__u32			seq; | 
 | 46 | 	__u32			ack; | 
 | 47 |  | 
 | 48 | 	__u32			len;		/* Length of the following data */ | 
 | 49 | 	__u8			data[0]; | 
 | 50 | }; | 
 | 51 |  | 
 | 52 | /*****************************************/ | 
 | 53 | Connector interfaces. | 
 | 54 | /*****************************************/ | 
 | 55 |  | 
 | 56 | int cn_add_callback(struct cb_id *id, char *name, void (*callback) (void *)); | 
 | 57 |  | 
 | 58 | Registers new callback with connector core. | 
 | 59 |  | 
 | 60 | struct cb_id *id 		- unique connector's user identifier. | 
 | 61 | 			  	  It must be registered in connector.h for legal in-kernel users. | 
 | 62 | char *name 			- connector's callback symbolic name. | 
 | 63 | void (*callback) (void *)	- connector's callback. | 
 | 64 | 				  Argument must be dereferenced to struct cn_msg *. | 
 | 65 |  | 
 | 66 | void cn_del_callback(struct cb_id *id); | 
 | 67 |  | 
 | 68 | Unregisters new callback with connector core. | 
 | 69 |  | 
 | 70 | struct cb_id *id 		- unique connector's user identifier. | 
 | 71 |  | 
| Evgeniy Polyakov | b191ba0 | 2006-03-20 22:21:40 -0800 | [diff] [blame] | 72 | int cn_netlink_send(struct cn_msg *msg, u32 __groups, int gfp_mask); | 
| Evgeniy Polyakov | 7672d0b | 2005-09-11 19:15:07 -0700 | [diff] [blame] | 73 |  | 
 | 74 | Sends message to the specified groups.  It can be safely called from | 
| Evgeniy Polyakov | b191ba0 | 2006-03-20 22:21:40 -0800 | [diff] [blame] | 75 | softirq context, but may silently fail under strong memory pressure. | 
 | 76 | If there are no listeners for given group -ESRCH can be returned. | 
| Evgeniy Polyakov | 7672d0b | 2005-09-11 19:15:07 -0700 | [diff] [blame] | 77 |  | 
 | 78 | struct cn_msg *			- message header(with attached data). | 
 | 79 | u32 __group			- destination group. | 
 | 80 | 				  If __group is zero, then appropriate group will | 
 | 81 | 				  be searched through all registered connector users, | 
 | 82 | 				  and message will be delivered to the group which was | 
 | 83 | 				  created for user with the same ID as in msg. | 
 | 84 | 				  If __group is not zero, then message will be delivered | 
 | 85 | 				  to the specified group. | 
 | 86 | int gfp_mask			- GFP mask. | 
 | 87 |  | 
 | 88 | Note: When registering new callback user, connector core assigns | 
 | 89 | netlink group to the user which is equal to it's id.idx. | 
 | 90 |  | 
 | 91 | /*****************************************/ | 
 | 92 | Protocol description. | 
 | 93 | /*****************************************/ | 
 | 94 |  | 
 | 95 | Current offers transport layer with fixed header.  Recommended | 
 | 96 | protocol which uses such header is following: | 
 | 97 |  | 
 | 98 | msg->seq and msg->ack are used to determine message genealogy.  When | 
 | 99 | someone sends message it puts there locally unique sequence and random | 
 | 100 | acknowledge numbers.  Sequence number may be copied into | 
 | 101 | nlmsghdr->nlmsg_seq too. | 
 | 102 |  | 
 | 103 | Sequence number is incremented with each message to be sent. | 
 | 104 |  | 
 | 105 | If we expect reply to our message, then sequence number in received | 
 | 106 | message MUST be the same as in original message, and acknowledge | 
 | 107 | number MUST be the same + 1. | 
 | 108 |  | 
 | 109 | If we receive message and it's sequence number is not equal to one we | 
 | 110 | are expecting, then it is new message.  If we receive message and it's | 
 | 111 | sequence number is the same as one we are expecting, but it's | 
 | 112 | acknowledge is not equal acknowledge number in original message + 1, | 
 | 113 | then it is new message. | 
 | 114 |  | 
 | 115 | Obviously, protocol header contains above id. | 
 | 116 |  | 
 | 117 | connector allows event notification in the following form: kernel | 
 | 118 | driver or userspace process can ask connector to notify it when | 
 | 119 | selected id's will be turned on or off(registered or unregistered it's | 
 | 120 | callback). It is done by sending special command to connector | 
 | 121 | driver(it also registers itself with id={-1, -1}). | 
 | 122 |  | 
 | 123 | As example of usage Documentation/connector now contains cn_test.c - | 
 | 124 | testing module which uses connector to request notification and to | 
 | 125 | send messages. | 
 | 126 |  | 
 | 127 | /*****************************************/ | 
 | 128 | Reliability. | 
 | 129 | /*****************************************/ | 
 | 130 |  | 
 | 131 | Netlink itself is not reliable protocol, that means that messages can | 
 | 132 | be lost due to memory pressure or process' receiving queue overflowed, | 
 | 133 | so caller is warned must be prepared. That is why struct cn_msg [main | 
 | 134 | connector's message header] contains u32 seq and u32 ack fields. | 
| Evgeniy Polyakov | eb0d604 | 2005-10-13 14:42:04 -0700 | [diff] [blame] | 135 |  | 
 | 136 | /*****************************************/ | 
 | 137 | Userspace usage. | 
 | 138 | /*****************************************/ | 
 | 139 | 2.6.14 has a new netlink socket implementation, which by default does not | 
 | 140 | allow to send data to netlink groups other than 1. | 
 | 141 | So, if to use netlink socket (for example using connector)  | 
 | 142 | with different group number userspace application must subscribe to  | 
 | 143 | that group. It can be achieved by following pseudocode: | 
 | 144 |  | 
 | 145 | s = socket(PF_NETLINK, SOCK_DGRAM, NETLINK_CONNECTOR); | 
 | 146 |  | 
 | 147 | l_local.nl_family = AF_NETLINK; | 
 | 148 | l_local.nl_groups = 12345; | 
 | 149 | l_local.nl_pid = 0; | 
 | 150 |  | 
 | 151 | if (bind(s, (struct sockaddr *)&l_local, sizeof(struct sockaddr_nl)) == -1) { | 
 | 152 | 	perror("bind"); | 
 | 153 | 	close(s); | 
 | 154 | 	return -1; | 
 | 155 | } | 
 | 156 |  | 
 | 157 | { | 
 | 158 | 	int on = l_local.nl_groups; | 
 | 159 | 	setsockopt(s, 270, 1, &on, sizeof(on)); | 
 | 160 | } | 
 | 161 |  | 
 | 162 | Where 270 above is SOL_NETLINK, and 1 is a NETLINK_ADD_MEMBERSHIP socket | 
 | 163 | option. To drop multicast subscription one should call above socket option | 
 | 164 | with NETLINK_DROP_MEMBERSHIP parameter which is defined as 0. | 
 | 165 |  | 
 | 166 | 2.6.14 netlink code only allows to select a group which is less or equal to | 
 | 167 | the maximum group number, which is used at netlink_kernel_create() time. | 
 | 168 | In case of connector it is CN_NETLINK_USERS + 0xf, so if you want to use | 
 | 169 | group number 12345, you must increment CN_NETLINK_USERS to that number. | 
 | 170 | Additional 0xf numbers are allocated to be used by non-in-kernel users. | 
 | 171 |  | 
 | 172 | Due to this limitation, group 0xffffffff does not work now, so one can | 
 | 173 | not use add/remove connector's group notifications, but as far as I know,  | 
 | 174 | only cn_test.c test module used it. | 
 | 175 |  | 
 | 176 | Some work in netlink area is still being done, so things can be changed in | 
 | 177 | 2.6.15 timeframe, if it will happen, documentation will be updated for that | 
 | 178 | kernel. |