Blame - Documentation/DocBook/media/v4l/selection-api.xml - android_kernel_oneplus_sm8150

blob: ac013e50e0bd4f6eba738ffdd6887e6003873e4b [file] [log] [blame]

Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	1	<section id="selection-api">
				2
				3	<title>Experimental API for cropping, composing and scaling</title>
				4
				5	<note>
				6	<title>Experimental</title>
				7
				8	<para>This is an <link linkend="experimental">experimental</link>
				9	interface and may change in the future.</para>
				10	</note>
				11
				12	<section>
				13	<title>Introduction</title>
				14
				15	<para>Some video capture devices can sample a subsection of a picture and
				16	shrink or enlarge it to an image of arbitrary size. Next, the devices can
				17	insert the image into larger one. Some video output devices can crop part of an
				18	input image, scale it up or down and insert it at an arbitrary scan line and
				19	horizontal offset into a video signal. We call these abilities cropping,
				20	scaling and composing.</para>
				21
				22	<para>On a video <emphasis>capture</emphasis> device the source is a video
				23	signal, and the cropping target determine the area actually sampled. The sink
				24	is an image stored in a memory buffer. The composing area specifies which part
				25	of the buffer is actually written to by the hardware. </para>
				26
				27	<para>On a video <emphasis>output</emphasis> device the source is an image in a
				28	memory buffer, and the cropping target is a part of an image to be shown on a
				29	display. The sink is the display or the graphics screen. The application may
				30	select the part of display where the image should be displayed. The size and
				31	position of such a window is controlled by the compose target.</para>
				32
				33	<para>Rectangles for all cropping and composing targets are defined even if the
				34	device does supports neither cropping nor composing. Their size and position
				35	will be fixed in such a case. If the device does not support scaling then the
				36	cropping and composing rectangles have the same size.</para>
				37
				38	</section>
				39
				40	<section>
				41	<title>Selection targets</title>
				42
				43	<figure id="sel-targets-capture">
				44	<title>Cropping and composing targets</title>
				45	<mediaobject>
				46	<imageobject>
				47	<imagedata fileref="selection.png" format="PNG" />
				48	</imageobject>
				49	<textobject>
				50	<phrase>Targets used by a cropping, composing and scaling
				51	process</phrase>
				52	</textobject>
				53	</mediaobject>
				54	</figure>
Sylwester Nawrocki	9080d5d	2012-03-09 06:46:28 -0300	[diff] [blame]	55
				56	For complete list of the available selection targets see table <xref
				57	linkend="v4l2-sel-target"/>
				58
Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	59	</section>
				60
				61	<section>
				62
				63	<title>Configuration</title>
				64
				65	<para>Applications can use the <link linkend="vidioc-g-selection">selection
				66	API</link> to select an area in a video signal or a buffer, and to query for
				67	default settings and hardware limits.</para>
				68
				69	<para>Video hardware can have various cropping, composing and scaling
				70	limitations. It may only scale up or down, support only discrete scaling
				71	factors, or have different scaling abilities in the horizontal and vertical
				72	directions. Also it may not support scaling at all. At the same time the
				73	cropping/composing rectangles may have to be aligned, and both the source and
				74	the sink may have arbitrary upper and lower size limits. Therefore, as usual,
				75	drivers are expected to adjust the requested parameters and return the actual
				76	values selected. An application can control the rounding behaviour using <link
				77	linkend="v4l2-sel-flags"> constraint flags </link>.</para>
				78
				79	<section>
				80
				81	<title>Configuration of video capture</title>
				82
				83	<para>See figure <xref linkend="sel-targets-capture" /> for examples of the
				84	selection targets available for a video capture device. It is recommended to
				85	configure the cropping targets before to the composing targets.</para>
				86
				87	<para>The range of coordinates of the top left corner, width and height of
				88	areas that can be sampled is given by the <constant> V4L2_SEL_TGT_CROP_BOUNDS
				89	</constant> target. It is recommended for the driver developers to put the
				90	top/left corner at position <constant> (0,0) </constant>. The rectangle's
Tomasz Stanislawski	aa73ab9	2011-12-09 13:45:47 -0300	[diff] [blame]	91	coordinates are expressed in pixels.</para>
Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	92
				93	<para>The top left corner, width and height of the source rectangle, that is
Sylwester Nawrocki	c133482	2012-05-20 11:17:12 -0300	[diff] [blame^]	94	the area actually sampled, is given by the <constant> V4L2_SEL_TGT_CROP
Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	95	</constant> target. It uses the same coordinate system as <constant>
				96	V4L2_SEL_TGT_CROP_BOUNDS </constant>. The active cropping area must lie
				97	completely inside the capture boundaries. The driver may further adjust the
				98	requested size and/or position according to hardware limitations.</para>
				99
				100	<para>Each capture device has a default source rectangle, given by the
				101	<constant> V4L2_SEL_TGT_CROP_DEFAULT </constant> target. This rectangle shall
				102	over what the driver writer considers the complete picture. Drivers shall set
				103	the active crop rectangle to the default when the driver is first loaded, but
				104	not later.</para>
				105
				106	<para>The composing targets refer to a memory buffer. The limits of composing
				107	coordinates are obtained using <constant> V4L2_SEL_TGT_COMPOSE_BOUNDS
Tomasz Stanislawski	aa73ab9	2011-12-09 13:45:47 -0300	[diff] [blame]	108	</constant>. All coordinates are expressed in pixels. The rectangle's top/left
				109	corner must be located at position <constant> (0,0) </constant>. The width and
				110	height are equal to the image size set by <constant> VIDIOC_S_FMT </constant>.
				111	</para>
Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	112
				113	<para>The part of a buffer into which the image is inserted by the hardware is
Sylwester Nawrocki	c133482	2012-05-20 11:17:12 -0300	[diff] [blame^]	114	controlled by the <constant> V4L2_SEL_TGT_COMPOSE </constant> target.
Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	115	The rectangle's coordinates are also expressed in the same coordinate system as
				116	the bounds rectangle. The composing rectangle must lie completely inside bounds
				117	rectangle. The driver must adjust the composing rectangle to fit to the
				118	bounding limits. Moreover, the driver can perform other adjustments according
				119	to hardware limitations. The application can control rounding behaviour using
				120	<link linkend="v4l2-sel-flags"> constraint flags </link>.</para>
				121
				122	<para>For capture devices the default composing rectangle is queried using
				123	<constant> V4L2_SEL_TGT_COMPOSE_DEFAULT </constant>. It is usually equal to the
				124	bounding rectangle.</para>
				125
				126	<para>The part of a buffer that is modified by the hardware is given by
				127	<constant> V4L2_SEL_TGT_COMPOSE_PADDED </constant>. It contains all pixels
Sylwester Nawrocki	c133482	2012-05-20 11:17:12 -0300	[diff] [blame^]	128	defined using <constant> V4L2_SEL_TGT_COMPOSE </constant> plus all
Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	129	padding data modified by hardware during insertion process. All pixels outside
				130	this rectangle <emphasis>must not</emphasis> be changed by the hardware. The
				131	content of pixels that lie inside the padded area but outside active area is
				132	undefined. The application can use the padded and active rectangles to detect
				133	where the rubbish pixels are located and remove them if needed.</para>
				134
				135	</section>
				136
				137	<section>
				138
				139	<title>Configuration of video output</title>
				140
				141	<para>For output devices targets and ioctls are used similarly to the video
				142	capture case. The <emphasis> composing </emphasis> rectangle refers to the
				143	insertion of an image into a video signal. The cropping rectangles refer to a
				144	memory buffer. It is recommended to configure the composing targets before to
				145	the cropping targets.</para>
				146
				147	<para>The cropping targets refer to the memory buffer that contains an image to
				148	be inserted into a video signal or graphical screen. The limits of cropping
				149	coordinates are obtained using <constant> V4L2_SEL_TGT_CROP_BOUNDS </constant>.
Tomasz Stanislawski	aa73ab9	2011-12-09 13:45:47 -0300	[diff] [blame]	150	All coordinates are expressed in pixels. The top/left corner is always point
				151	<constant> (0,0) </constant>. The width and height is equal to the image size
				152	specified using <constant> VIDIOC_S_FMT </constant> ioctl.</para>
Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	153
				154	<para>The top left corner, width and height of the source rectangle, that is
				155	the area from which image date are processed by the hardware, is given by the
Sylwester Nawrocki	c133482	2012-05-20 11:17:12 -0300	[diff] [blame^]	156	<constant> V4L2_SEL_TGT_CROP </constant>. Its coordinates are expressed
Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	157	in in the same coordinate system as the bounds rectangle. The active cropping
				158	area must lie completely inside the crop boundaries and the driver may further
				159	adjust the requested size and/or position according to hardware
				160	limitations.</para>
				161
				162	<para>For output devices the default cropping rectangle is queried using
				163	<constant> V4L2_SEL_TGT_CROP_DEFAULT </constant>. It is usually equal to the
				164	bounding rectangle.</para>
				165
				166	<para>The part of a video signal or graphics display where the image is
Tomasz Stanislawski	aa73ab9	2011-12-09 13:45:47 -0300	[diff] [blame]	167	inserted by the hardware is controlled by <constant>
Sylwester Nawrocki	c133482	2012-05-20 11:17:12 -0300	[diff] [blame^]	168	V4L2_SEL_TGT_COMPOSE </constant> target. The rectangle's coordinates
Tomasz Stanislawski	aa73ab9	2011-12-09 13:45:47 -0300	[diff] [blame]	169	are expressed in pixels. The composing rectangle must lie completely inside the
				170	bounds rectangle. The driver must adjust the area to fit to the bounding
Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	171	limits. Moreover, the driver can perform other adjustments according to
				172	hardware limitations. </para>
				173
				174	<para>The device has a default composing rectangle, given by the <constant>
				175	V4L2_SEL_TGT_COMPOSE_DEFAULT </constant> target. This rectangle shall cover what
				176	the driver writer considers the complete picture. It is recommended for the
				177	driver developers to put the top/left corner at position <constant> (0,0)
				178	</constant>. Drivers shall set the active composing rectangle to the default
				179	one when the driver is first loaded.</para>
				180
				181	<para>The devices may introduce additional content to video signal other than
				182	an image from memory buffers. It includes borders around an image. However,
				183	such a padded area is driver-dependent feature not covered by this document.
				184	Driver developers are encouraged to keep padded rectangle equal to active one.
				185	The padded target is accessed by the <constant> V4L2_SEL_TGT_COMPOSE_PADDED
				186	</constant> identifier. It must contain all pixels from the <constant>
Sylwester Nawrocki	c133482	2012-05-20 11:17:12 -0300	[diff] [blame^]	187	V4L2_SEL_TGT_COMPOSE </constant> target.</para>
Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	188
				189	</section>
				190
				191	<section>
				192
Sylwester Nawrocki	9080d5d	2012-03-09 06:46:28 -0300	[diff] [blame]	193	<title>Scaling control</title>
Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	194
				195	<para>An application can detect if scaling is performed by comparing the width
Sylwester Nawrocki	c133482	2012-05-20 11:17:12 -0300	[diff] [blame^]	196	and the height of rectangles obtained using <constant> V4L2_SEL_TGT_CROP
				197	</constant> and <constant> V4L2_SEL_TGT_COMPOSE </constant> targets. If
Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	198	these are not equal then the scaling is applied. The application can compute
				199	the scaling ratios using these values.</para>
				200
				201	</section>
				202
				203	</section>
				204
				205	<section>
				206
Sylwester Nawrocki	9080d5d	2012-03-09 06:46:28 -0300	[diff] [blame]	207	<title>Comparison with old cropping API</title>
Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	208
				209	<para>The selection API was introduced to cope with deficiencies of previous
				210	<link linkend="crop"> API </link>, that was designed to control simple capture
				211	devices. Later the cropping API was adopted by video output drivers. The ioctls
				212	are used to select a part of the display were the video signal is inserted. It
				213	should be considered as an API abuse because the described operation is
				214	actually the composing. The selection API makes a clear distinction between
				215	composing and cropping operations by setting the appropriate targets. The V4L2
				216	API lacks any support for composing to and cropping from an image inside a
				217	memory buffer. The application could configure a capture device to fill only a
				218	part of an image by abusing V4L2 API. Cropping a smaller image from a larger
				219	one is achieved by setting the field <structfield>
				220	&v4l2-pix-format;::bytesperline </structfield>. Introducing an image offsets
				221	could be done by modifying field <structfield> &v4l2-buffer;::m:userptr
				222	</structfield> before calling <constant> VIDIOC_QBUF </constant>. Those
				223	operations should be avoided because they are not portable (endianness), and do
				224	not work for macroblock and Bayer formats and mmap buffers. The selection API
				225	deals with configuration of buffer cropping/composing in a clear, intuitive and
				226	portable way. Next, with the selection API the concepts of the padded target
				227	and constraints flags are introduced. Finally, <structname> &v4l2-crop;
				228	</structname> and <structname> &v4l2-cropcap; </structname> have no reserved
				229	fields. Therefore there is no way to extend their functionality. The new
				230	<structname> &v4l2-selection; </structname> provides a lot of place for future
				231	extensions. Driver developers are encouraged to implement only selection API.
				232	The former cropping API would be simulated using the new one. </para>
				233
				234	</section>
				235
				236	<section>
				237	<title>Examples</title>
				238	<example>
				239	<title>Resetting the cropping parameters</title>
				240
				241	<para>(A video capture device is assumed; change <constant>
				242	V4L2_BUF_TYPE_VIDEO_CAPTURE </constant> for other devices; change target to
				243	<constant> V4L2_SEL_TGT_COMPOSE_* </constant> family to configure composing
				244	area)</para>
				245
				246	<programlisting>
				247
				248	&v4l2-selection; sel = {
				249	.type = V4L2_BUF_TYPE_VIDEO_CAPTURE,
				250	.target = V4L2_SEL_TGT_CROP_DEFAULT,
				251	};
				252	ret = ioctl(fd, &VIDIOC-G-SELECTION;, &sel);
				253	if (ret)
				254	exit(-1);
Sylwester Nawrocki	c133482	2012-05-20 11:17:12 -0300	[diff] [blame^]	255	sel.target = V4L2_SEL_TGT_CROP;
Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	256	ret = ioctl(fd, &VIDIOC-S-SELECTION;, &sel);
				257	if (ret)
				258	exit(-1);
				259
				260	</programlisting>
				261	</example>
				262
				263	<example>
				264	<title>Simple downscaling</title>
				265	<para>Setting a composing area on output of size of <emphasis> at most
				266	</emphasis> half of limit placed at a center of a display.</para>
				267	<programlisting>
				268
				269	&v4l2-selection; sel = {
				270	.type = V4L2_BUF_TYPE_VIDEO_OUTPUT,
				271	.target = V4L2_SEL_TGT_COMPOSE_BOUNDS,
				272	};
				273	struct v4l2_rect r;
				274
				275	ret = ioctl(fd, &VIDIOC-G-SELECTION;, &sel);
				276	if (ret)
				277	exit(-1);
				278	/* setting smaller compose rectangle */
				279	r.width = sel.r.width / 2;
				280	r.height = sel.r.height / 2;
				281	r.left = sel.r.width / 4;
				282	r.top = sel.r.height / 4;
				283	sel.r = r;
Sylwester Nawrocki	c133482	2012-05-20 11:17:12 -0300	[diff] [blame^]	284	sel.target = V4L2_SEL_TGT_COMPOSE;
Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	285	sel.flags = V4L2_SEL_FLAG_LE;
				286	ret = ioctl(fd, &VIDIOC-S-SELECTION;, &sel);
				287	if (ret)
				288	exit(-1);
				289
				290	</programlisting>
				291	</example>
				292
				293	<example>
				294	<title>Querying for scaling factors</title>
				295	<para>A video output device is assumed; change <constant>
				296	V4L2_BUF_TYPE_VIDEO_OUTPUT </constant> for other devices</para>
				297	<programlisting>
				298
				299	&v4l2-selection; compose = {
				300	.type = V4L2_BUF_TYPE_VIDEO_OUTPUT,
Sylwester Nawrocki	c133482	2012-05-20 11:17:12 -0300	[diff] [blame^]	301	.target = V4L2_SEL_TGT_COMPOSE,
Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	302	};
				303	&v4l2-selection; crop = {
				304	.type = V4L2_BUF_TYPE_VIDEO_OUTPUT,
Sylwester Nawrocki	c133482	2012-05-20 11:17:12 -0300	[diff] [blame^]	305	.target = V4L2_SEL_TGT_CROP,
Tomasz Stanislawski	8af4922	2011-08-19 07:00:04 -0300	[diff] [blame]	306	};
				307	double hscale, vscale;
				308
				309	ret = ioctl(fd, &VIDIOC-G-SELECTION;, &compose);
				310	if (ret)
				311	exit(-1);
				312	ret = ioctl(fd, &VIDIOC-G-SELECTION;, &crop);
				313	if (ret)
				314	exit(-1);
				315
				316	/* computing scaling factors */
				317	hscale = (double)compose.r.width / crop.r.width;
				318	vscale = (double)compose.r.height / crop.r.height;
				319
				320	</programlisting>
				321	</example>
				322
				323	</section>
				324
				325	</section>