Copyright © 2009 LinuxTV Developers
Table of Contents
List of Figures
List of Tables
magic fieldid
fieldList of Examples
V4L2_PIX_FMT_BGR24 4 × 4 pixel
imageV4L2_PIX_FMT_SBGGR8 4 × 4
pixel imageV4L2_PIX_FMT_SGBRG8 4 × 4
pixel imageV4L2_PIX_FMT_SGRBG8 4 ×
4 pixel imageV4L2_PIX_FMT_SBGGR16 4 × 4
pixel imageV4L2_PIX_FMT_GREY 4 × 4
pixel imageV4L2_PIX_FMT_Y16 4 × 4
pixel imageV4L2_PIX_FMT_YUYV 4 × 4
pixel imageV4L2_PIX_FMT_UYVY 4 × 4
pixel imageV4L2_PIX_FMT_YVYU 4 × 4
pixel imageV4L2_PIX_FMT_VYUY 4 × 4
pixel imageV4L2_PIX_FMT_Y41P 8 × 4
pixel imageV4L2_PIX_FMT_YVU420 4 × 4
pixel imageV4L2_PIX_FMT_YVU410 4 × 4
pixel imageV4L2_PIX_FMT_YUV422P 4 × 4
pixel imageV4L2_PIX_FMT_YUV411P 4 × 4
pixel imageV4L2_PIX_FMT_NV12 4 × 4
pixel imageV4L2_PIX_FMT_NV16 4 × 4
pixel imageThis document covers the Linux Kernel to Userspace API's used by video and radio straming devices, including video cameras, analog and digital TV receiver cards, AM/FM receiver cards, streaming capture devices.
It is divided into three parts.
The first part covers radio, capture, cameras and analog TV devices.
The second part covers the API used for digital TV and Internet reception via one of the several digital tv standards. While it is called as DVB API, in fact it covers several different video standards including DVB-T, DVB-S, DVB-C and ATSC. The API is currently being updated to documment support also for DVB-S2, ISDB-T and ISDB-S.
The third part covers other API's used by all media infrastructure devices
For additional information and for the latest development code, see: http://linuxtv.org.
For discussing improvements, reporting troubles, sending new drivers, etc, please mail to: Linux Media Mailing List (LMML)..
Copyright © 1999, 2000, 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009 Bill Dirks, Michael H. Schimek, Hans Verkuil, Martin Rubli, Andy Walls, Mauro Carvalho Chehab
| Revision History | ||
|---|---|---|
| Revision 2.6.32 | 2009-08-31 | mcc |
| Now, revisions will match the kernel version where the V4L2 API changes will be used by the Linux Kernel. Also added Remote Controller chapter. | ||
| Revision 0.29 | 2009-08-26 | ev |
| Added documentation for string controls and for FM Transmitter controls. | ||
| Revision 0.28 | 2009-08-26 | gl |
| Added V4L2_CID_BAND_STOP_FILTER documentation. | ||
| Revision 0.27 | 2009-08-15 | mcc |
| Added libv4l and Remote Controller documentation; added v4l2grab and keytable application examples. | ||
| Revision 0.26 | 2009-07-23 | hv |
| Finalized the RDS capture API. Added modulator and RDS encoder capabilities. Added support for string controls. | ||
| Revision 0.25 | 2009-01-18 | hv |
| Added pixel formats VYUY, NV16 and NV61, and changed the debug ioctls VIDIOC_DBG_G/S_REGISTER and VIDIOC_DBG_G_CHIP_IDENT. Added camera controls V4L2_CID_ZOOM_ABSOLUTE, V4L2_CID_ZOOM_RELATIVE, V4L2_CID_ZOOM_CONTINUOUS and V4L2_CID_PRIVACY. | ||
| Revision 0.24 | 2008-03-04 | mhs |
| Added pixel formats Y16 and SBGGR16, new controls and a camera controls class. Removed VIDIOC_G/S_MPEGCOMP. | ||
| Revision 0.23 | 2007-08-30 | mhs |
| Fixed a typo in VIDIOC_DBG_G/S_REGISTER. Clarified the byte order of packed pixel formats. | ||
| Revision 0.22 | 2007-08-29 | mhs |
| Added the Video Output Overlay interface, new MPEG controls, V4L2_FIELD_INTERLACED_TB and V4L2_FIELD_INTERLACED_BT, VIDIOC_DBG_G/S_REGISTER, VIDIOC_(TRY_)ENCODER_CMD, VIDIOC_G_CHIP_IDENT, VIDIOC_G_ENC_INDEX, new pixel formats. Clarifications in the cropping chapter, about RGB pixel formats, the mmap(), poll(), select(), read() and write() functions. Typographical fixes. | ||
| Revision 0.21 | 2006-12-19 | mhs |
| Fixed a link in the VIDIOC_G_EXT_CTRLS section. | ||
| Revision 0.20 | 2006-11-24 | mhs |
| Clarified the purpose of the audioset field in struct v4l2_input and v4l2_output. | ||
| Revision 0.19 | 2006-10-19 | mhs |
| Documented V4L2_PIX_FMT_RGB444. | ||
| Revision 0.18 | 2006-10-18 | mhs |
| Added the description of extended controls by Hans Verkuil. Linked V4L2_PIX_FMT_MPEG to V4L2_CID_MPEG_STREAM_TYPE. | ||
| Revision 0.17 | 2006-10-12 | mhs |
| Corrected V4L2_PIX_FMT_HM12 description. | ||
| Revision 0.16 | 2006-10-08 | mhs |
| VIDIOC_ENUM_FRAMESIZES and VIDIOC_ENUM_FRAMEINTERVALS are now part of the API. | ||
| Revision 0.15 | 2006-09-23 | mhs |
| Cleaned up the bibliography, added BT.653 and BT.1119. capture.c/start_capturing() for user pointer I/O did not initialize the buffer index. Documented the V4L MPEG and MJPEG VID_TYPEs and V4L2_PIX_FMT_SBGGR8. Updated the list of reserved pixel formats. See the history chapter for API changes. | ||
| Revision 0.14 | 2006-09-14 | mr |
| Added VIDIOC_ENUM_FRAMESIZES and VIDIOC_ENUM_FRAMEINTERVALS proposal for frame format enumeration of digital devices. | ||
| Revision 0.13 | 2006-04-07 | mhs |
| Corrected the description of struct v4l2_window clips. New V4L2_STD_ and V4L2_TUNER_MODE_LANG1_LANG2 defines. | ||
| Revision 0.12 | 2006-02-03 | mhs |
| Corrected the description of struct v4l2_captureparm and v4l2_outputparm. | ||
| Revision 0.11 | 2006-01-27 | mhs |
| Improved the description of struct v4l2_tuner. | ||
| Revision 0.10 | 2006-01-10 | mhs |
| VIDIOC_G_INPUT and VIDIOC_S_PARM clarifications. | ||
| Revision 0.9 | 2005-11-27 | mhs |
| Improved the 525 line numbering diagram. Hans Verkuil and I rewrote the sliced VBI section. He also contributed a VIDIOC_LOG_STATUS page. Fixed VIDIOC_S_STD call in the video standard selection example. Various updates. | ||
| Revision 0.8 | 2004-10-04 | mhs |
| Somehow a piece of junk slipped into the capture example, removed. | ||
| Revision 0.7 | 2004-09-19 | mhs |
| Fixed video standard selection, control enumeration, downscaling and aspect example. Added read and user pointer i/o to video capture example. | ||
| Revision 0.6 | 2004-08-01 | mhs |
| v4l2_buffer changes, added video capture example, various corrections. | ||
| Revision 0.5 | 2003-11-05 | mhs |
| Pixel format erratum. | ||
| Revision 0.4 | 2003-09-17 | mhs |
| Corrected source and Makefile to generate a PDF. SGML fixes. Added latest API changes. Closed gaps in the history chapter. | ||
| Revision 0.3 | 2003-02-05 | mhs |
| Another draft, more corrections. | ||
| Revision 0.2 | 2003-01-15 | mhs |
| Second draft, with corrections pointed out by Gerd Knorr. | ||
| Revision 0.1 | 2002-12-01 | mhs |
| First draft, based on documentation by Bill Dirks and discussions on the V4L mailing list. | ||
Table of Contents
Table of Contents
Programming a V4L2 device consists of these steps:
Opening the device
Changing device properties, selecting a video and audio input, video standard, picture brightness a. o.
Negotiating a data format
Negotiating an input/output method
The actual input/output loop
Closing the device
In practice most steps are optional and can be executed out of order. It depends on the V4L2 device type, you can read about the details in Chapter 4, Interfaces. In this chapter we will discuss the basic concepts applicable to all devices.
V4L2 drivers are implemented as kernel modules, loaded manually by the system administrator or automatically when a device is first opened. The driver modules plug into the "videodev" kernel module. It provides helper functions and a common application interface specified in this document.
Each driver thus loaded registers one or more device nodes
with major number 81 and a minor number between 0 and 255. Assigning
minor numbers to V4L2 devices is entirely up to the system administrator,
this is primarily intended to solve conflicts between devices.[1] The module options to select minor numbers are named
after the device special file with a "_nr" suffix. For example "video_nr"
for /dev/video video capture devices. The number is
an offset to the base minor number associated with the device type.
[2] When the driver supports multiple devices of the same
type more than one minor number can be assigned, separated by commas:
> insmod mydriver.o video_nr=0,1 radio_nr=0,1
In /etc/modules.conf this may be
written as:
alias char-major-81-0 mydriver alias char-major-81-1 mydriver alias char-major-81-64 mydriveroptions mydriver video_nr=0,1 radio_nr=0,1
![]()
When an application attempts to open a device special file with major number 81 and minor number 0, 1, or 64, load "mydriver" (and the "videodev" module it depends upon). | |
Register the first two video capture devices with minor number 0 and 1 (base number is 0), the first two radio device with minor number 64 and 65 (base 64). |
When no minor number is given as module option the driver supplies a default. Chapter 4, Interfaces recommends the base minor numbers to be used for the various device types. Obviously minor numbers must be unique. When the number is already in use the offending device will not be registered.
By convention system administrators create various
character device special files with these major and minor numbers in
the /dev directory. The names recomended for the
different V4L2 device types are listed in Chapter 4, Interfaces.
The creation of character special files (with mknod) is a privileged operation and devices cannot be opened by major and minor number. That means applications cannot reliable scan for loaded or installed drivers. The user must enter a device name, or the application can try the conventional device names.
Under the device filesystem (devfs) the minor number
options are ignored. V4L2 drivers (or by proxy the "videodev" module)
automatically create the required device files in the
/dev/v4l directory using the conventional device
names above.
Devices can support several related functions. For example video capturing, video overlay and VBI capturing are related because these functions share, amongst other, the same video input and tuner frequency. V4L and earlier versions of V4L2 used the same device name and minor number for video capturing and overlay, but different ones for VBI. Experience showed this approach has several problems[3], and to make things worse the V4L videodev module used to prohibit multiple opens of a device.
As a remedy the present version of the V4L2 API relaxed the concept of device types with specific names and minor numbers. For compatibility with old applications drivers must still register different minor numbers to assign a default function to the device. But if related functions are supported by the driver they must be available under all registered minor numbers. The desired function can be selected after opening the device as described in Chapter 4, Interfaces.
Imagine a driver supporting video capturing, video
overlay, raw VBI capturing, and FM radio reception. It registers three
devices with minor number 0, 64 and 224 (this numbering scheme is
inherited from the V4L API). Regardless if
/dev/video (81, 0) or
/dev/vbi (81, 224) is opened the application can
select any one of the video capturing, overlay or VBI capturing
functions. Without programming (e. g. reading from the device
with dd or cat)
/dev/video captures video images, while
/dev/vbi captures raw VBI data.
/dev/radio (81, 64) is invariable a radio device,
unrelated to the video functions. Being unrelated does not imply the
devices can be used at the same time, however. The open()
function may very well return an EBUSY error code.
Besides video input or output the hardware may also support audio sampling or playback. If so, these functions are implemented as OSS or ALSA PCM devices and eventually OSS or ALSA audio mixer. The V4L2 API makes no provisions yet to find these related devices. If you have an idea please write to the linux-media mailing list: http://www.linuxtv.org/lists.php.
In general, V4L2 devices can be opened more than once. When this is supported by the driver, users can for example start a "panel" application to change controls like brightness or audio volume, while another application captures video and audio. In other words, panel applications are comparable to an OSS or ALSA audio mixer application. When a device supports multiple functions like capturing and overlay simultaneously, multiple opens allow concurrent use of the device by forked processes or specialized applications.
Multiple opens are optional, although drivers should
permit at least concurrent accesses without data exchange, i. e. panel
applications. This implies open() can return an EBUSY error code when the
device is already in use, as well as ioctl() functions initiating
data exchange (namely the VIDIOC_S_FMT ioctl), and the read()
and write() functions.
Mere opening a V4L2 device does not grant exclusive access.[4] Initiating data exchange however assigns the right to read or write the requested type of data, and to change related properties, to this file descriptor. Applications can request additional access privileges using the priority mechanism described in the section called “Application Priority”.
V4L2 drivers should not support multiple applications reading or writing the same data stream on a device by copying buffers, time multiplexing or similar means. This is better handled by a proxy application in user space. When the driver supports stream sharing anyway it must be implemented transparently. The V4L2 API does not specify how conflicts are solved.
Because V4L2 covers a wide variety of devices not all aspects of the API are equally applicable to all types of devices. Furthermore devices of the same type have different capabilities and this specification permits the omission of a few complicated and less important parts of the API.
The VIDIOC_QUERYCAP ioctl is available to check if the kernel
device is compatible with this specification, and to query the functions and I/O
methods supported by the device. Other features can be queried
by calling the respective ioctl, for example VIDIOC_ENUMINPUT
to learn about the number, types and names of video connectors on the
device. Although abstraction is a major objective of this API, the
ioctl also allows driver specific applications to reliable identify
the driver.
All V4L2 drivers must support
VIDIOC_QUERYCAP. Applications should always call
this ioctl after opening the device.
When multiple applications share a device it may be desirable to assign them different priorities. Contrary to the traditional "rm -rf /" school of thought a video recording application could for example block other applications from changing video controls or switching the current TV channel. Another objective is to permit low priority applications working in background, which can be preempted by user controlled applications and automatically regain control of the device at a later time.
Since these features cannot be implemented entirely in user
space V4L2 defines the VIDIOC_G_PRIORITY and VIDIOC_S_PRIORITY
ioctls to request and query the access priority associate with a file
descriptor. Opening a device assigns a medium priority, compatible
with earlier versions of V4L2 and drivers not supporting these ioctls.
Applications requiring a different priority will usually call
VIDIOC_S_PRIORITY after verifying the device with
the VIDIOC_QUERYCAP ioctl.
Ioctls changing driver properties, such as VIDIOC_S_INPUT,
return an EBUSY error code after another application obtained higher priority.
An event mechanism to notify applications about asynchronous property
changes has been proposed but not added yet.
Video inputs and outputs are physical connectors of a device. These can be for example RF connectors (antenna/cable), CVBS a.k.a. Composite Video, S-Video or RGB connectors. Only video and VBI capture devices have inputs, output devices have outputs, at least one each. Radio devices have no video inputs or outputs.
To learn about the number and attributes of the
available inputs and outputs applications can enumerate them with the
VIDIOC_ENUMINPUT and VIDIOC_ENUMOUTPUT ioctl, respectively. The
struct v4l2_input returned by the VIDIOC_ENUMINPUT
ioctl also contains signal status information applicable when the
current video input is queried.
The VIDIOC_G_INPUT and VIDIOC_G_OUTPUT ioctl return the
index of the current video input or output. To select a different
input or output applications call the VIDIOC_S_INPUT and
VIDIOC_S_OUTPUT ioctl. Drivers must implement all the input ioctls
when the device has one or more inputs, all the output ioctls when the
device has one or more outputs.
Example 1.1. Information about the current video input
struct v4l2_input input; int index; if (-1 == ioctl (fd,VIDIOC_G_INPUT, &index)) { perror ("VIDIOC_G_INPUT"); exit (EXIT_FAILURE); } memset (&input, 0, sizeof (input)); input.index = index; if (-1 == ioctl (fd,VIDIOC_ENUMINPUT, &input)) { perror ("VIDIOC_ENUMINPUT"); exit (EXIT_FAILURE); } printf ("Current input: %s\n", input.name);
Example 1.2. Switching to the first video input
int index;
index = 0;
if (-1 == ioctl (fd, VIDIOC_S_INPUT, &index)) {
perror ("VIDIOC_S_INPUT");
exit (EXIT_FAILURE);
}
Audio inputs and outputs are physical connectors of a device. Video capture devices have inputs, output devices have outputs, zero or more each. Radio devices have no audio inputs or outputs. They have exactly one tuner which in fact is an audio source, but this API associates tuners with video inputs or outputs only, and radio devices have none of these.[5] A connector on a TV card to loop back the received audio signal to a sound card is not considered an audio output.
Audio and video inputs and outputs are associated. Selecting
a video source also selects an audio source. This is most evident when
the video and audio source is a tuner. Further audio connectors can
combine with more than one video input or output. Assumed two
composite video inputs and two audio inputs exist, there may be up to
four valid combinations. The relation of video and audio connectors
is defined in the audioset field of the
respective struct v4l2_input or struct v4l2_output, where each bit represents
the index number, starting at zero, of one audio input or output.
To learn about the number and attributes of the
available inputs and outputs applications can enumerate them with the
VIDIOC_ENUMAUDIO and VIDIOC_ENUMAUDOUT ioctl, respectively. The
struct v4l2_audio returned by the VIDIOC_ENUMAUDIO ioctl
also contains signal status information applicable when the current
audio input is queried.
The VIDIOC_G_AUDIO and VIDIOC_G_AUDOUT ioctl report
the current audio input and output, respectively. Note that, unlike
VIDIOC_G_INPUT and VIDIOC_G_OUTPUT these ioctls return a structure
as VIDIOC_ENUMAUDIO and
VIDIOC_ENUMAUDOUT do, not just an index.
To select an audio input and change its properties
applications call the VIDIOC_S_AUDIO ioctl. To select an audio
output (which presently has no changeable properties) applications
call the VIDIOC_S_AUDOUT ioctl.
Drivers must implement all input ioctls when the device
has one or more inputs, all output ioctls when the device has one
or more outputs. When the device has any audio inputs or outputs the
driver must set the V4L2_CAP_AUDIO flag in the
struct v4l2_capability returned by the VIDIOC_QUERYCAP ioctl.
Example 1.3. Information about the current audio input
struct v4l2_audio audio;
memset (&audio, 0, sizeof (audio));
if (-1 == ioctl (fd, VIDIOC_G_AUDIO, &audio)) {
perror ("VIDIOC_G_AUDIO");
exit (EXIT_FAILURE);
}
printf ("Current input: %s\n", audio.name);
Example 1.4. Switching to the first audio input
struct v4l2_audio audio;
memset (&audio, 0, sizeof (audio)); /* clear audio.mode, audio.reserved */
audio.index = 0;
if (-1 == ioctl (fd, VIDIOC_S_AUDIO, &audio)) {
perror ("VIDIOC_S_AUDIO");
exit (EXIT_FAILURE);
}
Video input devices can have one or more tuners
demodulating a RF signal. Each tuner is associated with one or more
video inputs, depending on the number of RF connectors on the tuner.
The type field of the respective
struct v4l2_input returned by the VIDIOC_ENUMINPUT ioctl is set to
V4L2_INPUT_TYPE_TUNER and its
tuner field contains the index number of
the tuner.
Radio devices have exactly one tuner with index zero, no video inputs.
To query and change tuner properties applications use the
VIDIOC_G_TUNER and VIDIOC_S_TUNER ioctl, respectively. The
struct v4l2_tuner returned by VIDIOC_G_TUNER also
contains signal status information applicable when the tuner of the
current video input, or a radio tuner is queried. Note that
VIDIOC_S_TUNER does not switch the current tuner,
when there is more than one at all. The tuner is solely determined by
the current video input. Drivers must support both ioctls and set the
V4L2_CAP_TUNER flag in the struct v4l2_capability
returned by the VIDIOC_QUERYCAP ioctl when the device has one or
more tuners.
Video output devices can have one or more modulators, uh,
modulating a video signal for radiation or connection to the antenna
input of a TV set or video recorder. Each modulator is associated with
one or more video outputs, depending on the number of RF connectors on
the modulator. The type field of the
respective struct v4l2_output returned by the VIDIOC_ENUMOUTPUT ioctl is
set to V4L2_OUTPUT_TYPE_MODULATOR and its
modulator field contains the index number
of the modulator. This specification does not define radio output
devices.
To query and change modulator properties applications use
the VIDIOC_G_MODULATOR and VIDIOC_S_MODULATOR ioctl. Note that
VIDIOC_S_MODULATOR does not switch the current
modulator, when there is more than one at all. The modulator is solely
determined by the current video output. Drivers must support both
ioctls and set the V4L2_CAP_MODULATOR flag in
the struct v4l2_capability returned by the VIDIOC_QUERYCAP ioctl when the
device has one or more modulators.
To get and set the tuner or modulator radio frequency
applications use the VIDIOC_G_FREQUENCY and VIDIOC_S_FREQUENCY
ioctl which both take a pointer to a struct v4l2_frequency. These ioctls
are used for TV and radio devices alike. Drivers must support both
ioctls when the tuner or modulator ioctls are supported, or
when the device is a radio device.
Video devices typically support one or more different video
standards or variations of standards. Each video input and output may
support another set of standards. This set is reported by the
std field of struct v4l2_input and
struct v4l2_output returned by the VIDIOC_ENUMINPUT and
VIDIOC_ENUMOUTPUT ioctl, respectively.
V4L2 defines one bit for each analog video standard
currently in use worldwide, and sets aside bits for driver defined
standards, e. g. hybrid standards to watch NTSC video tapes on PAL TVs
and vice versa. Applications can use the predefined bits to select a
particular standard, although presenting the user a menu of supported
standards is preferred. To enumerate and query the attributes of the
supported standards applications use the VIDIOC_ENUMSTD ioctl.
Many of the defined standards are actually just variations of a few major standards. The hardware may in fact not distinguish between them, or do so internal and switch automatically. Therefore enumerated standards also contain sets of one or more standard bits.
Assume a hypothetic tuner capable of demodulating B/PAL, G/PAL and I/PAL signals. The first enumerated standard is a set of B and G/PAL, switched automatically depending on the selected radio frequency in UHF or VHF band. Enumeration gives a "PAL-B/G" or "PAL-I" choice. Similar a Composite input may collapse standards, enumerating "PAL-B/G/H/I", "NTSC-M" and "SECAM-D/K".[6]
To query and select the standard used by the current video
input or output applications call the VIDIOC_G_STD and
VIDIOC_S_STD ioctl, respectively. The received
standard can be sensed with the VIDIOC_QUERYSTD ioctl. Note parameter of all these ioctls is a pointer to a v4l2_std_id type (a standard set), not an index into the standard enumeration.[7] Drivers must implement all video standard ioctls
when the device has one or more video inputs or outputs.
Special rules apply to USB cameras where the notion of video standards makes little sense. More generally any capture device, output devices accordingly, which is
incapable of capturing fields or frames at the nominal rate of the video standard, or
where timestamps refer to the instant the field or frame was received by the driver, not the capture time, or
where sequence numbers refer to the frames received by the driver, not the captured frames.
Here the driver shall set the
std field of struct v4l2_input and struct v4l2_output
to zero, the VIDIOC_G_STD,
VIDIOC_S_STD,
VIDIOC_QUERYSTD and
VIDIOC_ENUMSTD ioctls shall return the
EINVAL error code.[8]
Example 1.5. Information about the current video standard
v4l2_std_id std_id; struct v4l2_standard standard; if (-1 == ioctl (fd,VIDIOC_G_STD, &std_id)) { /* Note when VIDIOC_ENUMSTD always returns EINVAL this is no video device or it falls under the USB exception, and VIDIOC_G_STD returning EINVAL is no error. */ perror ("VIDIOC_G_STD"); exit (EXIT_FAILURE); } memset (&standard, 0, sizeof (standard)); standard.index = 0; while (0 == ioctl (fd,VIDIOC_ENUMSTD, &standard)) { if (standard.id & std_id) { printf ("Current video standard: %s\n", standard.name); exit (EXIT_SUCCESS); } standard.index++; } /* EINVAL indicates the end of the enumeration, which cannot be empty unless this device falls under the USB exception. */ if (errno == EINVAL || standard.index == 0) { perror ("VIDIOC_ENUMSTD"); exit (EXIT_FAILURE); }
Example 1.6. Listing the video standards supported by the current input
struct v4l2_input input; struct v4l2_standard standard; memset (&input, 0, sizeof (input)); if (-1 == ioctl (fd,VIDIOC_G_INPUT, &input.index)) { perror ("VIDIOC_G_INPUT"); exit (EXIT_FAILURE); } if (-1 == ioctl (fd,VIDIOC_ENUMINPUT, &input)) { perror ("VIDIOC_ENUM_INPUT"); exit (EXIT_FAILURE); } printf ("Current input %s supports:\n", input.name); memset (&standard, 0, sizeof (standard)); standard.index = 0; while (0 == ioctl (fd,VIDIOC_ENUMSTD, &standard)) { if (standard.id & input.std) printf ("%s\n", standard.name); standard.index++; } /* EINVAL indicates the end of the enumeration, which cannot be empty unless this device falls under the USB exception. */ if (errno != EINVAL || standard.index == 0) { perror ("VIDIOC_ENUMSTD"); exit (EXIT_FAILURE); }
Example 1.7. Selecting a new video standard
struct v4l2_input input; v4l2_std_id std_id; memset (&input, 0, sizeof (input)); if (-1 == ioctl (fd,VIDIOC_G_INPUT, &input.index)) { perror ("VIDIOC_G_INPUT"); exit (EXIT_FAILURE); } if (-1 == ioctl (fd,VIDIOC_ENUMINPUT, &input)) { perror ("VIDIOC_ENUM_INPUT"); exit (EXIT_FAILURE); } if (0 == (input.std & V4L2_STD_PAL_BG)) { fprintf (stderr, "Oops. B/G PAL is not supported.\n"); exit (EXIT_FAILURE); } /* Note this is also supposed to work when only B or G/PAL is supported. */ std_id = V4L2_STD_PAL_BG; if (-1 == ioctl (fd,VIDIOC_S_STD, &std_id)) { perror ("VIDIOC_S_STD"); exit (EXIT_FAILURE); }
Devices typically have a number of user-settable controls such as brightness, saturation and so on, which would be presented to the user on a graphical user interface. But, different devices will have different controls available, and furthermore, the range of possible values, and the default value will vary from device to device. The control ioctls provide the information and a mechanism to create a nice user interface for these controls that will work correctly with any device.
All controls are accessed using an ID value. V4L2 defines
several IDs for specific purposes. Drivers can also implement their
own custom controls using V4L2_CID_PRIVATE_BASE
and higher values. The pre-defined control IDs have the prefix
V4L2_CID_, and are listed in Table 1.1, “Control IDs”. The ID is used when querying the attributes of
a control, and when getting or setting the current value.
Generally applications should present controls to the user without assumptions about their purpose. Each control comes with a name string the user is supposed to understand. When the purpose is non-intuitive the driver writer should provide a user manual, a user interface plug-in or a driver specific panel application. Predefined IDs were introduced to change a few controls programmatically, for example to mute a device during a channel switch.
Drivers may enumerate different controls after switching the current video input or output, tuner or modulator, or audio input or output. Different in the sense of other bounds, another default and current value, step size or other menu items. A control with a certain custom ID can also change name and type.[9] Control values are stored globally, they do not change when switching except to stay within the reported bounds. They also do not change e. g. when the device is opened or closed, when the tuner radio frequency is changed or generally never without application request. Since V4L2 specifies no event mechanism, panel applications intended to cooperate with other panel applications (be they built into a larger application, as a TV viewer) may need to regularly poll control values to update their user interface.[10]
Table 1.1. Control IDs
| ID | Type | Description |
|---|---|---|
V4L2_CID_BASE | First predefined ID, equal to
V4L2_CID_BRIGHTNESS. | |
V4L2_CID_USER_BASE | Synonym of V4L2_CID_BASE. | |
V4L2_CID_BRIGHTNESS | integer | Picture brightness, or more precisely, the black level. |
V4L2_CID_CONTRAST | integer | Picture contrast or luma gain. |
V4L2_CID_SATURATION | integer | Picture color saturation or chroma gain. |
V4L2_CID_HUE | integer | Hue or color balance. |
V4L2_CID_AUDIO_VOLUME | integer | Overall audio volume. Note some drivers also provide an OSS or ALSA mixer interface. |
V4L2_CID_AUDIO_BALANCE | integer | Audio stereo balance. Minimum corresponds to all the way left, maximum to right. |
V4L2_CID_AUDIO_BASS | integer | Audio bass adjustment. |
V4L2_CID_AUDIO_TREBLE | integer | Audio treble adjustment. |
V4L2_CID_AUDIO_MUTE | boolean | Mute audio, i. e. set the volume to zero, however
without affecting V4L2_CID_AUDIO_VOLUME. Like
ALSA drivers, V4L2 drivers must mute at load time to avoid excessive
noise. Actually the entire device should be reset to a low power
consumption state. |
V4L2_CID_AUDIO_LOUDNESS | boolean | Loudness mode (bass boost). |
V4L2_CID_BLACK_LEVEL | integer | Another name for brightness (not a synonym of
V4L2_CID_BRIGHTNESS). This control is deprecated
and should not be used in new drivers and applications. |
V4L2_CID_AUTO_WHITE_BALANCE | boolean | Automatic white balance (cameras). |
V4L2_CID_DO_WHITE_BALANCE | button | This is an action control. When set (the value is
ignored), the device will do a white balance and then hold the current
setting. Contrast this with the boolean
V4L2_CID_AUTO_WHITE_BALANCE, which, when
activated, keeps adjusting the white balance. |
V4L2_CID_RED_BALANCE | integer | Red chroma balance. |
V4L2_CID_BLUE_BALANCE | integer | Blue chroma balance. |
V4L2_CID_GAMMA | integer | Gamma adjust. |
V4L2_CID_WHITENESS | integer | Whiteness for grey-scale devices. This is a synonym
for V4L2_CID_GAMMA. This control is deprecated
and should not be used in new drivers and applications. |
V4L2_CID_EXPOSURE | integer | Exposure (cameras). [Unit?] |
V4L2_CID_AUTOGAIN | boolean | Automatic gain/exposure control. |
V4L2_CID_GAIN | integer | Gain control. |
V4L2_CID_HFLIP | boolean | Mirror the picture horizontally. |
V4L2_CID_VFLIP | boolean | Mirror the picture vertically. |
V4L2_CID_HCENTER_DEPRECATED (formerly V4L2_CID_HCENTER) | integer | Horizontal image centering. This control is
deprecated. New drivers and applications should use the Camera class controls
V4L2_CID_PAN_ABSOLUTE,
V4L2_CID_PAN_RELATIVE and
V4L2_CID_PAN_RESET instead. |
V4L2_CID_VCENTER_DEPRECATED
(formerly V4L2_CID_VCENTER) | integer | Vertical image centering. Centering is intended to
physically adjust cameras. For image cropping see
the section called “Image Cropping, Insertion and Scaling”, for clipping the section called “Video Overlay Interface”. This
control is deprecated. New drivers and applications should use the
Camera class controls
V4L2_CID_TILT_ABSOLUTE,
V4L2_CID_TILT_RELATIVE and
V4L2_CID_TILT_RESET instead. |
V4L2_CID_POWER_LINE_FREQUENCY | enum | Enables a power line frequency filter to avoid
flicker. Possible values for enum v4l2_power_line_frequency are:
V4L2_CID_POWER_LINE_FREQUENCY_DISABLED (0),
V4L2_CID_POWER_LINE_FREQUENCY_50HZ (1) and
V4L2_CID_POWER_LINE_FREQUENCY_60HZ (2). |
V4L2_CID_HUE_AUTO | boolean | Enables automatic hue control by the device. The
effect of setting V4L2_CID_HUE while automatic
hue control is enabled is undefined, drivers should ignore such
request. |
V4L2_CID_WHITE_BALANCE_TEMPERATURE | integer | This control specifies the white balance settings as a color temperature in Kelvin. A driver should have a minimum of 2800 (incandescent) to 6500 (daylight). For more information about color temperature see Wikipedia. |
V4L2_CID_SHARPNESS | integer | Adjusts the sharpness filters in a camera. The minimum value disables the filters, higher values give a sharper picture. |
V4L2_CID_BACKLIGHT_COMPENSATION | integer | Adjusts the backlight compensation in a camera. The minimum value disables backlight compensation. |
V4L2_CID_CHROMA_AGC | boolean | Chroma automatic gain control. |
V4L2_CID_COLOR_KILLER | boolean | Enable the color killer (i. e. force a black & white image in case of a weak video signal). |
V4L2_CID_COLORFX | enum | Selects a color effect. Possible values for
enum v4l2_colorfx are:
V4L2_COLORFX_NONE (0),
V4L2_COLORFX_BW (1) and
V4L2_COLORFX_SEPIA (2). |
V4L2_CID_LASTP1 | End of the predefined control IDs (currently
V4L2_CID_COLORFX + 1). | |
V4L2_CID_PRIVATE_BASE | ID of the first custom (driver specific) control. Applications depending on particular custom controls should check the driver name and version, see the section called “Querying Capabilities”. |
Applications can enumerate the available controls with the
VIDIOC_QUERYCTRL and VIDIOC_QUERYMENU ioctls, get and set a
control value with the VIDIOC_G_CTRL and VIDIOC_S_CTRL ioctls.
Drivers must implement VIDIOC_QUERYCTRL,
VIDIOC_G_CTRL and
VIDIOC_S_CTRL when the device has one or more
controls, VIDIOC_QUERYMENU when it has one or
more menu type controls.
Example 1.8. Enumerating all controls
struct v4l2_queryctrl queryctrl; struct v4l2_querymenu querymenu; static void enumerate_menu (void) { printf (" Menu items:\n"); memset (&querymenu, 0, sizeof (querymenu)); querymenu.id = queryctrl.id; for (querymenu.index = queryctrl.minimum; querymenu.index <= queryctrl.maximum; querymenu.index++) { if (0 == ioctl (fd,VIDIOC_QUERYMENU, &querymenu)) { printf (" %s\n", querymenu.name); } else { perror ("VIDIOC_QUERYMENU"); exit (EXIT_FAILURE); } } } memset (&queryctrl, 0, sizeof (queryctrl)); for (queryctrl.id = V4L2_CID_BASE; queryctrl.id < V4L2_CID_LASTP1; queryctrl.id++) { if (0 == ioctl (fd,VIDIOC_QUERYCTRL, &queryctrl)) { if (queryctrl.flags & V4L2_CTRL_FLAG_DISABLED) continue; printf ("Control %s\n", queryctrl.name); if (queryctrl.type == V4L2_CTRL_TYPE_MENU) enumerate_menu (); } else { if (errno == EINVAL) continue; perror ("VIDIOC_QUERYCTRL"); exit (EXIT_FAILURE); } } for (queryctrl.id = V4L2_CID_PRIVATE_BASE;; queryctrl.id++) { if (0 == ioctl (fd,VIDIOC_QUERYCTRL, &queryctrl)) { if (queryctrl.flags & V4L2_CTRL_FLAG_DISABLED) continue; printf ("Control %s\n", queryctrl.name); if (queryctrl.type == V4L2_CTRL_TYPE_MENU) enumerate_menu (); } else { if (errno == EINVAL) break; perror ("VIDIOC_QUERYCTRL"); exit (EXIT_FAILURE); } }
Example 1.9. Changing controls
struct v4l2_queryctrl queryctrl; struct v4l2_control control; memset (&queryctrl, 0, sizeof (queryctrl)); queryctrl.id = V4L2_CID_BRIGHTNESS; if (-1 == ioctl (fd,VIDIOC_QUERYCTRL, &queryctrl)) { if (errno != EINVAL) { perror ("VIDIOC_QUERYCTRL"); exit (EXIT_FAILURE); } else { printf ("V4L2_CID_BRIGHTNESS is not supported\n"); } } else if (queryctrl.flags & V4L2_CTRL_FLAG_DISABLED) { printf ("V4L2_CID_BRIGHTNESS is not supported\n"); } else { memset (&control, 0, sizeof (control)); control.id = V4L2_CID_BRIGHTNESS; control.value = queryctrl.default_value; if (-1 == ioctl (fd,VIDIOC_S_CTRL, &control)) { perror ("VIDIOC_S_CTRL"); exit (EXIT_FAILURE); } } memset (&control, 0, sizeof (control)); control.id = V4L2_CID_CONTRAST; if (0 == ioctl (fd,VIDIOC_G_CTRL, &control)) { control.value += 1; /* The driver may clamp the value or return ERANGE, ignored here */ if (-1 == ioctl (fd,VIDIOC_S_CTRL, &control) && errno != ERANGE) { perror ("VIDIOC_S_CTRL"); exit (EXIT_FAILURE); } /* Ignore if V4L2_CID_CONTRAST is unsupported */ } else if (errno != EINVAL) { perror ("VIDIOC_G_CTRL"); exit (EXIT_FAILURE); } control.id = V4L2_CID_AUDIO_MUTE; control.value = TRUE; /* silence */ /* Errors ignored */ ioctl (fd, VIDIOC_S_CTRL, &control);
The control mechanism as originally designed was meant to be used for user settings (brightness, saturation, etc). However, it turned out to be a very useful model for implementing more complicated driver APIs where each driver implements only a subset of a larger API.
The MPEG encoding API was the driving force behind designing and implementing this extended control mechanism: the MPEG standard is quite large and the currently supported hardware MPEG encoders each only implement a subset of this standard. Further more, many parameters relating to how the video is encoded into an MPEG stream are specific to the MPEG encoding chip since the MPEG standard only defines the format of the resulting MPEG stream, not how the video is actually encoded into that format.
Unfortunately, the original control API lacked some features needed for these new uses and so it was extended into the (not terribly originally named) extended control API.
Even though the MPEG encoding API was the first effort to use the Extended Control API, nowadays there are also other classes of Extended Controls, such as Camera Controls and FM Transmitter Controls. The Extended Controls API as well as all Extended Controls classes are described in the following text.
Three new ioctls are available: VIDIOC_G_EXT_CTRLS,
VIDIOC_S_EXT_CTRLS and VIDIOC_TRY_EXT_CTRLS. These ioctls act on
arrays of controls (as opposed to the VIDIOC_G_CTRL and
VIDIOC_S_CTRL ioctls that act on a single control). This is needed
since it is often required to atomically change several controls at
once.
Each of the new ioctls expects a pointer to a
struct v4l2_ext_controls. This structure contains a pointer to the control
array, a count of the number of controls in that array and a control
class. Control classes are used to group similar controls into a
single class. For example, control class
V4L2_CTRL_CLASS_USER contains all user controls
(i. e. all controls that can also be set using the old
VIDIOC_S_CTRL ioctl). Control class
V4L2_CTRL_CLASS_MPEG contains all controls
relating to MPEG encoding, etc.
All controls in the control array must belong to the specified control class. An error is returned if this is not the case.
It is also possible to use an empty control array (count == 0) to check whether the specified control class is supported.
The control array is a struct v4l2_ext_control array. The v4l2_ext_control structure is very similar to struct v4l2_control, except for the fact that it also allows for 64-bit values and pointers to be passed.
It is important to realize that due to the flexibility of
controls it is necessary to check whether the control you want to set
actually is supported in the driver and what the valid range of values
is. So use the VIDIOC_QUERYCTRL and VIDIOC_QUERYMENU ioctls to
check this. Also note that it is possible that some of the menu
indices in a control of type V4L2_CTRL_TYPE_MENU
may not be supported (VIDIOC_QUERYMENU will
return an error). A good example is the list of supported MPEG audio
bitrates. Some drivers only support one or two bitrates, others
support a wider range.
The recommended way to enumerate over the extended
controls is by using VIDIOC_QUERYCTRL in combination with the
V4L2_CTRL_FLAG_NEXT_CTRL flag:
struct v4l2_queryctrl qctrl;
qctrl.id = V4L2_CTRL_FLAG_NEXT_CTRL;
while (0 == ioctl (fd, VIDIOC_QUERYCTRL, &qctrl)) {
/* ... */
qctrl.id |= V4L2_CTRL_FLAG_NEXT_CTRL;
}
The initial control ID is set to 0 ORed with the
V4L2_CTRL_FLAG_NEXT_CTRL flag. The
VIDIOC_QUERYCTRL ioctl will return the first
control with a higher ID than the specified one. When no such controls
are found an error is returned.
If you want to get all controls within a specific control
class, then you can set the initial
qctrl.id value to the control class and add
an extra check to break out of the loop when a control of another
control class is found:
qctrl.id = V4L2_CTRL_CLASS_MPEG | V4L2_CTRL_FLAG_NEXT_CTRL;
while (0 == ioctl (fd, VIDIOC_QUERYCTRL, &qctrl)) {
if (V4L2_CTRL_ID2CLASS (qctrl.id) != V4L2_CTRL_CLASS_MPEG)
break;
/* ... */
qctrl.id |= V4L2_CTRL_FLAG_NEXT_CTRL;
}
The 32-bit qctrl.id value is
subdivided into three bit ranges: the top 4 bits are reserved for
flags (e. g. V4L2_CTRL_FLAG_NEXT_CTRL) and are not
actually part of the ID. The remaining 28 bits form the control ID, of
which the most significant 12 bits define the control class and the
least significant 16 bits identify the control within the control
class. It is guaranteed that these last 16 bits are always non-zero
for controls. The range of 0x1000 and up are reserved for
driver-specific controls. The macro
V4L2_CTRL_ID2CLASS(id) returns the control class
ID based on a control ID.
If the driver does not support extended controls, then
VIDIOC_QUERYCTRL will fail when used in
combination with V4L2_CTRL_FLAG_NEXT_CTRL. In
that case the old method of enumerating control should be used (see
1.8). But if it is supported, then it is guaranteed to enumerate over
all controls, including driver-private controls.
It is possible to create control panels for a graphical
user interface where the user can select the various controls.
Basically you will have to iterate over all controls using the method
described above. Each control class starts with a control of type
V4L2_CTRL_TYPE_CTRL_CLASS.
VIDIOC_QUERYCTRL will return the name of this
control class which can be used as the title of a tab page within a
control panel.
The flags field of struct v4l2_queryctrl also contains hints on
the behavior of the control. See the VIDIOC_QUERYCTRL documentation
for more details.
Below all controls within the MPEG control class are described. First the generic controls, then controls specific for certain hardware.
Table 1.2. MPEG Control IDs
| ID | Type | ||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Description | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_CLASS | class | ||||||||||||||||||||||||||||||||||||||||
The MPEG class
descriptor. Calling VIDIOC_QUERYCTRL for this control will return a
description of this control class. This description can be used as the
caption of a Tab page in a GUI, for example. | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_STREAM_TYPE | enum v4l2_mpeg_stream_type | ||||||||||||||||||||||||||||||||||||||||
| The MPEG-1, -2 or -4 output stream type. One cannot assume anything here. Each hardware MPEG encoder tends to support different subsets of the available MPEG stream types. The currently defined stream types are: | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_STREAM_PID_PMT | integer | ||||||||||||||||||||||||||||||||||||||||
| Program Map Table Packet ID for the MPEG transport stream (default 16) | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_STREAM_PID_AUDIO | integer | ||||||||||||||||||||||||||||||||||||||||
| Audio Packet ID for the MPEG transport stream (default 256) | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_STREAM_PID_VIDEO | integer | ||||||||||||||||||||||||||||||||||||||||
| Video Packet ID for the MPEG transport stream (default 260) | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_STREAM_PID_PCR | integer | ||||||||||||||||||||||||||||||||||||||||
| Packet ID for the MPEG transport stream carrying PCR fields (default 259) | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_STREAM_PES_ID_AUDIO | integer | ||||||||||||||||||||||||||||||||||||||||
| Audio ID for MPEG PES | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_STREAM_PES_ID_VIDEO | integer | ||||||||||||||||||||||||||||||||||||||||
| Video ID for MPEG PES | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_STREAM_VBI_FMT | enum v4l2_mpeg_stream_vbi_fmt | ||||||||||||||||||||||||||||||||||||||||
| Some cards can embed VBI data (e. g. Closed Caption, Teletext) into the MPEG stream. This control selects whether VBI data should be embedded, and if so, what embedding method should be used. The list of possible VBI formats depends on the driver. The currently defined VBI format types are: | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_AUDIO_SAMPLING_FREQ | enum v4l2_mpeg_audio_sampling_freq | ||||||||||||||||||||||||||||||||||||||||
| MPEG Audio sampling frequency. Possible values are: | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_AUDIO_ENCODING | enum v4l2_mpeg_audio_encoding | ||||||||||||||||||||||||||||||||||||||||
| MPEG Audio encoding. Possible values are: | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_AUDIO_L1_BITRATE | enum v4l2_mpeg_audio_l1_bitrate | ||||||||||||||||||||||||||||||||||||||||
| MPEG-1/2 Layer I bitrate. Possible values are: | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_AUDIO_L2_BITRATE | enum v4l2_mpeg_audio_l2_bitrate | ||||||||||||||||||||||||||||||||||||||||
| MPEG-1/2 Layer II bitrate. Possible values are: | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_AUDIO_L3_BITRATE | enum v4l2_mpeg_audio_l3_bitrate | ||||||||||||||||||||||||||||||||||||||||
| MPEG-1/2 Layer III bitrate. Possible values are: | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_AUDIO_AAC_BITRATE | integer | ||||||||||||||||||||||||||||||||||||||||
| AAC bitrate in bits per second. | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_AUDIO_AC3_BITRATE | enum v4l2_mpeg_audio_ac3_bitrate | ||||||||||||||||||||||||||||||||||||||||
| AC-3 bitrate. Possible values are: | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_AUDIO_MODE | enum v4l2_mpeg_audio_mode | ||||||||||||||||||||||||||||||||||||||||
| MPEG Audio mode. Possible values are: | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_AUDIO_MODE_EXTENSION | enum v4l2_mpeg_audio_mode_extension | ||||||||||||||||||||||||||||||||||||||||
| Joint Stereo audio mode extension. In Layer I and II they indicate which subbands are in intensity stereo. All other subbands are coded in stereo. Layer III is not (yet) supported. Possible values are: | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_AUDIO_EMPHASIS | enum v4l2_mpeg_audio_emphasis | ||||||||||||||||||||||||||||||||||||||||
| Audio Emphasis. Possible values are: | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_AUDIO_CRC | enum v4l2_mpeg_audio_crc | ||||||||||||||||||||||||||||||||||||||||
| CRC method. Possible values are: | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_AUDIO_MUTE | boolean | ||||||||||||||||||||||||||||||||||||||||
| Mutes the audio when capturing. This is not done by muting audio hardware, which can still produce a slight hiss, but in the encoder itself, guaranteeing a fixed and reproducable audio bitstream. 0 = unmuted, 1 = muted. | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_VIDEO_ENCODING | enum v4l2_mpeg_video_encoding | ||||||||||||||||||||||||||||||||||||||||
| MPEG Video encoding method. Possible values are: | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_VIDEO_ASPECT | enum v4l2_mpeg_video_aspect | ||||||||||||||||||||||||||||||||||||||||
| Video aspect. Possible values are: | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_VIDEO_B_FRAMES | integer | ||||||||||||||||||||||||||||||||||||||||
| Number of B-Frames (default 2) | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_VIDEO_GOP_SIZE | integer | ||||||||||||||||||||||||||||||||||||||||
| GOP size (default 12) | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_VIDEO_GOP_CLOSURE | boolean | ||||||||||||||||||||||||||||||||||||||||
| GOP closure (default 1) | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_VIDEO_PULLDOWN | boolean | ||||||||||||||||||||||||||||||||||||||||
| Enable 3:2 pulldown (default 0) | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_VIDEO_BITRATE_MODE | enum v4l2_mpeg_video_bitrate_mode | ||||||||||||||||||||||||||||||||||||||||
| Video bitrate mode. Possible values are: | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_VIDEO_BITRATE | integer | ||||||||||||||||||||||||||||||||||||||||
| Video bitrate in bits per second. | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_VIDEO_BITRATE_PEAK | integer | ||||||||||||||||||||||||||||||||||||||||
| Peak video bitrate in bits per second. Must be larger or equal to the average video bitrate. It is ignored if the video bitrate mode is set to constant bitrate. | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_VIDEO_TEMPORAL_DECIMATION | integer | ||||||||||||||||||||||||||||||||||||||||
| For every captured frame, skip this many subsequent frames (default 0). | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_VIDEO_MUTE | boolean | ||||||||||||||||||||||||||||||||||||||||
| "Mutes" the video to a fixed color when capturing. This is useful for testing, to produce a fixed video bitstream. 0 = unmuted, 1 = muted. | |||||||||||||||||||||||||||||||||||||||||
V4L2_CID_MPEG_VIDEO_MUTE_YUV | integer | ||||||||||||||||||||||||||||||||||||||||
| Sets the "mute" color of the video. The supplied 32-bit integer is interpreted as follows (bit 0 = least significant bit): | |||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||
The following MPEG class controls deal with MPEG encoding settings that are specific to the Conexant CX23415 and CX23416 MPEG encoding chips.
Table 1.3. CX2341x Control IDs
The Camera class includes controls for mechanical (or equivalent digital) features of a device such as controllable lenses or sensors.
Table 1.4. Camera Control IDs
| ID | Type | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Description | |||||||||||
V4L2_CID_CAMERA_CLASS | class | ||||||||||
The Camera class
descriptor. Calling VIDIOC_QUERYCTRL for this control will return a
description of this control class. | |||||||||||
V4L2_CID_EXPOSURE_AUTO | enum v4l2_exposure_auto_type | ||||||||||
| Enables automatic adjustments of the exposure time and/or iris aperture. The effect of manual changes of the exposure time or iris aperture while these features are enabled is undefined, drivers should ignore such requests. Possible values are: | |||||||||||
| |||||||||||
V4L2_CID_EXPOSURE_ABSOLUTE | integer | ||||||||||
| Determines the exposure time of the camera sensor. The exposure time is limited by the frame interval. Drivers should interpret the values as 100 µs units, where the value 1 stands for 1/10000th of a second, 10000 for 1 second and 100000 for 10 seconds. | |||||||||||
V4L2_CID_EXPOSURE_AUTO_PRIORITY | boolean | ||||||||||
When
V4L2_CID_EXPOSURE_AUTO is set to
AUTO or APERTURE_PRIORITY,
this control determines if the device may dynamically vary the frame
rate. By default this feature is disabled (0) and the frame rate must
remain constant. | |||||||||||
V4L2_CID_PAN_RELATIVE | integer | ||||||||||
| This control turns the camera horizontally by the specified amount. The unit is undefined. A positive value moves the camera to the right (clockwise when viewed from above), a negative value to the left. A value of zero does not cause motion. This is a write-only control. | |||||||||||
V4L2_CID_TILT_RELATIVE | integer | ||||||||||
| This control turns the camera vertically by the specified amount. The unit is undefined. A positive value moves the camera up, a negative value down. A value of zero does not cause motion. This is a write-only control. | |||||||||||
V4L2_CID_PAN_RESET | button | ||||||||||
| When this control is set, the camera moves horizontally to the default position. | |||||||||||
V4L2_CID_TILT_RESET | button | ||||||||||
| When this control is set, the camera moves vertically to the default position. | |||||||||||
V4L2_CID_PAN_ABSOLUTE | integer | ||||||||||
| This control turns the camera horizontally to the specified position. Positive values move the camera to the right (clockwise when viewed from above), negative values to the left. Drivers should interpret the values as arc seconds, with valid values between -180 * 3600 and +180 * 3600 inclusive. | |||||||||||
V4L2_CID_TILT_ABSOLUTE | integer | ||||||||||
| This control turns the camera vertically to the specified position. Positive values move the camera up, negative values down. Drivers should interpret the values as arc seconds, with valid values between -180 * 3600 and +180 * 3600 inclusive. | |||||||||||
V4L2_CID_FOCUS_ABSOLUTE | integer | ||||||||||
| This control sets the focal point of the camera to the specified position. The unit is undefined. Positive values set the focus closer to the camera, negative values towards infinity. | |||||||||||
V4L2_CID_FOCUS_RELATIVE | integer | ||||||||||
| This control moves the focal point of the camera by the specified amount. The unit is undefined. Positive values move the focus closer to the camera, negative values towards infinity. This is a write-only control. | |||||||||||
V4L2_CID_FOCUS_AUTO | boolean | ||||||||||
| Enables automatic focus adjustments. The effect of manual focus adjustments while this feature is enabled is undefined, drivers should ignore such requests. | |||||||||||
V4L2_CID_ZOOM_ABSOLUTE | integer | ||||||||||
| Specify the objective lens focal length as an absolute value. The zoom unit is driver-specific and its value should be a positive integer. | |||||||||||
V4L2_CID_ZOOM_RELATIVE | integer | ||||||||||
| Specify the objective lens focal length relatively to the current value. Positive values move the zoom lens group towards the telephoto direction, negative values towards the wide-angle direction. The zoom unit is driver-specific. This is a write-only control. | |||||||||||
V4L2_CID_ZOOM_CONTINUOUS | integer | ||||||||||
| Move the objective lens group at the specified speed until it reaches physical device limits or until an explicit request to stop the movement. A positive value moves the zoom lens group towards the telephoto direction. A value of zero stops the zoom lens group movement. A negative value moves the zoom lens group towards the wide-angle direction. The zoom speed unit is driver-specific. | |||||||||||
V4L2_CID_PRIVACY | boolean | ||||||||||
Prevent video from being acquired
by the camera. When this control is set to TRUE (1), no
image can be captured by the camera. Common means to enforce privacy are
mechanical obturation of the sensor and firmware image processing, but the
device is not restricted to these methods. Devices that implement the privacy
control must support read access and may support write access. | |||||||||||
V4L2_CID_BAND_STOP_FILTER | integer | ||||||||||
| Switch the band-stop filter of a camera sensor on or off, or specify its strength. Such band-stop filters can be used, for example, to filter out the fluorescent light component. | |||||||||||
The FM Transmitter (FM_TX) class includes controls for common features of FM transmissions capable devices. Currently this class includes parameters for audio compression, pilot tone generation, audio deviation limiter, RDS transmission and tuning power features.
Table 1.5. FM_TX Control IDs
| ID | Type | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Description | |||||||||
V4L2_CID_FM_TX_CLASS | class | ||||||||
The FM_TX class
descriptor. Calling VIDIOC_QUERYCTRL for this control will return a
description of this control class. | |||||||||
V4L2_CID_RDS_TX_DEVIATION | integer | ||||||||
| Configures RDS signal frequency deviation level in Hz. The range and step are driver-specific. | |||||||||
V4L2_CID_RDS_TX_PI | integer | ||||||||
| Sets the RDS Programme Identification field for transmission. | |||||||||
V4L2_CID_RDS_TX_PTY | integer | ||||||||
| Sets the RDS Programme Type field for transmission. This encodes up to 31 pre-defined programme types. | |||||||||
V4L2_CID_RDS_TX_PS_NAME | string | ||||||||
| Sets the Programme Service name (PS_NAME) for transmission. It is intended for static display on a receiver. It is the primary aid to listeners in programme service identification and selection. In Annex E of [EN 50067], the RDS specification, there is a full description of the correct character encoding for Programme Service name strings. Also from RDS specification, PS is usually a single eight character text. However, it is also possible to find receivers which can scroll strings sized as 8 x N characters. So, this control must be configured with steps of 8 characters. The result is it must always contain a string with size multiple of 8. | |||||||||
V4L2_CID_RDS_TX_RADIO_TEXT | string | ||||||||
Sets the Radio Text info for transmission. It is a textual description of
what is being broadcasted. RDS Radio Text can be applied when broadcaster wishes to transmit longer PS names,
programme-related information or any other text. In these cases, RadioText should be used in addition to
V4L2_CID_RDS_TX_PS_NAME. The encoding for Radio Text strings is also fully described
in Annex E of [EN 50067]. The length of Radio Text strings depends on which RDS Block is being
used to transmit it, either 32 (2A block) or 64 (2B block). However, it is also possible
to find receivers which can scroll strings sized as 32 x N or 64 x N characters. So, this control must be configured
with steps of 32 or 64 characters. The result is it must always contain a string with size multiple of 32 or 64. | |||||||||
V4L2_CID_AUDIO_LIMITER_ENABLED | boolean | ||||||||
| Enables or disables the audio deviation limiter feature. The limiter is useful when trying to maximize the audio volume, minimize receiver-generated distortion and prevent overmodulation. | |||||||||
V4L2_CID_AUDIO_LIMITER_RELEASE_TIME | integer | ||||||||
| Sets the audio deviation limiter feature release time. Unit is in useconds. Step and range are driver-specific. | |||||||||
V4L2_CID_AUDIO_LIMITER_DEVIATION | integer | ||||||||
| Configures audio frequency deviation level in Hz. The range and step are driver-specific. | |||||||||
V4L2_CID_AUDIO_COMPRESSION_ENABLED | boolean | ||||||||
| Enables or disables the audio compression feature. This feature amplifies signals below the threshold by a fixed gain and compresses audio signals above the threshold by the ratio of Threshold/(Gain + Threshold). | |||||||||
V4L2_CID_AUDIO_COMPRESSION_GAIN | integer | ||||||||
| Sets the gain for audio compression feature. It is a dB value. The range and step are driver-specific. | |||||||||
V4L2_CID_AUDIO_COMPRESSION_THRESHOLD | integer | ||||||||
| Sets the threshold level for audio compression freature. It is a dB value. The range and step are driver-specific. | |||||||||
V4L2_CID_AUDIO_COMPRESSION_ATTACK_TIME | integer | ||||||||
| Sets the attack time for audio compression feature. It is a useconds value. The range and step are driver-specific. | |||||||||
V4L2_CID_AUDIO_COMPRESSION_RELEASE_TIME | integer | ||||||||
| Sets the release time for audio compression feature. It is a useconds value. The range and step are driver-specific. | |||||||||
V4L2_CID_PILOT_TONE_ENABLED | boolean | ||||||||
| Enables or disables the pilot tone generation feature. | |||||||||
V4L2_CID_PILOT_TONE_DEVIATION | integer | ||||||||
| Configures pilot tone frequency deviation level. Unit is in Hz. The range and step are driver-specific. | |||||||||
V4L2_CID_PILOT_TONE_FREQUENCY | integer | ||||||||
| Configures pilot tone frequency value. Unit is in Hz. The range and step are driver-specific. | |||||||||
V4L2_CID_TUNE_PREEMPHASIS | integer | ||||||||
| Configures the pre-emphasis value for broadcasting. A pre-emphasis filter is applied to the broadcast to accentuate the high audio frequencies. Depending on the region, a time constant of either 50 or 75 useconds is used. The enum v4l2_preemphasis defines possible values for pre-emphasis. Here they are: | |||||||||
| |||||||||
V4L2_CID_TUNE_POWER_LEVEL | integer | ||||||||
| Sets the output power level for signal transmission. Unit is in dBuV. Range and step are driver-specific. | |||||||||
V4L2_CID_TUNE_ANTENNA_CAPACITOR | integer | ||||||||
| This selects the value of antenna tuning capacitor manually or automatically if set to zero. Unit, range and step are driver-specific. | |||||||||
For more details about RDS specification, refer to [EN 50067] document, from CENELEC.
Different devices exchange different kinds of data with applications, for example video images, raw or sliced VBI data, RDS datagrams. Even within one kind many different formats are possible, in particular an abundance of image formats. Although drivers must provide a default and the selection persists across closing and reopening a device, applications should always negotiate a data format before engaging in data exchange. Negotiation means the application asks for a particular format and the driver selects and reports the best the hardware can do to satisfy the request. Of course applications can also just query the current selection.
A single mechanism exists to negotiate all data formats
using the aggregate struct v4l2_format and the VIDIOC_G_FMT and
VIDIOC_S_FMT ioctls. Additionally the VIDIOC_TRY_FMT ioctl can be
used to examine what the hardware could do,
without actually selecting a new data format. The data formats
supported by the V4L2 API are covered in the respective device section
in Chapter 4, Interfaces. For a closer look at image formats see
Chapter 2, Image Formats.
The VIDIOC_S_FMT ioctl is a major
turning-point in the initialization sequence. Prior to this point
multiple panel applications can access the same device concurrently to
select the current input, change controls or modify other properties.
The first VIDIOC_S_FMT assigns a logical stream
(video data, VBI data etc.) exclusively to one file descriptor.
Exclusive means no other application, more precisely no other file descriptor, can grab this stream or change device properties inconsistent with the negotiated parameters. A video standard change for example, when the new standard uses a different number of scan lines, can invalidate the selected image format. Therefore only the file descriptor owning the stream can make invalidating changes. Accordingly multiple file descriptors which grabbed different logical streams prevent each other from interfering with their settings. When for example video overlay is about to start or already in progress, simultaneous video capturing may be restricted to the same cropping and image size.
When applications omit the
VIDIOC_S_FMT ioctl its locking side effects are
implied by the next step, the selection of an I/O method with the
VIDIOC_REQBUFS ioctl or implicit with the first read() or
write() call.
Generally only one logical stream can be assigned to a
file descriptor, the exception being drivers permitting simultaneous
video capturing and overlay using the same file descriptor for
compatibility with V4L and earlier versions of V4L2. Switching the
logical stream or returning into "panel mode" is possible by closing
and reopening the device. Drivers may support a
switch using VIDIOC_S_FMT.
All drivers exchanging data with
applications must support the VIDIOC_G_FMT and
VIDIOC_S_FMT ioctl. Implementation of the
VIDIOC_TRY_FMT is highly recommended but
optional.
Apart of the generic format negotiation functions a special ioctl to enumerate all image formats supported by video capture, overlay or output devices is available.[11]
The VIDIOC_ENUM_FMT ioctl must be supported
by all drivers exchanging image data with applications.
Drivers are not supposed to convert image formats in kernel space. They must enumerate only formats directly supported by the hardware. If necessary driver writers should publish an example conversion routine or library for integration into applications.
Some video capture devices can sample a subsection of the picture and shrink or enlarge it to an image of arbitrary size. We call these abilities cropping and scaling. Some video output devices can scale an image up or down and insert it at an arbitrary scan line and horizontal offset into a video signal.
Applications can use the following API to select an area in
the video signal, query the default area and the hardware limits.
Despite their name, the VIDIOC_CROPCAP, VIDIOC_G_CROP
and VIDIOC_S_CROP ioctls apply to input as well as output
devices.
Scaling requires a source and a target. On a video capture
or overlay device the source is the video signal, and the cropping
ioctls determine the area actually sampled. The target are images
read by the application or overlaid onto the graphics screen. Their
size (and position for an overlay) is negotiated with the
VIDIOC_G_FMT and VIDIOC_S_FMT ioctls.
On a video output device the source are the images passed in
by the application, and their size is again negotiated with the
VIDIOC_G/S_FMT ioctls, or may be encoded in a
compressed video stream. The target is the video signal, and the
cropping ioctls determine the area where the images are
inserted.
Source and target rectangles are defined even if the device
does not support scaling or the VIDIOC_G/S_CROP
ioctls. Their size (and position where applicable) will be fixed in
this case. All capture and output device must support the
VIDIOC_CROPCAP ioctl such that applications can
determine if scaling takes place.
For capture devices the coordinates of the top left
corner, width and height of the area which can be sampled is given by
the bounds substructure of the
struct v4l2_cropcap returned by the VIDIOC_CROPCAP
ioctl. To support a wide range of hardware this specification does not
define an origin or units. However by convention drivers should
horizontally count unscaled samples relative to 0H (the leading edge
of the horizontal sync pulse, see Figure 4.1, “Line synchronization”).
Vertically ITU-R line
numbers of the first field (Figure 4.2, “ITU-R 525 line numbering (M/NTSC and M/PAL)”, Figure 4.3, “ITU-R 625 line numbering”), multiplied by two if the driver can capture both
fields.
The top left corner, width and height of the source
rectangle, that is the area actually sampled, is given by struct v4l2_crop
using the same coordinate system as struct v4l2_cropcap. Applications can
use the VIDIOC_G_CROP and
VIDIOC_S_CROP ioctls to get and set this
rectangle. It must lie completely within the capture boundaries and
the driver may further adjust the requested size and/or position
according to hardware limitations.
Each capture device has a default source rectangle, given
by the defrect substructure of
struct v4l2_cropcap. The center of this rectangle shall align with the
center of the active picture area of the video signal, and cover what
the driver writer considers the complete picture. Drivers shall reset
the source rectangle to the default when the driver is first loaded,
but not later.
For output devices these structures and ioctls are used accordingly, defining the target rectangle where the images will be inserted into the video signal.
Video hardware can have various cropping, insertion and
scaling limitations. It may only scale up or down, support only
discrete scaling factors, or have different scaling abilities in
horizontal and vertical direction. Also it may not support scaling at
all. At the same time the struct v4l2_crop rectangle may have to be
aligned, and both the source and target rectangles may have arbitrary
upper and lower size limits. In particular the maximum
width and height
in struct v4l2_crop may be smaller than the
struct v4l2_cropcap.bounds area. Therefore, as
usual, drivers are expected to adjust the requested parameters and
return the actual values selected.
Applications can change the source or the target rectangle
first, as they may prefer a particular image size or a certain area in
the video signal. If the driver has to adjust both to satisfy hardware
limitations, the last requested rectangle shall take priority, and the
driver should preferably adjust the opposite one. The VIDIOC_TRY_FMT
ioctl however shall not change the driver state and therefore only
adjust the requested rectangle.
Suppose scaling on a video capture device is restricted to
a factor 1:1 or 2:1 in either direction and the target image size must
be a multiple of 16 × 16 pixels. The source cropping
rectangle is set to defaults, which are also the upper limit in this
example, of 640 × 400 pixels at offset 0, 0. An
application requests an image size of 300 × 225
pixels, assuming video will be scaled down from the "full picture"
accordingly. The driver sets the image size to the closest possible
values 304 × 224, then chooses the cropping rectangle
closest to the requested size, that is 608 × 224
(224 × 2:1 would exceed the limit 400). The offset
0, 0 is still valid, thus unmodified. Given the default cropping
rectangle reported by VIDIOC_CROPCAP the
application can easily propose another offset to center the cropping
rectangle.
Now the application may insist on covering an area using a picture aspect ratio closer to the original request, so it asks for a cropping rectangle of 608 × 456 pixels. The present scaling factors limit cropping to 640 × 384, so the driver returns the cropping size 608 × 384 and adjusts the image size to closest possible 304 × 192.
Source and target rectangles shall remain unchanged across closing and reopening a device, such that piping data into or out of a device will work without special preparations. More advanced applications should ensure the parameters are suitable before starting I/O.
Example 1.10. Resetting the cropping parameters
(A video capture device is assumed; change
V4L2_BUF_TYPE_VIDEO_CAPTURE for other
devices.)
struct v4l2_cropcap cropcap; struct v4l2_crop crop; memset (&cropcap, 0, sizeof (cropcap)); cropcap.type = V4L2_BUF_TYPE_VIDEO_CAPTURE; if (-1 == ioctl (fd,VIDIOC_CROPCAP, &cropcap)) { perror ("VIDIOC_CROPCAP"); exit (EXIT_FAILURE); } memset (&crop, 0, sizeof (crop)); crop.type = V4L2_BUF_TYPE_VIDEO_CAPTURE; crop.c = cropcap.defrect; /* Ignore if cropping is not supported (EINVAL). */ if (-1 == ioctl (fd,VIDIOC_S_CROP, &crop) && errno != EINVAL) { perror ("VIDIOC_S_CROP"); exit (EXIT_FAILURE); }
Example 1.11. Simple downscaling
(A video capture device is assumed.)
struct v4l2_cropcap cropcap;
struct v4l2_format format;
reset_cropping_parameters ();
/* Scale down to 1/4 size of full picture. */
memset (&format, 0, sizeof (format)); /* defaults */
format.type = V4L2_BUF_TYPE_VIDEO_CAPTURE;
format.fmt.pix.width = cropcap.defrect.width >> 1;
format.fmt.pix.height = cropcap.defrect.height >> 1;
format.fmt.pix.pixelformat = V4L2_PIX_FMT_YUYV;
if (-1 == ioctl (fd, VIDIOC_S_FMT, &format)) {
perror ("VIDIOC_S_FORMAT");
exit (EXIT_FAILURE);
}
/* We could check the actual image size now, the actual scaling factor
or if the driver can scale at all. */
Example 1.12. Selecting an output area
struct v4l2_cropcap cropcap; struct v4l2_crop crop; memset (&cropcap, 0, sizeof (cropcap)); cropcap.type = V4L2_BUF_TYPE_VIDEO_OUTPUT; if (-1 == ioctl (fd, VIDIOC_CROPCAP;, &cropcap)) { perror ("VIDIOC_CROPCAP"); exit (EXIT_FAILURE); } memset (&crop, 0, sizeof (crop)); crop.type = V4L2_BUF_TYPE_VIDEO_OUTPUT; crop.c = cropcap.defrect; /* Scale the width and height to 50 % of their original size and center the output. */ crop.c.width /= 2; crop.c.height /= 2; crop.c.left += crop.c.width / 2; crop.c.top += crop.c.height / 2; /* Ignore if cropping is not supported (EINVAL). */ if (-1 == ioctl (fd, VIDIOC_S_CROP, &crop) && errno != EINVAL) { perror ("VIDIOC_S_CROP"); exit (EXIT_FAILURE); }
Example 1.13. Current scaling factor and pixel aspect
(A video capture device is assumed.)
struct v4l2_cropcap cropcap; struct v4l2_crop crop; struct v4l2_format format; double hscale, vscale; double aspect; int dwidth, dheight; memset (&cropcap, 0, sizeof (cropcap)); cropcap.type = V4L2_BUF_TYPE_VIDEO_CAPTURE; if (-1 == ioctl (fd,VIDIOC_CROPCAP, &cropcap)) { perror ("VIDIOC_CROPCAP"); exit (EXIT_FAILURE); } memset (&crop, 0, sizeof (crop)); crop.type = V4L2_BUF_TYPE_VIDEO_CAPTURE; if (-1 == ioctl (fd,VIDIOC_G_CROP, &crop)) { if (errno != EINVAL) { perror ("VIDIOC_G_CROP"); exit (EXIT_FAILURE); } /* Cropping not supported. */ crop.c = cropcap.defrect; } memset (&format, 0, sizeof (format)); format.fmt.type = V4L2_BUF_TYPE_VIDEO_CAPTURE; if (-1 == ioctl (fd,VIDIOC_G_FMT, &format)) { perror ("VIDIOC_G_FMT"); exit (EXIT_FAILURE); } /* The scaling applied by the driver. */ hscale = format.fmt.pix.width / (double) crop.c.width; vscale = format.fmt.pix.height / (double) crop.c.height; aspect = cropcap.pixelaspect.numerator / (double) cropcap.pixelaspect.denominator; aspect = aspect * hscale / vscale; /* Devices following ITU-R BT.601 do not capture square pixels. For playback on a computer monitor we should scale the images to this size. */ dwidth = format.fmt.pix.width / aspect; dheight = format.fmt.pix.height;
Streaming parameters are intended to optimize the video
capture process as well as I/O. Presently applications can request a
high quality capture mode with the VIDIOC_S_PARM ioctl.
The current video standard determines a nominal number of
frames per second. If less than this number of frames is to be
captured or output, applications can request frame skipping or
duplicating on the driver side. This is especially useful when using
the read() or write(), which are not augmented by timestamps
or sequence counters, and to avoid unneccessary data copying.
Finally these ioctls can be used to determine the number of
buffers used internally by a driver in read/write mode. For
implications see the section discussing the read()
function.
To get and set the streaming parameters applications call
the VIDIOC_G_PARM and VIDIOC_S_PARM ioctl, respectively. They take
a pointer to a struct v4l2_streamparm, which contains a union holding
separate parameters for input and output devices.
These ioctls are optional, drivers need not implement them. If so, they return the EINVAL error code.
[1] Access permissions are associated with character device special files, hence we must ensure device numbers cannot change with the module load order. To this end minor numbers are no longer automatically assigned by the "videodev" module as in V4L but requested by the driver. The defaults will suffice for most people unless two drivers compete for the same minor numbers.
[2] In earlier versions of the V4L2 API the module options where named after the device special file with a "unit_" prefix, expressing the minor number itself, not an offset. Rationale for this change is unknown. Lastly the naming and semantics are just a convention among driver writers, the point to note is that minor numbers are not supposed to be hardcoded into drivers.
[3] Given a device file name one cannot reliable find
related devices. For once names are arbitrary and in a system with
multiple devices, where only some support VBI capturing, a
/dev/video2 is not necessarily related to
/dev/vbi2. The V4L
VIDIOCGUNIT ioctl would require a search for a
device file with a particular major and minor number.
[4] Drivers could recognize the
O_EXCL open flag. Presently this is not required,
so applications cannot know if it really works.
[5] Actually struct v4l2_audio ought to have a
tuner field like struct v4l2_input, not only
making the API more consistent but also permitting radio devices with
multiple tuners.
[6] Some users are already confused by technical terms PAL, NTSC and SECAM. There is no point asking them to distinguish between B, G, D, or K when the software or hardware can do that automatically.
[7] An alternative to the current scheme is to use pointers
to indices as arguments of VIDIOC_G_STD and
VIDIOC_S_STD, the struct v4l2_input and
struct v4l2_output std field would be a set of
indices like audioset.
Indices are consistent with the rest of the API
and identify the standard unambiguously. In the present scheme of
things an enumerated standard is looked up by v4l2_std_id. Now the
standards supported by the inputs of a device can overlap. Just
assume the tuner and composite input in the example above both
exist on a device. An enumeration of "PAL-B/G", "PAL-H/I" suggests
a choice which does not exist. We cannot merge or omit sets, because
applications would be unable to find the standards reported by
VIDIOC_G_STD. That leaves separate enumerations
for each input. Also selecting a standard by v4l2_std_id can be
ambiguous. Advantage of this method is that applications need not
identify the standard indirectly, after enumerating.
So in summary, the lookup itself is unavoidable. The difference is only whether the lookup is necessary to find an enumerated standard or to switch to a standard by v4l2_std_id.
[8] See the section called “Buffers” for a rationale. Probably even USB cameras follow some well known video standard. It might have been better to explicitly indicate elsewhere if a device cannot live up to normal expectations, instead of this exception.
[9] It will be more convenient for applications if drivers
make use of the V4L2_CTRL_FLAG_DISABLED flag, but
that was never required.
[10] Applications could call an ioctl to request events.
After another process called VIDIOC_S_CTRL or another ioctl changing
shared properties the select() function would indicate
readability until any ioctl (querying the properties) is
called.
[11] Enumerating formats an application has no a-priori knowledge of (otherwise it could explicitly ask for them and need not enumerate) seems useless, but there are applications serving as proxy between drivers and the actual video applications for which this is useful.
Table of Contents
The V4L2 API was primarily designed for devices exchanging
image data with applications. The
v4l2_pix_format structure defines the format
and layout of an image in memory. Image formats are negotiated with
the VIDIOC_S_FMT ioctl. (The explanations here focus on video
capturing and output, for overlay frame buffer formats see also
VIDIOC_G_FBUF.)
Table 2.1. struct v4l2_pix_format
| __u32 | width | Image width in pixels. |
| __u32 | height | Image height in pixels. |
Applications set these fields to
request an image size, drivers return the closest possible values. In
case of planar formats the width and
height applies to the largest plane. To
avoid ambiguities drivers must return values rounded up to a multiple
of the scale factor of any smaller planes. For example when the image
format is YUV 4:2:0, width and
height must be multiples of two. | ||
| __u32 | pixelformat | The pixel format or type of compression, set by the application. This is a little endian four character code. V4L2 defines standard RGB formats in Table 2.4, “Packed RGB Image Formats”, YUV formats in the section called “YUV Formats”, and reserved codes in Table 2.8, “Reserved Image Formats” |
| enum v4l2_field | field | Video images are typically interlaced. Applications can request to capture or output only the top or bottom field, or both fields interlaced or sequentially stored in one buffer or alternating in separate buffers. Drivers return the actual field order selected. For details see the section called “Field Order”. |
| __u32 | bytesperline | Distance in bytes between the leftmost pixels in two adjacent lines. |
Both applications and drivers
can set this field to request padding bytes at the end of each line.
Drivers however may ignore the value requested by the application,
returning Video hardware may access padding bytes, therefore they must reside in accessible memory. Consider cases where padding bytes after the last line of an image cross a system page boundary. Input devices may write padding bytes, the value is undefined. Output devices ignore the contents of padding bytes. When the image format is planar the
| ||
| __u32 | sizeimage | Size in bytes of the buffer to hold a complete image,
set by the driver. Usually this is
bytesperline times
height. When the image consists of variable
length compressed data this is the maximum number of bytes required to
hold an image. |
| enum v4l2_colorspace | colorspace | This information supplements the
pixelformat and must be set by the driver,
see the section called “Colorspaces”. |
| __u32 | priv | Reserved for custom (driver defined) additional information about formats. When not used drivers and applications must set this field to zero. |
In order to exchange images between drivers and applications, it is necessary to have standard image data formats which both sides will interpret the same way. V4L2 includes several such formats, and this section is intended to be an unambiguous specification of the standard image data formats in V4L2.
V4L2 drivers are not limited to these formats, however. Driver-specific formats are possible. In that case the application may depend on a codec to convert images to one of the standard formats when needed. But the data can still be stored and retrieved in the proprietary format. For example, a device may support a proprietary compressed format. Applications can still capture and save the data in the compressed format, saving much disk space, and later use a codec to convert the images to the X Windows screen format when the video is to be displayed.
Even so, ultimately, some standard formats are needed, so the V4L2 specification would not be complete without well-defined standard formats.
The V4L2 standard formats are mainly uncompressed formats. The pixels are always arranged in memory from left to right, and from top to bottom. The first byte of data in the image buffer is always for the leftmost pixel of the topmost row. Following that is the pixel immediately to its right, and so on until the end of the top row of pixels. Following the rightmost pixel of the row there may be zero or more bytes of padding to guarantee that each row of pixel data has a certain alignment. Following the pad bytes, if any, is data for the leftmost pixel of the second row from the top, and so on. The last row has just as many pad bytes after it as the other rows.
In V4L2 each format has an identifier which looks like
PIX_FMT_XXX, defined in the videodev.h header file. These identifiers
represent four character codes
which are also listed below, however they are not the same as those
used in the Windows world.
[intro]
[to do]
E'R = f(R)
E'G = f(G)
E'B = f(B)
[to do]
E'Y = CoeffR E'R + CoeffG E'G + CoeffB E'B
(E'R - E'Y) = E'R - CoeffR E'R - CoeffG E'G - CoeffB E'B
(E'B - E'Y) = E'B - CoeffR E'R - CoeffG E'G - CoeffB E'B
The color-difference signals are scaled back to unity range [-0.5;+0.5]:
KB = 0.5 / (1 - CoeffB)
KR = 0.5 / (1 - CoeffR)
PB = KB (E'B - E'Y) = 0.5 (CoeffR / CoeffB) E'R + 0.5 (CoeffG / CoeffB) E'G + 0.5 E'B
PR = KR (E'R - E'Y) = 0.5 E'R + 0.5 (CoeffG / CoeffR) E'G + 0.5 (CoeffB / CoeffR) E'B
[to do]
Y' = (Lum. Levels - 1) · E'Y + Lum. Offset
CB = (Chrom. Levels - 1) · PB + Chrom. Offset
CR = (Chrom. Levels - 1) · PR + Chrom. Offset
Rounding to the nearest integer and clamping to the range [0;255] finally yields the digital color components Y'CbCr stored in YUV images.
Example 2.1. ITU-R Rec. BT.601 color conversion
Forward Transformation
int ER, EG, EB; /* gamma corrected RGB input [0;255] */
int Y1, Cb, Cr; /* output [0;255] */
double r, g, b; /* temporaries */
double y1, pb, pr;
int
clamp (double x)
{
int r = x; /* round to nearest */
if (r < 0) return 0;
else if (r > 255) return 255;
else return r;
}
r = ER / 255.0;
g = EG / 255.0;
b = EB / 255.0;
y1 = 0.299 * r + 0.587 * g + 0.114 * b;
pb = -0.169 * r - 0.331 * g + 0.5 * b;
pr = 0.5 * r - 0.419 * g - 0.081 * b;
Y1 = clamp (219 * y1 + 16);
Cb = clamp (224 * pb + 128);
Cr = clamp (224 * pr + 128);
/* or shorter */
y1 = 0.299 * ER + 0.587 * EG + 0.114 * EB;
Y1 = clamp ( (219 / 255.0) * y1 + 16);
Cb = clamp (((224 / 255.0) / (2 - 2 * 0.114)) * (EB - y1) + 128);
Cr = clamp (((224 / 255.0) / (2 - 2 * 0.299)) * (ER - y1) + 128);
Inverse Transformation
int Y1, Cb, Cr; /* gamma pre-corrected input [0;255] */
int ER, EG, EB; /* output [0;255] */
double r, g, b; /* temporaries */
double y1, pb, pr;
int
clamp (double x)
{
int r = x; /* round to nearest */
if (r < 0) return 0;
else if (r > 255) return 255;
else return r;
}
y1 = (255 / 219.0) * (Y1 - 16);
pb = (255 / 224.0) * (Cb - 128);
pr = (255 / 224.0) * (Cr - 128);
r = 1.0 * y1 + 0 * pb + 1.402 * pr;
g = 1.0 * y1 - 0.344 * pb - 0.714 * pr;
b = 1.0 * y1 + 1.772 * pb + 0 * pr;
ER = clamp (r * 255); /* [ok? one should prob. limit y1,pb,pr] */
EG = clamp (g * 255);
EB = clamp (b * 255);
Table 2.2. enum v4l2_colorspace
| Identifier | Value | Description | Chromaticities[a] | White Point | Gamma Correction | Luminance E'Y | Quantization | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Red | Green | Blue | Y' | Cb, Cr | ||||||
V4L2_COLORSPACE_SMPTE170M | 1 | NTSC/PAL according to [SMPTE 170M], [ITU BT.601] | x = 0.630, y = 0.340 | x = 0.310, y = 0.595 | x = 0.155, y = 0.070 | x = 0.3127, y = 0.3290, Illuminant D65 | E' = 4.5 I for I ≤0.018, 1.099 I0.45 - 0.099 for 0.018 < I | 0.299 E'R + 0.587 E'G + 0.114 E'B | 219 E'Y + 16 | 224 PB,R + 128 |
V4L2_COLORSPACE_SMPTE240M | 2 | 1125-Line (US) HDTV, see [SMPTE 240M] | x = 0.630, y = 0.340 | x = 0.310, y = 0.595 | x = 0.155, y = 0.070 | x = 0.3127, y = 0.3290, Illuminant D65 | E' = 4 I for I ≤0.0228, 1.1115 I0.45 - 0.1115 for 0.0228 < I | 0.212 E'R + 0.701 E'G + 0.087 E'B | 219 E'Y + 16 | 224 PB,R + 128 |
V4L2_COLORSPACE_REC709 | 3 | HDTV and modern devices, see [ITU BT.709] | x = 0.640, y = 0.330 | x = 0.300, y = 0.600 | x = 0.150, y = 0.060 | x = 0.3127, y = 0.3290, Illuminant D65 | E' = 4.5 I for I ≤0.018, 1.099 I0.45 - 0.099 for 0.018 < I | 0.2125 E'R + 0.7154 E'G + 0.0721 E'B | 219 E'Y + 16 | 224 PB,R + 128 |
V4L2_COLORSPACE_BT878 | 4 | Broken Bt878 extents[b], [ITU BT.601] | ? | ? | ? | ? | ? | 0.299 E'R + 0.587 E'G + 0.114 E'B | 237 E'Y + 16 | 224 PB,R + 128 (probably) |
V4L2_COLORSPACE_470_SYSTEM_M | 5 | M/NTSC[c] according to [ITU BT.470], [ITU BT.601] | x = 0.67, y = 0.33 | x = 0.21, y = 0.71 | x = 0.14, y = 0.08 | x = 0.310, y = 0.316, Illuminant C | ? | 0.299 E'R + 0.587 E'G + 0.114 E'B | 219 E'Y + 16 | 224 PB,R + 128 |
V4L2_COLORSPACE_470_SYSTEM_BG | 6 | 625-line PAL and SECAM systems according to [ITU BT.470], [ITU BT.601] | x = 0.64, y = 0.33 | x = 0.29, y = 0.60 | x = 0.15, y = 0.06 | x = 0.313, y = 0.329, Illuminant D65 | ? | 0.299 E'R + 0.587 E'G + 0.114 E'B | 219 E'Y + 16 | 224 PB,R + 128 |
V4L2_COLORSPACE_JPEG | 7 | JPEG Y'CbCr, see [JFIF], [ITU BT.601] | ? | ? | ? | ? | ? | 0.299 E'R + 0.587 E'G + 0.114 E'B | 256 E'Y + 16[d] | 256 PB,R + 128 |
V4L2_COLORSPACE_SRGB | 8 | [?] | x = 0.640, y = 0.330 | x = 0.300, y = 0.600 | x = 0.150, y = 0.060 | x = 0.3127, y = 0.3290, Illuminant D65 | E' = 4.5 I for I ≤0.018, 1.099 I0.45 - 0.099 for 0.018 < I | n/a | ||
[a] The coordinates of the color primaries are given in the CIE system (1931) [b] The ubiquitous Bt878 video capture chip quantizes E'Y to 238 levels, yielding a range of Y' = 16 … 253, unlike Rec. 601 Y' = 16 … 235. This is not a typo in the Bt878 documentation, it has been implemented in silicon. The chroma extents are unclear. [c] No identifier exists for M/PAL which uses the chromaticities of M/NTSC, the remaining parameters are equal to B and G/PAL. [d] Note JFIF quantizes Y'PBPR in range [0;+1] and [-0.5;+0.5] to 257 levels, however Y'CbCr signals are still clamped to [0;255]. | ||||||||||
In this format each pixel is represented by an 8 bit index into a 256 entry ARGB palette. It is intended for Video Output Overlays only. There are no ioctls to access the palette, this must be done with ioctls of the Linux framebuffer API.
Packed RGB formats — Packed RGB formats
These formats are designed to match the pixel formats of typical PC graphics frame buffers. They occupy 8, 16, 24 or 32 bits per pixel. These are all packed-pixel formats, meaning all the data for a pixel lie next to each other in memory.
When one of these formats is used, drivers shall report the
colorspace V4L2_COLORSPACE_SRGB.
Table 2.4. Packed RGB Image Formats
| Identifier | Code | Byte 0 in memory | Byte 1 | Byte 2 | Byte 3 | ||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Bit | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | |||||
V4L2_PIX_FMT_RGB332 | 'RGB1' | b1 | b0 | g2 | g1 | g0 | r2 | r1 | r0 | ||||||||||||||||||||||||||||
V4L2_PIX_FMT_RGB444 | 'R444' | g3 | g2 | g1 | g0 | b3 | b2 | b1 | b0 | a3 | a2 | a1 | a0 | r3 | r2 | r1 | r0 | ||||||||||||||||||||
V4L2_PIX_FMT_RGB555 | 'RGBO' | g2 | g1 | g0 | r4 | r3 | r2 | r1 | r0 | a | b4 | b3 | b2 | b1 | b0 | g4 | g3 | ||||||||||||||||||||
V4L2_PIX_FMT_RGB565 | 'RGBP' | g2 | g1 | g0 | r4 | r3 | r2 | r1 | r0 | b4 | b3 | b2 | b1 | b0 | g5 | g4 | g3 | ||||||||||||||||||||
V4L2_PIX_FMT_RGB555X | 'RGBQ' | a | b4 | b3 | b2 | b1 | b0 | g4 | g3 | g2 | g1 | g0 | r4 | r3 | r2 | r1 | r0 | ||||||||||||||||||||
V4L2_PIX_FMT_RGB565X | 'RGBR' | b4 | b3 | b2 | b1 | b0 | g5 | g4 | g3 | g2 | g1 | g0 | r4 | r3 | r2 | r1 | r0 | ||||||||||||||||||||
V4L2_PIX_FMT_BGR24 | 'BGR3' | b7 | b6 | b5 | b4 | b3 | b2 | b1 | b0 | g7 | g6 | g5 | g4 | g3 | g2 | g1 | g0 | r7 | r6 | r5 | r4 | r3 | r2 | r1 | r0 | ||||||||||||
V4L2_PIX_FMT_RGB24 | 'RGB3' | r7 | r6 | r5 | r4 | r3 | r2 | r1 | r0 | g7 | g6 | g5 | g4 | g3 | g2 | g1 | g0 | b7 | b6 | b5 | b4 | b3 | b2 | b1 | b0 | ||||||||||||
V4L2_PIX_FMT_BGR32 | 'BGR4' | b7 | b6 | b5 | b4 | b3 | b2 | b1 | b0 | g7 | g6 | g5 | g4 | g3 | g2 | g1 | g0 | r7 | r6 | r5 | r4 | r3 | r2 | r1 | r0 | a7 | a6 | a5 | a4 | a3 | a2 | a1 | a0 | ||||
V4L2_PIX_FMT_RGB32 | 'RGB4' | r7 | r6 | r5 | r4 | r3 | r2 | r1 | r0 | g7 | g6 | g5 | g4 | g3 | g2 | g1 | g0 | b7 | b6 | b5 | b4 | b3 | b2 | b1 | b0 | a7 | a6 | a5 | a4 | a3 | a2 | a1 | a0 | ||||
Bit 7 is the most significant bit. The value of a = alpha bits is undefined when reading from the driver, ignored when writing to the driver, except when alpha blending has been negotiated for a Video Overlay or Video Output Overlay.
Example 2.2. V4L2_PIX_FMT_BGR24 4 × 4 pixel
image
Byte Order. Each cell is one byte.
| start + 0: | B00 | G00 | R00 | B01 | G01 | R01 | B02 | G02 | R02 | B03 | G03 | R03 |
| start + 12: | B10 | G10 | R10 | B11 | G11 | R11 | B12 | G12 | R12 | B13 | G13 | R13 |
| start + 24: | B20 | G20 | R20 | B21 | G21 | R21 | B22 | G22 | R22 | B23 | G23 | R23 |
| start + 36: | B30 | G30 | R30 | B31 | G31 | R31 | B32 | G32 | R32 | B33 | G33 | R33 |
Drivers may interpret these formats differently.
Some RGB formats above are uncommon and were probably defined in error. Drivers may interpret them as in Table 2.5, “Packed RGB Image Formats (corrected)”.
Table 2.5. Packed RGB Image Formats (corrected)
| Identifier | Code | Byte 0 in memory | Byte 1 | Byte 2 | Byte 3 | ||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Bit | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | |||||
V4L2_PIX_FMT_RGB332 | 'RGB1' | r2 | r1 | r0 | g2 | g1 | g0 | b1 | b0 | ||||||||||||||||||||||||||||
V4L2_PIX_FMT_RGB444 | 'R444' | g3 | g2 | g1 | g0 | b3 | b2 | b1 | b0 | a3 | a2 | a1 | a0 | r3 | r2 | r1 | r0 | ||||||||||||||||||||
V4L2_PIX_FMT_RGB555 | 'RGBO' | g2 | g1 | g0 | b4 | b3 | b2 | b1 | b0 | a | r4 | r3 | r2 | r1 | r0 | g4 | g3 | ||||||||||||||||||||
V4L2_PIX_FMT_RGB565 | 'RGBP' | g2 | g1 | g0 | b4 | b3 | b2 | b1 | b0 | r4 | r3 | r2 | r1 | r0 | g5 | g4 | g3 | ||||||||||||||||||||
V4L2_PIX_FMT_RGB555X | 'RGBQ' | a | r4 | r3 | r2 | r1 | r0 | g4 | g3 | g2 | g1 | g0 | b4 | b3 | b2 | b1 | b0 | ||||||||||||||||||||
V4L2_PIX_FMT_RGB565X | 'RGBR' | r4 | r3 | r2 | r1 | r0 | g5 | g4 | g3 | g2 | g1 | g0 | b4 | b3 | b2 | b1 | b0 | ||||||||||||||||||||
V4L2_PIX_FMT_BGR24 | 'BGR3' | b7 | b6 | b5 | b4 | b3 | b2 | b1 | b0 | g7 | g6 | g5 | g4 | g3 | g2 | g1 | g0 | r7 | r6 | r5 | r4 | r3 | r2 | r1 | r0 | ||||||||||||
V4L2_PIX_FMT_RGB24 | 'RGB3' | r7 | r6 | r5 | r4 | r3 | r2 | r1 | r0 | g7 | g6 | g5 | g4 | g3 | g2 | g1 | g0 | b7 | b6 | b5 | b4 | b3 | b2 | b1 | b0 | ||||||||||||
V4L2_PIX_FMT_BGR32 | 'BGR4' | b7 | b6 | b5 | b4 | b3 | b2 | b1 | b0 | g7 | g6 | g5 | g4 | g3 | g2 | g1 | g0 | r7 | r6 | r5 | r4 | r3 | r2 | r1 | r0 | a7 | a6 | a5 | a4 | a3 | a2 | a1 | a0 | ||||
V4L2_PIX_FMT_RGB32 | 'RGB4' | a7 | a6 | a5 | a4 | a3 | a2 | a1 | a0 | r7 | r6 | r5 | r4 | r3 | r2 | r1 | r0 | g7 | g6 | g5 | g4 | g3 | g2 | g1 | g0 | b7 | b6 | b5 | b4 | b3 | b2 | b1 | b0 | ||||
A test utility to determine which RGB formats a driver actually supports is available from the LinuxTV v4l-dvb repository. See http://linuxtv.org/repo/ for access instructions.
V4L2_PIX_FMT_SBGGR8 — Bayer RGB format
This is commonly the native format of digital cameras, reflecting the arrangement of sensors on the CCD device. Only one red, green or blue value is given for each pixel. Missing components must be interpolated from neighbouring pixels. From left to right the first row consists of a blue and green value, the second row of a green and red value. This scheme repeats to the right and down for every two columns and rows.
Example 2.3. V4L2_PIX_FMT_SBGGR8 4 × 4
pixel image
Byte Order. Each cell is one byte.
| start + 0: | B00 | G01 | B02 | G03 |
| start + 4: | G10 | R11 | G12 | R13 |
| start + 8: | B20 | G21 | B22 | G23 |
| start + 12: | G30 | R31 | G32 | R33 |
V4L2_PIX_FMT_SGBRG8 — Bayer RGB format
This is commonly the native format of digital cameras, reflecting the arrangement of sensors on the CCD device. Only one red, green or blue value is given for each pixel. Missing components must be interpolated from neighbouring pixels. From left to right the first row consists of a green and blue value, the second row of a red and green value. This scheme repeats to the right and down for every two columns and rows.
Example 2.4. V4L2_PIX_FMT_SGBRG8 4 × 4
pixel image
Byte Order. Each cell is one byte.
| start + 0: | G00 | B01 | G02 | B03 |
| start + 4: | R10 | G11 | R12 | G13 |
| start + 8: | G20 | B21 | G22 | B23 |
| start + 12: | R30 | G31 | R32 | G33 |
V4L2_PIX_FMT_SGRBG8 — Bayer RGB format
This is commonly the native format of digital cameras, reflecting the arrangement of sensors on the CCD device. Only one red, green or blue value is given for each pixel. Missing components must be interpolated from neighbouring pixels. From left to right the first row consists of a green and blue value, the second row of a red and green value. This scheme repeats to the right and down for every two columns and rows.
Example 2.5. V4L2_PIX_FMT_SGRBG8 4 ×
4 pixel image
Byte Order. Each cell is one byte.
| start + 0: | G00 | R01 | G02 | R03 |
| start + 4: | R10 | B11 | R12 | B13 |
| start + 8: | G20 | R21 | G22 | R23 |
| start + 12: | R30 | B31 | R32 | B33 |
V4L2_PIX_FMT_SBGGR16 — Bayer RGB format
This format is similar to
V4L2_PIX_FMT_SBGGR8, except each pixel has
a depth of 16 bits. The least significant byte is stored at lower
memory addresses (little-endian). Note the actual sampling precision
may be lower than 16 bits, for example 10 bits per pixel with values
in range 0 to 1023.
Example 2.6. V4L2_PIX_FMT_SBGGR16 4 × 4
pixel image
Byte Order. Each cell is one byte.
| start + 0: | B00low | B00high | G01low | G01high | B02low | B02high | G03low | G03high |
| start + 8: | G10low | G10high | R11low | R11high | G12low | G12high | R13low | R13high |
| start + 16: | B20low | B20high | G21low | G21high | B22low | B22high | G23low | G23high |
| start + 24: | G30low | G30high | R31low | R31high | G32low | G32high | R33low | R33high |
YUV is the format native to TV broadcast and composite video signals. It separates the brightness information (Y) from the color information (U and V or Cb and Cr). The color information consists of red and blue color difference signals, this way the green component can be reconstructed by subtracting from the brightness component. See the section called “Colorspaces” for conversion examples. YUV was chosen because early television would only transmit brightness information. To add color in a way compatible with existing receivers a new signal carrier was added to transmit the color difference signals. Secondary in the YUV format the U and V components usually have lower resolution than the Y component. This is an analog video compression technique taking advantage of a property of the human visual system, being more sensitive to brightness information.
Packed YUV formats — Packed YUV formats
Similar to the packed RGB formats these formats store the Y, Cb and Cr component of each pixel in one 16 or 32 bit word.
Table 2.6. Packed YUV Image Formats
| Identifier | Code | Byte 0 in memory | Byte 1 | Byte 2 | Byte 3 | ||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Bit | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | |||||
V4L2_PIX_FMT_YUV444 | 'Y444' | Cb3 | Cb2 | Cb1 | Cb0 | Cr3 | Cr2 | Cr1 | Cr0 | a3 | a2 | a1 | a0 | Y'3 | Y'2 | Y'1 | Y'0 | ||||||||||||||||||||
V4L2_PIX_FMT_YUV555 | 'YUVO' | Cb2 | Cb1 | Cb0 | Cr4 | Cr3 | Cr2 | Cr1 | Cr0 | a | Y'4 | Y'3 | Y'2 | Y'1 | Y'0 | Cb4 | Cb3 | ||||||||||||||||||||
V4L2_PIX_FMT_YUV565 | 'YUVP' | Cb2 | Cb1 | Cb0 | Cr4 | Cr3 | Cr2 | Cr1 | Cr0 | Y'4 | Y'3 | Y'2 | Y'1 | Y'0 | Cb5 | Cb4 | Cb3 | ||||||||||||||||||||
V4L2_PIX_FMT_YUV32 | 'YUV4' | a7 | a6 | a5 | a4 | a3 | a2 | a1 | a0 | Y'7 | Y'6 | Y'5 | Y'4 | Y'3 | Y'2 | Y'1 | Y'0 | Cb7 | Cb6 | Cb5 | Cb4 | Cb3 | Cb2 | Cb1 | Cb0 | Cr7 | Cr6 | Cr5 | Cr4 | Cr3 | Cr2 | Cr1 | Cr0 | ||||
Bit 7 is the most significant bit. The value of a = alpha bits is undefined when reading from the driver, ignored when writing to the driver, except when alpha blending has been negotiated for a Video Overlay or Video Output Overlay.
V4L2_PIX_FMT_GREY — Grey-scale image
This is a grey-scale image. It is really a degenerate Y'CbCr format which simply contains no Cb or Cr data.
Example 2.7. V4L2_PIX_FMT_GREY 4 × 4
pixel image
Byte Order. Each cell is one byte.
| start + 0: | Y'00 | Y'01 | Y'02 | Y'03 |
| start + 4: | Y'10 | Y'11 | Y'12 | Y'13 |
| start + 8: | Y'20 | Y'21 | Y'22 | Y'23 |
| start + 12: | Y'30 | Y'31 | Y'32 | Y'33 |
V4L2_PIX_FMT_Y16 — Grey-scale image
This is a grey-scale image with a depth of 16 bits per pixel. The least significant byte is stored at lower memory addresses (little-endian). Note the actual sampling precision may be lower than 16 bits, for example 10 bits per pixel with values in range 0 to 1023.
Example 2.8. V4L2_PIX_FMT_Y16 4 × 4
pixel image
Byte Order. Each cell is one byte.
| start + 0: | Y'00low | Y'00high | Y'01low | Y'01high | Y'02low | Y'02high | Y'03low | Y'03high |
| start + 8: | Y'10low | Y'10high | Y'11low | Y'11high | Y'12low | Y'12high | Y'13low | Y'13high |
| start + 16: | Y'20low | Y'20high | Y'21low | Y'21high | Y'22low | Y'22high | Y'23low | Y'23high |
| start + 24: | Y'30low | Y'30high | Y'31low | Y'31high | Y'32low | Y'32high | Y'33low | Y'33high |
V4L2_PIX_FMT_YUYV — Packed format with ½ horizontal chroma
resolution, also known as YUV 4:2:2
In this format each four bytes is two pixels. Each four
bytes is two Y's, a Cb and a Cr. Each Y goes to one of the pixels, and
the Cb and Cr belong to both pixels. As you can see, the Cr and Cb
components have half the horizontal resolution of the Y component.
V4L2_PIX_FMT_YUYV is known in the Windows
environment as YUY2.
Example 2.9. V4L2_PIX_FMT_YUYV 4 × 4
pixel image
Byte Order. Each cell is one byte.
| start + 0: | Y'00 | Cb00 | Y'01 | Cr00 | Y'02 | Cb01 | Y'03 | Cr01 |
| start + 8: | Y'10 | Cb10 | Y'11 | Cr10 | Y'12 | Cb11 | Y'13 | Cr11 |
| start + 16: | Y'20 | Cb20 | Y'21 | Cr20 | Y'22 | Cb21 | Y'23 | Cr21 |
| start + 24: | Y'30 | Cb30 | Y'31 | Cr30 | Y'32 | Cb31 | Y'33 | Cr31 |
Color Sample Location.
| 0 | 1 | 2 | 3 | ||||
| 0 | Y | C | Y | Y | C | Y | |
| 1 | Y | C | Y | Y | C | Y | |
| 2 | Y | C | Y | Y | C | Y | |
| 3 | Y | C | Y | Y | C | Y |
V4L2_PIX_FMT_UYVY — Variation of
V4L2_PIX_FMT_YUYV with different order of samples
in memory
In this format each four bytes is two pixels. Each four bytes is two Y's, a Cb and a Cr. Each Y goes to one of the pixels, and the Cb and Cr belong to both pixels. As you can see, the Cr and Cb components have half the horizontal resolution of the Y component.
Example 2.10. V4L2_PIX_FMT_UYVY 4 × 4
pixel image
Byte Order. Each cell is one byte.
| start + 0: | Cb00 | Y'00 | Cr00 | Y'01 | Cb01 | Y'02 | Cr01 | Y'03 |
| start + 8: | Cb10 | Y'10 | Cr10 | Y'11 | Cb11 | Y'12 | Cr11 | Y'13 |
| start + 16: | Cb20 | Y'20 | Cr20 | Y'21 | Cb21 | Y'22 | Cr21 | Y'23 |
| start + 24: | Cb30 | Y'30 | Cr30 | Y'31 | Cb31 | Y'32 | Cr31 | Y'33 |
Color Sample Location.
| 0 | 1 | 2 | 3 | ||||
| 0 | Y | C | Y | Y | C | Y | |
| 1 | Y | C | Y | Y | C | Y | |
| 2 | Y | C | Y | Y | C | Y | |
| 3 | Y | C | Y | Y | C | Y |
V4L2_PIX_FMT_YVYU — Variation of
V4L2_PIX_FMT_YUYV with different order of samples
in memory
In this format each four bytes is two pixels. Each four bytes is two Y's, a Cb and a Cr. Each Y goes to one of the pixels, and the Cb and Cr belong to both pixels. As you can see, the Cr and Cb components have half the horizontal resolution of the Y component.
Example 2.11. V4L2_PIX_FMT_YVYU 4 × 4
pixel image
Byte Order. Each cell is one byte.
| start + 0: | Y'00 | Cr00 | Y'01 | Cb00 | Y'02 | Cr01 | Y'03 | Cb01 |
| start + 8: | Y'10 | Cr10 | Y'11 | Cb10 | Y'12 | Cr11 | Y'13 | Cb11 |
| start + 16: | Y'20 | Cr20 | Y'21 | Cb20 | Y'22 | Cr21 | Y'23 | Cb21 |
| start + 24: | Y'30 | Cr30 | Y'31 | Cb30 | Y'32 | Cr31 | Y'33 | Cb31 |
Color Sample Location.
| 0 | 1 | 2 | 3 | ||||
| 0 | Y | C | Y | Y | C | Y | |
| 1 | Y | C | Y | Y | C | Y | |
| 2 | Y | C | Y | Y | C | Y | |
| 3 | Y | C | Y | Y | C | Y |
V4L2_PIX_FMT_VYUY — Variation of
V4L2_PIX_FMT_YUYV with different order of samples
in memory
In this format each four bytes is two pixels. Each four bytes is two Y's, a Cb and a Cr. Each Y goes to one of the pixels, and the Cb and Cr belong to both pixels. As you can see, the Cr and Cb components have half the horizontal resolution of the Y component.
Example 2.12. V4L2_PIX_FMT_VYUY 4 × 4
pixel image
Byte Order. Each cell is one byte.
| start + 0: | Cr00 | Y'00 | Cb00 | Y'01 | Cr01 | Y'02 | Cb01 | Y'03 |
| start + 8: | Cr10 | Y'10 | Cb10 | Y'11 | Cr11 | Y'12 | Cb11 | Y'13 |
| start + 16: | Cr20 | Y'20 | Cb20 | Y'21 | Cr21 | Y'22 | Cb21 | Y'23 |
| start + 24: | Cr30 | Y'30 | Cb30 | Y'31 | Cr31 | Y'32 | Cb31 | Y'33 |
Color Sample Location.
| 0 | 1 | 2 | 3 | ||||
| 0 | Y | C | Y | Y | C | Y | |
| 1 | Y | C | Y | Y | C | Y | |
| 2 | Y | C | Y | Y | C | Y | |
| 3 | Y | C | Y | Y | C | Y |
V4L2_PIX_FMT_Y41P — Format with ¼ horizontal chroma
resolution, also known as YUV 4:1:1
In this format each 12 bytes is eight pixels. In the twelve bytes are two CbCr pairs and eight Y's. The first CbCr pair goes with the first four Y's, and the second CbCr pair goes with the other four Y's. The Cb and Cr components have one fourth the horizontal resolution of the Y component.
Do not confuse this format with V4L2_PIX_FMT_YUV411P.
Y41P is derived from "YUV 4:1:1 packed", while
YUV411P stands for "YUV 4:1:1 planar".
Example 2.13. V4L2_PIX_FMT_Y41P 8 × 4
pixel image
Byte Order. Each cell is one byte.
| start + 0: | Cb00 | Y'00 | Cr00 | Y'01 | Cb01 | Y'02 | Cr01 | Y'03 | Y'04 | Y'05 | Y'06 | Y'07 |
| start + 12: | Cb10 | Y'10 | Cr10 | Y'11 | Cb11 | Y'12 | Cr11 | Y'13 | Y'14 | Y'15 | Y'16 | Y'17 |
| start + 24: | Cb20 | Y'20 | Cr20 | Y'21 | Cb21 | Y'22 | Cr21 | Y'23 | Y'24 | Y'25 | Y'26 | Y'27 |
| start + 36: | Cb30 | Y'30 | Cr30 | Y'31 | Cb31 | Y'32 | Cr31 | Y'33 | Y'34 | Y'35 | Y'36 | Y'37 |
Color Sample Location.
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | ||||||||
| 0 | Y | Y | C | Y | Y | Y | Y | C | Y | Y | |||||
| 1 | Y | Y | C | Y | Y | Y | Y | C | Y | Y | |||||
| 2 | Y | Y | C | Y | Y | Y | Y | C | Y | Y | |||||
| 3 | Y | Y | C | Y | Y | Y | Y | C | Y | Y |
V4L2_PIX_FMT_YVU420, V4L2_PIX_FMT_YUV420 — Planar formats with ½ horizontal and
vertical chroma resolution, also known as YUV 4:2:0
These are planar formats, as opposed to a packed format.
The three components are separated into three sub- images or planes.
The Y plane is first. The Y plane has one byte per pixel. For
V4L2_PIX_FMT_YVU420, the Cr plane immediately
follows the Y plane in memory. The Cr plane is half the width and half
the height of the Y plane (and of the image). Each Cr belongs to four
pixels, a two-by-two square of the image. For example,
Cr0 belongs to Y'00,
Y'01, Y'10, and
Y'11. Following the Cr plane is the Cb plane,
just like the Cr plane. V4L2_PIX_FMT_YUV420 is
the same except the Cb plane comes first, then the Cr plane.
If the Y plane has pad bytes after each row, then the Cr and Cb planes have half as many pad bytes after their rows. In other words, two Cx rows (including padding) is exactly as long as one Y row (including padding).
Example 2.14. V4L2_PIX_FMT_YVU420 4 × 4
pixel image
Byte Order. Each cell is one byte.
| start + 0: | Y'00 | Y'01 | Y'02 | Y'03 |
| start + 4: | Y'10 | Y'11 | Y'12 | Y'13 |
| start + 8: | Y'20 | Y'21 | Y'22 | Y'23 |
| start + 12: | Y'30 | Y'31 | Y'32 | Y'33 |
| start + 16: | Cr00 | Cr01 | ||
| start + 18: | Cr10 | Cr11 | ||
| start + 20: | Cb00 | Cb01 | ||
| start + 22: | Cb10 | Cb11 |
Color Sample Location.
| 0 | 1 | 2 | 3 | ||||
| 0 | Y | Y | Y | Y | |||
| C | C | ||||||
| 1 | Y | Y | Y | Y | |||
| 2 | Y | Y | Y | Y | |||
| C | C | ||||||
| 3 | Y | Y | Y | Y |
V4L2_PIX_FMT_YVU410, V4L2_PIX_FMT_YUV410 — Planar formats with ¼ horizontal and
vertical chroma resolution, also known as YUV 4:1:0
These are planar formats, as opposed to a packed format.
The three components are separated into three sub-images or planes.
The Y plane is first. The Y plane has one byte per pixel. For
V4L2_PIX_FMT_YVU410, the Cr plane immediately
follows the Y plane in memory. The Cr plane is ¼ the width and
¼ the height of the Y plane (and of the image). Each Cr belongs
to 16 pixels, a four-by-four square of the image. Following the Cr
plane is the Cb plane, just like the Cr plane.
V4L2_PIX_FMT_YUV410 is the same, except the Cb
plane comes first, then the Cr plane.
If the Y plane has pad bytes after each row, then the Cr and Cb planes have ¼ as many pad bytes after their rows. In other words, four Cx rows (including padding) are exactly as long as one Y row (including padding).
Example 2.15. V4L2_PIX_FMT_YVU410 4 × 4
pixel image
Byte Order. Each cell is one byte.
| start + 0: | Y'00 | Y'01 | Y'02 | Y'03 |
| start + 4: | Y'10 | Y'11 | Y'12 | Y'13 |
| start + 8: | Y'20 | Y'21 | Y'22 | Y'23 |
| start + 12: | Y'30 | Y'31 | Y'32 | Y'33 |
| start + 16: | Cr00 | |||
| start + 17: | Cb00 |
Color Sample Location.
| 0 | 1 | 2 | 3 | ||||
| 0 | Y | Y | Y | Y | |||
| 1 | Y | Y | Y | Y | |||
| C | |||||||
| 2 | Y | Y | Y | Y | |||
| 3 | Y | Y | Y | Y |
V4L2_PIX_FMT_YUV422P — Format with ½ horizontal chroma resolution,
also known as YUV 4:2:2. Planar layout as opposed to
V4L2_PIX_FMT_YUYV
This format is not commonly used. This is a planar version of the YUYV format. The three components are separated into three sub-images or planes. The Y plane is first. The Y plane has one byte per pixel. The Cb plane immediately follows the Y plane in memory. The Cb plane is half the width of the Y plane (and of the image). Each Cb belongs to two pixels. For example, Cb0 belongs to Y'00, Y'01. Following the Cb plane is the Cr plane, just like the Cb plane.
If the Y plane has pad bytes after each row, then the Cr and Cb planes have half as many pad bytes after their rows. In other words, two Cx rows (including padding) is exactly as long as one Y row (including padding).
Example 2.16. V4L2_PIX_FMT_YUV422P 4 × 4
pixel image
Byte Order. Each cell is one byte.
| start + 0: | Y'00 | Y'01 | Y'02 | Y'03 |
| start + 4: | Y'10 | Y'11 | Y'12 | Y'13 |
| start + 8: | Y'20 | Y'21 | Y'22 | Y'23 |
| start + 12: | Y'30 | Y'31 | Y'32 | Y'33 |
| start + 16: | Cb00 | Cb01 | ||
| start + 18: | Cb10 | Cb11 | ||
| start + 20: | Cb20 | Cb21 | ||
| start + 22: | Cb30 | Cb31 | ||
| start + 24: | Cr00 | Cr01 | ||
| start + 26: | Cr10 | Cr11 | ||
| start + 28: | Cr20 | Cr21 | ||
| start + 30: | Cr30 | Cr31 |
Color Sample Location.
| 0 | 1 | 2 | 3 | ||||
| 0 | Y | C | Y | Y | C | Y | |
| 1 | Y | C | Y | Y | C | Y | |
| 2 | Y | C | Y | Y | C | Y | |
| 3 | Y | C | Y | Y | C | Y |
V4L2_PIX_FMT_YUV411P — Format with ¼ horizontal chroma resolution,
also known as YUV 4:1:1. Planar layout as opposed to
V4L2_PIX_FMT_Y41P
This format is not commonly used. This is a planar format similar to the 4:2:2 planar format except with half as many chroma. The three components are separated into three sub-images or planes. The Y plane is first. The Y plane has one byte per pixel. The Cb plane immediately follows the Y plane in memory. The Cb plane is ¼ the width of the Y plane (and of the image). Each Cb belongs to 4 pixels all on the same row. For example, Cb0 belongs to Y'00, Y'01, Y'02 and Y'03. Following the Cb plane is the Cr plane, just like the Cb plane.
If the Y plane has pad bytes after each row, then the Cr and Cb planes have ¼ as many pad bytes after their rows. In other words, four C x rows (including padding) is exactly as long as one Y row (including padding).
Example 2.17. V4L2_PIX_FMT_YUV411P 4 × 4
pixel image
Byte Order. Each cell is one byte.
| start + 0: | Y'00 | Y'01 | Y'02 | Y'03 |
| start + 4: | Y'10 | Y'11 | Y'12 | Y'13 |
| start + 8: | Y'20 | Y'21 | Y'22 | Y'23 |
| start + 12: | Y'30 | Y'31 | Y'32 | Y'33 |
| start + 16: | Cb00 | |||
| start + 17: | Cb10 | |||
| start + 18: | Cb20 | |||
| start + 19: | Cb30 | |||
| start + 20: | Cr00 | |||
| start + 21: | Cr10 | |||
| start + 22: | Cr20 | |||
| start + 23: | Cr30 |
Color Sample Location.
| 0 | 1 | 2 | 3 | ||||
| 0 | Y | Y | C | Y | Y | ||
| 1 | Y | Y | C | Y | Y | ||
| 2 | Y | Y | C | Y | Y | ||
| 3 | Y | Y | C | Y | Y |
V4L2_PIX_FMT_NV12, V4L2_PIX_FMT_NV21 — Formats with ½ horizontal and vertical
chroma resolution, also known as YUV 4:2:0. One luminance and one
chrominance plane with alternating chroma samples as opposed to
V4L2_PIX_FMT_YVU420
These are two-plane versions of the YUV 4:2:0 format.
The three components are separated into two sub-images or planes. The
Y plane is first. The Y plane has one byte per pixel. For
V4L2_PIX_FMT_NV12, a combined CbCr plane
immediately follows the Y plane in memory. The CbCr plane is the same
width, in bytes, as the Y plane (and of the image), but is half as
tall in pixels. Each CbCr pair belongs to four pixels. For example,
Cb0/Cr0 belongs to
Y'00, Y'01,
Y'10, Y'11.
V4L2_PIX_FMT_NV21 is the same except the Cb and
Cr bytes are swapped, the CrCb plane starts with a Cr byte.
If the Y plane has pad bytes after each row, then the CbCr plane has as many pad bytes after its rows.
Example 2.18. V4L2_PIX_FMT_NV12 4 × 4
pixel image
Byte Order. Each cell is one byte.
| start + 0: | Y'00 | Y'01 | Y'02 | Y'03 |
| start + 4: | Y'10 | Y'11 | Y'12 | Y'13 |
| start + 8: | Y'20 | Y'21 | Y'22 | Y'23 |
| start + 12: | Y'30 | Y'31 | Y'32 | Y'33 |
| start + 16: | Cb00 | Cr00 | Cb01 | Cr01 |
| start + 20: | Cb10 | Cr10 | Cb11 | Cr11 |
Color Sample Location.
| 0 | 1 | 2 | 3 | ||||
| 0 | Y | Y | Y | Y | |||
| C | C | ||||||
| 1 | Y | Y | Y | Y | |||
| 2 | Y | Y | Y | Y | |||
| C | C | ||||||
| 3 | Y | Y | Y | Y |
V4L2_PIX_FMT_NV16, V4L2_PIX_FMT_NV61 — Formats with ½ horizontal
chroma resolution, also known as YUV 4:2:2. One luminance and one
chrominance plane with alternating chroma samples as opposed to
V4L2_PIX_FMT_YVU420
These are two-plane versions of the YUV 4:2:2 format.
The three components are separated into two sub-images or planes. The
Y plane is first. The Y plane has one byte per pixel. For
V4L2_PIX_FMT_NV16, a combined CbCr plane
immediately follows the Y plane in memory. The CbCr plane is the same
width and height, in bytes, as the Y plane (and of the image).
Each CbCr pair belongs to two pixels. For example,
Cb0/Cr0 belongs to
Y'00, Y'01.
V4L2_PIX_FMT_NV61 is the same except the Cb and
Cr bytes are swapped, the CrCb plane starts with a Cr byte.
If the Y plane has pad bytes after each row, then the CbCr plane has as many pad bytes after its rows.
Example 2.19. V4L2_PIX_FMT_NV16 4 × 4
pixel image
Byte Order. Each cell is one byte.
| start + 0: | Y'00 | Y'01 | Y'02 | Y'03 |
| start + 4: | Y'10 | Y'11 | Y'12 | Y'13 |
| start + 8: | Y'20 | Y'21 | Y'22 | Y'23 |
| start + 12: | Y'30 | Y'31 | Y'32 | Y'33 |
| start + 16: | Cb00 | Cr00 | Cb01 | Cr01 |
| start + 20: | Cb10 | Cr10 | Cb11 | Cr11 |
| start + 24: | Cb20 | Cr20 | Cb21 | Cr21 |
| start + 28: | Cb30 | Cr30 | Cb31 | Cr31 |
Color Sample Location.
| 0 | 1 | 2 | 3 | ||||
| 0 | Y | Y | Y | Y | |||
| C | C | ||||||
| 1 | Y | Y | Y | Y | |||
| C | C | ||||||
| 2 | Y | Y | Y | Y | |||
| C | C | ||||||
| 3 | Y | Y | Y | Y | |||
| C | C |
Table 2.7. Compressed Image Formats
| Identifier | Code | Details |
|---|---|---|
V4L2_PIX_FMT_JPEG | 'JPEG' | TBD. See also VIDIOC_G_JPEGCOMP,
VIDIOC_S_JPEGCOMP. |
V4L2_PIX_FMT_MPEG | 'MPEG' | MPEG stream. The actual format is determined by
extended control V4L2_CID_MPEG_STREAM_TYPE, see
Table 1.2, “MPEG Control IDs”. |
These formats are not defined by this specification, they
are just listed for reference and to avoid naming conflicts. If you
want to register your own format, send an e-mail to the linux-media mailing
list http://www.linuxtv.org/lists.php for inclusion in the videodev2.h
file. If you want to share your format with other developers add a
link to your documentation and send a copy to the linux-media mailing list
for inclusion in this section. If you think your format should be listed
in a standard format section please make a proposal on the linux-media mailing
list.
Table 2.8. Reserved Image Formats
| Identifier | Code | Details |
|---|---|---|
V4L2_PIX_FMT_DV | 'dvsd' | unknown |
V4L2_PIX_FMT_ET61X251 | 'E625' | Compressed format of the ET61X251 driver. |
V4L2_PIX_FMT_HI240 | 'HI24' | 8 bit RGB format used by the BTTV driver. |
V4L2_PIX_FMT_HM12 | 'HM12' | YUV 4:2:0 format used by the IVTV driver, http://www.ivtvdriver.org/ The format is documented in the
kernel sources in the file |
V4L2_PIX_FMT_SPCA501 | 'S501' | YUYV per line used by the gspca driver. |
V4L2_PIX_FMT_SPCA505 | 'S505' | YYUV per line used by the gspca driver. |
V4L2_PIX_FMT_SPCA508 | 'S508' | YUVY per line used by the gspca driver. |
V4L2_PIX_FMT_SPCA561 | 'S561' | Compressed GBRG Bayer format used by the gspca driver. |
V4L2_PIX_FMT_SGRBG10 | 'DA10' | 10 bit raw Bayer, expanded to 16 bits. |
V4L2_PIX_FMT_SGRBG10DPCM8 | 'DB10' | 10 bit raw Bayer DPCM compressed to 8 bits. |
V4L2_PIX_FMT_PAC207 | 'P207' | Compressed BGGR Bayer format used by the gspca driver. |
V4L2_PIX_FMT_MR97310A | 'M310' | Compressed BGGR Bayer format used by the gspca driver. |
V4L2_PIX_FMT_OV511 | 'O511' | OV511 JPEG format used by the gspca driver. |
V4L2_PIX_FMT_OV518 | 'O518' | OV518 JPEG format used by the gspca driver. |
V4L2_PIX_FMT_PJPG | 'PJPG' | Pixart 73xx JPEG format used by the gspca driver. |
V4L2_PIX_FMT_SQ905C | '905C' | Compressed RGGB bayer format used by the gspca driver. |
V4L2_PIX_FMT_MJPEG | 'MJPG' | Compressed format used by the Zoran driver |
V4L2_PIX_FMT_PWC1 | 'PWC1' | Compressed format of the PWC driver. |
V4L2_PIX_FMT_PWC2 | 'PWC2' | Compressed format of the PWC driver. |
V4L2_PIX_FMT_SN9C10X | 'S910' | Compressed format of the SN9C102 driver. |
V4L2_PIX_FMT_SN9C20X_I420 | 'S920' | YUV 4:2:0 format of the gspca sn9c20x driver. |
V4L2_PIX_FMT_WNVA | 'WNVA' | Used by the Winnov Videum driver, http://www.thedirks.org/winnov/ |
V4L2_PIX_FMT_TM6000 | 'TM60' | Used by Trident tm6000 |
V4L2_PIX_FMT_YYUV | 'YYUV' | unknown |
Table of Contents
The V4L2 API defines several different methods to read from or write to a device. All drivers exchanging data with applications must support at least one of them.
The classic I/O method using the read()
and write() function is automatically selected
after opening a V4L2 device. When the driver does not support this
method attempts to read or write will fail at any time.
Other methods must be negotiated. To select the streaming I/O
method with memory mapped or user buffers applications call the
VIDIOC_REQBUFS ioctl. The asynchronous I/O method is not defined
yet.
Video overlay can be considered another I/O method, although
the application does not directly receive the image data. It is
selected by initiating video overlay with the VIDIOC_S_FMT ioctl.
For more information see the section called “Video Overlay Interface”.
Generally exactly one I/O method, including overlay, is associated with each file descriptor. The only exceptions are applications not exchanging data with a driver ("panel applications", see the section called “Opening and Closing Devices”) and drivers permitting simultaneous video capturing and overlay using the same file descriptor, for compatibility with V4L and earlier versions of V4L2.
VIDIOC_S_FMT and
VIDIOC_REQBUFS would permit this to some degree,
but for simplicity drivers need not support switching the I/O method
(after first switching away from read/write) other than by closing
and reopening the device.
The following sections describe the various I/O methods in more detail.
Input and output devices support the
read() and write() function,
respectively, when the V4L2_CAP_READWRITE flag in
the capabilities field of struct v4l2_capability
returned by the VIDIOC_QUERYCAP ioctl is set.
Drivers may need the CPU to copy the data, but they may also support DMA to or from user memory, so this I/O method is not necessarily less efficient than other methods merely exchanging buffer pointers. It is considered inferior though because no meta-information like frame counters or timestamps are passed. This information is necessary to recognize frame dropping and to synchronize with other data streams. However this is also the simplest I/O method, requiring little or no setup to exchange data. It permits command line stunts like this (the vidctrl tool is fictitious):
> vidctrl /dev/video --input=0 --format=YUYV --size=352x288 > dd if=/dev/video of=myimage.422 bs=202752 count=1
To read from the device applications use the
read() function, to write the write() function.
Drivers must implement one I/O method if they
exchange data with applications, but it need not be this.[12] When reading or writing is supported, the driver
must also support the select() and poll()
function.[13]
Input and output devices support this I/O method when the
V4L2_CAP_STREAMING flag in the
capabilities field of struct v4l2_capability
returned by the VIDIOC_QUERYCAP ioctl is set. There are two
streaming methods, to determine if the memory mapping flavor is
supported applications must call the VIDIOC_REQBUFS ioctl.
Streaming is an I/O method where only pointers to buffers are exchanged between application and driver, the data itself is not copied. Memory mapping is primarily intended to map buffers in device memory into the application's address space. Device memory can be for example the video memory on a graphics card with a video capture add-on. However, being the most efficient I/O method available for a long time, many other drivers support streaming as well, allocating buffers in DMA-able main memory.
A driver can support many sets of buffers. Each set is identified by a unique buffer type value. The sets are independent and each set can hold a different type of data. To access different sets at the same time different file descriptors must be used.[14]
To allocate device buffers applications call the
VIDIOC_REQBUFS ioctl with the desired number of buffers and buffer
type, for example V4L2_BUF_TYPE_VIDEO_CAPTURE.
This ioctl can also be used to change the number of buffers or to free
the allocated memory, provided none of the buffers are still
mapped.
Before applications can access the buffers they must map
them into their address space with the mmap() function. The
location of the buffers in device memory can be determined with the
VIDIOC_QUERYBUF ioctl. The m.offset and
length returned in a struct v4l2_buffer are
passed as sixth and second parameter to the
mmap() function. The offset and length values
must not be modified. Remember the buffers are allocated in physical
memory, as opposed to virtual memory which can be swapped out to disk.
Applications should free the buffers as soon as possible with the
munmap() function.
Example 3.1. Mapping buffers
struct v4l2_requestbuffers reqbuf; struct { void *start; size_t length; } *buffers; unsigned int i; memset (&reqbuf, 0, sizeof (reqbuf)); reqbuf.type = V4L2_BUF_TYPE_VIDEO_CAPTURE; reqbuf.memory = V4L2_MEMORY_MMAP; reqbuf.count = 20; if (-1 == ioctl (fd,VIDIOC_REQBUFS, &reqbuf)) { if (errno == EINVAL) printf ("Video capturing or mmap-streaming is not supported\n"); else perror ("VIDIOC_REQBUFS"); exit (EXIT_FAILURE); } /* We want at least five buffers. */ if (reqbuf.count < 5) { /* You may need to free the buffers here. */ printf ("Not enough buffer memory\n"); exit (EXIT_FAILURE); } buffers = calloc (reqbuf.count, sizeof (*buffers)); assert (buffers != NULL); for (i = 0; i < reqbuf.count; i++) { struct v4l2_buffer buffer; memset (&buffer, 0, sizeof (buffer)); buffer.type = reqbuf.type; buffer.memory = V4L2_MEMORY_MMAP; buffer.index = i; if (-1 == ioctl (fd,VIDIOC_QUERYBUF, &buffer)) { perror ("VIDIOC_QUERYBUF"); exit (EXIT_FAILURE); } buffers[i].length = buffer.length; /* remember for munmap() */ buffers[i].start = mmap (NULL, buffer.length, PROT_READ | PROT_WRITE, /* recommended */ MAP_SHARED, /* recommended */ fd, buffer.m.offset); if (MAP_FAILED == buffers[i].start) { /* If you do not exit here you should unmap() and free() the buffers mapped so far. */ perror ("mmap"); exit (EXIT_FAILURE); } } /* Cleanup. */ for (i = 0; i < reqbuf.count; i++) munmap (buffers[i].start, buffers[i].length);
Conceptually streaming drivers maintain two buffer queues, an incoming and an outgoing queue. They separate the synchronous capture or output operation locked to a video clock from the application which is subject to random disk or network delays and preemption by other processes, thereby reducing the probability of data loss. The queues are organized as FIFOs, buffers will be output in the order enqueued in the incoming FIFO, and were captured in the order dequeued from the outgoing FIFO.
The driver may require a minimum number of buffers enqueued
at all times to function, apart of this no limit exists on the number
of buffers applications can enqueue in advance, or dequeue and
process. They can also enqueue in a different order than buffers have
been dequeued, and the driver can fill enqueued
empty buffers in any order. [15] The index number of a buffer (struct v4l2_buffer
index) plays no role here, it only
identifies the buffer.
Initially all mapped buffers are in dequeued state,
inaccessible by the driver. For capturing applications it is customary
to first enqueue all mapped buffers, then to start capturing and enter
the read loop. Here the application waits until a filled buffer can be
dequeued, and re-enqueues the buffer when the data is no longer
needed. Output applications fill and enqueue buffers, when enough
buffers are stacked up the output is started with
VIDIOC_STREAMON. In the write loop, when
the application runs out of free buffers, it must wait until an empty
buffer can be dequeued and reused.
To enqueue and dequeue a buffer applications use the
VIDIOC_QBUF and VIDIOC_DQBUF ioctl. The status of a buffer being
mapped, enqueued, full or empty can be determined at any time using the
VIDIOC_QUERYBUF ioctl. Two methods exist to suspend execution of the
application until one or more buffers can be dequeued. By default
VIDIOC_DQBUF blocks when no buffer is in the
outgoing queue. When the O_NONBLOCK flag was
given to the open() function, VIDIOC_DQBUF
returns immediately with an EAGAIN error code when no buffer is available. The
select() or poll() function are always available.
To start and stop capturing or output applications call the
VIDIOC_STREAMON and VIDIOC_STREAMOFF ioctl. Note
VIDIOC_STREAMOFF removes all buffers from both
queues as a side effect. Since there is no notion of doing anything
"now" on a multitasking system, if an application needs to synchronize
with another event it should examine the struct v4l2_buffer
timestamp of captured buffers, or set the
field before enqueuing buffers for output.
Drivers implementing memory mapping I/O must
support the VIDIOC_REQBUFS,
VIDIOC_QUERYBUF,
VIDIOC_QBUF, VIDIOC_DQBUF,
VIDIOC_STREAMON and
VIDIOC_STREAMOFF ioctl, the
mmap(), munmap(),
select() and poll()
function.[16]
[capture example]
Input and output devices support this I/O method when the
V4L2_CAP_STREAMING flag in the
capabilities field of struct v4l2_capability
returned by the VIDIOC_QUERYCAP ioctl is set. If the particular user
pointer method (not only memory mapping) is supported must be
determined by calling the VIDIOC_REQBUFS ioctl.
This I/O method combines advantages of the read/write and
memory mapping methods. Buffers are allocated by the application
itself, and can reside for example in virtual or shared memory. Only
pointers to data are exchanged, these pointers and meta-information
are passed in struct v4l2_buffer. The driver must be switched
into user pointer I/O mode by calling the VIDIOC_REQBUFS with the
desired buffer type. No buffers are allocated beforehands,
consequently they are not indexed and cannot be queried like mapped
buffers with the VIDIOC_QUERYBUF ioctl.
Example 3.2. Initiating streaming I/O with user pointers
struct v4l2_requestbuffers reqbuf;
memset (&reqbuf, 0, sizeof (reqbuf));
reqbuf.type = V4L2_BUF_TYPE_VIDEO_CAPTURE;
reqbuf.memory = V4L2_MEMORY_USERPTR;
if (ioctl (fd, VIDIOC_REQBUFS, &reqbuf) == -1) {
if (errno == EINVAL)
printf ("Video capturing or user pointer streaming is not supported\n");
else
perror ("VIDIOC_REQBUFS");
exit (EXIT_FAILURE);
}
Buffer addresses and sizes are passed on the fly with the
VIDIOC_QBUF ioctl. Although buffers are commonly cycled,
applications can pass different addresses and sizes at each
VIDIOC_QBUF call. If required by the hardware the
driver swaps memory pages within physical memory to create a
continuous area of memory. This happens transparently to the
application in the virtual memory subsystem of the kernel. When buffer
pages have been swapped out to disk they are brought back and finally
locked in physical memory for DMA.[17]
Filled or displayed buffers are dequeued with the
VIDIOC_DQBUF ioctl. The driver can unlock the memory pages at any
time between the completion of the DMA and this ioctl. The memory is
also unlocked when VIDIOC_STREAMOFF is called, VIDIOC_REQBUFS, or
when the device is closed. Applications must take care not to free
buffers without dequeuing. For once, the buffers remain locked until
further, wasting physical memory. Second the driver will not be
notified when the memory is returned to the application's free list
and subsequently reused for other purposes, possibly completing the
requested DMA and overwriting valuable data.
For capturing applications it is customary to enqueue a
number of empty buffers, to start capturing and enter the read loop.
Here the application waits until a filled buffer can be dequeued, and
re-enqueues the buffer when the data is no longer needed. Output
applications fill and enqueue buffers, when enough buffers are stacked
up output is started. In the write loop, when the application
runs out of free buffers it must wait until an empty buffer can be
dequeued and reused. Two methods exist to suspend execution of the
application until one or more buffers can be dequeued. By default
VIDIOC_DQBUF blocks when no buffer is in the
outgoing queue. When the O_NONBLOCK flag was
given to the open() function, VIDIOC_DQBUF
returns immediately with an EAGAIN error code when no buffer is available. The
select() or poll() function are always available.
To start and stop capturing or output applications call the
VIDIOC_STREAMON and VIDIOC_STREAMOFF ioctl. Note
VIDIOC_STREAMOFF removes all buffers from both
queues and unlocks all buffers as a side effect. Since there is no
notion of doing anything "now" on a multitasking system, if an
application needs to synchronize with another event it should examine
the struct v4l2_buffer timestamp of captured
buffers, or set the field before enqueuing buffers for output.
Drivers implementing user pointer I/O must
support the VIDIOC_REQBUFS,
VIDIOC_QBUF, VIDIOC_DQBUF,
VIDIOC_STREAMON and
VIDIOC_STREAMOFF ioctl, the
select() and poll() function.[18]
A buffer contains data exchanged by application and
driver using one of the Streaming I/O methods. Only pointers to
buffers are exchanged, the data itself is not copied. These pointers,
together with meta-information like timestamps or field parity, are
stored in a struct v4l2_buffer, argument to
the VIDIOC_QUERYBUF, VIDIOC_QBUF and VIDIOC_DQBUF ioctl.
Nominally timestamps refer to the first data byte transmitted. In practice however the wide range of hardware covered by the V4L2 API limits timestamp accuracy. Often an interrupt routine will sample the system clock shortly after the field or frame was stored completely in memory. So applications must expect a constant difference up to one field or frame period plus a small (few scan lines) random error. The delay and error can be much larger due to compression or transmission over an external bus when the frames are not properly stamped by the sender. This is frequently the case with USB cameras. Here timestamps refer to the instant the field or frame was received by the driver, not the capture time. These devices identify by not enumerating any video standards, see the section called “Video Standards”.
Similar limitations apply to output timestamps. Typically the video hardware locks to a clock controlling the video timing, the horizontal and vertical synchronization pulses. At some point in the line sequence, possibly the vertical blanking, an interrupt routine samples the system clock, compares against the timestamp and programs the hardware to repeat the previous field or frame, or to display the buffer contents.
Apart of limitations of the video device and natural inaccuracies of all clocks, it should be noted system time itself is not perfectly stable. It can be affected by power saving cycles, warped to insert leap seconds, or even turned back or forth by the system administrator affecting long term measurements. [19]
Table 3.1. struct v4l2_buffer
| __u32 | index | Number of the buffer, set by the application. This
field is only used for memory mapping I/O
and can range from zero to the number of buffers allocated
with the VIDIOC_REQBUFS ioctl (struct v4l2_requestbuffers count) minus one. | |
| enum v4l2_buf_type | type | Type of the buffer, same as struct v4l2_format
type or struct v4l2_requestbuffers
type, set by the application. | |
| __u32 | bytesused | The number of bytes occupied by the data in the
buffer. It depends on the negotiated data format and may change with
each buffer for compressed variable size data like JPEG images.
Drivers must set this field when type
refers to an input stream, applications when an output stream. | |
| __u32 | flags | Flags set by the application or driver, see Table 3.3, “Buffer Flags”. | |
| enum v4l2_field | field | Indicates the field order of the image in the
buffer, see Table 3.8, “enum v4l2_field”. This field is not used when
the buffer contains VBI data. Drivers must set it when
type refers to an input stream,
applications when an output stream. | |
| struct timeval | timestamp | For input streams this is the
system time (as returned by the | |
| struct v4l2_timecode | timecode | When type is
V4L2_BUF_TYPE_VIDEO_CAPTURE and the
V4L2_BUF_FLAG_TIMECODE flag is set in
flags, this structure contains a frame
timecode. In V4L2_FIELD_ALTERNATE
mode the top and bottom field contain the same timecode.
Timecodes are intended to help video editing and are typically recorded on
video tapes, but also embedded in compressed formats like MPEG. This
field is independent of the timestamp and
sequence fields. | |
| __u32 | sequence | Set by the driver, counting the frames in the sequence. | |
In V4L2_FIELD_ALTERNATE mode the top and bottom field have the same sequence number. The count starts at zero and includes dropped or repeated frames. A dropped frame was received by an input device but could not be stored due to lack of free buffer space. A repeated frame was displayed again by an output device because the application did not pass new data in time. Note this may count the frames received e.g. over USB, without taking into account the frames dropped by the remote hardware due to limited compression throughput or bus bandwidth. These devices identify by not enumerating any video standards, see the section called “Video Standards”. | |||
| enum v4l2_memory | memory | This field must be set by applications and/or drivers in accordance with the selected I/O method. | |
| union | m | ||
| __u32 | offset | When memory is
V4L2_MEMORY_MMAP this is the offset of the buffer
from the start of the device memory. The value is returned by the
driver and apart of serving as parameter to the mmap() function
not useful for applications. See the section called “Streaming I/O (Memory Mapping)” for details. | |
| unsigned long | userptr | When memory is
V4L2_MEMORY_USERPTR this is a pointer to the
buffer (casted to unsigned long type) in virtual memory, set by the
application. See the section called “Streaming I/O (User Pointers)” for details. | |
| __u32 | length | Size of the buffer (not the payload) in bytes. | |
| __u32 | input | Some video capture drivers support rapid and
synchronous video input changes, a function useful for example in
video surveillance applications. For this purpose applications set the
V4L2_BUF_FLAG_INPUT flag, and this field to the
number of a video input as in struct v4l2_input field
index. | |
| __u32 | reserved | A place holder for future extensions and custom
(driver defined) buffer types
V4L2_BUF_TYPE_PRIVATE and higher. | |
Table 3.2. enum v4l2_buf_type
V4L2_BUF_TYPE_VIDEO_CAPTURE | 1 | Buffer of a video capture stream, see the section called “Video Capture Interface”. |
V4L2_BUF_TYPE_VIDEO_OUTPUT | 2 | Buffer of a video output stream, see the section called “Video Output Interface”. |
V4L2_BUF_TYPE_VIDEO_OVERLAY | 3 | Buffer for video overlay, see the section called “Video Overlay Interface”. |
V4L2_BUF_TYPE_VBI_CAPTURE | 4 | Buffer of a raw VBI capture stream, see the section called “Raw VBI Data Interface”. |
V4L2_BUF_TYPE_VBI_OUTPUT | 5 | Buffer of a raw VBI output stream, see the section called “Raw VBI Data Interface”. |
V4L2_BUF_TYPE_SLICED_VBI_CAPTURE | 6 | Buffer of a sliced VBI capture stream, see the section called “Sliced VBI Data Interface”. |
V4L2_BUF_TYPE_SLICED_VBI_OUTPUT | 7 | Buffer of a sliced VBI output stream, see the section called “Sliced VBI Data Interface”. |
V4L2_BUF_TYPE_VIDEO_OUTPUT_OVERLAY | 8 | Buffer for video output overlay (OSD), see the section called “Video Output Overlay Interface”. Status: Experimental. |
V4L2_BUF_TYPE_PRIVATE | 0x80 | This and higher values are reserved for custom (driver defined) buffer types. |
Table 3.3. Buffer Flags
V4L2_BUF_FLAG_MAPPED | 0x0001 | The buffer resides in device memory and has been mapped into the application's address space, see the section called “Streaming I/O (Memory Mapping)” for details. Drivers set or clear this flag when the VIDIOC_QUERYBUF, VIDIOC_QBUF or VIDIOC_DQBUF ioctl is called. Set by the driver. |
V4L2_BUF_FLAG_QUEUED | 0x0002 | Internally drivers maintain two buffer queues, an
incoming and outgoing queue. When this flag is set, the buffer is
currently on the incoming queue. It automatically moves to the
outgoing queue after the buffer has been filled (capture devices) or
displayed (output devices). Drivers set or clear this flag when the
VIDIOC_QUERYBUF ioctl is called. After
(successful) calling the VIDIOC_QBUF ioctl it is
always set and after VIDIOC_DQBUF always
cleared. |
V4L2_BUF_FLAG_DONE | 0x0004 | When this flag is set, the buffer is currently on
the outgoing queue, ready to be dequeued from the driver. Drivers set
or clear this flag when the VIDIOC_QUERYBUF ioctl
is called. After calling the VIDIOC_QBUF or
VIDIOC_DQBUF it is always cleared. Of course a
buffer cannot be on both queues at the same time, the
V4L2_BUF_FLAG_QUEUED and
V4L2_BUF_FLAG_DONE flag are mutually exclusive.
They can be both cleared however, then the buffer is in "dequeued"
state, in the application domain to say so. |
V4L2_BUF_FLAG_KEYFRAME | 0x0008 | Drivers set or clear this flag when calling the
VIDIOC_DQBUF ioctl. It may be set by video
capture devices when the buffer contains a compressed image which is a
key frame (or field), i. e. can be decompressed on its own. |
V4L2_BUF_FLAG_PFRAME | 0x0010 | Similar to V4L2_BUF_FLAG_KEYFRAME
this flags predicted frames or fields which contain only differences to a
previous key frame. |
V4L2_BUF_FLAG_BFRAME | 0x0020 | Similar to V4L2_BUF_FLAG_PFRAME
this is a bidirectional predicted frame or field. [ooc tbd] |
V4L2_BUF_FLAG_TIMECODE | 0x0100 | The timecode field is valid.
Drivers set or clear this flag when the VIDIOC_DQBUF
ioctl is called. |
V4L2_BUF_FLAG_INPUT | 0x0200 | The input field is valid.
Applications set or clear this flag before calling the
VIDIOC_QBUF ioctl. |
Table 3.4. enum v4l2_memory
V4L2_MEMORY_MMAP | 1 | The buffer is used for memory mapping I/O. |
V4L2_MEMORY_USERPTR | 2 | The buffer is used for user pointer I/O. |
V4L2_MEMORY_OVERLAY | 3 | [to do] |
The v4l2_timecode structure is
designed to hold a [SMPTE 12M] or similar timecode.
(struct timeval timestamps are stored in
struct v4l2_buffer field timestamp.)
Table 3.5. struct v4l2_timecode
| __u32 | type | Frame rate the timecodes are based on, see Table 3.6, “Timecode Types”. |
| __u32 | flags | Timecode flags, see Table 3.7, “Timecode Flags”. |
| __u8 | frames | Frame count, 0 ... 23/24/29/49/59, depending on the type of timecode. |
| __u8 | seconds | Seconds count, 0 ... 59. This is a binary, not BCD number. |
| __u8 | minutes | Minutes count, 0 ... 59. This is a binary, not BCD number. |
| __u8 | hours | Hours count, 0 ... 29. This is a binary, not BCD number. |
| __u8 | userbits[4] | The "user group" bits from the timecode. |
Table 3.6. Timecode Types
V4L2_TC_TYPE_24FPS | 1 | 24 frames per second, i. e. film. |
V4L2_TC_TYPE_25FPS | 2 | 25 frames per second, i. e. PAL or SECAM video. |
V4L2_TC_TYPE_30FPS | 3 | 30 frames per second, i. e. NTSC video. |
V4L2_TC_TYPE_50FPS | 4 | |
V4L2_TC_TYPE_60FPS | 5 |
Table 3.7. Timecode Flags
V4L2_TC_FLAG_DROPFRAME | 0x0001 | Indicates "drop frame" semantics for counting frames in 29.97 fps material. When set, frame numbers 0 and 1 at the start of each minute, except minutes 0, 10, 20, 30, 40, 50 are omitted from the count. |
V4L2_TC_FLAG_COLORFRAME | 0x0002 | The "color frame" flag. |
V4L2_TC_USERBITS_field | 0x000C | Field mask for the "binary group flags". |
V4L2_TC_USERBITS_USERDEFINED | 0x0000 | Unspecified format. |
V4L2_TC_USERBITS_8BITCHARS | 0x0008 | 8-bit ISO characters. |
We have to distinguish between progressive and interlaced video. Progressive video transmits all lines of a video image sequentially. Interlaced video divides an image into two fields, containing only the odd and even lines of the image, respectively. Alternating the so called odd and even field are transmitted, and due to a small delay between fields a cathode ray TV displays the lines interleaved, yielding the original frame. This curious technique was invented because at refresh rates similar to film the image would fade out too quickly. Transmitting fields reduces the flicker without the necessity of doubling the frame rate and with it the bandwidth required for each channel.
It is important to understand a video camera does not expose one frame at a time, merely transmitting the frames separated into fields. The fields are in fact captured at two different instances in time. An object on screen may well move between one field and the next. For applications analysing motion it is of paramount importance to recognize which field of a frame is older, the temporal order.
When the driver provides or accepts images field by field rather than interleaved, it is also important applications understand how the fields combine to frames. We distinguish between top and bottom fields, the spatial order: The first line of the top field is the first line of an interlaced frame, the first line of the bottom field is the second line of that frame.
However because fields were captured one after the other, arguing whether a frame commences with the top or bottom field is pointless. Any two successive top and bottom, or bottom and top fields yield a valid frame. Only when the source was progressive to begin with, e. g. when transferring film to video, two fields may come from the same frame, creating a natural order.
Counter to intuition the top field is not necessarily the older field. Whether the older field contains the top or bottom lines is a convention determined by the video standard. Hence the distinction between temporal and spatial order of fields. The diagrams below should make this clearer.
All video capture and output devices must report the current
field order. Some drivers may permit the selection of a different
order, to this end applications initialize the
field field of struct v4l2_pix_format before
calling the VIDIOC_S_FMT ioctl. If this is not desired it should
have the value V4L2_FIELD_ANY (0).
Table 3.8. enum v4l2_field
V4L2_FIELD_ANY | 0 | Applications request this field order when any
one of the V4L2_FIELD_NONE,
V4L2_FIELD_TOP,
V4L2_FIELD_BOTTOM, or
V4L2_FIELD_INTERLACED formats is acceptable.
Drivers choose depending on hardware capabilities or e. g. the
requested image size, and return the actual field order. struct v4l2_buffer
field can never be
V4L2_FIELD_ANY. |
V4L2_FIELD_NONE | 1 | Images are in progressive format, not interlaced.
The driver may also indicate this order when it cannot distinguish
between V4L2_FIELD_TOP and
V4L2_FIELD_BOTTOM. |
V4L2_FIELD_TOP | 2 | Images consist of the top field only. |
V4L2_FIELD_BOTTOM | 3 | Images consist of the bottom field only. Applications may wish to prevent a device from capturing interlaced images because they will have "comb" or "feathering" artefacts around moving objects. |
V4L2_FIELD_INTERLACED | 4 | Images contain both fields, interleaved line by line. The temporal order of the fields (whether the top or bottom field is first transmitted) depends on the current video standard. M/NTSC transmits the bottom field first, all other standards the top field first. |
V4L2_FIELD_SEQ_TB | 5 | Images contain both fields, the top field lines are stored first in memory, immediately followed by the bottom field lines. Fields are always stored in temporal order, the older one first in memory. Image sizes refer to the frame, not fields. |
V4L2_FIELD_SEQ_BT | 6 | Images contain both fields, the bottom field lines are stored first in memory, immediately followed by the top field lines. Fields are always stored in temporal order, the older one first in memory. Image sizes refer to the frame, not fields. |
V4L2_FIELD_ALTERNATE | 7 | The two fields of a frame are passed in separate
buffers, in temporal order, i. e. the older one first. To indicate the field
parity (whether the current field is a top or bottom field) the driver
or application, depending on data direction, must set struct v4l2_buffer
field to
V4L2_FIELD_TOP or
V4L2_FIELD_BOTTOM. Any two successive fields pair
to build a frame. If fields are successive, without any dropped fields
between them (fields can drop individually), can be determined from
the struct v4l2_buffer sequence field. Image
sizes refer to the frame, not fields. This format cannot be selected
when using the read/write I/O method. |
V4L2_FIELD_INTERLACED_TB | 8 | Images contain both fields, interleaved line by line, top field first. The top field is transmitted first. |
V4L2_FIELD_INTERLACED_BT | 9 | Images contain both fields, interleaved line by line, top field first. The bottom field is transmitted first. |
[12] It would be desirable if applications could depend on drivers supporting all I/O interfaces, but as much as the complex memory mapping I/O can be inadequate for some devices we have no reason to require this interface, which is most useful for simple applications capturing still images.
[13] At the driver level select() and
poll() are the same, and
select() is too important to be optional.
[14] One could use one file descriptor and set the buffer
type field accordingly when calling VIDIOC_QBUF etc., but it makes
the select() function ambiguous. We also like the
clean approach of one file descriptor per logical stream. Video
overlay for example is also a logical stream, although the CPU is not
needed for continuous operation.
[15] Random enqueue order permits applications processing images out of order (such as video codecs) to return buffers earlier, reducing the probability of data loss. Random fill order allows drivers to reuse buffers on a LIFO-basis, taking advantage of caches holding scatter-gather lists and the like.
[16] At the driver level select() and
poll() are the same, and
select() is too important to be optional. The
rest should be evident.
[17] We expect that frequently used buffers are typically not swapped out. Anyway, the process of swapping, locking or generating scatter-gather lists may be time consuming. The delay can be masked by the depth of the incoming buffer queue, and perhaps by maintaining caches assuming a buffer will be soon enqueued again. On the other hand, to optimize memory usage drivers can limit the number of buffers locked in advance and recycle the most recently used buffers first. Of course, the pages of empty buffers in the incoming queue need not be saved to disk. Output buffers must be saved on the incoming and outgoing queue because an application may share them with other processes.
[18] At the driver level select() and
poll() are the same, and
select() is too important to be optional. The
rest should be evident.
[19] Since no other Linux multimedia API supports unadjusted time it would be foolish to introduce here. We must use a universally supported clock to synchronize different media, hence time of day.
Table of Contents
Video capture devices sample an analog video signal and store the digitized images in memory. Today nearly all devices can capture at full 25 or 30 frames/second. With this interface applications can control the capture process and move images from the driver into user space.
Conventionally V4L2 video capture devices are accessed through
character device special files named /dev/video
and /dev/video0 to
/dev/video63 with major number 81 and minor
numbers 0 to 63. /dev/video is typically a
symbolic link to the preferred video device. Note the same device
files are used for video output devices.
Devices supporting the video capture interface set the
V4L2_CAP_VIDEO_CAPTURE flag in the
capabilities field of struct v4l2_capability
returned by the VIDIOC_QUERYCAP ioctl. As secondary device functions
they may also support the video overlay
(V4L2_CAP_VIDEO_OVERLAY) and the raw VBI capture
(V4L2_CAP_VBI_CAPTURE) interface. At least one of
the read/write or streaming I/O methods must be supported. Tuners and
audio inputs are optional.
Video capture devices shall support audio input, tuner, controls, cropping and scaling and streaming parameter ioctls as needed. The video input and video standard ioctls must be supported by all video capture devices.
The result of a capture operation is determined by cropping and image format parameters. The former select an area of the video picture to capture, the latter how images are stored in memory, i. e. in RGB or YUV format, the number of bits per pixel or width and height. Together they also define how images are scaled in the process.
As usual these parameters are not reset
at open() time to permit Unix tool chains, programming a device
and then reading from it as if it was a plain file. Well written V4L2
applications ensure they really get what they want, including cropping
and scaling.
Cropping initialization at minimum requires to reset the parameters to defaults. An example is given in the section called “Image Cropping, Insertion and Scaling”.
To query the current image format applications set the
type field of a struct v4l2_format to
V4L2_BUF_TYPE_VIDEO_CAPTURE and call the
VIDIOC_G_FMT ioctl with a pointer to this structure. Drivers fill
the struct v4l2_pix_format pix member of the
fmt union.
To request different parameters applications set the
type field of a struct v4l2_format as above and
initialize all fields of the struct v4l2_pix_format
vbi member of the
fmt union, or better just modify the
results of VIDIOC_G_FMT, and call the
VIDIOC_S_FMT ioctl with a pointer to this structure. Drivers may
adjust the parameters and finally return the actual parameters as
VIDIOC_G_FMT does.
Like VIDIOC_S_FMT the
VIDIOC_TRY_FMT ioctl can be used to learn about hardware limitations
without disabling I/O or possibly time consuming hardware
preparations.
The contents of struct v4l2_pix_format are discussed in Chapter 2, Image Formats. See also the specification of the
VIDIOC_G_FMT, VIDIOC_S_FMT
and VIDIOC_TRY_FMT ioctls for details. Video
capture devices must implement both the
VIDIOC_G_FMT and
VIDIOC_S_FMT ioctl, even if
VIDIOC_S_FMT ignores all requests and always
returns default parameters as VIDIOC_G_FMT does.
VIDIOC_TRY_FMT is optional.
A video capture device may support the read() function and/or streaming (memory mapping or user pointer) I/O. See Chapter 3, Input/Output for details.
Video overlay devices have the ability to genlock (TV-)video into the (VGA-)video signal of a graphics card, or to store captured images directly in video memory of a graphics card, typically with clipping. This can be considerable more efficient than capturing images and displaying them by other means. In the old days when only nuclear power plants needed cooling towers this used to be the only way to put live video into a window.
Video overlay devices are accessed through the same character
special files as video capture devices.
Note the default function of a /dev/video device
is video capturing. The overlay function is only available after
calling the VIDIOC_S_FMT ioctl.
The driver may support simultaneous overlay and capturing using the read/write and streaming I/O methods. If so, operation at the nominal frame rate of the video standard is not guaranteed. Frames may be directed away from overlay to capture, or one field may be used for overlay and the other for capture if the capture parameters permit this.
Applications should use different file descriptors for capturing and overlay. This must be supported by all drivers capable of simultaneous capturing and overlay. Optionally these drivers may also permit capturing and overlay with a single file descriptor for compatibility with V4L and earlier versions of V4L2.[20]
Devices supporting the video overlay interface set the
V4L2_CAP_VIDEO_OVERLAY flag in the
capabilities field of struct v4l2_capability
returned by the VIDIOC_QUERYCAP ioctl. The overlay I/O method specified
below must be supported. Tuners and audio inputs are optional.
Video overlay devices shall support audio input, tuner, controls, cropping and scaling and streaming parameter ioctls as needed. The video input and video standard ioctls must be supported by all video overlay devices.
Before overlay can commence applications must program the
driver with frame buffer parameters, namely the address and size of
the frame buffer and the image format, for example RGB 5:6:5. The
VIDIOC_G_FBUF and VIDIOC_S_FBUF ioctls are available to get
and set these parameters, respectively. The
VIDIOC_S_FBUF ioctl is privileged because it
allows to set up DMA into physical memory, bypassing the memory
protection mechanisms of the kernel. Only the superuser can change the
frame buffer address and size. Users are not supposed to run TV
applications as root or with SUID bit set. A small helper application
with suitable privileges should query the graphics system and program
the V4L2 driver at the appropriate time.
Some devices add the video overlay to the output signal
of the graphics card. In this case the frame buffer is not modified by
the video device, and the frame buffer address and pixel format are
not needed by the driver. The VIDIOC_S_FBUF ioctl
is not privileged. An application can check for this type of device by
calling the VIDIOC_G_FBUF ioctl.
A driver may support any (or none) of five clipping/blending methods:
Chroma-keying displays the overlaid image only where pixels in the primary graphics surface assume a certain color.
A bitmap can be specified where each bit corresponds to a pixel in the overlaid image. When the bit is set, the corresponding video pixel is displayed, otherwise a pixel of the graphics surface.
A list of clipping rectangles can be specified. In these regions no video is displayed, so the graphics surface can be seen here.
The framebuffer has an alpha channel that can be used to clip or blend the framebuffer with the video.
A global alpha value can be specified to blend the framebuffer contents with video images.
When simultaneous capturing and overlay is supported and
the hardware prohibits different image and frame buffer formats, the
format requested first takes precedence. The attempt to capture
(VIDIOC_S_FMT) or overlay (VIDIOC_S_FBUF) may fail with an
EBUSY error code or return accordingly modified parameters..
The overlaid image is determined by cropping and overlay window parameters. The former select an area of the video picture to capture, the latter how images are overlaid and clipped. Cropping initialization at minimum requires to reset the parameters to defaults. An example is given in the section called “Image Cropping, Insertion and Scaling”.
The overlay window is described by a struct v4l2_window. It
defines the size of the image, its position over the graphics surface
and the clipping to be applied. To get the current parameters
applications set the type field of a
struct v4l2_format to V4L2_BUF_TYPE_VIDEO_OVERLAY and
call the VIDIOC_G_FMT ioctl. The driver fills the
v4l2_window substructure named
win. It is not possible to retrieve a
previously programmed clipping list or bitmap.
To program the overlay window applications set the
type field of a struct v4l2_format to
V4L2_BUF_TYPE_VIDEO_OVERLAY, initialize the
win substructure and call the
VIDIOC_S_FMT ioctl. The driver adjusts the parameters against
hardware limits and returns the actual parameters as
VIDIOC_G_FMT does. Like
VIDIOC_S_FMT, the VIDIOC_TRY_FMT ioctl can be
used to learn about driver capabilities without actually changing
driver state. Unlike VIDIOC_S_FMT this also works
after the overlay has been enabled.
The scaling factor of the overlaid image is implied by the width and height given in struct v4l2_window and the size of the cropping rectangle. For more information see the section called “Image Cropping, Insertion and Scaling”.
When simultaneous capturing and overlay is supported and
the hardware prohibits different image and window sizes, the size
requested first takes precedence. The attempt to capture or overlay as
well (VIDIOC_S_FMT) may fail with an EBUSY error code or return accordingly
modified parameters.
Table 4.1. struct v4l2_window
| struct v4l2_rect | w | Size and position of the window relative to the
top, left corner of the frame buffer defined with VIDIOC_S_FBUF. The
window can extend the frame buffer width and height, the
x and y
coordinates can be negative, and it can lie completely outside the
frame buffer. The driver clips the window accordingly, or if that is
not possible, modifies its size and/or position. |
| enum v4l2_field | field | Applications set this field to determine which
video field shall be overlaid, typically one of
V4L2_FIELD_ANY (0),
V4L2_FIELD_TOP,
V4L2_FIELD_BOTTOM or
V4L2_FIELD_INTERLACED. Drivers may have to choose
a different field order and return the actual setting here. |
| __u32 | chromakey | When chroma-keying has been negotiated with
VIDIOC_S_FBUF applications set this field to the desired pixel value
for the chroma key. The format is the same as the pixel format of the
framebuffer (struct v4l2_framebuffer
fmt.pixelformat field), with bytes in host
order. E. g. for V4L2_PIX_FMT_BGR24
the value should be 0xRRGGBB on a little endian, 0xBBGGRR on a big
endian host. |
| struct v4l2_clip * | clips | When chroma-keying has not
been negotiated and VIDIOC_G_FBUF indicated this capability,
applications can set this field to point to an array of
clipping rectangles. |
Like the window coordinates
w, clipping rectangles are defined relative
to the top, left corner of the frame buffer. However clipping
rectangles must not extend the frame buffer width and height, and they
must not overlap. If possible applications should merge adjacent
rectangles. Whether this must create x-y or y-x bands, or the order of
rectangles, is not defined. When clip lists are not supported the
driver ignores this field. Its contents after calling VIDIOC_S_FMT
are undefined. | ||
| __u32 | clipcount | When the application set the
clips field, this field must contain the
number of clipping rectangles in the list. When clip lists are not
supported the driver ignores this field, its contents after calling
VIDIOC_S_FMT are undefined. When clip lists are
supported but no clipping is desired this field must be set to
zero. |
| void * | bitmap | When chroma-keying has
not been negotiated and VIDIOC_G_FBUF indicated
this capability, applications can set this field to point to a
clipping bit mask. |
It must be of the same size
as the window, ((__u8 *) where When a clipping
bit mask is not supported the driver ignores this field, its contents
after calling Applications need not create a clip list or bit mask. When they pass both, or despite negotiating chroma-keying, the results are undefined. Regardless of the chosen method, the clipping abilities of the hardware may be limited in quantity or quality. The results when these limits are exceeded are undefined.[b] | ||
| __u8 | global_alpha | The global alpha value used to blend the
framebuffer with video images, if global alpha blending has been
negotiated (V4L2_FBUF_FLAG_GLOBAL_ALPHA, see
VIDIOC_S_FBUF, Table A.45, “Frame Buffer Flags”). |
| Note this field was added in Linux 2.6.23, extending the structure. However the VIDIOC_G/S/TRY_FMT ioctls, which take a pointer to a v4l2_format parent structure with padding bytes at the end, are not affected. | ||
[a] Should we require
[b] When the image is written into frame buffer memory it will be undesirable if the driver clips out less pixels than expected, because the application and graphics system are not aware these regions need to be refreshed. The driver should clip out more pixels or not write the image at all. | ||
Table 4.2. struct v4l2_clip[21]
| struct v4l2_rect | c | Coordinates of the clipping rectangle, relative to the top, left corner of the frame buffer. Only window pixels outside all clipping rectangles are displayed. |
| struct v4l2_clip * | next | Pointer to the next clipping rectangle, NULL when this is the last rectangle. Drivers ignore this field, it cannot be used to pass a linked list of clipping rectangles. |
Table 4.3. struct v4l2_rect
| __s32 | left | Horizontal offset of the top, left corner of the rectangle, in pixels. |
| __s32 | top | Vertical offset of the top, left corner of the rectangle, in pixels. Offsets increase to the right and down. |
| __s32 | width | Width of the rectangle, in pixels. |
| __s32 | height | Height of the rectangle, in pixels. Width and height cannot be negative, the fields are signed for hysterical reasons. |
To start or stop the frame buffer overlay applications call
the VIDIOC_OVERLAY ioctl.
Video output devices encode stills or image sequences as analog video signal. With this interface applications can control the encoding process and move images from user space to the driver.
Conventionally V4L2 video output devices are accessed through
character device special files named /dev/video
and /dev/video0 to
/dev/video63 with major number 81 and minor
numbers 0 to 63. /dev/video is typically a
symbolic link to the preferred video device. Note the same device
files are used for video capture devices.
Devices supporting the video output interface set the
V4L2_CAP_VIDEO_OUTPUT flag in the
capabilities field of struct v4l2_capability
returned by the VIDIOC_QUERYCAP ioctl. As secondary device functions
they may also support the raw VBI
output (V4L2_CAP_VBI_OUTPUT) interface. At
least one of the read/write or streaming I/O methods must be
supported. Modulators and audio outputs are optional.
Video output devices shall support audio output, modulator, controls, cropping and scaling and streaming parameter ioctls as needed. The video output and video standard ioctls must be supported by all video output devices.
The output is determined by cropping and image format parameters. The former select an area of the video picture where the image will appear, the latter how images are stored in memory, i. e. in RGB or YUV format, the number of bits per pixel or width and height. Together they also define how images are scaled in the process.
As usual these parameters are not reset
at open() time to permit Unix tool chains, programming a device
and then writing to it as if it was a plain file. Well written V4L2
applications ensure they really get what they want, including cropping
and scaling.
Cropping initialization at minimum requires to reset the parameters to defaults. An example is given in the section called “Image Cropping, Insertion and Scaling”.
To query the current image format applications set the
type field of a struct v4l2_format to
V4L2_BUF_TYPE_VIDEO_OUTPUT and call the
VIDIOC_G_FMT ioctl with a pointer to this structure. Drivers fill
the struct v4l2_pix_format pix member of the
fmt union.
To request different parameters applications set the
type field of a struct v4l2_format as above and
initialize all fields of the struct v4l2_pix_format
vbi member of the
fmt union, or better just modify the
results of VIDIOC_G_FMT, and call the
VIDIOC_S_FMT ioctl with a pointer to this structure. Drivers may
adjust the parameters and finally return the actual parameters as
VIDIOC_G_FMT does.
Like VIDIOC_S_FMT the
VIDIOC_TRY_FMT ioctl can be used to learn about hardware limitations
without disabling I/O or possibly time consuming hardware
preparations.
The contents of struct v4l2_pix_format are discussed in Chapter 2, Image Formats. See also the specification of the
VIDIOC_G_FMT, VIDIOC_S_FMT
and VIDIOC_TRY_FMT ioctls for details. Video
output devices must implement both the
VIDIOC_G_FMT and
VIDIOC_S_FMT ioctl, even if
VIDIOC_S_FMT ignores all requests and always
returns default parameters as VIDIOC_G_FMT does.
VIDIOC_TRY_FMT is optional.
A video output device may support the write() function and/or streaming (memory mapping or user pointer) I/O. See Chapter 3, Input/Output for details.
This is an experimental interface and may change in the future.
Some video output devices can overlay a framebuffer image onto t