AV1(Redirected from AOMedia Video 1)
AOMedia Video 1 (AV1) is an open, royalty-free video coding format designed for video transmissions over the Internet. It is being developed by the Alliance for Open Media (AOMedia), a consortium of firms from the semiconductor industry, video on demand providers, and web browser developers, founded in 2015. The AV1 bitstream specification includes a reference video codec. It succeeds VP9. It can have 20% higher data compression than VP9 or HEVC/H.265 from the Moving Picture Experts Group and about 50% higher than the widely used AVC.
|Developed by||Alliance for Open Media|
|Initial release||28 March 2018|
|Type of format||Compressed video|
AV1 is the primary contender for standardization by the video standard working group NetVC of the Internet Engineering Task Force (IETF). The group has put together a list of criteria to be met by the new video standard.
The first official announcement of the project came with the press release on the formation of the Alliance on 1 September 2015. The increased usage of its predecessor VP9 is attributed to confidence in the Alliance and development of AV1 as well as the pricey and complicated licensing situation of HEVC (High Efficiency Video Coding).
The roots of the project precede the Alliance, however. Individual contributors started experimental technology platforms years before: Xiph's/Mozilla's Daala already published code in 2010, VP10 was announced on 12 September 2014, and Cisco's Thor was published on 11 August 2015. The first version 0.1.0 of the AV1 reference codec was published on 7 April 2016.
Soft feature freeze was at the end of October 2017, but a few significant features were decided to continue developing beyond this. The bitstream format was projected to be frozen in January 2018; however, this was delayed due to unresolved critical bugs as well as last changes to transformations, syntax, the prediction of motion vectors, and the completion of legal analysis. The Alliance announced the release of the AV1 bitstream specification on 28 March 2018, along with a reference, software-based encoder and decoder. On 25 June 2018, a validated version 1.0.0 of the specification was released. On 8 January 2019 a validated version 1.0.0 with Errata 1 of the specification was released.
Martin Smole from AOM member Bitmovin says that the computational efficiency of the reference encoder is the greatest remaining challenge after the bitstream format freeze. While still working on the format, the encoder was not targeted for productive use and didn't receive any speed optimizations. Therefore, it works orders of magnitude slower than e.g. existing HEVC encoders, and development is planned to shift its focus towards maturing the reference encoder after the freeze.
To fulfill the goal of being royalty free, the development process is such that no feature is adopted before it has been independently double checked that it does not infringe on patents of competing companies. This contrasts with its main competitor HEVC, for which a review of the intellectual property rights (IPR review) was not part of the standardization process. The latter reviewing practice is stipulated in the ITU-T's definition of an open standard.
The possible existence of yet unknown patents has been a recurring concern in the field of royalty-free multimedia formats; the concern has been raised regarding AV1, and previously VP9, Theora and IVC. The problem of unforeseen patents is not unique to royalty-free formats, but it uniquely threatens their status as royalty-free. In contrast, IPR avoidance has not traditionally been a priority in MPEG's business model for royalty-bearing formats (although the MPEG chairman argues it has to change).
|Patent licensing||AV1, VP9, Theora, etc||HEVC, AVC, etc||GIF, MP3, MPEG-2, etc|
|By known patent holders||Royalty-free||Royalty bearing||Expired|
|By unknown patent holders||Impossible to know until expiry|
Under patent rules adopted from the World Wide Web Consortium (W3C), technology contributors license their AV1-connected patents to anyone, anywhere, anytime based on reciprocity, i.e. as long as the user does not engage in patent litigation. As a defensive condition, anyone engaging in patent litigation loses the right to the patents of all patent holders.
The pursuit of royalty-free web standards has historical precedence and can have many reasons. In 2007, the proposal for HTML5 video specified Theora as mandatory to implement. The reason given was that public content should be encoded in freely implementable formats, if only as a “baseline format”, and that changing such a baseline format later would be hard because of network effects. The Alliance for Open Media is a continuation of Google's efforts with the WebM project, which renewed the royalty-free competition after Theora had been surpassed by AVC. AVC is hard for (among others) Mozilla to support, the problem being that a per-copy royalty easily is unsustainable for software that is distributed free of charge (see FRAND § Excluding costless distribution). HEVC is likewise – an exception for freely distributed software has not been made by all licensors (see HEVC § Provision for costless software).
The performance goals include "a step up from VP9 and HEVC" in efficiency for a low increase in complexity. NETVC's efficiency goal is 25% improvement over HEVC. The primary complexity concern is for software decoding, since hardware support will take time to reach users. However, for WebRTC, live encoding performance is also relevant, which is Cisco's agenda: Cisco is a manufacturer of videoconferencing equipment, and their Thor contributions aim at "reasonable compression at only moderate complexity".
Feature wise, it is specifically designed for real-time applications (especially WebRTC) and higher resolutions (wider color gamuts, higher frame rates, UHD) than typical usage scenarios of the current generation (H.264) of video formats where it is expected to achieve its biggest efficiency gains. It is therefore planned to support the color space from ITU-R Recommendation BT.2020 and up to 12 bits of precision per color component. AV1 is primarily intended for lossy encoding, although lossless compression is supported as well.
AV1 is a traditional block-based frequency transform format featuring new techniques, several of which were developed in experimental formats that have been testing technology for a next-generation format after HEVC and VP9. Based on Google's experimental VP9 evolution project VP10, AV1 incorporates additional techniques developed in Xiph's/Mozilla's Daala and Cisco's Thor.
|Developer(s)||Alliance for Open Media|
|Written in||C, assembly|
The Alliance published a reference implementation written in C and assembly language (
aomdec) as free software under the terms of the BSD 2-Clause License. Development happens in public and is open for contributions, regardless of AOM membership.
The development process is such that coding tools are added to the reference codebase as experiments, controlled by flags that enable or disable them at build time, for review by other group members as well as specialized teams that help with and ensure hardware friendliness and compliance with intellectual property rights (TAPAS). Once the feature gains some support in the community, the experiment can be enabled by default, and ultimately have its flag removed when all of the reviews are passed. Experiment names are lowercased in the configure script and uppercased in conditional compilation flags.
Frame content is separated into adjacent same-sized blocks referred to as superblocks. Similar to the concept of a macroblock, superblocks are square-shaped and can either be of size 128×128 or 64×64 pixels. Superblocks can be divided in smaller blocks according to different partitioning patterns. The four-way split pattern is the only pattern whose partitions can be recursively subdivided. This allows superblocks to be divided into partitions as small as 4×4 pixels.
"T-shaped" partitioning patterns are introduced, a feature developed for VP10, as well as horizontal or vertical splits into four stripes of 4:1 and 1:4 aspect ratio. The available partitioning patterns vary according to the block size, both 128×128 and 8×8 blocks can't use 4:1 and 1:4 splits. Moreover, 8×8 blocks can't use "T" shaped splits.
Two separate predictions can now be used on spatially different parts of a block using a smooth, oblique transition line (wedge-partitioned prediction). This enables more accurate separation of objects without the traditional staircase lines along the boundaries of square blocks.
AV1 performs internal processing in higher precision (10 or 12 bits per sample), which leads to compression improvement due to smaller rounding errors in reference imagery.
Predictions can be combined in more advanced ways (than a uniform average) in a block (compound prediction), including smooth and sharp transition gradients in different directions (wedge-partitioned prediction) as well as implicit masks that are based on the difference between the two predictors. This allows combination of either two inter predictions or an inter and an intra prediction to be used in the same block.
The Warped Motion (
warped_motion) and Global Motion (
global_motion) tools in AV1 aim to reduce redundant information in motion vectors by recognizing patterns arising from camera motion. They implement ideas that were tried to be exploited in preceding formats like e.g. MPEG-4 ASP, albeit with a novel approach that works in three dimensions. There can be a set of warping parameters for a whole frame offered in the bitstream, or blocks can use a set of implicit local parameters that get computed based on surrounding blocks.
Switch frames (S-frame) are a new inter-frame type that can be predicted using already decoded reference frames from a higher-resolution version of the same video to allow switching to a lower resolution without the need for a full keyframe at the beginning of a video segment in the adaptive bitrate streaming use case.
Intra prediction consists of predicting the pixels of a given blocks only using information available in the current frame. Most often, intra predictions are built from the neighboring pixels above and to the left of the predicted block. The DC predictor builds a prediction by averaging the pixels above and to the left of block.
Directional predictors extrapolate these neighboring pixels according to a specified angle. In AV1, 8 main directional modes can be chosen. These modes start at an angle of 45 degrees and increase by a step size of 22.5 degrees up until 203 degrees. Furthermore, for each directional mode, six offsets of 3 degree can be signalled, three above the main angle and three below it, resulting in a total of 56 angles (
The "TrueMotion" predictor got replaced with a Paeth predictor which looks at the difference from the known pixel in the above left corner to the pixel directly above and directly left of the new one and then chooses the one that lies in direction of the smaller gradient as predictor. A palette predictor is available for blocks with very few colors like in some computer screen content. Correlations between the luminosity and the color information can now be exploited with a predictor for chroma blocks that is based on samples from the luma plane (
cfl). In order to reduce discontinuities along borders of inter-predicted blocks, predictors can be overlapped and blended with those of neighbouring blocks (overlapped block motion compensation).
To transform the error remaining after prediction to the frequency domain, AV1 uses square, 2:1/1:2, and 4:1/1:4 rectangular DCTs (
rect_tx), as well as an asymmetric DST for blocks where the top and/or left edge is expected to have lower error thanks to prediction from nearby pixels.
AV1 has new optimized quantization matrices (
For the in-loop filtering step, the integration of Thor's constrained low-pass filter and Daala's directional deringing filter has been fruitful: The combined Constrained Directional Enhancement Filter (
cdef) exceeds the results of using the original filters separately or together.
It is an edge-directed conditional replacement filter that smoothes blocks with configurable (signaled) strength roughly along the direction of the dominant edge to eliminate ringing artifacts.
There is also the loop restoration filter (
loop_restoration) to remove blur artifacts due to block processing.
Film grain synthesis (
film_grain) improves coding of noisy signals using a parametric video coding approach.
Due to the randomness inherent to film grain noise, this signal component is traditionally either very expensive to code or prone to get damaged or lost, possibly leaving serious coding artefacts as residue. This tool circumvents these problems using analysis and synthesis, replacing parts of the signal with a visually similar synthetic texture, based solely on subjective visual impression instead of objective similarity. It removes the grain component from the signal, analyzes its non-random characteristics, and instead transmits only descriptive parameters to the decoder, which adds back a synthetic, pseudorandom noise signal that's shaped after the original component.
Daala's entropy coder (
daala_ec), a non-binary arithmetic coder, was selected for replacing VP9's binary entropy coder. The use of non-binary arithmetic coding helps evade patents, but also adds bit-level parallelism to an otherwise serial process, reducing clock rate demands on hardware implementations. This is to say that the effectiveness of modern binary arithmetic coding like CABAC is being approached using a greater alphabet than binary, hence greater speed, as in Huffman code (but not as simple and fast as Huffman code).
AV1 also gained the ability to adapt the symbol probabilities in the arithmetic coder per coded symbol instead of per frame (
Former experiments that have been fully integratedEdit
This list is no longer complete. It is being rewritten in prose.
|Historic build-time flag||Explanation|
||A new prediction mode suitable for smooth regions|
||Delta quantization step: Arbitrary adaptation of quantizers within a frame|
||Ability to choose between 4 horizontal and vertical interpolation filters for subpixel motion compensation (three 8-tap and one 12-tap)|
||Extended inter: Weighted compound prediction with variable weights per block|
||7-bit interpolation filters|
||Interpolate the reference samples before prediction to reduce the impact of quantization noise|
||Renamed from obmc. Overlapped Block Motion Compensation: Reduce discontinuities at block edges using different motion vectors|
||Code extra_bits using up to 5 non-adaptive symbols, starting from the LSB|
||Better methods for coding the motion vector predictors through implicit list of spatial and temporal neighbor MVs|
||Independent group of tiles. Inside this group, row of tiles could be independent or not|
||Recursive transform block partition and coding scheme|
Features that were droppedEdit
Daala Transforms were the major innovation behind the daala codec. They implement "lapped" discrete cosine and sine transforms that its authors describe as "better in every way" than the
txmg set of transforms that prevailed in AV1. Both the
daala_tx experiments have merged high and low bitdepth code paths (unlike VP9), but
daala_tx achieved full embedding of smaller transforms within larger, as well as using fewer multiplies, which could have further reduced the cost of hardware implementations. The Daala transforms were kept as optional in the experimental codebase until late January 2018, but changing hardware blocks at a late stage was a general concern for delaying hardware availability.
The encoding complexity of Daala's Perceptual Vector Quantization (PVQ) was too much within the already complex framework of AV1. The Rate Distortion
dist_8x8 heuristic aims to speed up the encoder by a sizable factor, PVQ or not, but PVQ was ultimately dropped.
Quality and efficiencyEdit
In April 2017, using the 8 enabled experimental features at the time (of 77 total), Bitmovin was able to demonstrate favorable objective metrics, as well as visual results, compared to HEVC on the Sintel and Tears of Steel animated films. A follow-up comparison by Jan Ozer of Streaming Media Magazine confirmed this, and concluded that "AV1 is at least as good as HEVC now".
Ozer noted that his and Bitmovin's results contradicted a comparison by Fraunhofer Institute for Telecommunications from late 2016 that had found AV1 38.4% less efficient than HEVC, underperforming even H.264/AVC, and justified this discrepancy by having used encoding parameters endorsed by each encoder vendor, as well as having more features in the newer AV1 encoder.
Tests from Netflix showed that, based on measurements with PSNR and VMAF at 720p, AV1 could be about 25% more efficient than VP9 (libvpx), at the expense of a 4–10 fold increase in encoding complexity. Similar conclusions with respect to quality were drawn from a test conducted by Moscow State University researchers, where VP9 was found to require 31% and HEVC 22% more bitrate than AV1 for the same level of quality. The researchers found that the used AV1 encoder was operating at a speed “2500–3500 times lower than competitors”, while admitting that it has not been optimized yet.
In a comparison of AV1 against H.264 (x264) and VP9 (libvpx), Facebook showed about 45–50% bitrate savings over H.264 and about 40% over VP9 when using a constant quality encoding mode.
AOMedia provides a list of test results on their website.
Profiles and levelsEdit
AV1 defines three profiles for decoders which are Main, High, and Professional. The Main profile allows for a bit depth of 8- or 10-bits per sample with 4:0:0 (greyscale) and 4:2:0 chroma sampling. The High profile further adds support for 4:4:4 chroma sampling. The Professional profile extends capabilities to full support for 4:0:0, 4:2:0, 4:2:2 and 4:4:4 chroma sub-sampling with 8, 10 and 12 bit color depths.
|Main (0)||High (1)||Professional (2)|
|8 or 10-bit||8 or 10-bit||8, 10 & 12 bit|
AV1 defines levels for decoders with maximum variables for levels ranging from 2.0 to 7.3. Example resolutions would be 426×240@30 fps for level 2.0, 854×480@30 fps for level 3.0, 1920×1080@30 fps for level 4.0, 3840×2160@60 fps for level 5.1, 3840×2160@120 fps for level 5.2 and 5.3,[further explanation needed] and 7680×4320@120 fps for level 6.2. Level 7 has not been defined yet.
|Min Comp Basis||Max Tiles||Max Tile Cols||Example|
Supported container formatsEdit
- ISO Base Media File Format: The ISOBMFF containerization spec by AOMedia was the first to be finalized and the first to gain adoption. This is the format used by YouTube.
- Standards with unclear promise of finalization
- Matroska: Version 1 of the Matroska containerization spec was published in September 2018. However, arguably breaking changes continued into October, and a new version, or finalization, is yet to be announced as of December.
- Not standardized
- WebM: As a matter of formality, AV1 has not been sanctioned into the subset of Matroska known as WebM as of late 2018.
- On2 IVF: This format was inherited from the first public release of VP8, where it served as a simple development container. Rav1e also supports this format.
- Pre-standard WebM: Libaom featured early support for WebM, before Matroska containerization was specified, but has been changed to conform.
YouTube has begun rolling out AV1, starting with its AV1 Beta Launch Playlist. According to the description, the videos are (to begin with) encoded at high bitrate to test decoding performance, and YouTube has “ambitious goals” for rolling out AV1.
Netflix “expects to be an early adopter of AV1”.
Following very positive own test results, Facebook said to gradually roll out AV1 as soon as browser support emerges, starting with their most popular videos.
- Libaom is the reference implementation. It includes an encoder (aomenc) and a decoder (aomdec). As the former research codec, it has the advantage of being made to justifiably demonstrate efficient use of every feature, but at the general cost of encoding speed. At feature freeze, the encoder had become problematically slow, but speed optimizations with negligible efficiency impact have continued to be made also after that.
- Rav1e is an encoder written in Rust and assembly. Rav1e takes the opposite developmental approach to Aomenc: start out as the simplest (therefore fastest) conforming encoder, and then improve efficiency over time while remaining fast.
- The SVT-AV1 Encoder is an open-source encoder released by Intel that is designed especially for usage on data center servers based on Intel Xeon processors.
- Dav1d is a decoder written in C99 and assembly focused on speed (via parallelism) and portability. The first official version (0.1 ) was released in December 2018. Firefox is planning to switch from Libaom to Dav1d as a default decoder.
- Firefox (since version 65.0, January 2019; enabled by default on Windows, but the
media.av1.enabledconfiguration flag must be set manually on other systems)
- Google Chrome (since version 70, October 2018)
- Opera (since version 57)
- VLC media player (since version 3.0)
- GStreamer (since version 1.14)
- FFmpeg (since version 4.0)
- mpv (since version 0.29.0)
- MKVToolNix (adoption of final av1-in-mkv spec since version 28)
- MediaInfo (since version 18.03)
- Bitmovin Encoding (since version 1.50.0, July 2018)
Operating system supportEdit
|Microsoft Windows||macOS||Linux||Android OS||iOS|
|Codec support||Beta||TBA||Beta||Coming soon||TBA|
|Container support||ISO base media file format (.mp4)
|TBA||ISO base media file format (.mp4)
|Notes||- Support introduced in Windows 10 October 2018 Update (1809) with AV1 Video Extension beta add-on.||Unsupported as of macOS Mojave.||Currently, no GPU supports hardware acceleration.||Support to be introduced in Android Q (Q3 2019).||Unsupported as of iOS 12.|
Several Alliance members demonstrated AV1 enabled products at IBC 2018, including Socionext's hardware accelerated encoder. According to Socionext, the encoding accelerator is FPGA based and can run on an Amazon EC2 F1 cloud instance, where it runs 10 times faster than existing software encoders.
According to Mukund Srinivasan, chief business officer of AOM member Ittiam, early hardware support will be dominated by software running on non-CPU hardware (such as GPGPU, DSP or shader programs, as is the case with some VP9 hardware implementations), as fixed-function hardware will take 12–18 months after bitstream freeze until chips are available, plus 6 months for products based on those chips to hit the market. The bitstream was finally frozen on 28 March 2018, meaning chips could be available sometime between March and August 2019. According to the above forecast, products based on chips could then be on the market at the end of 2019 or the beginning of 2020.
AV1 Image File Format (AVIF)Edit
The AV1 Image File Format (AVIF) is a specification for storing images or image sequences compressed with AV1 in the HEIF file format. AVIF files conform to the HEIC specification and support features like High Dynamic Range (HDR) and wide color gamut (WCG).
On December 14, 2018 Netflix published the first .avif sample images and support was added in VLC. Microsoft also announced support with the Windows 10 "19h1" preview release, including File Explorer, Paint and multiple API's, together with some sample images. Mozilla is also working on support for the new image format in Firefox.
- Zimmerman, Steven (15 May 2017). "Google's Royalty-Free Answer to HEVC: A Look at AV1 and the Future of Video Codecs". XDA Developers. Archived from the original on 14 June 2017. Retrieved 10 June 2017.
- Kevin Staunton-Lambert; Werner Robitza; Gyan Doshi; James Almer; Ewout ter Hoeven (2018-12-21). "libaom AV1 Encoding Guide". FFmpeg Wiki. Retrieved 2019-02-01.
- How to use AV1 with open source tools, Videolan's Jean-Baptiste Kempf on using AV1 with open source tools, 2018-12-01.
- Rick Merritt (EE Times), 30 June 2016: Video Compression Feels a Pinch
- Sebastian Grüner (19 July 2016). "Der nächste Videocodec soll 25 Prozent besser sein als H.265" (in German). golem.de. Retrieved 1 March 2017.
- Tsahi Levent-Levi (2 April 2018). "AV1 Specification Released: Can we kiss goodbye to HEVC and royalty bearing video codecs?". BlogGeek.me. Retrieved 19 December 2018.
AV1 for video coding is what Opus is for audio coding.
- "Alliance for Open Media established to deliver next-generation open media formats" (Press release). Alliance for Open Media. 1 September 2015. Retrieved 5 September 2015.[self-published source]
- Timothy B. Terriberry (18 January 2017). "Progress in the Alliance for Open Media" (video). linux.conf.au. Retrieved 1 March 2017.[self-published source]
- Timothy B. Terriberry (18 January 2017). "Progress in the Alliance for Open Media (slides)" (PDF). Retrieved 22 June 2017.[self-published source]
- Stephen Shankland (12 September 2014). "Google's Web-video ambitions bump into hard reality". CNET. Retrieved 13 September 2014.
- Krishnan, Jai (22 November 2017). "Jai Krishnan from Google and AOMedia giving us an update on AV1". YouTube. Retrieved 22 December 2017.[self-published source]
- Terriberry, Timothy B. (3 February 2018). "AV1 Codec Update". FOSDEM. Retrieved 8 February 2018.[self-published source]
- Alliance for Open Media (28 March 2018). "The Alliance for Open Media Kickstarts Video Innovation Era with "AV1" Release" (Press release). Wakefield, Mass.[self-published source]
- Shilov, Anton (2018-03-30). "Alliance for Open Media Releases Royalty-Free AV1 1.0 Codec Spec". AnandTech. Retrieved 2 April 2018.
- Larabel, Michael (25 June 2018). "AOMedia AV1 Codec v1.0.0 Appears Ready For Release". Phoronix. Retrieved 27 June 2018.
- "AV1 Bitstream and Decoding Process Specification". Alliance for Open Media. Retrieved 5 February 2018.
This version 1.0.0 with Errata 1 of the AV1 Bitstream Specification corresponds to the Git tag v1.0.0-errata1 in the AOMediaCodec/av1-spec project. Its content has been validated as consistent with the reference decoderprovided by libaom v1.0.0-errata1.[self-published source]
- Hunter, Philip (15 February 2018). "Race on to bring AV1 open source codec to market, as code freezes". Videonet. Mediatel Limited-GB. Retrieved 19 March 2018.
- Daede, Thomas (5 October 2017). "AV1 Update". YouTube. Retrieved 21 December 2017.[self-published source]
- Frost, Matt (31 July 2017). "VP9-AV1 Video Compression Update". Retrieved 21 November 2017.
The mission of the Alliance for Open Media remains the same as the mission of the WebM project that we launched back in 2010. (…) Obviously, if we have an open source codec, we need to take very strong steps, and be very diligent in making sure that we are in fact producing something that's royalty free. So we have an extensive IP diligence process which involves diligence on both the contributor level – so when Google proposes a tool, we are doing our in-house IP diligence, using our in-house patent assets and outside advisors – that is then forwarded to the group, and is then again reviewed by an outside counsel that is engaged by the alliance. So that's a step that actually slows down innovation, but is obviously necessary to produce something that is open source and royalty free.
- Jan Ozer (28 March 2018). "AV1 Is Finally Here, but Intellectual Property Questions Remain". Streaming Media Magazine. Retrieved 21 April 2018.
- Jan Ozer (June 2016). "VP9 Finally Comes of Age, But Is it Right for Everyone?". Retrieved 21 April 2018.
- Silvia Pfeiffer (December 2009). "Patents and their effect on Standards: Open video codecs for HTML5". Retrieved 21 April 2018.
- Leonardo Chiariglione (28 January 2018). "A crisis, the causes and a solution". Retrieved 21 April 2018.
two tracks in MPEG: one track producing royalty free standards (Option 1, in ISO language) and the other the traditional Fair Reasonable and Non Discriminatory (FRAND) standards (Option 2, in ISO language). (…) The Internet Video Coding (IVC) standard was a successful implementation of the idea (…). Unfortunately 3 companies made blank Option 2 statements (of the kind “I may have patents and I am willing to license them at FRAND terms”), a possibility that ISO allows. MPEG had no means to remove the claimed infringing technologies, if any, and IVC is practically dead.
- Leonardo Chiariglione (28 January 2018). "A crisis, the causes and a solution". Retrieved 21 April 2018.
How could MPEG achieve this? Thanks to its “business model” that can simply be described as: produce standards having the best performance as a goal, irrespective of the IPR involved.
- Neil McAllister, 1 September 2015: Web giants gang up to take on MPEG LA, HEVC Advance with royalty-free streaming codec – Joining forces for cheap, fast 4K video
- Wium Lie, Håkon (2007-03-29). "Proposal for the HTML 5 video element (Google TechTalks)". Google Video, later YouTube. Retrieved 2019-01-03.
Flash is today the baseline format on the web. The problem with Flash is that it's not an open standard. It's a proprietary format, it hasn't been documented, and it probably requires the payment of licenses if you are going to (…) write software for it (…) The web community has always been based on open standards. This has been what the web was founded on, where HTML started. That's why we developed the PNG image format – we wanted a freely implementable open standard to hold the content we are putting out there. Our content is too valuable to put into some locked format. This goes back all the way to SGML, in which the mantra was “own your data”. (…) If we look at open standards for video today (…), there is one which I believe is the right one, and that's called Ogg Theora.
- "An Invisible Tax on the Web: Video Codecs". 2018-07-11. Retrieved 4 January 2019.
Mozilla uses Cisco’s OpenH264 in Firefox. If not for Cisco’s generosity, Mozilla would be paying estimated licensing fees of $9.75 million a year.
- Steinar Midtskogen, Arild Fuldseth, Gisle Bjøntegaard, Thomas Davies (13 September 2017). "Integrating Thor tools into the emerging AV1 codec" (PDF). Retrieved 2 October 2017.
What can Thor add to VP9/AV1? Since Thor aims for reasonable compression at only moderate complexity, we considered features of Thor that could increase the compression efficiency of VP9 and/or reduce the computational complexity.CS1 maint: Uses authors parameter (link)
- Ozer, Jan (3 June 2016). "What is AV1?". Streaming Media Magazine. Information Today, Inc. Archived from the original on 26 November 2016. Retrieved 26 November 2016.
... Once available, YouTube expects to transition to AV1 as quickly as possible, particularly for video configurations such as UHD, HDR, and high frame rate videos ... Based upon its experience with implementing VP9, YouTube estimates that they could start shipping AV1 streams within six months after the bitstream is finalized. ...CS1 maint: BOT: original-url status unknown (link)
- "examples/lossless_encoder.c". Git at Google. Alliance for Open Media. Retrieved 29 October 2017.[self-published source]
- Shankland, Stephen (19 January 2018). "Photo format from Google and Mozilla could leave JPEG in the dust". CNET. CBS Interactive. Retrieved 28 January 2018.
- Romain Bouqueau (12 June 2016). "A view on VP9 and AV1 part 1: specifications". GPAC Project on Advanced Content. Retrieved 1 March 2017.
- Jan Ozer, 26 May 2016: What Is VP9?
- "LICENSE - aom - Git at Google". Aomedia.googlesource.com. Retrieved 26 September 2018.
- Ozer, Jan (30 August 2017). "AV1: A status update". Streaming Media Magazine. Retrieved 14 September 2017.
- Cho, Yushin (30 August 2017). "Delete daala_dist and cdef-dist experiments in configure". Retrieved 2 October 2017.
Since those two experiments have been merged into the dist-8x8 experiment[self-published source]
- Converse, Alex (16 November 2015). "New video coding techniques under consideration for VP10 – the successor to VP9". YouTube. Retrieved 3 December 2016.[self-published source]
- "Decoding the Buzz over AV1 Codec". 9 June 2017. Retrieved 22 June 2017.[self-published source]
- "Analysis of the emerging AOMedia AV1 video coding format for OTT use-cases" (PDF). Archived from the original (PDF) on 20 September 2017. Retrieved 19 September 2017.
- Mukherjee, Debargha; Su, Hui; Bankoski, Jim; Converse, Alex; Han, Jingning; Liu, Zoe; Xu (Google Inc.), Yaowu, "An overview of new video coding tools under consideration for VP10 – the successor to VP9", SPIE Optical Engineering+ Applications, International Society for Optics and Photonics, 9599, doi:10.1117/12.2191104
- Ian Trow (16 September 2018). Tech Talks: Codec wars (Recorded talk). IBC 2018 Conference. 28 minutes in. Retrieved 18 September 2018.
- Alaiwan, Sebastien (16 October 2017). "Remove compile guards for CONFIG_EXT_REFS". Retrieved 29 October 2017.
This experiment has been adopted[self-published source]
- Alaiwan, Sebastien (31 October 2017). "Remove experimental flag of WARPED_MOTION". Retrieved 23 November 2017.[self-published source]
- Alaiwan, Sebastien (30 October 2017). "Remove experimental flag of GLOBAL_MOTION". Retrieved 23 November 2017.[self-published source]
- Jan Ozer (2017-10-11). "Demuxed: A Video Engineer's Nirvana". Streaming Media Magazine. Retrieved 2019-02-10.
- Joshi, Urvang; Mukherjee, Debargha; Han, Jingning; Chen, Yue; Parker, Sarah; Su, Hui; Chiang, Angie; Xu, Yaowu; Liu, Zoe (19 September 2017). "Novel inter and intra prediction tools under consideration for the emerging AV1 video codec". Applications of Digital Image Processing XL, proceedings of SPIE Optical Engineering + Applications 2017. International Society for Optics and Photonics. 10396: 103960F. doi:10.1117/12.2274022.
- Mukherjee, Debargha (29 October 2017). "Remove experimental flag for rect-tx". Retrieved 23 November 2017.[self-published source]
- Mukherjee, Debargha (1 July 2016). "Rectangular transforms 4x8 & 8x4". Retrieved 14 September 2017.[self-published source]
- Han, Jingning; Saxena, Ankur; Melkote, Vinay; Rose, Kenneth (2011-09-29). "Jointly Optimized Spatial Prediction and Block Transform for Video and Image Coding" (PDF). IEEE Transactions on Image Processing. Retrieved 2019-02-12.
- "Mozilla shares how AV1, the new open source royalty-free video codec, works". 2018-11-12. Retrieved 21 December 2018.
- "Into the Depths:The Technical Details Behind AV1" (PDF). 2018-07-31. Retrieved 21 December 2018.
- Alaiwan, Sebastien (2 November 2017). "Remove experimental flag of EXT_TX". Retrieved 23 November 2017.[self-published source]
- Davies, Thomas (9 August 2017). "AOM_QM: enable by default". Retrieved 19 September 2017.[self-published source]
- Barbier, Frederic (10 November 2017). "Remove experimental flag of CDEF". Retrieved 23 October 2017.[self-published source]
- "Constrained Directional Enhancement Filter". 28 March 2017. Retrieved 15 September 2017.[self-published source]
- "Thor update". July 2017. Retrieved 2 October 2017.[self-published source]
- Egge, Nathan (25 May 2017). "This patch forces DAALA_EC on by default and removes the dkbool coder". Retrieved 14 September 2017.[self-published source]
- Egge, Nathan (14 February 2017). "Daala Entropy Coder in AV1" (PDF).[self-published source]
- Egge, Nathan (18 June 2017). "Remove the EC_ADAPT experimental flags". Retrieved 23 September 2017.[self-published source]
- Joshi, Urvang (1 June 2017). "Remove ALT_INTRA flag". Retrieved 19 September 2017.[self-published source]
- Mukherjee, Debargha (21 October 2017). "Remove CONFIG_CB4X4 config options". Retrieved 29 October 2017.[self-published source]
- Su, Hui (23 October 2017). "Remove experimental flag of chroma_sub8x8". Retrieved 29 October 2017.[self-published source]
- Mukherjee, Debargha (29 October 2017). "Remove compound_segment/wedge config flags". Retrieved 23 November 2017.[self-published source]
- Wang, Yunqing (12 December 2017). "Remove convolve_round/compound_round config flags". Retrieved 17 December 2017.[self-published source]
- Davies, Thomas (19 September 2017). "Remove delta_q experimental flag". Retrieved 2 October 2017.[self-published source]
- Alaiwan, Sebastien (2 October 2017). "Remove compile guards for CONFIG_EXT_INTER". Retrieved 29 October 2017.
This experiment has been adopted[self-published source]
- Davies, Thomas (19 September 2017). "Remove filter_7bit experimental flag". Retrieved 29 October 2017.[self-published source]
- Fuldseth, Arild (26 August 2017). "7-bit interpolation filters". Retrieved 29 October 2017.
Purpose: Reduce dynamic range of interpolation filter coefficients from 8 bits to 7 bits. Inner product for 8-bit input data can be stored in a 16-bit signed integer.[self-published source]
- Alaiwan, Sebastien (31 October 2017). "Remove experimental flag of MOTION_VAR". Retrieved 23 November 2017.[self-published source]
- Chen, Yue (13 October 2017). "Renamings for OBMC experiment". Retrieved 19 September 2017.[self-published source]
- Barbier, Frederic (15 November 2017). "Remove experimental flag of NEW_MULTISYMBOL". Retrieved 23 October 2017.[self-published source]
- "NEW_MULTISYMBOL: Code extra_bits using multi-symbols". Git at Google. Alliance for Open Media. Retrieved 25 May 2018.
- Liu, Zoe (7 November 2017). "Remove ONE_SIDED_COMPOUND experimental flag". Retrieved 23 November 2017.[self-published source]
- Barbier, Frederic (15 December 2017). "Remove experimental flag of PALETTE_DELTA_ENCODING". Retrieved 17 December 2017.[self-published source]
- Yoshi, Urvang (26 September 2017). "Remove rect_intra_pred experimental flag". Retrieved 2 October 2017.[self-published source]
- Alaiwan, Sebastien (27 April 2017). "Merge ref-mv into codebase". Retrieved 23 September 2017.[self-published source]
- Joshi, Urvang (9 November 2017). "Remove smooth_hv experiment flag". Retrieved 23 November 2017.[self-published source]
- Davies, Thomas (18 July 2017). "Remove the CONFIG_TILE_GROUPS experimental flag". Retrieved 19 September 2017.[self-published source]
- Alaiwan, Sebastien (24 October 2017). "Remove compile guards for VAR_TX experiment". Retrieved 29 October 2017.
This experiment has been adopted[self-published source]
- "Add support to recursive transform block coding". Git at Google. Alliance for Open Media. Retrieved 25 May 2018.
- "next generation video: Introducing Daala". people.xiph.org. 2013-05-30. Retrieved 2018-10-30.
- Chiang, Angie (31 July 2017). "Add txmg experiment". Retrieved 3 January 2018.
This experiment aims at merging lbd/hbd txfms[self-published source]
- "Daala-TX" (PDF). 22 August 2017. Retrieved 26 September 2017.
Replaces the existing AV1 TX with the lifting implementation from Daala. Daala TX is better in every way: ● Fewer multiplies ● Same shifts, quantizers for all transform sizes and depths ● Smaller intermediaries ● Low-bitdepth transforms wide enough for high-bitdepth ● Less hardware area ● Inherently lossless[self-published source]
- Egge, Nathan (27 October 2017). "Daala Transforms in AV1".[self-published source]
- Egge, Nathan (1 December 2017). "Daala Transforms Update".[self-published source]
- Egge, Nathan (15 December 2017). "Daala Transforms Evaluation".[self-published source]
- Egge, Nathan (21 December 2017). "Daala Transforms Informational Discussion".[self-published source]
- "The Future of Video Codecs: VP9, HEVC, AV1". 2 November 2017. Retrieved 30 January 2018.
- Sebastian Grüner (9 June 2016). "Freie Videocodecs teilweise besser als H.265" (in German). golem.de. Retrieved 1 March 2017.
- "Results of Elecard's latest benchmarks of AV1 compared to HEVC". 24 April 2017. Retrieved 14 June 2017.
The most intriguing result obtained after analysis of the data lies in the fact that the developed codec AV1 is currently equal in its performance with HEVC. The given streams are encoded with AV1 update of 2017.01.31
- "Bitmovin Supports AV1 Encoding for VoD and Live and Joins the Alliance for Open Media". 18 April 2017. Retrieved 20 May 2017.[self-published source]
- Ozer, Jan. "HEVC: Rating the contenders" (PDF). Streaming Learning Center. Retrieved 22 May 2017.
- D. Grois, T, Nguyen, and D. Marpe, "Coding efficiency comparison of AV1/VP9, H.265/MPEG-HEVC, and H.264/MPEG-AVC encoders", IEEE Picture Coding Symposium (PCS) 2016 http://iphome.hhi.de/marpe/download/Preprint-Performance-Comparison-AV1-HEVC-AVC-PCS2016.pdf
- "Netflix on AV1". Streaming Learning Center. 30 November 2017. Retrieved 8 December 2017.
- "MSU Codec Comparison 2017" (PDF). 17 January 2018. Retrieved 9 February 2018.
- Ozer, Jan (30 January 2018). "AV1 Beats VP9 and HEVC on Quality, if You've Got Time, says Moscow State". Streaming Media Magazine. Retrieved 9 February 2018.
- Baumgartner, Jeff (2019-02-08). "Facebook: Tests Show AV1 Streaming Performance Is Exceeding Expectations". Multichannel. Retrieved 2019-02-10.
- "AOMediaCodec/av1-spec". GitHub. Retrieved 2018-10-16.[self-published source]
- "AV1 Codec ISO Media File Format Binding". cdn.rawgit.com. Retrieved 14 September 2018.
- "AOM AV1 codec mapping in Matroska/WebM". 2018-12-03. Retrieved 19 December 2018.
- "Matroska AV1 support". 2018-09-12. Retrieved 19 December 2018.
- "WebM Container Guidelines". 2017-11-28. Retrieved 19 December 2018.
- "Simple Encoder". 2010-05-18. Retrieved 2019-01-17.
IVF files will not generally be used by your application.
- "The fastest and safest AV1 encoder". Retrieved 9 April 2018.
- "WebM output in libaom". 2018-11-01. Retrieved 19 December 2018.
- "Linux Conference Australia 2019: The AV1 Video Codec". 2010-02-24. Retrieved 2019-02-05.
We have been focusing on freezing the bitstream and getting the quality, not necessarily making things fast. This is a graph of the [encoding] speed of AV1 over its development process. You can se that as we near the end of that process, we started making things faster again, and it's now two orders of magnitude faster than it was at its slowest point. So that's going to improve. And this is a corresponding graph of the quality. (…) So you can see that even as it has continued to get much faster, the quality hasn't really gone down. (…) We wanted to approach this from the other end, so we started an encoder of our own, called rav1e, and the idea is that we would start out always being fast, and then try to make it better over time.
- Armasu, Lucian (2019-02-04). "Intel Releases Open Source Encoder for Next-Gen AV1 Codec". Tom's Hardware. Retrieved 2019-02-13.
- "Introducing dav1d: a new AV1 decoder". 1 October 2018. Retrieved 6 January 2019.
- Kempf, Jean-Baptiste (2018-12-11). "First release of dav1d, the AV1 decoder". personal website of Jean-Baptiste Kempf. Retrieved 2019-02-03.
- "Bug 1493397 (dav1d) – Switch AV1 implementation to dav1d". Mozilla bug tracking system. 2019-02-01. Retrieved 2019-02-03.
- Cimpanu, Catalin (2019-01-29). "Firefox 65 released with AV1 and WebP support". ZDNet. Retrieved 2019-02-13.
- "Chrome 70 Arrives With Option To Disable Linked Sign-Ins, PWAs On Windows, and AV1 Decoder". Slashdot. 2018-10-16. Retrieved 2019-02-13.
- "AV1 Decoder". Chrome Platform Status. Retrieved 2018-09-14.[self-published source]
- Tung, Liam (2018-02-12). "VideoLAN: VLC 3.0's huge update brings Chromecast support, 360-degree video". ZDNet. Retrieved 2019-02-13.
- Larabel, Michael (2018-03-20). "GStreamer 1.14.0 Released With WebRTC Support, AV1 Video & Better Rust Bindings". Phoronix. Retrieved 2019-02-13.
- "Time to Start Testing: FFmpeg Turns 4.0 and Adds AV1 Support". Streaming Media Magazine. 2018-09-27. Retrieved 2019-02-13.
- "mpv v0.29.0". github.com/mpv-player/mpv. 2018-07-22. Retrieved 2018-09-16.[self-published source]
- "MKVToolNix v28.0.0 release notes".[self-published source]
- Bunkus, Moritz (2018-04-01). "MKVToolNix v22.0.0 released". mosu's Matroska stuff. Retrieved 3 May 2018.[self-published source]
- Serea, Razvan (2018-03-20). "MediaInfo 18.03". Neowin. Retrieved 2018-05-03.
- "Encoding Encoder Release 1.50.0". Bitmovin Documentation. Retrieved 2018-11-09.[self-published source]
- "Microsoft Launches Free AV1 Video Codec For Windows 10". Slashdot. 2018-11-10. Retrieved 2019-02-13.
- Building Modern Web Media Experiences: Picture-in-Picture and AV1 (Chrome Dev Summit 2018) - YouTube
- "AOMedia Members Demo AV1 at IBC2018". 6 September 2018. Retrieved 6 January 2019.[self-published source]
- "Socionext Implements AV1 Encoder on FPGA over Cloud Service". 6 June 2018. Retrieved 6 January 2019.[self-published source]
- "AV1 Image File Format (AVIF)". aomediacodec.github.io. Retrieved 25 November 2018.
- "S3 Bucket Listing Generator". download.opencontent.netflix.com. Retrieved 2018-12-17.[self-published source]
- Abrams, Lawrence (2018-12-28). "Microsoft is Adding AVIF Image Support to Windows 10". BleepingComputer. Retrieved 2019-02-13.