Introduces "rotation: int" and "flip: bool" values to various video related interfaces in WebCodecs so that developers can work with frame sources that have orientation (E.g., Android cameras, certain media). The VideoFrame interface grows the ability to create VideoFrames with arbitrary rotation and flip as well as accessors for this information on the VideoFrame object. The VideoDecoderConfig object gains rotation and flip fields that are emitted on decoded VideoFrame objects automatically. The VideoEncoder class gains mechanisms for passing rotation and flip information from encode() to the VideoDecoderConfig emitted as part of EncodedVideoChunkMetadata. If encode() is called with frames with different orientations a non-fatal exception will be thrown. configure() may be used to reset the allowed orientation.
Low, feature designed in conjunction with Firefox and the Media WG.
No risk. Small incremental change to WebCodecs API.
No risk. Small incremental change to WebCodecs API. MDN documentation will be updated.
No risk. No new control surfaces. Video elements already have this functionality.
Does this intent deprecate or change behavior of existing APIs, such that it has potentially high risk for Android WebView-based applications?
Some cameras and media will immediately begin exposing orientation information. In uncommon cases this orientation may change. If orientation changes during encoding through VideoEncoder frames with different orientation than the first are currently encoded with incorrect size and orientation. However the operation completes without a hard error. During discussions in the media working group it was decided that these cases should instead throw a non-fatal exception during encode(). There is already precedent for exception handling during encode() so authors should already be handling this case.
Rotation information shows up in media configs emitted in media dev tools.
Mostly just exposes existing under the hood information used by video elements, so is already supported everywhere.
https://wpt.fyi/results/webcodecs?label=master&label=experimental&aligned&q=orientation
Shipping on desktop | 137 |
DevTrial on desktop | 136 |
Shipping on Android | 137 |
DevTrial on Android | 136 |
Shipping on WebView | 137 |
Open questions about a feature may be a source of future web compat or interop issues. Please list open issues (e.g. links to known github issues in the project for the feature specification) whose resolution may introduce web compat/interop risk (e.g., changing to naming or structure of the API in a non-backward-compatible way).
Spec changes have already been landed after review with Firefox and the Media Working Group: * https://github.com/w3c/webcodecs/pull/884 * https://github.com/w3c/webcodecs/pull/874 * https://github.com/w3c/webcodecs/pull/840 * https://github.com/w3c/webcodecs/pull/559Contact emails
dalec...@chromium.orgExplainer
NoneSpecification
https://www.w3.org/TR/webcodecsSummary
Introduces "rotation: int" and "flip: bool" values to various video related interfaces in WebCodecs so that developers can work with frame sources that have orientation (E.g., Android cameras, certain media). The VideoFrame interface grows the ability to create VideoFrames with arbitrary rotation and flip as well as accessors for this information on the VideoFrame object. The VideoDecoderConfig object gains rotation and flip fields that are emitted on decoded VideoFrame objects automatically. The VideoEncoder class gains mechanisms for passing rotation and flip information from encode() to the VideoDecoderConfig emitted as part of EncodedVideoChunkMetadata. If encode() is called with frames with different orientations a non-fatal exception will be thrown. configure() may be used to reset the allowed orientation.
--
You received this message because you are subscribed to the Google Groups "blink-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+...@chromium.org.
To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/67e2d562.170a0220.e1a1e.07a6.GAE%40google.com.
On Tue, Mar 25, 2025 at 12:10 PM Chromestatus <ad...@cr-status.appspotmail.com> wrote:Contact emails
dalec...@chromium.orgExplainer
NoneSpecification
https://www.w3.org/TR/webcodecsSummary
Introduces "rotation: int" and "flip: bool" values to various video related interfaces in WebCodecs so that developers can work with frame sources that have orientation (E.g., Android cameras, certain media). The VideoFrame interface grows the ability to create VideoFrames with arbitrary rotation and flip as well as accessors for this information on the VideoFrame object. The VideoDecoderConfig object gains rotation and flip fields that are emitted on decoded VideoFrame objects automatically. The VideoEncoder class gains mechanisms for passing rotation and flip information from encode() to the VideoDecoderConfig emitted as part of EncodedVideoChunkMetadata. If encode() is called with frames with different orientations a non-fatal exception will be thrown. configure() may be used to reset the allowed orientation.
Back in the day I implemented the CSS image-orientation property, which is conceptually addressing the same kind of thing in the image case. There were privacy concerns about exposing image metadata content through the orientation property, leading to the requirement that cross-origin content always render using the image's EXIF orientation. This prevents pages from applying the image-orientation property, detecting a change in rendering, and hence inferring the metadata.As I understand the spec link and this intent, it seems like existing behavior is to always use the encoded video orientation, and this feature allows sites to modify the orientation. If that's so, could it expose information about users' cameras via video length/width, thus leading to additional fingerprinting surface? Did this come up in the spec discussions?
On Tue, Mar 25, 2025 at 9:44 AM Stephen Chenney <sche...@chromium.org> wrote:On Tue, Mar 25, 2025 at 12:10 PM Chromestatus <ad...@cr-status.appspotmail.com> wrote:Contact emails
dalec...@chromium.orgExplainer
NoneSpecification
https://www.w3.org/TR/webcodecsSummary
Introduces "rotation: int" and "flip: bool" values to various video related interfaces in WebCodecs so that developers can work with frame sources that have orientation (E.g., Android cameras, certain media). The VideoFrame interface grows the ability to create VideoFrames with arbitrary rotation and flip as well as accessors for this information on the VideoFrame object. The VideoDecoderConfig object gains rotation and flip fields that are emitted on decoded VideoFrame objects automatically. The VideoEncoder class gains mechanisms for passing rotation and flip information from encode() to the VideoDecoderConfig emitted as part of EncodedVideoChunkMetadata. If encode() is called with frames with different orientations a non-fatal exception will be thrown. configure() may be used to reset the allowed orientation.
Back in the day I implemented the CSS image-orientation property, which is conceptually addressing the same kind of thing in the image case. There were privacy concerns about exposing image metadata content through the orientation property, leading to the requirement that cross-origin content always render using the image's EXIF orientation. This prevents pages from applying the image-orientation property, detecting a change in rendering, and hence inferring the metadata.As I understand the spec link and this intent, it seems like existing behavior is to always use the encoded video orientation, and this feature allows sites to modify the orientation. If that's so, could it expose information about users' cameras via video length/width, thus leading to additional fingerprinting surface? Did this come up in the spec discussions?The existing behavior is to ignore orientation entirely, which leads to incorrect encoding and decoding. There's no new exposure of information in this case:
- VideoFrames can't be created from cross-origin objects, so this can't leak anything about <image> tags or other CanvasImageSource types.
- The camera width, height, and aspectRatio are already available through the MediaStreamTrack information (getUserMedia is permission gated).
- Orientation is already indirectly observable per-frame by drawing to canvas with and without image-orientation: none.
Some cameras and media will immediately begin exposing orientation information. In uncommon cases this orientation may change. If orientation changes during encoding through VideoEncoder frames with different orientation than the first are currently encoded with incorrect size and orientation. However the operation completes without a hard error. During discussions in the media working group it was decided that these cases should instead throw a non-fatal exception during encode(). There is already precedent for exception handling during encode() so authors should already be handling this case.
Some cameras and media will immediately begin exposing orientation information. In uncommon cases this orientation may change. If orientation changes during encoding through VideoEncoder frames with different orientation than the first are currently encoded with incorrect size and orientation. However the operation completes without a hard error. During discussions in the media working group it was decided that these cases should instead throw a non-fatal exception during encode(). There is already precedent for exception handling during encode() so authors should already be handling this case.
Can you clarify if "uncommon" here means that some cameras are known to change the orientation and those cameras are typically rare, or that most cameras will change the orientation if say the user changes the orientation, but that user behavior is rare? :)
This does seem to introduce a new exception reason that seems almost expected to happen in some cases. Can you comment on whether the other exceptions that the author should already be happening are likewise "common"? There's a bit of a concern that authors that don't handle these exceptions will start breaking on orientation changing cases where previously they did not.
Thanks!Vlad
On Tue, Apr 1, 2025 at 7:16 PM Vladimir Levin <vmp...@chromium.org> wrote:Some cameras and media will immediately begin exposing orientation information. In uncommon cases this orientation may change. If orientation changes during encoding through VideoEncoder frames with different orientation than the first are currently encoded with incorrect size and orientation. However the operation completes without a hard error. During discussions in the media working group it was decided that these cases should instead throw a non-fatal exception during encode(). There is already precedent for exception handling during encode() so authors should already be handling this case.
Can you clarify if "uncommon" here means that some cameras are known to change the orientation and those cameras are typically rare, or that most cameras will change the orientation if say the user changes the orientation, but that user behavior is rare? :)I meant most cases will have a single fixed orientation for the duration of the session. Users changing their orientation is uncommon. It will almost always happen with users on mobile / hand-held devices accidentally or intentionally reorienting the device itself.
This does seem to introduce a new exception reason that seems almost expected to happen in some cases. Can you comment on whether the other exceptions that the author should already be happening are likewise "common"? There's a bit of a concern that authors that don't handle these exceptions will start breaking on orientation changing cases where previously they did not.Anecdotally, I'd say exceptions are uncommon for encode(). Exceptions will be thrown if a frame is already closed or the codec has been closed due to an error or reclaimed by system pressure (an automatic process which is not uncommon, but also generates an error signal).We discussed this concern in the working group. Keep in mind, encoding is already broken in these cases. It's just that instead of an exception the frame is either squished (perpendicular orientation) or encoded in mirror. As such the working group preferred a clear error signal to the silent issues that exist today.Additionally, given low WebCodecs VideoEncoder usage on mobile the actual risk of any issues seems unlikely:
Thanks!Vlad
On Wednesday, April 2, 2025 at 12:05:53 AM UTC-4 Dale Curtis wrote:On Tue, Apr 1, 2025 at 7:16 PM Vladimir Levin <vmp...@chromium.org> wrote:Some cameras and media will immediately begin exposing orientation information. In uncommon cases this orientation may change. If orientation changes during encoding through VideoEncoder frames with different orientation than the first are currently encoded with incorrect size and orientation. However the operation completes without a hard error. During discussions in the media working group it was decided that these cases should instead throw a non-fatal exception during encode(). There is already precedent for exception handling during encode() so authors should already be handling this case.
Can you clarify if "uncommon" here means that some cameras are known to change the orientation and those cameras are typically rare, or that most cameras will change the orientation if say the user changes the orientation, but that user behavior is rare? :)I meant most cases will have a single fixed orientation for the duration of the session. Users changing their orientation is uncommon. It will almost always happen with users on mobile / hand-held devices accidentally or intentionally reorienting the device itself.I'm not sure how to reason about common-ness of the user device orientation changes, so I'm not convinced by this specific argument.
This does seem to introduce a new exception reason that seems almost expected to happen in some cases. Can you comment on whether the other exceptions that the author should already be happening are likewise "common"? There's a bit of a concern that authors that don't handle these exceptions will start breaking on orientation changing cases where previously they did not.Anecdotally, I'd say exceptions are uncommon for encode(). Exceptions will be thrown if a frame is already closed or the codec has been closed due to an error or reclaimed by system pressure (an automatic process which is not uncommon, but also generates an error signal).We discussed this concern in the working group. Keep in mind, encoding is already broken in these cases. It's just that instead of an exception the frame is either squished (perpendicular orientation) or encoded in mirror. As such the working group preferred a clear error signal to the silent issues that exist today.Additionally, given low WebCodecs VideoEncoder usage on mobile the actual risk of any issues seems unlikely:That seems a more compelling argument: the current state would be broken in a different way, and the usage is very low. So this makes me think the change is pretty safe. Nevertheless, can you comment on the type of website that could potentially be impacted? Ie what are the common usages of this API?
Thanks!VladThanks!Vlad
On Wed, Apr 2, 2025 at 8:37 AM Vladimir Levin <vmp...@chromium.org> wrote:On Wednesday, April 2, 2025 at 12:05:53 AM UTC-4 Dale Curtis wrote:On Tue, Apr 1, 2025 at 7:16 PM Vladimir Levin <vmp...@chromium.org> wrote:Some cameras and media will immediately begin exposing orientation information. In uncommon cases this orientation may change. If orientation changes during encoding through VideoEncoder frames with different orientation than the first are currently encoded with incorrect size and orientation. However the operation completes without a hard error. During discussions in the media working group it was decided that these cases should instead throw a non-fatal exception during encode(). There is already precedent for exception handling during encode() so authors should already be handling this case.
Can you clarify if "uncommon" here means that some cameras are known to change the orientation and those cameras are typically rare, or that most cameras will change the orientation if say the user changes the orientation, but that user behavior is rare? :)I meant most cases will have a single fixed orientation for the duration of the session. Users changing their orientation is uncommon. It will almost always happen with users on mobile / hand-held devices accidentally or intentionally reorienting the device itself.I'm not sure how to reason about common-ness of the user device orientation changes, so I'm not convinced by this specific argument.Sorry, I forgot to cite my source: https://uma.googleplex.com/p/chrome/timeline_v2?sid=9db1427b181520ab8c02f338ff6bb809 (internal only unfortunately), but ~99% of streams never see more than the initial transform.
This does seem to introduce a new exception reason that seems almost expected to happen in some cases. Can you comment on whether the other exceptions that the author should already be happening are likewise "common"? There's a bit of a concern that authors that don't handle these exceptions will start breaking on orientation changing cases where previously they did not.Anecdotally, I'd say exceptions are uncommon for encode(). Exceptions will be thrown if a frame is already closed or the codec has been closed due to an error or reclaimed by system pressure (an automatic process which is not uncommon, but also generates an error signal).We discussed this concern in the working group. Keep in mind, encoding is already broken in these cases. It's just that instead of an exception the frame is either squished (perpendicular orientation) or encoded in mirror. As such the working group preferred a clear error signal to the silent issues that exist today.Additionally, given low WebCodecs VideoEncoder usage on mobile the actual risk of any issues seems unlikely:That seems a more compelling argument: the current state would be broken in a different way, and the usage is very low. So this makes me think the change is pretty safe. Nevertheless, can you comment on the type of website that could potentially be impacted? Ie what are the common usages of this API?The most common of those affected would be some semi-RTC based site doing its own camera/capture work on Android. E.g., Zoom who we're working with on this.
Thanks!VladThanks!Vlad
--
You received this message because you are subscribed to the Google Groups "blink-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+...@chromium.org.
To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/4f46eb2b-e01d-4103-be00-50ecc05589a0n%40chromium.org.