Intent to Ship: Add support for video frame orientation metadata to WebCodecs

118 views
Skip to first unread message

Chromestatus

unread,
Mar 25, 2025, 9:10:24 AM (12 days ago) Mar 25
to blin...@chromium.org, dalec...@chromium.org

Contact emails

dalec...@chromium.org

Explainer

None

Specification

https://www.w3.org/TR/webcodecs

Summary

Introduces "rotation: int" and "flip: bool" values to various video related interfaces in WebCodecs so that developers can work with frame sources that have orientation (E.g., Android cameras, certain media). The VideoFrame interface grows the ability to create VideoFrames with arbitrary rotation and flip as well as accessors for this information on the VideoFrame object. The VideoDecoderConfig object gains rotation and flip fields that are emitted on decoded VideoFrame objects automatically. The VideoEncoder class gains mechanisms for passing rotation and flip information from encode() to the VideoDecoderConfig emitted as part of EncodedVideoChunkMetadata. If encode() is called with frames with different orientations a non-fatal exception will be thrown. configure() may be used to reset the allowed orientation.



Blink component

Blink>Media>WebCodecs

Search tags

webcodecs, videoframe, orientation, videodecoder, videoencoder

TAG review

None

TAG review status

Not applicable

Risks



Interoperability and Compatibility

Low, feature designed in conjunction with Firefox and the Media WG.



Gecko: Positive (https://bugzilla.mozilla.org/show_bug.cgi?id=1955904) Spec co-editor, approved all spec changes, implementation planned.

WebKit: No signal (https://github.com/WebKit/standards-positions/issues/475) Participated in some of the standards discussions around this feature.

Web developers: Positive (https://github.com/w3c/webcodecs/issues/351) Active requests from Zoom and others developers (see issue) who are working with cameras with orientation metadata.

Other signals:

Ergonomics

No risk. Small incremental change to WebCodecs API.



Activation

No risk. Small incremental change to WebCodecs API. MDN documentation will be updated.



Security

No risk. No new control surfaces. Video elements already have this functionality.



WebView application risks

Does this intent deprecate or change behavior of existing APIs, such that it has potentially high risk for Android WebView-based applications?

Some cameras and media will immediately begin exposing orientation information. In uncommon cases this orientation may change. If orientation changes during encoding through VideoEncoder frames with different orientation than the first are currently encoded with incorrect size and orientation. However the operation completes without a hard error. During discussions in the media working group it was decided that these cases should instead throw a non-fatal exception during encode(). There is already precedent for exception handling during encode() so authors should already be handling this case.



Debuggability

Rotation information shows up in media configs emitted in media dev tools.



Will this feature be supported on all six Blink platforms (Windows, Mac, Linux, ChromeOS, Android, and Android WebView)?

Yes

Mostly just exposes existing under the hood information used by video elements, so is already supported everywhere.



Is this feature fully tested by web-platform-tests?

Yes

https://wpt.fyi/results/webcodecs?label=master&label=experimental&aligned&q=orientation



Flag name on about://flags

WebCodecsOrientation

Finch feature name

WebCodecsOrientation

Requires code in //chrome?

False

Tracking bug

https://crbug.com/40243431

Measurement

WebCodecs has use counters, but no specific use counter has been added for the rotation and flip usage.

Availability expectation

Available on all platforms except for iOS.

Adoption expectation

Should be adopted by sites consuming camera feeds with WebCodecs.

Estimated milestones

Shipping on desktop 137
DevTrial on desktop 136
Shipping on Android 137
DevTrial on Android 136
Shipping on WebView 137


Anticipated spec changes

Open questions about a feature may be a source of future web compat or interop issues. Please list open issues (e.g. links to known github issues in the project for the feature specification) whose resolution may introduce web compat/interop risk (e.g., changing to naming or structure of the API in a non-backward-compatible way).

Spec changes have already been landed after review with Firefox and the Media Working Group: * https://github.com/w3c/webcodecs/pull/884 * https://github.com/w3c/webcodecs/pull/874 * https://github.com/w3c/webcodecs/pull/840 * https://github.com/w3c/webcodecs/pull/559

Link to entry on the Chrome Platform Status

https://chromestatus.com/feature/5128855343595520?gate=5187346531876864

This intent message was generated by Chrome Platform Status.

Stephen Chenney

unread,
Mar 25, 2025, 9:44:37 AM (12 days ago) Mar 25
to Chromestatus, blin...@chromium.org, dalec...@chromium.org
On Tue, Mar 25, 2025 at 12:10 PM Chromestatus <ad...@cr-status.appspotmail.com> wrote:

Contact emails

dalec...@chromium.org

Explainer

None

Specification

https://www.w3.org/TR/webcodecs

Summary

Introduces "rotation: int" and "flip: bool" values to various video related interfaces in WebCodecs so that developers can work with frame sources that have orientation (E.g., Android cameras, certain media). The VideoFrame interface grows the ability to create VideoFrames with arbitrary rotation and flip as well as accessors for this information on the VideoFrame object. The VideoDecoderConfig object gains rotation and flip fields that are emitted on decoded VideoFrame objects automatically. The VideoEncoder class gains mechanisms for passing rotation and flip information from encode() to the VideoDecoderConfig emitted as part of EncodedVideoChunkMetadata. If encode() is called with frames with different orientations a non-fatal exception will be thrown. configure() may be used to reset the allowed orientation.

Back in the day I implemented the CSS image-orientation property, which is conceptually addressing the same kind of thing in the image case. There were privacy concerns about exposing image metadata content through the orientation property, leading to the requirement that cross-origin content always render using the image's EXIF orientation. This prevents pages from applying the image-orientation property, detecting a change in rendering, and hence inferring the metadata.

As I understand the spec link and this intent, it seems like existing behavior is to always use the encoded video orientation, and this feature allows sites to modify the orientation. If that's so, could it expose information about users' cameras via video length/width, thus leading to additional fingerprinting surface? Did this come up in the spec discussions?

Stephen.
 
--
You received this message because you are subscribed to the Google Groups "blink-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+...@chromium.org.
To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/67e2d562.170a0220.e1a1e.07a6.GAE%40google.com.

Dale Curtis

unread,
Mar 25, 2025, 12:13:28 PM (12 days ago) Mar 25
to Stephen Chenney, Chromestatus, blin...@chromium.org
On Tue, Mar 25, 2025 at 9:44 AM Stephen Chenney <sche...@chromium.org> wrote:


On Tue, Mar 25, 2025 at 12:10 PM Chromestatus <ad...@cr-status.appspotmail.com> wrote:

Contact emails

dalec...@chromium.org

Explainer

None

Specification

https://www.w3.org/TR/webcodecs

Summary

Introduces "rotation: int" and "flip: bool" values to various video related interfaces in WebCodecs so that developers can work with frame sources that have orientation (E.g., Android cameras, certain media). The VideoFrame interface grows the ability to create VideoFrames with arbitrary rotation and flip as well as accessors for this information on the VideoFrame object. The VideoDecoderConfig object gains rotation and flip fields that are emitted on decoded VideoFrame objects automatically. The VideoEncoder class gains mechanisms for passing rotation and flip information from encode() to the VideoDecoderConfig emitted as part of EncodedVideoChunkMetadata. If encode() is called with frames with different orientations a non-fatal exception will be thrown. configure() may be used to reset the allowed orientation.

Back in the day I implemented the CSS image-orientation property, which is conceptually addressing the same kind of thing in the image case. There were privacy concerns about exposing image metadata content through the orientation property, leading to the requirement that cross-origin content always render using the image's EXIF orientation. This prevents pages from applying the image-orientation property, detecting a change in rendering, and hence inferring the metadata.

As I understand the spec link and this intent, it seems like existing behavior is to always use the encoded video orientation, and this feature allows sites to modify the orientation. If that's so, could it expose information about users' cameras via video length/width, thus leading to additional fingerprinting surface? Did this come up in the spec discussions?

The existing behavior is to ignore orientation entirely, which leads to incorrect encoding and decoding. There's no new exposure of information in this case:
  • VideoFrames can't be created from cross-origin objects, so this can't leak anything about <image> tags or other CanvasImageSource types.
  • The camera width, height, and aspectRatio are already available through the MediaStreamTrack information (getUserMedia is permission gated).
  • Orientation is already indirectly observable per-frame by drawing to canvas with and without image-orientation: none.

Stephen Chenney

unread,
Mar 25, 2025, 2:12:50 PM (12 days ago) Mar 25
to Dale Curtis, Chromestatus, blin...@chromium.org
On Tue, Mar 25, 2025 at 3:13 PM Dale Curtis <dalec...@chromium.org> wrote:
On Tue, Mar 25, 2025 at 9:44 AM Stephen Chenney <sche...@chromium.org> wrote:


On Tue, Mar 25, 2025 at 12:10 PM Chromestatus <ad...@cr-status.appspotmail.com> wrote:

Contact emails

dalec...@chromium.org

Explainer

None

Specification

https://www.w3.org/TR/webcodecs

Summary

Introduces "rotation: int" and "flip: bool" values to various video related interfaces in WebCodecs so that developers can work with frame sources that have orientation (E.g., Android cameras, certain media). The VideoFrame interface grows the ability to create VideoFrames with arbitrary rotation and flip as well as accessors for this information on the VideoFrame object. The VideoDecoderConfig object gains rotation and flip fields that are emitted on decoded VideoFrame objects automatically. The VideoEncoder class gains mechanisms for passing rotation and flip information from encode() to the VideoDecoderConfig emitted as part of EncodedVideoChunkMetadata. If encode() is called with frames with different orientations a non-fatal exception will be thrown. configure() may be used to reset the allowed orientation.

Back in the day I implemented the CSS image-orientation property, which is conceptually addressing the same kind of thing in the image case. There were privacy concerns about exposing image metadata content through the orientation property, leading to the requirement that cross-origin content always render using the image's EXIF orientation. This prevents pages from applying the image-orientation property, detecting a change in rendering, and hence inferring the metadata.

As I understand the spec link and this intent, it seems like existing behavior is to always use the encoded video orientation, and this feature allows sites to modify the orientation. If that's so, could it expose information about users' cameras via video length/width, thus leading to additional fingerprinting surface? Did this come up in the spec discussions?

The existing behavior is to ignore orientation entirely, which leads to incorrect encoding and decoding. There's no new exposure of information in this case:
  • VideoFrames can't be created from cross-origin objects, so this can't leak anything about <image> tags or other CanvasImageSource types.
  • The camera width, height, and aspectRatio are already available through the MediaStreamTrack information (getUserMedia is permission gated).
  • Orientation is already indirectly observable per-frame by drawing to canvas with and without image-orientation: none.
Thanks. That clarifies things and addresses any concerns I had.

Vladimir Levin

unread,
Apr 1, 2025, 7:16:52 PM (5 days ago) Apr 1
to blink-dev, Stephen Chenney, Chromestatus, blin...@chromium.org, Dale Curtis

Some cameras and media will immediately begin exposing orientation information. In uncommon cases this orientation may change. If orientation changes during encoding through VideoEncoder frames with different orientation than the first are currently encoded with incorrect size and orientation. However the operation completes without a hard error. During discussions in the media working group it was decided that these cases should instead throw a non-fatal exception during encode(). There is already precedent for exception handling during encode() so authors should already be handling this case.

Can you clarify if "uncommon" here means that some cameras are known to change the orientation and those cameras are typically rare, or that most cameras will change the orientation if say the user changes the orientation, but that user behavior is rare? :)

This does seem to introduce a new exception reason that seems almost expected to happen in some cases. Can you comment on whether the other exceptions that the author should already be happening are likewise "common"? There's a bit of a concern that authors that don't handle these exceptions will start breaking on orientation changing cases where previously they did not.

Thanks!
Vlad

Dale Curtis

unread,
Apr 1, 2025, 9:05:53 PM (5 days ago) Apr 1
to Vladimir Levin, blink-dev, Stephen Chenney, Chromestatus
On Tue, Apr 1, 2025 at 7:16 PM Vladimir Levin <vmp...@chromium.org> wrote:

Some cameras and media will immediately begin exposing orientation information. In uncommon cases this orientation may change. If orientation changes during encoding through VideoEncoder frames with different orientation than the first are currently encoded with incorrect size and orientation. However the operation completes without a hard error. During discussions in the media working group it was decided that these cases should instead throw a non-fatal exception during encode(). There is already precedent for exception handling during encode() so authors should already be handling this case.

Can you clarify if "uncommon" here means that some cameras are known to change the orientation and those cameras are typically rare, or that most cameras will change the orientation if say the user changes the orientation, but that user behavior is rare? :)

I meant most cases will have a single fixed orientation for the duration of the session. Users changing their orientation is uncommon. It will almost always happen with users on mobile / hand-held devices accidentally or intentionally reorienting the device itself.

This does seem to introduce a new exception reason that seems almost expected to happen in some cases. Can you comment on whether the other exceptions that the author should already be happening are likewise "common"? There's a bit of a concern that authors that don't handle these exceptions will start breaking on orientation changing cases where previously they did not.

Anecdotally, I'd say exceptions are uncommon for encode(). Exceptions will be thrown if a frame is already closed or the codec has been closed due to an error or reclaimed by system pressure (an automatic process which is not uncommon, but also generates an error signal).

We discussed this concern in the working group. Keep in mind, encoding is already broken in these cases. It's just that instead of an exception the frame is either squished (perpendicular orientation) or encoded in mirror. As such the working group preferred a clear error signal to the silent issues that exist today.

Additionally, given low WebCodecs VideoEncoder usage on mobile the actual risk of any issues seems unlikely:


Thanks!
Vlad

Vladimir Levin

unread,
Apr 2, 2025, 8:37:49 AM (4 days ago) Apr 2
to blink-dev, Dale Curtis, blink-dev, Stephen Chenney, Chromestatus, Vladimir Levin
On Wednesday, April 2, 2025 at 12:05:53 AM UTC-4 Dale Curtis wrote:
On Tue, Apr 1, 2025 at 7:16 PM Vladimir Levin <vmp...@chromium.org> wrote:

Some cameras and media will immediately begin exposing orientation information. In uncommon cases this orientation may change. If orientation changes during encoding through VideoEncoder frames with different orientation than the first are currently encoded with incorrect size and orientation. However the operation completes without a hard error. During discussions in the media working group it was decided that these cases should instead throw a non-fatal exception during encode(). There is already precedent for exception handling during encode() so authors should already be handling this case.

Can you clarify if "uncommon" here means that some cameras are known to change the orientation and those cameras are typically rare, or that most cameras will change the orientation if say the user changes the orientation, but that user behavior is rare? :)

I meant most cases will have a single fixed orientation for the duration of the session. Users changing their orientation is uncommon. It will almost always happen with users on mobile / hand-held devices accidentally or intentionally reorienting the device itself.

I'm not sure how to reason about common-ness of the user device orientation changes, so I'm not convinced by this specific argument. 

This does seem to introduce a new exception reason that seems almost expected to happen in some cases. Can you comment on whether the other exceptions that the author should already be happening are likewise "common"? There's a bit of a concern that authors that don't handle these exceptions will start breaking on orientation changing cases where previously they did not.

Anecdotally, I'd say exceptions are uncommon for encode(). Exceptions will be thrown if a frame is already closed or the codec has been closed due to an error or reclaimed by system pressure (an automatic process which is not uncommon, but also generates an error signal).

We discussed this concern in the working group. Keep in mind, encoding is already broken in these cases. It's just that instead of an exception the frame is either squished (perpendicular orientation) or encoded in mirror. As such the working group preferred a clear error signal to the silent issues that exist today.

Additionally, given low WebCodecs VideoEncoder usage on mobile the actual risk of any issues seems unlikely:

That seems a more compelling argument: the current state would be broken in a different way, and the usage is very low. So this makes me think the change is pretty safe. Nevertheless, can you comment on the type of website that could potentially be impacted? Ie what are the common usages of this API?

Thanks!
Vlad
 


Thanks!
Vlad

Dale Curtis

unread,
Apr 2, 2025, 9:04:23 AM (4 days ago) Apr 2
to Vladimir Levin, blink-dev, Stephen Chenney, Chromestatus
On Wed, Apr 2, 2025 at 8:37 AM Vladimir Levin <vmp...@chromium.org> wrote:


On Wednesday, April 2, 2025 at 12:05:53 AM UTC-4 Dale Curtis wrote:
On Tue, Apr 1, 2025 at 7:16 PM Vladimir Levin <vmp...@chromium.org> wrote:

Some cameras and media will immediately begin exposing orientation information. In uncommon cases this orientation may change. If orientation changes during encoding through VideoEncoder frames with different orientation than the first are currently encoded with incorrect size and orientation. However the operation completes without a hard error. During discussions in the media working group it was decided that these cases should instead throw a non-fatal exception during encode(). There is already precedent for exception handling during encode() so authors should already be handling this case.

Can you clarify if "uncommon" here means that some cameras are known to change the orientation and those cameras are typically rare, or that most cameras will change the orientation if say the user changes the orientation, but that user behavior is rare? :)

I meant most cases will have a single fixed orientation for the duration of the session. Users changing their orientation is uncommon. It will almost always happen with users on mobile / hand-held devices accidentally or intentionally reorienting the device itself.

I'm not sure how to reason about common-ness of the user device orientation changes, so I'm not convinced by this specific argument. 

Sorry, I forgot to cite my source: https://uma.googleplex.com/p/chrome/timeline_v2?sid=9db1427b181520ab8c02f338ff6bb809 (internal only unfortunately), but ~99% of streams never see more than the initial transform.
 

This does seem to introduce a new exception reason that seems almost expected to happen in some cases. Can you comment on whether the other exceptions that the author should already be happening are likewise "common"? There's a bit of a concern that authors that don't handle these exceptions will start breaking on orientation changing cases where previously they did not.

Anecdotally, I'd say exceptions are uncommon for encode(). Exceptions will be thrown if a frame is already closed or the codec has been closed due to an error or reclaimed by system pressure (an automatic process which is not uncommon, but also generates an error signal).

We discussed this concern in the working group. Keep in mind, encoding is already broken in these cases. It's just that instead of an exception the frame is either squished (perpendicular orientation) or encoded in mirror. As such the working group preferred a clear error signal to the silent issues that exist today.

Additionally, given low WebCodecs VideoEncoder usage on mobile the actual risk of any issues seems unlikely:

That seems a more compelling argument: the current state would be broken in a different way, and the usage is very low. So this makes me think the change is pretty safe. Nevertheless, can you comment on the type of website that could potentially be impacted? Ie what are the common usages of this API?

The most common of those affected would be some semi-RTC based site doing its own camera/capture work on Android. E.g., Zoom who we're working with on this.
 

Thanks!
Vlad
 


Thanks!
Vlad

Vladimir Levin

unread,
Apr 2, 2025, 10:36:53 AM (4 days ago) Apr 2
to blink-dev, Dale Curtis, blink-dev, Stephen Chenney, Chromestatus, Vladimir Levin
On Wednesday, April 2, 2025 at 12:04:23 PM UTC-4 Dale Curtis wrote:
On Wed, Apr 2, 2025 at 8:37 AM Vladimir Levin <vmp...@chromium.org> wrote:


On Wednesday, April 2, 2025 at 12:05:53 AM UTC-4 Dale Curtis wrote:
On Tue, Apr 1, 2025 at 7:16 PM Vladimir Levin <vmp...@chromium.org> wrote:

Some cameras and media will immediately begin exposing orientation information. In uncommon cases this orientation may change. If orientation changes during encoding through VideoEncoder frames with different orientation than the first are currently encoded with incorrect size and orientation. However the operation completes without a hard error. During discussions in the media working group it was decided that these cases should instead throw a non-fatal exception during encode(). There is already precedent for exception handling during encode() so authors should already be handling this case.

Can you clarify if "uncommon" here means that some cameras are known to change the orientation and those cameras are typically rare, or that most cameras will change the orientation if say the user changes the orientation, but that user behavior is rare? :)

I meant most cases will have a single fixed orientation for the duration of the session. Users changing their orientation is uncommon. It will almost always happen with users on mobile / hand-held devices accidentally or intentionally reorienting the device itself.

I'm not sure how to reason about common-ness of the user device orientation changes, so I'm not convinced by this specific argument. 

Sorry, I forgot to cite my source: https://uma.googleplex.com/p/chrome/timeline_v2?sid=9db1427b181520ab8c02f338ff6bb809 (internal only unfortunately), but ~99% of streams never see more than the initial transform.

Cool thanks for the source!
 
 

This does seem to introduce a new exception reason that seems almost expected to happen in some cases. Can you comment on whether the other exceptions that the author should already be happening are likewise "common"? There's a bit of a concern that authors that don't handle these exceptions will start breaking on orientation changing cases where previously they did not.

Anecdotally, I'd say exceptions are uncommon for encode(). Exceptions will be thrown if a frame is already closed or the codec has been closed due to an error or reclaimed by system pressure (an automatic process which is not uncommon, but also generates an error signal).

We discussed this concern in the working group. Keep in mind, encoding is already broken in these cases. It's just that instead of an exception the frame is either squished (perpendicular orientation) or encoded in mirror. As such the working group preferred a clear error signal to the silent issues that exist today.

Additionally, given low WebCodecs VideoEncoder usage on mobile the actual risk of any issues seems unlikely:

That seems a more compelling argument: the current state would be broken in a different way, and the usage is very low. So this makes me think the change is pretty safe. Nevertheless, can you comment on the type of website that could potentially be impacted? Ie what are the common usages of this API?

The most common of those affected would be some semi-RTC based site doing its own camera/capture work on Android. E.g., Zoom who we're working with on this.

Ok, that's good to know, thanks.

LGTM1
 
 

Thanks!
Vlad
 


Thanks!
Vlad

Yoav Weiss (@Shopify)

unread,
Apr 3, 2025, 1:33:16 AM (3 days ago) Apr 3
to Vladimir Levin, blink-dev, Dale Curtis, Stephen Chenney, Chromestatus
LGTM2

--
You received this message because you are subscribed to the Google Groups "blink-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+...@chromium.org.
Reply all
Reply to author
Forward
0 new messages