Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Screen Capture Requested #3655

Open
wants to merge 6 commits into
base: main
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
369 changes: 369 additions & 0 deletions specs/ScreenCaptureRequested.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,369 @@
# Background

The HTML DOM's Screen Capture API `navigator.mediaDevices.getDisplayMedia` allows developers to
get a video stream of a user's tabs, windows, or desktop. This API is available in WebView2,
but the current default UI has some problems that we need to fix to make sure that hybrid apps
using WebView2 have a more seamless web/native experience. This includes removing the tab column
in the UI, replacing default strings and icons that do not match in WV2, and potentially having
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is talking about UI that is never shown in the spec, so it's not very helpful. But really, the second half of the paragraph sounds like inside baseball. Maybe shorten to

This API is available in WebView2 and displays a default picker which may not be appropriate for all apps.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Checking my understanding ... This API isn't about enabling the host to make the experience more seamless (replacing default strings and such), it's just about the ability to cancel?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Next time we'll separate out back story and future plans from rest of background.

the ability to customize the UI itself.

These apps also expect that the screen capture dialog has an event before the UI is shown to give
the host app an opportunity to block or allow UI from showing at all.

In this document we describe the updated API. We'd appreciate your feedback.

# Description

We propose introducing the `ScreenCaptureStarting` event. This event will be raised whenever
the WebView2 and/or iframe corresponding to the CoreWebView2Frame or any of its descendant iframes
requests permission to use the Screen Capture API before the UI is shown.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

From the name "Requested", I was expecting that this was asking the host to do something. E.g. BasicAuthenticationRequested is asking the host for a response. Really, I think this event is saying "I'm about to start the screen capture process, this is your chance to reject it"? That suggests a name more like ScreenCaptureStarting.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also, should we more align the name with getDisplayMedia?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's rename this to align with the other ..Starting events

  • ScreenCaptureStarting


For convenience of the end developer, by default we plan to raise
`ScreenCaptureStarting` on both `CoreWebView2Frame` and `CoreWebView2`. The
`CoreWebView2Frame` event handlers will be invoked first,
before the `CoreWebView2` event handlers. If `Handled` is set true as part of
the `CoreWebView2Frame` event handlers, then the `ScreenCaptureStarting` event
will not be raised on the `CoreWebView2`, and its event handlers will not be
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I assume that if I set args.Cancel=True for CoreWebView2Frame.ScreenCaptureRequested but don't mark it as handled, when CoreWebView2.ScreenCaptureRequested is raised, the args.Cancel property will return true?

If so, it would be worth explicitly stating so in the doc.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Explicitly note in docs: if you don't mark handled true then the same event args (with modifications) bubble up to the next event handler on the corewebview2

invoked.

If `Handled` is not set as true as part of the `CoreWebView2Frame` event
handlers, the same event args with modifications bubble up to the next
event handler on the `CoreWebView2`. In this case, the `Handled` property
will do nothing.

In the case of a nested iframe requesting permission, we will raise the event
off of the top level iframe.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nested frames are weird in the WebView2 API surface.

  • Sometimes actions related to nested frames result in the event being raised on the nested frame. (DOMContentLoaded, navigation events, WebMessageReceived.)
  • Sometimes actions related to nested frames result in the event being raised on the top level frame + the root document. (PermissionRequested, ScreenCaptureRequested.)

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Dave: Follow up with how we want events to behave wrt frames in the future. Plus public document on event patterns in WebView2.


# Examples
## C++: Registering Screen Started Handler on CoreWebView2
``` cpp
wil::com_ptr<ICoreWebView2> m_webview;
EventRegistrationToken m_ScreenCaptureStartingToken = {};
auto webview2_20 = m_webView.try_query<ICoreWebView2_20>();
if (webview2_20) {
webview2_20->add_ScreenCaptureStarting(
Callback<ICoreWebView2ScreenCaptureStartingEventHandler>(
[this](ICoreWebView2* sender, ICoreWebView2ScreenCaptureStartingEventArgs* args)
-> HRESULT
{
// Get Frame Info
wil::com_ptr<ICoreWebView2FrameInfo> frameInfo;
CHECK_FAILURE(args->get_OriginalSourceFrameInfo(&frameInfo));

// Frame Source
wil::unique_cotaskmem_string frameSource;
CHECK_FAILURE(frameInfo->get_Source(&frameSource));

// If the host app wants to cancel the request for a specific source
static const PCWSTR url_to_cancel = L"developer.microsoft.com";
wil::unique_bstr domain = GetDomainOfUri(frameSource.get());
const wchar_t *domains = domain.get();
if (wcscmp(url_to_cancel, domains) == 0) {
CHECK_FAILURE(args->put_Cancel(TRUE));
}

return S_OK;
})
.Get(),
&m_ScreenCaptureStartingToken);
}

```
## C++: Registering Screen Started Handler on CoreWebView2Frame
``` cpp
wil::com_ptr<ICoreWebView2> m_webview;
auto webview4 = m_webview.try_query<ICoreWebView2_4>();
if (webview4)
{
EventRegistrationToken m_frameCreatedToken = {};
EventRegistrationToken m_ScreenCaptureStartingToken = {};

CHECK_FAILURE(webview4->add_FrameCreated(
Callback<ICoreWebView2FrameCreatedEventHandler>(
[this](ICoreWebView2* sender, ICoreWebView2FrameCreatedEventArgs* args)
-> HRESULT
{
wil::com_ptr<ICoreWebView2Frame> webviewFrame;
CHECK_FAILURE(args->get_Frame(&webviewFrame));

auto webviewFrame3 = webviewFrame.try_query<ICoreWebView2Frame3>();
if (webviewFrame3)
{
CHECK_FAILURE(webviewFrame3->add_ScreenCaptureStarting(
Callback<ICoreWebView3FrameScreenCaptureStartingEventHandler>(
[this](ICoreWebView3Frame* sender,
ICoreWebView2ScreenCaptureStartingEventArgs* args) -> HRESULT
{

// Get Frame Info
wil::com_ptr<ICoreWebView2FrameInfo> frameInfo;
CHECK_FAILURE(args->get_OriginalSourceFrameInfo(&frameInfo));

// Frame Source
wil::unique_cotaskmem_string frameSource;
CHECK_FAILURE(frameInfo->get_Source(&frameSource));

// If the host app wants to cancel the request for a specific source
static const PCWSTR url_to_cancel = L"developer.microsoft.com";
wil::unique_bstr domain = GetDomainOfUri(frameSource.get());
const wchar_t *domains = domain.get();
if (wcscmp(url_to_cancel, domains) == 0) {
CHECK_FAILURE(args->put_Cancel(TRUE));
}

// Let CoreWebView2 handler know the event is already handled

// In the case of an iframe requesting permission to use Screen Capture, the default
// behavior is to first raise the ScreenCaptureStarting event off of the
// CoreWebView2Frame and invoke it's handlers, and then raise the event off the
// CoreWebView2 and invoke it's handlers. However, If we set Handled to true on the
// CoreWebView2Frame event handler, then we will not raise the
// ScreenCaptureStarting event off the CoreWebView2.

CHECK_FAILURE(args->put_Handled(true));
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

After we set Handled = true, what does it mean when Cancel is left as its default value of false?

Does it mean "Don't cancel, show default UI"?

Does it mean "Don't cancel, allow silent capture without any prompt"?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does it mean "Don't cancel, show default UI"?

This is what happens

return S_OK;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What happens if I set Cancel = true but don't set Handled? Does the next handler observe Cancel = true? Or does each handler get a fresh Cancel = false?

What happens if nobody sets Handled = true, but somebody sets Cancel = true? Does that cancel? Or do you have to Handle the event in order to Cancel it?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Discussed above

})
.Get(),
&m_ScreenCaptureStartingToken));
}
return S_OK;
}).Get(),
&m_FrameCreatedToken));
}
```

## C#: Registering Screen Capture Started Handler
```c#
private WebView2 webView;
webView.CoreWebView2.ScreenCaptureStarting += (sender, screenCaptureArgs) =>
{
// Get Frame Info
CoreWebView2FrameInfo frameInfo;
frameInfo = screenCaptureArgs.OriginalSourceFrameInfo

// Frame Source
string frameSource;
frameSource = frameInfo.Source;

// If the host app wants to cancel the request from a specific frame
if (new Uri(frameSource).Host == "developer.microsoft.com")
{
screenCaptureArgs.Cancel = true;
}
}
```


## C#: Registering IFrame Screen Capture Started Handler
```c#
private WebView2 webView;

webView.CoreWebView2.FrameCreated += (sender, frameCreatedArgs) =>
{
// Checking for runtime support of CoreWebView2Frame.ScreenCaptureStarting
try
{
frameCreatedArgs.Frame.ScreenCaptureStarting += (frameSender, screenCaptureArgs) =>
{
// Get Frame Info
CoreWebView2FrameInfo frameInfo;
frameInfo = screenCaptureArgs.OriginalSourceFrameInfo

// Frame Source
string frameSource;
frameSource = frameInfo.Source;

// If the host app wants to cancel the request from a specific source
if (new Uri(frameSource).Host == "developer.microsoft.com")
{
screenCaptureArgs.Cancel = true;
}

// Let CoreWebView2 handler know the event is already handled

// In the case of an iframe requesting permission to use Screen Capture, the default
// behavior is to first raise the ScreenCaptureStarting event off of the
// CoreWebView2Frame and invoke it's handlers, and then raise the event off the
// CoreWebView2 and invoke it's handlers. However, If we set Handled to true on the
// CoreWebView2Frame event handler, then we will not raise the
// ScreenCaptureStarting event off the CoreWebView2.
//
// NotImplementedException could be thrown if underlying runtime did not
// implement Handled. However, we only run this code after checking if
// CoreWebView2Frame.ScreenCaptureStarting exists, and both exist together,
// so it would not be a problem.
args.Handled = true;
};
}
catch (NotImplementedException exception)
{
// If the runtime support is not there we probably want this
// to be a no-op.
}
};
```

# API Details
## C++
```
interface ICoreWebView2_20;
interface ICoreWebView2ScreenCaptureStartingEventArgs;
interface ICoreWebView2ScreenCaptureStartingEventHandler;

interface ICoreWebView2Frame3;
interface ICoreWebView2FrameScreenCaptureStartingEventHandler;
interface ICoreWebView2FrameScreenCaptureStartingEventArgs;

/// This interface is an extension of `ICoreWebView2` that supports the ScreenCaptureStarting event.
// MSOWNERS: stevenwei@microsoft.com
[uuid(accc0e97-fa2d-4a8d-ad61-cc9ae57a1825), object, pointer_default(unique)]
interface ICoreWebView2_20 : IUnknown {
/// Add an event handler for the `ScreenCaptureStarting` event.
/// `ScreenCaptureStarting` event is raised when the Screen Capture API is requested by the user using getDisplayMedia().
HRESULT add_ScreenCaptureStarting(
[in] ICoreWebView2ScreenCaptureStartingEventHandler* eventHandler,
[out] EventRegistrationToken* token);
/// Remove an event handler previously added with `add_ScreenCaptureStarting`.
// MSOWNERS: stevenwei@microsoft.com
HRESULT remove_ScreenCaptureStarting(
[in] EventRegistrationToken token);
}
/// Receives `ScreenCaptureStarting` events.
// MSOWNERS: stevenwei@microsoft.com
[uuid(9b5bbea1-4a58-4567-8b42-8781d3986cb4), object, pointer_default(unique)]
interface ICoreWebView2ScreenCaptureStartingEventHandler : IUnknown {
/// Called to provide the event args when a screen capture is requested on
/// a WebView element.
HRESULT Invoke(
[in] ICoreWebView2* sender,
[in] ICoreWebView2ScreenCaptureStartingEventArgs* args);
}
/// Event args for the `ScreenCaptureStarting` event.
// MSOWNERS: stevenwei@microsoft.com
[uuid(a1d309ee-c03f-11eb-8529-0242ac130003), object, pointer_default(unique)]
interface ICoreWebView2ScreenCaptureStartingEventArgs : IUnknown {
/// The associated frame information that requests the screen capture
/// permission. This can be used to grab the frame source, name, frameId,
/// and parent frame information.
[propget] HRESULT OriginalSourceFrameInfo([out, retval] ICoreWebView2FrameInfo**
frameInfo);
/// By default, both the `ScreenCaptureStarting` event handlers on the
/// `CoreWebView2Frame` and the `CoreWebView2` will be invoked, with the
/// `CoreWebView2Frame` event handlers invoked first. The host may
/// set this flag to `TRUE` within the `CoreWebView2Frame` event handlers
/// to prevent the remaining `CoreWebView2` event handlers from being
/// invoked. If the flag is set to `FALSE` within the `CoreWebView2Frame`
/// event handlers, downstream handlers can update the `Cancel` property.
///
/// If a deferral is taken on the event args, then you must synchronously
/// set `Handled` to TRUE prior to taking your deferral to prevent the
/// `CoreWebView2`s event handlers from being invoked.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we have this pattern anywhere else? Generally when you take a deferral, you can set results later.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Leave this as is. We do have this pattern elsewhere in WebView2.

Dave: Include this in how event bubbling should work

[propget] HRESULT Handled([out, retval] BOOL* handled);
/// Sets the `Handled` property.
[propput] HRESULT Handled([in] BOOL handled);
/// The host may set this flag to cancel the screen capture. If canceled,
/// the screen capture UI is not displayed regardless of the
/// `Handled` property.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

(Although if you don't set Handled, someone downstream might clear your Cancel)

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes. Please add this to the documentation.

/// On the script side, it will return with a NotAllowedError as Permission denied.
[propget] HRESULT Cancel([out, retval] BOOL* cancel);
/// Sets the `Cancel` property.
[propput] HRESULT Cancel([in] BOOL cancel);
/// Returns an `ICoreWebView2Deferral` object. Use this deferral to
/// defer the decision to show the Screen Capture UI.
///
/// Returns an `ICoreWebView2Deferral` object.
HRESULT GetDeferral([out, retval] ICoreWebView2Deferral** deferral);
}
/// This is an extension of the ICoreWebView2Frame interface that supports ScreenCaptureStarting
// MSOWNERS: stevenwei@microsoft.com
[uuid(12885cda-9caa-4793-9c38-f15827dbab1f), object, pointer_default(unique)]
interface ICoreWebView2Frame3 : IUnknown {
/// Add an event handler for the `ScreenCaptureStarting` event.
/// `ScreenCaptureStarting is raised when content in an iframe or any of its
/// descendant iframes requests permission to use the Screen Capture
/// API from getDisplayMedia()
///
/// This relates to the `ScreenCaptureStarting` event on the
/// CoreWebView2`.
/// Both these events will be raised in the case of an iframe requesting
/// permission. The `CoreWebView2Frame`'s event handlers will be invoked
/// before the event handlers on the `CoreWebView2`. If the `Handled`
/// property of the `ScreenCaptureStartingEventArgs` is set to TRUE
/// within the`CoreWebView2Frame` event handler, then the event will not
/// be raised on the `CoreWebView2`, and its event handlers will not be
/// invoked.
///
HRESULT add_ScreenCaptureStarting(
[in] ICoreWebView2FrameScreenCaptureStartingEventHandler* handler,
[out] EventRegistrationToken* token);

/// Remove an event handler previously added with
/// `add_ScreenCaptureStarting`
HRESULT remove_ScreenCaptureStarting(
[in] EventRegistrationToken token);
}

/// Receives `ScreenCaptureStarting` events for iframes.
// MSOWNERS: stevenwei@microsoft.com
[uuid(c07ac75c-2105-4bb8-9c57-21b6ed8fb381), object, pointer_default(unique)]
interface ICoreWebView2FrameScreenCaptureStartingEventHandler : IUnknown {
/// Provides the event args for the corresponding event.
HRESULT Invoke(
[in] ICoreWebView2Frame* sender,
[in] ICoreWebView2ScreenCaptureStartingEventArgs * args);
}

```

## C#
```c#
namespace Microsoft.Web.WebView2.Core
{
runtimeclass CoreWebView2ScreenCaptureStartingEventArgs
{

[interface_name("Microsoft.Web.WebView2.Core.ICoreWebView2ScreenCaptureStartingEventArgs")]
{
CoreWebView2FrameInfo OriginalSourceFrameInfo { get; };
Boolean Cancel { get; set; };
Boolean Handled { get; set; };
}

}

runtimeclass CoreWebView2
{
// ...
event Windows.Foundation.TypedEventHandler<CoreWebView2,
CoreWebView2ScreenCaptureStartingEventArgs> ScreenCaptureStarting;
};

runtimeclass CoreWebView2Frame
{
// ...
// ICoreWebView2Frame3 members
event Windows.Foundation.TypedEventHandler<CoreWebView2Frame,
CoreWebView2ScreenCaptureStartingEventArgs> ScreenCaptureStarting;
}

}
```

# UI Changes

Having the column for specific tabs/WV2s doesn’t make sense in the vast majority of cases, so we
will remove that column entirely. Apps that want to offer the ability to select a specific Tab/WV2
will need to use the full API when we have it available to construct their own UI.


Next, the URL of the WV2 is used in a handful of locations that we should replace by default:

“Choose what to share with <url>” in the main dialog.

“<url> is sharing a window” in the sharing bar when sharing a window.

All of these should be replaced with “this app”.


When the sharing bar is open, the icon for it is a WV2 icon, not the host app’s icon. We should
use the host app’s icon (or no icon?)