You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
They are not the same, at least not with the latest version:
Left is the one opened by crawlee, right is my regular chrome browser.
Keep in mind that what crawlee (or better say fingerprint-suite) generates for UA tries to be as close to a real UA as possible, so seeing something very similar (or even the same) is rather expected, not a bug.
crawlee@3.1.4, playwright@1.27.1
v3.1.4 is pretty old, you should always upgrade before you report anything. There might be fixes not just to crawlee, but to its transitive dependencies.
I believe session pools also (should) handle useragents, but I am not 100% on that.
Yes, that's what we want in the long run (we plan on calling that user pool instead), but the refactoring will be more complex, we'd like to have that for v4 (sometimes around the end of 2023 probably).
This discussion was converted from issue #1953 on June 20, 2023 09:44.
Heading
Bold
Italic
Quote
Code
Link
Numbered list
Unordered list
Task list
Attach files
Mention
Reference
Menu
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Which package is this bug report for? If unsure which one to select, leave blank
@crawlee/playwright (PlaywrightCrawler)
Issue description
Create a basic Playwright project and visit this website: https://www.whatismybrowser.com/detect/what-is-my-user-agent/
The useragent will be the same as if you check within your actual browser.
Code sample
Package version
crawlee@3.1.4, playwright@1.27.1
Node.js version
v19.0.0
Operating system
Ubuntu
Apify platform
I have tested this on the
next
releaseNo response
Other context
Crawlee should set a useragent header to avoid detection, as confirmed here:
https://crawlee.dev/api/browser-crawler/interface/BrowserLaunchContext#userAgent
I believe session pools also (should) handle useragents, but I am not 100% on that.
Beta Was this translation helpful? Give feedback.
All reactions