Request fingerprinting parameters
The request fingerprinter class of scrapy-zyte-api generates request fingerprints for Zyte API requests based on the following Zyte API parameters:
url (
canonicalized
).For URLs that include a URL fragment, like
https://example.com#foo
, URL canonicalization keeps the URL fragment if the request may be a browser request.Request attribute parameters (httpRequestBody, httpRequestText, httpRequestMethod), except headers.
Equivalent httpRequestBody and httpRequestText values generate the same signature.
Output parameters (browserHtml, httpResponseBody, httpResponseHeaders, responseCookies, screenshot, automatic extraction outputs like product, and customAttributes).
Same for networkCapture, although it is not a proper output parameter (it needs to be combined with another browser rendering parameter to work).
Rendering option parameters (actions, device, javascript, screenshotOptions, viewport, and automatic extraction options like productOptions or customAttributesOptions).
-
When using the session management API, session pool IDs are treated the same as sessionContext.
tags.
The following Zyte API parameters are not taken into account for request fingerprinting by default:
Request header parameters (customHttpRequestHeaders, requestHeaders).
Request cookie parameters (cookieManagement, requestCookies).
-
When using the session management API, session initialization parameters are treated the same as sessionContextParameters.
Experimental parameters (experimental.*).