Added Source-stripe-native #1546

Luishfs · 2024-05-02T18:50:15Z

Description:

Added Source-stripe-native.

Documentation links affected:

(list any documentation links that you created, or existing ones that you've identified as needing updates, along with a brief description)

Notes for reviewers:

Stop Date

This connector implements a feature called Stop Date. When backfilling, the connector will constantly check wether the given document is older than the Stop Date ( or the cutoff, by default ). If so, the connector will instantly stop backfilling that specific stream.

Issuing endpoints

This connector implements 3 streams that uses "Issuing" Endpoints: Authorizations, CardHolders, Transactions
Issuing Endpoint requires permission to be accessed, so we validate the permission before creating Issuing objects inside all_resources

This change is

Some streams models are not validating since the stripe docs was not up-to-date, will be fixing those in next commits Full tests still havents started, and this version is not stable

SetupAttempts streams to child streams

…cords and created new resources to better handle edge cases

Modified stop_date behaviour to only yield equal stop date values Modified stream names to new pattern Setting the PR ready to review

williamhbaker

I made some higher-level comments per @jonwihl's request. Many of these comments are general and are applicable to the code overall, as I have not commented on every single place where they apply. But overall this seems fairly reasonable, from what I can tell after a relatively cursory review.

williamhbaker · 2024-05-29T18:17:19Z

source-stripe-native/source_stripe_native/resources.py

+    # Checking if user has "Issuing" permissions
+    try:
+        #Using Authorizations stream for testing
+        url = f"https://api.stripe.com/v1/{Authorizations.SEARCH_NAME}"


Why would we do this, as opposed to requiring that the user has the required permissions? As far as I know we don't use this kind of adaptive discovery mechanism elsewhere, so I think we shouldn't here either unless there is some reason that I'm not aware of.

i believe this comment is more related to my bad writing than a concept itself.

"issuing" is a product inside Stripe, that is available on only a few countries and select accounts. As other services, you cannot allow/enable/disable usage throught a key like hubspot. Stripe is the one that allows your account to access such endpoints.
Which means that this checks if this feature is enabled by Stripe, and not if the user gave us permission to access it.
The text was changed here 0046721

For future reference, to test this, i created a US dev account, and was able to successfully create a sandbox enviroment.

williamhbaker · 2024-05-29T18:17:38Z

source-stripe-native/source_stripe_native/resources.py

+                 "Removing 'Issuing' streams")
+
+    return all_streams
+


Weird amount of extra whitespace here

Removed in 0046721

williamhbaker · 2024-05-29T18:29:53Z

source-stripe-native/source_stripe_native/models.py

+class EndpointConfig(BaseModel):
+    credentials: AccessToken = Field(
+        title="API Key"        )
+    stop_date: AwareDatetime = Field(


I think it's still correct to call this a start date. Stop date would imply that we'd get everything earlier than this date but nothing afterwards . The connector works backward chronologically through the API responses, but the end result is that we will get records on or after this date, which makes it a start date.

My idea to call it a stop_date and not start_date is to reference the real-time incremental sync method along with the backfilling.
Yes, we get records on and after the date, but not like other singer connectors. The run will start with incremental getting the newest data and backfilling getting the rest.
I know that conceptually that is basically the same thing and shouln't really matter, but it was a way i found to show that this connector behaves differently than a imported one, and that the user will get real-time data & data untill its stop date, and not the connector starting at the given date, untill it reaches "real-time" like others.
Else, it may seem that our native versions are simply other variations of the same connector

I also realize that users are accustomed to the start_date term, hence why i reference stop_date as a "similar to start_date" in its description

williamhbaker · 2024-05-29T18:30:57Z

source-stripe-native/source_stripe_native/models.py

+    data: List[Item]
+
+class BackfillResult(BaseModel, Generic[Item], extra="allow"):
+    # Set extra as allow since Refunds has one aditional field


Is Refunds the only one with an additional field? It would be better to accurately model the responses if possible, rather than allowing any extra fields through all the time, which could make things more difficult to debug.

in this specific case, what happens is that BackfillResult is a generic name for the defualt Stripe API response.
All data endpoints have the exact same response format shown. but Refunds has one aditional field called count if i remember correctly.

I figured some ways to fix this: Add the count field with a default 0 value to the BackfillResult model, create a new object, with a new fetch_backfill specific for Refunds or allow the model to be permissive with extra fields.

I went with allowing the model to be permissive since this extra response data is actually not used, and if stripe decides to add a new random field that specific backfilling would simply break. Else, since this data is not yielded, i believe the resulting model should not affect performance or schema generation that much.
But, if you still believe it is worth to change that, would adding the field with a default value be the best alternative?

williamhbaker · 2024-05-29T18:46:23Z