-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
✨ Source Freshcaller : Migrate to Manifest-only #46472
base: master
Are you sure you want to change the base?
Conversation
…pe/freshcaller/migrate-manifest-only
The latest updates on your projects. Learn more about Vercel for Git ↗︎
|
/format-fix
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The CI says that incremental reads are broken and always return the same records. Probably filtering by updated at is broken.
…pe/freshcaller/migrate-manifest-only
…ub.com/airbytehq/airbyte into tope/freshcaller/migrate-manifest-only
@topefolorunso nope, that backfired. |
…pe/freshcaller/migrate-manifest-only
@natikgadzhi see your comment here #40618 (comment). Could something similar be happening here? |
…pe/freshcaller/migrate-manifest-only
EDIT: Originally thought the failure might be cased by the test stream only returning a single record. However, I ran regression tests and it does seem like incremental sync breaks with this update, as they show a difference of ~11000 to 30 records between target and control versions: https://github.com/airbytehq/airbyte/actions/runs/11412961965/job/31759820864 So my current guess is that something about incremental changed in the CDK and it's messing up the existing incremental configuration for this connector when we migrate to the new version. |
cleanup the conflicts, let's merge this |
…pe/freshcaller/migrate-manifest-only
/format-fix
|
@natikgadzhi Update: Time Travel in Incremental Sync The failing incremental test is due to time zone correction of the datetime parameter. The record pulled in the first sequential read has Temporary fix: Maintained the time zone received from the api when storing the state and cursor_value. Same time zone is used when making subsequent requests with the state. But what happens if there are different records returned with different time zones? More permanent fix would be to find a way to correct all incoming datetime from records to +0000 before storing them as state values |
Also worthy of note in this PR - I removed the incremental sync for calls stream because the endpoint is not configured properly for time filtering with same parameters as |
It's really inspiring to see such a deep dive into failure reasons. Thank you @topefolorunso! |
@natikgadzhi How do we proceed with this? |
Alright, so — everything related to calls should stay incremental if it was incremental before. The reason for this is the quality of the experience for our users — you'd assume they have a LOT of calls, and you don't want to increase sync time linearly with data volume growth. @brianjlai can you take a look and advice @topefolorunso on how to go about the cursor? If you have time for a loom, that would prob help us level up Tope so he can figure out situations like this in the future. |
…pe/freshcaller/migrate-manifest-only
/format-fix
|
Ideally cursor datetime timezone conversions should be automatic too =/ |
…pe/freshcaller/migrate-manifest-only
/format-fix
|
@natikgadzhi All green here - did a little improvisation with custom components. You should check it out and confirm it's a safe way to go. |
No description provided.