Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] Search Optimization for Index Transform #1134

Open
sarthakaggarwal97 opened this issue Mar 18, 2024 · 1 comment
Open

[FEATURE] Search Optimization for Index Transform #1134

sarthakaggarwal97 opened this issue Mar 18, 2024 · 1 comment
Labels
enhancement New request

Comments

@sarthakaggarwal97
Copy link
Contributor

Is your feature request related to a problem?
Currently, whenever the transform job is executed, the search phase is executed first before any compute or indexing processes are initiated.

Now, job scheduler schedules the job at specific intervals. Once the interval is over, the job is again initiated. If the search phase is taking up a lot of time, possibly more than the duration of the interval itself, the search process will keep on continuing during the transform job's every restart, till all the checkpoints / buckets / documents are visited.

In cases of time series data, where the transform job is unable to keep up with the indexing in source index, the transform job keeps on searching without computing and indexing into the source index.

Since the queried data is loaded into memory, the node could experience circuit breaker exceptions, due to which the job fails. Without circuit breakers, the node can go into OOMs as well.

What solution would you like?
This is to propose a change in a way transform job is executed currently. Instead of waiting for the search phase execution to complete, we should keep on computing the data based on aggregations and thus indexing into target index. This would allow us to release some of the computed buckets from the memory, thus freeing up memory from time to time.

@sarthakaggarwal97 sarthakaggarwal97 changed the title [FEATURE] Search Optimization for Transform Job [FEATURE] Search Optimization for Index Transform Mar 18, 2024
@dblock dblock removed the untriaged label Jun 17, 2024
@dblock
Copy link
Member

dblock commented Jun 17, 2024

Catch All Triage - 1 2 3 4 5

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New request
Projects
None yet
Development

No branches or pull requests

2 participants