[Solved] Optimize nested sql query

How to optimize this SQL query?

In case you have your own slow SQL query, you can optimize it automatically here.

For the query above, the following recommendations will be helpful as part of the SQL tuning process.
You'll find 3 sections below:

  1. Description of the steps you can take to speed up the query.
  2. The optimal indexes for this query, which you can copy and create in your database.
  3. An automatically re-written query you can copy and execute in your database.
The optimization process and recommendations:
  1. Avoid Selecting Unnecessary Columns (query line: 2): Avoid selecting all columns with the '*' wildcard, unless you intend to use them all. Selecting redundant columns may result in unnecessary performance degradation.
  2. Avoid Subselect When Selecting MAX/MIN Per Group (query line: 7): Constant subquery results are usually not cached by the database, especially in non-recent database versions. Therefore, a constant subquery in a WHERE clause will be fully evaluated for every row the WHERE clause will examine, which can significantly impact query performance. Use the method mentioned in the example instead.
  3. Create Optimal Indexes (modified query below): The recommended indexes are an integral part of this optimization effort and should be created before testing the execution duration of the optimized query.
Optimal indexes for this query:
CREATE INDEX event_idx_id_state_sub_id ON "event" ("id","state","sub_id");
CREATE INDEX event_idx_sub_id_state ON "event" ("sub_id","state");
The optimized query:
SELECT
        * 
    FROM
        event AS event1 
    LEFT JOIN
        event AS event2 
            ON (
                event1.sub_id = event2.sub_id 
                AND event2.state = 'PENDING' 
                AND event2.il_flag = event2.true
            ) 
            AND (
                event1.id < event2.id
            ) 
    WHERE
        (
            1 = 1 
            AND NOT EXISTS (
                SELECT
                    se2.id 
                FROM
                    event se2 
                WHERE
                    se1.sub_id = se2.sub_id 
                    AND se2.state IN (
                        'ACCEPTED', 'FAILED', 'INPROCESS'
                    )
            )
        ) 
        AND (
            event2.id IS NULL
        )

Related Articles



* original question posted on StackOverflow here.