What makes a SQL statement sargable?
Asked Answered
D

4

310

By definition (at least from what I've seen) sargable means that a query is capable of having the query engine optimize the execution plan that the query uses. I've tried looking up the answers, but there doesn't seem to be a lot on the subject matter. So the question is, what does or doesn't make an SQL query sargable? Any documentation would be greatly appreciated.

For reference: Sargable

Donetta answered 28/4, 2009 at 19:59 Comment(5)
+1 for "sargable". That's my word of the day for today. :-pFlytrap
I might also add to Adam's answer, that the mountains of information are in most cases extremely particular to each DB engine.Ppi
SARG = Search ARGument. Funny thing is: "SARG" in German means "Coffin", so I always have to smile when folks talk about SARGABLE - able to be put in a coffin? :-)Musso
sargability depends on your environment. MySQL's is documented here: dev.mysql.com/doc/refman/5.0/en/mysql-indexes.htmlTzong
Having free-text fields instead of "lookup tables" also goes against the spirit of making a query sargable. Users misspell stuff when entering free-text (e.g. town name), whereas lookup-tables force users to choose a correctly spelled entry. Well worth the slight extra trouble, because this can be properly indexed instead of using LIKE '%...%' in the predicate.Embarrass
H
313

The most common thing that will make a query non-sargable is to include a field inside a function in the where clause:

SELECT ... FROM ...
WHERE Year(myDate) = 2008

The SQL optimizer can't use an index on myDate, even if one exists. It will literally have to evaluate this function for every row of the table. Much better to use:

WHERE myDate >= '01-01-2008' AND myDate < '01-01-2009'

Some other examples:

Bad: Select ... WHERE isNull(FullName,'Ed Jones') = 'Ed Jones'
Fixed: Select ... WHERE ((FullName = 'Ed Jones') OR (FullName IS NULL))

Bad: Select ... WHERE SUBSTRING(DealerName,4) = 'Ford'
Fixed: Select ... WHERE DealerName Like 'Ford%'

Bad: Select ... WHERE DateDiff(mm,OrderDate,GetDate()) >= 30
Fixed: Select ... WHERE OrderDate < DateAdd(mm,-30,GetDate()) 
Hopehopeful answered 28/4, 2009 at 20:9 Comment(9)
Will including a function inside of GROUP BY cause a query to become non-sargable?Edessa
@MikeBantegui Just including a field in a GROUP BY won't necessarily make it non-sargeable, no. The right indexes will definitely help a GROUP BY query.Hopehopeful
Some database engines (Oracle, PostgreSQL) support indexes on expressions, dontcha know?Acidulent
Would an even better version of WHERE ((FullName = 'Ed Jones') OR (FullName IS NULL)) be SELECT... FROM ... WHERE FullName = 'Ed Jones' UNION SELECT...FROM...WHERE FullName IS NULL? I was once told by an optimisation guy that using OR in the where clause can unsarg queries..?Boresome
The UNION query would create temporary tables, which in fact may decrease overall performance for greater setsFlavin
@HighPlainsGrifter @Flavin I would recommend using explain analyze or similar to actually look at the query plans. Optimizers are pretty good at handling queries like that, so it's hard to say which will be faster in practice. There's also an element of non-determinism in there, as the query planner will give up trying to find a plan after a certain amount of time, and also base its decision off of table statistics. Always profile it to see!Mekka
@HighPlainsGrifter you should use a UNION ALL on that query - union has an implicit distinct, which makes a query much more expensive than it needs to be when you have to mutually exclusive datasetsLankton
@Hopehopeful In MSSQL 2016, there is no execution plan difference between Select ... WHERE isNull(FullName,'Ed Jones') = 'Ed Jones' and Select ... WHERE ((FullName = 'Ed Jones') OR (FullName IS NULL)). They both use the index on FullName and do an index seek.Confection
also beware if you need to convert a SQL condition yourself to make it "sargable" in most cases you have to add parenthesis consider this YEAR(myDate) = 2008 AND any_columns = 1 you would need to use (myDate >= '2008-01-01' AND myDate < '2009-01-01') AND any_columns = 1 to keep the SQL meaning the same.. But i believe SQL Server 2016+ has a pretty good "sargable" optimisation rewrite step in the optimizer which handle have most cases..Kelcey
A
97

Don't do this:

WHERE Field LIKE '%blah%'

That causes a table/index scan, because the LIKE value begins with a wildcard character.

Don't do this:

WHERE FUNCTION(Field) = 'BLAH'

That causes a table/index scan.

The database server will have to evaluate FUNCTION() against every row in the table and then compare it to 'BLAH'.

If possible, do it in reverse:

WHERE Field = INVERSE_FUNCTION('BLAH')

This will run INVERSE_FUNCTION() against the parameter once and will still allow use of the index.

Alicea answered 28/4, 2009 at 20:5 Comment(2)
Your suggestion with flipping the function would really only work when the function round-trips data (meaning that f(f(n)) = n).Cleaner
True. I considered adding INVERSE_FUNCTION but didn't want to be confusing. I'll change it.Alicea
J
12

In this answer I assume the database has sufficient covering indexes. There are enough questions about this topic.

A lot of the times the sargability of a query is determined by the tipping point of the related indexes. The tipping point defines the difference between seeking and scanning an index while joining one table or result set onto another. One seek is of course much faster than scanning a whole table, but when you have to seek a lot of rows, a scan could make more sense.

So among other things a SQL statement is more sargable when the optimizer expects the number of resulting rows of one table to be less than the tipping point of a possible index on the next table.

You can find a detailed post and example here.

Jugurtha answered 28/4, 2009 at 21:14 Comment(0)
B
5

For an operation to be considered sargable, it is not sufficient for it to just be able to use an existing index. In the example above, adding a function call against an indexed column in the where clause, would still most likely take some advantage of the defined index. It will "scan" aka retrieve all values from that column (index) and then eliminate the ones that do not match to the filter value provided. It is still not efficient enough for tables with high number of rows. What really defines sargability is the query ability to traverse the b-tree index using the binary search method that relies on half-set elimination for the sorted items array. In SQL, it would be displayed on the execution plan as a "index seek".

Beirut answered 9/6, 2017 at 21:8 Comment(0)

© 2022 - 2024 — McMap. All rights reserved.