SQL Server IN vs = in query(for one value)

Asked 22/1, 2015 at 15:59 Answered 22/1, 2015 at 20:2

First of all, sorry for the vague topic name(I could not think of something better).

An SO answer got me to thinking on whether there really is any performance impact when I chuck = for an IN.

select 1 id, 'abc' name into #a union 
select 2, 'abcd' union 
select 3, 'abcde' 

select * from #a where id = 1 ------ Query 1
select * from #a where id in (1)------ Query 2

I checked the query plans for both the queries and found them to me same(Maybe the example was simplistic probably that's why).

But am I violating any performance best practice when using an IN? Sure there must be some overheads involved with that, as an IN can handle multiple items and even SELECTS. I know IN offers a lot more in terms of features, but for cases when there are multiple records for id = 1,

select * from #a where id = (SELECT id from #a where id = 1)------ Query 3
select * from #a where id in (SELECT id from #a where id = 1)------ Query 4

query3 would fail while query4 would work. Thus I would want to replace = with an IN in future in my code. But I want to know of any potential trade-offs.

EDIT I am just interested in learning how(differently?) the SQL engine treats these statements.

Natiha answered 22/1, 2015 at 15:59 Comment(4)

Usually you'd rewrite queries 3 and 4 with a join anyway, so you wouldn't need the IN or =. – Despoliation 22/1, 2015 at 16:14

Consider it just an example :) – Natiha 22/1, 2015 at 16:16

"=" operator would be faster than "IN". If you want to check use SET STATISTIC TIME IO ON – Sprinkling 22/1, 2015 at 16:48

Why is what I want to know. – Natiha 22/1, 2015 at 16:54

Ignoring the performance aspect of your question if I may, there is a design issue here. You should code to your intention so that when someone else reads your code they have a better chance of understanding what you were trying to achieve. Do you expect SELECT id from #a where id = 1 to return exactly one record? If so then use '='. If you expect it to return zero or more then '=' would be wrong.

Attenuate answered 22/1, 2015 at 16:14 Comment(2)

Agree. I had actually asked this question out of curiosity on whether the SQL Engine on a much lower level, actually treats an IN and a = differently. My question I must reiterate is purely to understand how the engine works. – Natiha 22/1, 2015 at 16:19

The performance side of this is much more complex. The '=' operator is the '=' operator, nice and predictable, but 'IN' could manifest itself in query plans in many different ways, depending on many different factors. – Attenuate 22/1, 2015 at 16:30

Index efficiency is the only consideration between "=" and "in" to my knowledge. So if you want to take advantage of a index, you would need to use an "=".

The "in" statement is nonsargable. "=" is sargable.

Query 3 and 4 would probably do additional table scans in the subquery, so you would lose efficiency compared to a join.

Guck answered 22/1, 2015 at 16:27 Comment(2)

I disagree. I created clustered index on id column and tested and saw that both the queries were using Index Seek operation. So yes, for query 4 too index IS used – Natiha 22/1, 2015 at 16:32

"in" and "=" are operators, it is incorrect to say that operators are sargable or not-sargable. Expressions are sargable or not, depending on many factors. – Attenuate 22/1, 2015 at 16:56

Generally IN will be broken up into multiple = predicates and those will be used to satisfy the query. When used on an index (assuming) like in your example the individual predicates will be used for the index seek operation. Try it yourself. Look at the seek predicates in the 'Index Seek' operation of your execution plan.

Nondisjunction answered 22/1, 2015 at 20:2 Comment(0)

Recommended topics

Hot tags