January 16, 2010 at 6:28 pm
Comments posted to this topic are about the item Caution with EXCEPT
January 17, 2010 at 11:58 pm
Very nice and informative article...
This is what we do in real life as well (Use of SELECT *). Even though we hear from everybody that to avoid SELECT * always.
So, if we make practice of writing SELECT col1, col2, col3,.... then we would not get that problem.
Enjoy.
January 18, 2010 at 4:48 am
Another issue with select * is that if you have identity keys and the tables are independently generated (as in deve and staging) and an identical data entry has separate "identities" on the different boxes select * will always return all the rows.
Instead of the Union I usually just
select * from A
except
select * from B
select * from B
except
select * from A
then you can it is easier to know where the "differences" originate.
<><
Livin' down on the cube farm. Left, left, then a right.
January 18, 2010 at 4:57 am
mohd.nizamuddin (1/17/2010)
Very nice and informative article...This is what we do in real life as well (Use of SELECT *). Even though we hear from everybody that to avoid SELECT * always.
So, if we make practice of writing SELECT col1, col2, col3,.... then we would not get that problem.
Enjoy.
Sorry for 2 responses. More thoughts occurred as the coffee kicked in.
I "always" π start to worry when someone used the phrase "always", there are always reasons why always is never appropriate. (Same goes with "never"). Sometimes it is just the thing to do.
When needing to "select col1, col2, etc." I use the Management Studio tactic of "Script table as->Select into->new query window" and then just hack out any columns I do not want. It "Always" works for me.
<><
Livin' down on the cube farm. Left, left, then a right.
January 18, 2010 at 5:25 am
Here's a better way:
SELECT 'IN A NOT B' location, a, b, c, d, ...
FROM tablea
EXCEPT
SELECT 'IN A NOT B' location, a, b, c, d, ...
FROM tableb
UNION ALL
SELECT 'IN B NOT A' location, a, b, c, d, ...
FROM tableb
EXCEPT
SELECT 'IN B NOT A' location, a, b, c, d, ...
FROM tablea
No parentheses this way.
January 18, 2010 at 6:12 am
Stephen Hirsch (1/18/2010)
Here's a better way:SELECT 'IN A NOT B' location, a, b, c, d, ...
FROM tablea
EXCEPT
SELECT 'IN A NOT B' location, a, b, c, d, ...
FROM tableb
UNION ALL
SELECT 'IN B NOT A' location, a, b, c, d, ...
FROM tableb
EXCEPT
SELECT 'IN B NOT A' location, a, b, c, d, ...
FROM tablea
No parentheses this way.
Wait a minute! If ever there was a case for parentheses, it's right here - what was the intent of the programmer? Parentheses make the intent self-documenting:
SELECT 'IN A NOT B' location, a, b, c, d, ...
FROM tablea
EXCEPT
(SELECT 'IN A NOT B' location, a, b, c, d, ...
FROM tableb
UNION ALL
SELECT 'IN B NOT A' location, a, b, c, d, ...
FROM tableb
EXCEPT
SELECT 'IN B NOT A' location, a, b, c, d, ...
FROM tablea)
Cheers
ChrisM
For fast, accurate and documented assistance in answering your questions, please read this article.
Understanding and using APPLY, (I) and (II) Paul White
Hidden RBAR: Triangular Joins / The "Numbers" or "Tally" Table: What it is and how it replaces a loop Jeff Moden
January 18, 2010 at 7:11 am
The programmer "intended" it to work the way the programmer thought they were programming it to work. π [this said in the universal sense]
<><
Livin' down on the cube farm. Left, left, then a right.
January 18, 2010 at 7:14 am
Tobar (1/18/2010)
The programmer "intended" it to work the way the programmer thought they were programming it to work. π [this said in the universal sense]
LOL you used to work there too!
For fast, accurate and documented assistance in answering your questions, please read this article.
Understanding and using APPLY, (I) and (II) Paul White
Hidden RBAR: Triangular Joins / The "Numbers" or "Tally" Table: What it is and how it replaces a loop Jeff Moden
January 18, 2010 at 7:39 am
We have seen poor performance with EXCEPT on large data sets.
January 18, 2010 at 8:05 am
Thanks for the article, I've never used EXCEPT because I didn't realize it existed.
On the point of poor performance with large datasets, how can we expect anything else? Probably too many columns to put in a covering index. SQL has to compare every column to every column, it's going to either table scan or bookmark lookup every row. Give your SQL Server a break! I suppose when I think about it, you could include a WHERE statement to break your large datasets in to smaller ones if appropriate to what you are looking for.
January 18, 2010 at 8:20 am
That's interesting
I benchmarked EXCEPT vs WHERE NOT IN (SELECT .....), and for my dataset, server, DB version etc. the performance results were equivalent, or biased towards EXCEPT.
Can you give more details about your scenario?
January 18, 2010 at 8:22 am
I find that an easier way to get the list of columns is to drag them from the object explorer in SSMS. If you expand the table object's columns list, you can then drag the "Columns" parent entry into a query window, to get the comma seperated list of columns.
This works for some other items too.
January 18, 2010 at 8:34 am
Bravo! What a great catch. You have my 5 star vote.
-Mike
January 18, 2010 at 9:53 am
jdurandt (1/18/2010)
That's interestingI benchmarked EXCEPT vs WHERE NOT IN (SELECT .....), and for my dataset, server, DB version etc. the performance results were equivalent, or biased towards EXCEPT.
Can you give more details about your scenario?
Can you post your benchmark code, please?
--Jeff Moden
Change is inevitable... Change for the better is not.
January 18, 2010 at 9:56 am
Nicely done, Stephen. Well written and straight to the point.
--Jeff Moden
Change is inevitable... Change for the better is not.
Viewing 15 posts - 1 through 15 (of 29 total)
You must be logged in to reply to this topic. Login to reply