Improving delete performance.

Question

Improving delete performance.

senthilj

SSC-Addicted

Points: 442
More actions
October 11, 2004 at 10:46 am

#162389

I have a large table from which I need to delete a small set of rows everyday. I cannot use truncate. Since delete operation does logging the performance is poor. Looks like I cannot disable logging. Is there any other faster way to delete the rows?
Thanks.

Viewing 12 posts - 1 through 11 (of 11 total)

You must be logged in to reply to this topic. Login to reply

Kathi Kellenberger SSChampion Points: 11811 More actions · Answer 1

Take a look at the indexes on the table. It is possible that SQL is scanning the entire table to find your small set of rows to delete and that is what is causing the poor performance. You can use Query Analyser to display an execution plan of your query to see what is really going on.

Aunt Kathi Data Platform MVP
Author of Expert T-SQL Window Functions
Simple-Talk Editor

senthilj SSC-Addicted Points: 442 More actions · Answer 2

Assuming that the indexes are defined properly, is there a better way to delete the rows?

Kathi Kellenberger SSChampion Points: 11811 More actions · Answer 3

How are you deleting them now? I think you are probably doing something like:

delete from myTable where myColumn = 'somevalue'

If there is an index on myColumn and SQL is using it, then I'm not sure you can get any better than that.

Aunt Kathi Data Platform MVP
Author of Expert T-SQL Window Functions
Simple-Talk Editor

Aaron Templeton SSCarpal Tunnel Points: 4655 More actions · Answer 4

The only other thing I can come up with right now is to turn what might be a sequential process into a set based process. If you are generating a few hundred individual DELETE statements it may perform better if you store off the criteria somewhere like a temporary table and then run a DELETE where the values are IN the temp table values. That, of course, is a lot easier if you have a single column where clause. If you have to match several key columns I think you can use a join. It seems like DELETEs with joins got tricky, though, if possible. Anyhow, if you can make it a single set based delete it should require less index scanning.

Something else to consider is that every delete has to perform index maintenance. The more indexes the more work.

Another consideration would be fragmentation. If the table and/or indexes are highly fragmented (DBCC SHOWCONTIG) the less efficiently it will work.

senthilj SSC-Addicted Points: 442 More actions · Answer 5

senthilj

SSC-Addicted

Points: 442

October 11, 2004 at 5:15 pm

#526264

Thanks Kathi and Aaron.

David.Poole SSC Guru Points: 76078 More actions · Answer 6

When it comes to WHERE clauses I have found that the different comparison operators have different performances. In order of speed (fastest first)

=
> or <
<> or !=

If you can avoid "not equal" operators then do so.

Using "MyDate BETWEEN @startDate AND @endDate" is more efficient than "WHERE MyDATE >=@startDate and MyDATE<=@endDate"

LIKE is faster than where clauses with LEFT, RIGHT, SUBSTR etc.

LinkedIn Profile

Kenneth Wilhelmsson SSC-Dedicated Points: 30043 More actions · Answer 7

Some agreements and some disagreements

Not equal operators by their nature never uses any indices, they always force a table scan - that's why they should be avoided if possible.

There is no difference with "MyDate BETWEEN @startDate AND @endDate" and "WHERE MyDATE >=@startDate and MyDATE<=@endDate"

BETWEEN is just shorthand for ">= AND <=", and the optimizer will convert WHERE col1 BETWEEN 'x' AND 'y' into WHERE col1 >= 'x' AND col1 <= 'y'

LIKE is an operator - LEFT, RIGHT, SUBSTRING etc are functions. When you use a function on a column that has an index on it, the index can't be used - thus a function causes tablescans instead of index seeks.

/Kenneth

David.Poole SSC Guru Points: 76078 More actions · Answer 8

Kenneth, try using BETWEEN on a column with a clustered index and then comparing the > and < method.

LinkedIn Profile

Kenneth Wilhelmsson SSC-Dedicated Points: 30043 More actions · Answer 9

That's what I did. Both methods produce identical query plans - and both methods also produce the same argument. Even though you write WHERE col1 BETWEEN 'x' AND 'y' , the argument actually executed is WHERE col1 >= 'x' AND col1 <= 'y'

AFAIK, there is no 'real' difference between the two but syntax - between requires fewer keystrokes. Having said that, I'll admit that I've just did some very quick testing on this - there may indeed be the 'odd' circumstances where the two may produce different plans, however I don't know of any such.

/Kenneth

David Branscome Old Hand Points: 356 More actions · Answer 10

If you can fashion a delete which makes use of your clustered index and run your deletes against small sets of data perhaps on a single page the impact to the rest of the table will be minimized. Running deletes against larger sets of dispersed records will have significant impacts on table access and performance.

I had an opportunity to implement a very tight clustered index including ‘CODE’, day,month,year fields this reduced record sets to a single page and was successful in deleting records with almost no impact on the table.

Good luck

dmb

senthilj SSC-Addicted Points: 442 More actions · Answer 11

senthilj

SSC-Addicted

Points: 442

October 12, 2004 at 1:22 pm

#526400

Thanks to all.