Performance issue with tally solution

Question

Performance issue with tally solution

Viewing 15 posts - 391 through 405 (of 522 total)

You must be logged in to reply to this topic. Login to reply

peter-757102 SSCertifiable Points: 6877 More actions · Answer 1

I did some testing myself using the testsuite Jeff made a few posts back. Thanks, it ran good first time....and on my machine (a pretty fast one) the timings indicate my IO less function is just as fast as the crossjoin method Jeff made.

In fact all methods are pretty close together, what it now comes down to is resource use. We should determine if the tableless versions cap out on memory at some point. Again I got plenty of memory so its hard to test for me as I cant restrict the amount of memory SQL Server can use, other people are working on it too. And we need to put each method into a set of different slutions to see how the affects queryplans and overall efficiency. I think this is where real differences will show up!

Can somone else take a look at it as well?

And I am also interested to see how this function works on lesser equipment....calling Jeff here ;)...

peter-757102 SSCertifiable Points: 6877 More actions · Answer 2

Here is an update to my tally function, it has now slightly less overhead by eliminating two cross joins. I did this by moving from a 4 bit (0-16) constant table to 6 bit (0-63) wide one.

The result is a slight speed bump when using large numbers and less clutter when examining execution plans in code that uses the function!

-- Tally table function (24 bit, large enaugh for general purpose use)

--

create function dbo.tfnTally24bit( @max-2 int ) returns table

as

return

(

with

nQ( N ) as

(

select 0 union all select 0 union all select 0 union all select 0 union all -- 4

select 0 union all select 0 union all select 0 union all select 0 union all -- 8

select 0 union all select 0 union all select 0 union all select 0 union all -- 12

select 0 union all select 0 union all select 0 union all select 0 union all -- 16

select 0 union all select 0 union all select 0 union all select 0 union all -- 20

select 0 union all select 0 union all select 0 union all select 0 union all -- 24

select 0 union all select 0 union all select 0 union all select 0 union all -- 28

select 0 union all select 0 union all select 0 union all select 0 union all -- 32

select 0 union all select 0 union all select 0 union all select 0 union all -- 36

select 0 union all select 0 union all select 0 union all select 0 union all -- 40

select 0 union all select 0 union all select 0 union all select 0 union all -- 44

select 0 union all select 0 union all select 0 union all select 0 union all -- 48

select 0 union all select 0 union all select 0 union all select 0 union all -- 52

select 0 union all select 0 union all select 0 union all select 0 union all -- 56

select 0 union all select 0 union all select 0 union all select 0 union all -- 60

select 0 union all select 0 union all select 0 union all select 0 -- 64

)

select top ( isnull( @max-2, 0 ) )

row_number() over ( order by anchor.constant ) as n

from

( select 0 ) as anchor( constant )

cross join nQ as n1 -- 64 ( 6 bit)

cross join nQ as n2 -- 4096 ( 12 bit)

cross join nQ as n3 -- 262144 ( 18 bit)

cross join nQ as n4 -- 16777216 ( 24 bit)

)

;

Enjoy

peter-757102 SSCertifiable Points: 6877 More actions · Answer 3

Me again....

I modified the classic Tally solution for string splitting as found on the initial pages and made speed improvements.

This solution is unlikely to suffer from the predictable data problem that the really fast solutions turned out to be victim of.

This code is based on that of Jeff which got improved by Barry and now its my turn....grin.

I will explain in a bit after you inspected the code for yourself :).

create function dbo.fnSplitClassicTweak

(

@parameter varchar(Max)

, @Separator Varchar(64)

, @sectorsize int = 8 -- have this size closly match the expected separator-less blocks in the input

)

returns

@Items TABLE

(

ID INT identity(1,1) primary key clustered -- the element number

, item VARCHAR(8000) not null -- the split-out string element

, OffSet int not null -- the original offest / ( not entirley accurate if LEN(@Seperator) > 1 because of the Replace() )

)

as

begin

--our seperator character (convenient, doesn't affect performance)

declare @Sep char(1);

select @Sep = char( 10 );

--NOTE: we make the @Sep character LF so that we will automatically

-- parse out rogue LF-only line breaks.

--===== Add start and end seprators to the Parameter so we can handle

-- all the elements the same way

-- Also change the seperator expressions to our seperator

-- character to keep all offsets = 1

select @Parameter = @Sep + Replace( @Parameter, @Separator, @Sep ) + @Sep;

insert into @Items( Offset, item )

select

(charpos.N + sector.N) + 1

, substring( @Parameter, (charpos.N + sector.N) + 1, charindex( @Sep, @Parameter, (charpos.N + sector.N) + 1) - (charpos.N + sector.N) - 1 )

from

(

select

@sectorsize * (s.n - 1) as N

from

dbo.tfnTally24bit( (datalength( @parameter ) + @sectorsize - 1) / @sectorsize ) as s

where

-- Notice how we idenify sectors of interest

charindex( @Sep, substring( @parameter, 1 + (@sectorsize * (s.n - 1)), @sectorsize ) ) != 0

) as sector

cross join dbo.tfnTally24bit( @sectorsize ) as charpos

where

-- Notice how we find the separator within for a sector of interest

(charpos.N + sector.N) < datalength( @Parameter ) and substring( @Parameter, (charpos.N + sector.N), 1 ) = @Sep

order by 1

;

return;

end

go

This is what I call an engineering solution....it address some of the drawbacks and adds a few minor tweaks....no radical things.

Improvements

1. The tally table from my previous post (no I/O, and in all test I did so far it performs as the best).

2. The result table now has an identity column, which makes it possible for me to dispose of the row_number() over construction as I can do an order by 1 on the character index to generate the IDs.

3. Added a clustered primary key in the result table.

4. Now there a configurable mechanism that splits the input string into even sized pieces (I call them sectors). You can configure the size of these sectors so it best matches the separator distances typically experienced in your data.

Point 4 is by far the most significant improvement as no longer is every character inspected by a costly substring operation. First all sector are scanned using charindex and if a match is found that sector becomes part of the derived table that contains all sectors to be inspected in the classic way. Thus if your sector size is small enough to have plenty of them without separator and the still larger then 3 to 4 characters you should save on string manipulation costs and thus gain speed.

I hope to have given a new impulse and some new ideas for you guys to incorporate into your own solutions, be it string splitting or otherwise 🙂

Looking forward to some replies, its getting lonely here and it was a hard sleepless night work!

Cheers!

Special note to Jeff: I saved your tally.......it beats the cursor based function 🙂

Jeff Moden SSC Guru Points: 1004503 More actions · Answer 4

peter (5/4/2009)
Here is an update to my tally function, it has now slightly less overhead by eliminating two cross joins.

As you keenly observed before, all of these are now in the area of greased lightning and it comes down to which kind of resources you want to expend. Here's the same million row test on my "slow" box with your latest function...

[font="Courier New"]====================================================================================================

DBCC execution completed. If DBCC printed error messages, contact your system administrator.

===== Matt Miller's Method =====

SQL Server Execution Times:

CPU time = 906 ms, elapsed time = 979 ms.

====================================================================================================

DBCC execution completed. If DBCC printed error messages, contact your system administrator.

===== Itzek's Method =====

SQL Server Execution Times:

CPU time = 844 ms, elapsed time = 853 ms.

====================================================================================================

DBCC execution completed. If DBCC printed error messages, contact your system administrator.

===== Jeff Moden's Method

SQL Server Execution Times:

CPU time = 0 ms, elapsed time = 1 ms.

Table 'syscolrdb'. Scan count 2, logical reads 98, physical reads 0, read-ahead reads 115, lob logical reads 0, lob physical reads 0, lob read-ahead reads 0.

Table 'syscolpars'. Scan count 1, logical reads 7, physical reads 1, read-ahead reads 16, lob logical reads 0, lob physical reads 0, lob read-ahead reads 0.

Table 'Worktable'. Scan count 0, logical reads 0, physical reads 0, read-ahead reads 0, lob logical reads 0, lob physical reads 0, lob read-ahead reads 0.

SQL Server Execution Times:

CPU time = 719 ms, elapsed time = 832 ms.

====================================================================================================

DBCC execution completed. If DBCC printed error messages, contact your system administrator.

===== RBarryYoung's Method

Table 'syscolrdb'. Scan count 2, logical reads 22, physical reads 2, read-ahead reads 97, lob logical reads 0, lob physical reads 0, lob read-ahead reads 0.

Table 'syscolpars'. Scan count 2, logical reads 10, physical reads 1, read-ahead reads 47, lob logical reads 0, lob physical reads 0, lob read-ahead reads 0.

SQL Server Execution Times:

CPU time = 734 ms, elapsed time = 808 ms.

====================================================================================================

DBCC execution completed. If DBCC printed error messages, contact your system administrator.

===== Combined Method

Table 'syscolrdb'. Scan count 2, logical reads 12, physical reads 0, read-ahead reads 92, lob logical reads 0, lob physical reads 0, lob read-ahead reads 0.

Table 'syscolpars'. Scan count 2, logical reads 14, physical reads 1, read-ahead reads 16, lob logical reads 0, lob physical reads 0, lob read-ahead reads 0.

SQL Server Execution Times:

CPU time = 703 ms, elapsed time = 865 ms.

====================================================================================================

DBCC execution completed. If DBCC printed error messages, contact your system administrator.

[highlight="YELLOW"]===== Peter's Method

SQL Server Execution Times:

CPU time = 719 ms, elapsed time = 726 ms.[/highlight][/font]

--Jeff Moden

RBAR is pronounced "ree-bar" and is a "Modenism" for Row-By-Agonizing-Row.
First step towards the paradigm shift of writing Set Based code:
________Stop thinking about what you want to do to a ROW... think, instead, of what you want to do to a COLUMN.

Change is inevitable... Change for the better is not.

Helpful Links:
How to post code problems
How to Post Performance Problems
Create a Tally Function (fnTally)

peter-757102 SSCertifiable Points: 6877 More actions · Answer 5

Thanks for the information Jeff...care to take a quick look at what your string splitter evolved into as well?

PS.

I see still room for improvement for the classic method..my strategy is to eliminate as much substring operations as possible as that is the main killer!

In fact I already got a faster version ready, but am still trying to squeeze more out of it 🙂

First a slight tally update:

-- Tally table function (24 bit, large enaugh for general purpose use)

--

alter function dbo.tfnTally24bit( @max-2 int ) returns table