sum(float) != sum(float)

Question

sum(float) != sum(float)

colin.Leversuch-Roberts

SSC Guru

Points: 52643
More actions
October 15, 2007 at 7:46 am

#73773

I've moved a databse from prod to dev and a calculation on a sum of a flaot column gives a different value. There are approx 16k rows and here are the results ( so you can see the difference )
-648365.80999999424
-648365.81000000483
The data is identical as I've checked it, assuming data compare can be trusted and the dev db is a restored backup from production.
I seem to remember something about the float datatype from way back when, which is why i don't personally use it, however this is a client database. collation etc. is the same - anyone any ideas?
[font="Comic Sans MS"]The GrumpyOldDBA[/font]
www.grumpyolddba.co.uk
http://sqlblogcasts.com/blogs/grumpyolddba/

Viewing 15 posts - 1 through 15 (of 18 total)

You must be logged in to reply to this topic. Login to reply

Matt Miller (4) SSC Guru Points: 124210 More actions · Answer 1

you have to take into account the number of "significant positions", meaning - the precision. The purpose of FLOAT is to store "approximate numbers", so some of the display digits might not be within the precision (that's actually usually a given).

Looks to me that you're summing Float(24) numbers (which have a 7-digit precision). Note that the precision has nothing to do with the decimal point - that's ALL digits).

Take a look here - BOL describes this a bit:

http://msdn2.microsoft.com/en-us/library/aa258876(SQL.80).aspx

----------------------------------------------------------------------------------
Your lack of planning does not constitute an emergency on my part...unless you're my manager...or a director and above...or a really loud-spoken end-user..All right - what was my emergency again?

Lowell SSC Guru Points: 323518 More actions · Answer 2

can you change the comparison to a money or decimal type ?

SELECT CONVERT(MONEY,-648365.80999999424)AS P1,

CONVERT(MONEY,-648365.81000000483) AS P2

SELECT CONVERT(DECIMAL(10,4),-648365.80999999424)AS P1,

CONVERT(DECIMAL(10,4),-648365.81000000483) AS P2

P1 P2

--------------------- ---------------------

-648365.8100 -648365.8100

Lowell

--help us help you! If you post a question, make sure you include a CREATE TABLE... statement and INSERT INTO... statement into that table to give the volunteers here representative data. with your description of the problem, we can provide a tested, verifiable solution to your question! asking the question the right way gets you a tested answer the fastest way possible!

colin.Leversuch-Roberts SSC Guru Points: 52643 More actions · Answer 3

I know all about this and as I say i wouldn't normally use a float datatype, the issue remains that the identical data gives different results ( consistantly ) on two different servers.

I seem to remember something about float accuracy relating to the cpu's but my searches couldn't unearth anything.

both sql server is 2000 ent sp4 but the cpu and o/s on one box is intel and x64 whilst the other is amd and x32

The "problem" is that you should be able to take a financial system ( database ) from one server to another without the figures changing - despite the small variance it's still not the same - my client is concerned!

[font="Comic Sans MS"]The GrumpyOldDBA[/font]
www.grumpyolddba.co.uk
http://sqlblogcasts.com/blogs/grumpyolddba/

SwePeso SSC-Dedicated Points: 39757 More actions · Answer 4

As other has said before me, FLOAT is just an approximation based on binary storage of the number.

See here http://www.sqlteam.com/forums/topic.asp?TOPIC_ID=81849

Can't you just change the datatype to DECIMAL(20, 12) ?

N 56°04'39.16"
E 12°55'05.25"

Kenneth Wilhelmsson SSC-Dedicated Points: 30043 More actions · Answer 5

The 'problem' with financial systems is that they all too often use floats.. And expect them to be 'reliable'. Wrong tool for the wrong purpose.

/Kenneth

Jeff Gray SSChampion Points: 10667 More actions · Answer 6

I've seen a financial system that used floats as surrogate primary keys. I'm not sure what was going on there, but I assume that the database was converted from some other SQL engine. At least I hope that's how it happened....

Colin, I would say that your sums are approximately equal. If you want to make sure that X = Y using the float datatype, you'll need to decide how close they have to be for them to be considered "equal".

I agree that it is counterintuitive that you would get different results from the same database on different hardware, but that is yet another reason not to use an imprecise data type for precise data.

Have you tried converting the floats to a decimal type and then performing the sums? Perhaps comparing the converted types row-by-row. It would be interesting to see where the error is showing up.

colin.Leversuch-Roberts SSC Guru Points: 52643 More actions · Answer 7

oh I do agree that float is a bad choice.

select sum( convert(numeric(38,28),xxxxxxx)) from dbo.yyyyyy

returns the same value from each server and if I physically convert the float column to numeric(38,28) the sum on each table returns the same result set.

-648365.8100000000199072000000000000

however it doesn't explain why moving a database to another server returns a different result - interesting but not the whole answer

[font="Comic Sans MS"]The GrumpyOldDBA[/font]
www.grumpyolddba.co.uk
http://sqlblogcasts.com/blogs/grumpyolddba/

Matt Miller (4) SSC Guru Points: 124210 More actions · Answer 8

I'm sorry to be hard-nosed about it - but it's a very clear-cut answer: because you're outside of the stated precision of the calculation/ data type. The only way floats/reals would be "accurate" would be if they could hold an unlimited amount of digits. Until they do - they're accurate to a certain precision, and the non-significant digits are NOT to be used.

The fact that you'd extend it that far past what the stated precision doesn't ultimately add anything to the conversation: those digits are non-significant and can't be used. Not SHOULDN'T, CAN'T. It's a common notation in science to include 1-2 non significant digits, so that you don't start introducing error into the significant digits; any more than that denotes a lack of understanding of the concept.

It's the same thing with datetime - it's precise within +/- 3.3 ms. I'm sure you could conjure up a calculation that needs ns, but it still doesn't change the fact that you wouldn't be able to accurately show it in a SQL datetime field. It's part of the definition, which is there for a reason.

If they need something other than that - they need to switch their data to money/decimal. I understand that their system design made some poor choices, but that doesn't change the fact that it will remain scientifically approximate until you switch the data type. No amount of display tricks will change that.

----------------------------------------------------------------------------------
Your lack of planning does not constitute an emergency on my part...unless you're my manager...or a director and above...or a really loud-spoken end-user..All right - what was my emergency again?

Jeff Gray SSChampion Points: 10667 More actions · Answer 9

Hi Colin,

Are there any differences in CPU, O/S, or SQL build between the two boxes?

Jeff Gray SSChampion Points: 10667 More actions · Answer 10

Colin,

One more question. Can you attempt the float sum again, but this time specify the ordering of the rows? I ask because an unconfirmed source (wikipedia) seems to indicate that addition is NOT associative.

Thus (a + b) + c = a + (b + c) is not always true for floating point numbers, if this article is true.

select ID,[some float column] from [your table]

order by ID

Compute sum([some float column])

Jeff Gray SSChampion Points: 10667 More actions · Answer 11

I just tried a little experiment myself:

create table foo (

ID int identity,

X float)

set nocount on

declare @ctr int

select @ctr = 0

while @ctr < 1000

BEGIN

insert foo (X)

select rand()

select @ctr= @ctr + 1

END

select X from foo

order by newid()

compute sum(X)

select X from foo

order by newid()

compute sum(X)

drop table foo

The compute clauses in the identical select statements return different results. This is apparently caused by the ordering of the data.

SwePeso SSC-Dedicated Points: 39757 More actions · Answer 12

Not for me.

I have tested on all these and your code gives exact same sum.

SQL Server Express 2005 9.0.3186

SQL Server Enterprise 2000 8.00.818

SQL Server Standard 2000 8.00.818

SQL Server Enterprise 2000 8.00.2039

N 56°04'39.16"
E 12°55'05.25"

Matt Miller (4) SSC Guru Points: 124210 More actions · Answer 13

Jeff - the differences you're seeing are past the precision scale (7th number), aren't they? I'll bet you even get a substantially more "precise" result if and when you declare it as a Float(53). Funny how float point arithmetic works EXACTLY like BOL says it would.....

----------------------------------------------------------------------------------
Your lack of planning does not constitute an emergency on my part...unless you're my manager...or a director and above...or a really loud-spoken end-user..All right - what was my emergency again?

Jeff Gray SSChampion Points: 10667 More actions · Answer 14

I'm not pointing out anything other than the fact that the order in which the rows are added changes the result, thus suggesting that addition of floats is not associative. I'm precision agnostic, just trying to help Colin explain the results he is getting.

I get different results every time I run the code.

sum(float) != sum(float)

Cookies on SQLServerCentral