Grouping by ten days period

Question

Grouping by ten days period

dfarina

SSC Veteran

Points: 238
More actions
April 16, 2012 at 6:09 pm

#254835

Hi
I was asked to do a query that performs a grouping by ten days period and i don't know how to begin

Viewing 12 posts - 1 through 11 (of 11 total)

You must be logged in to reply to this topic. Login to reply

Lynn Pettis SSC Guru Points: 442467 More actions · Answer 1

If you provide the DDL (CREATE TABLE statement) for the table(s) involved, provide sample data (as a series of INSERT INTO statements) for the table(s) involved, the expected resutls of the query I am sure we could provide you with something that meets your requirements.

dfarina SSC Veteran Points: 238 More actions · Answer 2

sorry, here is the data

CREATE TABLE sample(id int identity(1,1), s_date datetime, amount int)

insert into sample

select '20100101', 1

union all

select '20100102', 1

union all

select '20100103', 1

union all

select '20100104', 1

union all

select '20100111', 1

union all

select '20100112', 1

union all

select '20100113', 1

union all

select '20100121', 1

union all

select '20100122', 1

union all

select '20100123', 1

Expected output

20100101 to 20100110 4

20100111 to 20100120 3

20100121 to 20100130 3

Lynn Pettis SSC Guru Points: 442467 More actions · Answer 3

dfarina (4/16/2012)
sorry, here is the data
CREATE TABLE sample(id int identity(1,1), s_date datetime, amount int)
insert into sample
select '20100101', 1
union all
select '20100102', 1
union all
select '20100103', 1
union all
select '20100104', 1
union all
select '20100111', 1
union all
select '20100112', 1
union all
select '20100113', 1
union all
select '20100121', 1
union all
select '20100122', 1
union all
select '20100123', 1
Expected output
20100101 to 20100110 4
20100111 to 20100120 3
20100121 to 20100130 3

What sets the start date and end date for the query or is it based on the earliest and latest dates in the table?

dfarina SSC Veteran Points: 238 More actions · Answer 4

dfarina

SSC Veteran

Points: 238

April 16, 2012 at 7:23 pm

#1474490

Just for the year 2010

brad.mason5 Hall of Fame Points: 3299 More actions · Answer 5

A possible solution. I am assuming every month is 31 days but an idea to help you get started.

WITH TenDates

AS

(

SELECT

s_date

, DayAdd =

CASE WHEN DAY(s_date) <= Increment

THEN Increment

ELSE NULL

END

,amount

FROM sample

CROSS JOIN (VALUES(10),(20),(31)) AS Calendar(Increment)

),TenDatesFinal

AS

(

SELECT s_date

,FromDate = MIN(s_date)

,ToDate = DATEADD(dd, MIN(DayAdd)-1,DATEADD(MONTH, DATEDIFF(MONTH,'19000101',s_date), '19000101'))

,Amount = MIN(Amount)

FROM TenDates

GROUP BY s_date

)

SELECT FromDate= MIN(FromDate)

,ToDate = MAX(ToDate)

,Amount = SUM(Amount)

FROM TenDatesFinal

GROUP BY ToDate;

Dwain Camps SSC Guru Points: 86978 More actions · Answer 6

DDL=Data Definition Language

That and the INSERT to set up your sample data looks like this:

DECLARE @sample TABLE (id int identity(1,1), s_date datetime, amount int)

insert into @sample

select '2010-01-01', 1

union all select '2010-01-02', 1

union all select '2010-01-03', 1

union all select '2010-01-04', 1

union all select '2010-01-11', 1

union all select '2010-01-12', 1

union all select '2010-01-13', 1

union all select '2010-01-21', 1

union all select '2010-01-22', 1

union all select '2010-01-23', 1

--union all select '2010-02-05', 1

--union all select '2010-02-15', 1

union all select '2010-02-23', 1

Here's a solution that will create the 10 day groups for only the groups where you have data.

DECLARE @startdate DATETIME

SELECT @startdate = DATEADD(year, DATEDIFF(year, 0, '2010-05-31'), 0)

;WITH Dates AS (

SELECT DATEADD(day, 10*(DATEDIFF(day, @startdate, s_date)/10), @startdate) as s_date

,DATEADD(day, 9+10*(DATEDIFF(day, @startdate, s_date)/10), @startdate) as e_date

,amount

FROM @sample)

SELECT s_date, e_date, SUM(amount) As amount

FROM Dates

GROUP BY s_date, e_date

I've used a little trick I learned to set the starting date for the period (@startdate) using any day in that particular year. If you don't need to start on the year boundary (01-Jan) just set @startdate = '2010-05-31' or whatever your start date is.

However as Mr. Celko has pointed out, if you aren't satisfied when there are gaps in the output not covered by your input data, you'll need to use either a Calendar or a Tally table to generate the missing 10 day periods.

Notice how a couple of 10 day periods are missing with the additional data I added. You can fill those in by uncommenting the two SELECTs in the set up data.

My mantra: No loops! No CURSORs! No RBAR! Hoo-uh![/I]
My thought question: Have you ever been told that your query runs too fast?

My advice:
INDEXing a poor-performing query is like putting sugar on cat food. Yeah, it probably tastes better but are you sure you want to eat it?
The path of least resistance can be a slippery slope. Take care that fixing your fixes of fixes doesn't snowball and end up costing you more than fixing the root cause would have in the first place.

Need to UNPIVOT? Why not CROSS APPLY VALUES instead?[/url]
Since random numbers are too important to be left to chance, let's generate some![/url]
Learn to understand recursive CTEs by example.[/url]
[url url=http://www.sqlservercentral.com/articles/St

Vedran Kesegic SSCrazy Eights Points: 9302 More actions · Answer 7

Simplest form:

select Decade = convert(VARCHAR(9), s.s_date, 120), Amount = sum(s.amount)

from dbo.sample s

group by convert(VARCHAR(9), s.s_date, 120) -- 120 = yyyy-mm-dd hh:mi:ss

You could use variation, e.g. month + decade, fill empty gaps etc but the trick is to group by formula.

_____________________________________________________
Microsoft Certified Master: SQL Server 2008
XDetails Addin - for SQL Developers
blog.sqlxdetails.com - Transaction log myths

dfarina SSC Veteran Points: 238 More actions · Answer 8

dfarina

SSC Veteran

Points: 238

April 18, 2012 at 4:20 pm

#1475774

Thanks to all of you!!!

Jeff Moden SSC Guru Points: 1004439 More actions · Answer 9

Vedran Kesegic (4/18/2012)
Simplest form:
select Decade = convert(VARCHAR(9), s.s_date, 120), Amount = sum(s.amount)
from dbo.sample s
group by convert(VARCHAR(9), s.s_date, 120) -- 120 = yyyy-mm-dd hh:mi:ss
You could use variation, e.g. month + decade, fill empty gaps etc but the trick is to group by formula.

Oh, be careful now. Although it makes coding very simple, grouping by conversion to VARCHAR can be a whole lot slower than using the ol' >= and < method.

--Jeff Moden

RBAR is pronounced "ree-bar" and is a "Modenism" for Row-By-Agonizing-Row.
First step towards the paradigm shift of writing Set Based code:
________Stop thinking about what you want to do to a ROW... think, instead, of what you want to do to a COLUMN.

Change is inevitable... Change for the better is not.

Helpful Links:
How to post code problems
How to Post Performance Problems
Create a Tally Function (fnTally)

Andre Guerreiro SSCertifiable Points: 7319 More actions · Answer 10

dwain.c (4/18/2012)
Here's a solution that will create the 10 day groups for only the groups where you have data.
DECLARE @startdate DATETIME
SELECT @startdate = DATEADD(year, DATEDIFF(year, 0, '2010-05-31'), 0)
;WITH Dates AS (
SELECT DATEADD(day, 10*(DATEDIFF(day, @startdate, s_date)/10), @startdate) as s_date
,DATEADD(day, 9+10*(DATEDIFF(day, @startdate, s_date)/10), @startdate) as e_date
,amount
FROM @sample)
SELECT s_date, e_date, SUM(amount) As amount
FROM Dates
GROUP BY s_date, e_date

Excellent example! You are using the same table as its own tally table and I never thought about those neat calculations before? Have you thought about that using your own formula or is it based in an article from the past?

Learned something new. Thank you.

Best regards,

Andre Guerreiro Neto

Database Analyst
http://www.softplan.com.br
MCITPx1/MCTSx2/MCSE/MCSA

Dwain Camps SSC Guru Points: 86978 More actions · Answer 11

It's nice to hear when an original work is appreciated!

Actually, in most of my posts where I'm suggesting a solution, I usually put something together from scratch. There is the occasional snippet (e.g., the XML method to put together a delimited string) of course.

My mantra: No loops! No CURSORs! No RBAR! Hoo-uh![/I]
My thought question: Have you ever been told that your query runs too fast?

My advice:
INDEXing a poor-performing query is like putting sugar on cat food. Yeah, it probably tastes better but are you sure you want to eat it?
The path of least resistance can be a slippery slope. Take care that fixing your fixes of fixes doesn't snowball and end up costing you more than fixing the root cause would have in the first place.

Need to UNPIVOT? Why not CROSS APPLY VALUES instead?[/url]
Since random numbers are too important to be left to chance, let's generate some![/url]
Learn to understand recursive CTEs by example.[/url]
[url url=http://www.sqlservercentral.com/articles/St