Find double records within DateTime range

Question

Post reply

Find double records within DateTime range

sirokinl

Old Hand

Points: 335
More actions
April 13, 2015 at 1:44 am

#316630

Hi, thanks for reading my topic.
I have a query question.
Consider a table with the following structure:
RecordID (PK - int) - RecordDate (DateTime)
I need to find all records that fall within a 7 day period slot based on the first RecordDate of a specific slot.
Example, consider the following records:
RecordID - RecordDate
1 - 2015-04-01 14:00
2 - 2015-04-03 15:00
3 - 2015-04-03 16:05
4 - 2015-04-03 19:23
5 - 2015-04-06 09:15
6 - 2015-04-06 11:30
7 - 2015-04-07 12:00
8 - 2015-04-09 15:15
The result of the query I'd like should look something like this
1
2
5
7
8
So basically I'd like to leave record 3 and 4 out because they fall within 24 hours of record 2 and I'd like to leave record 6 out because it falls within 24 hours of record 5.
I'd tried working with a CTE and set a dateadd(d, 1, recorddate), join it on itself and use a between From / To filter on the join but that didn't work. I don't think NTILE will work with this?
I'd love to hear some suggestion on how to approach this problem.
Thanks and have a great day.

Viewing 12 posts - 1 through 11 (of 11 total)

You must be logged in to reply to this topic. Login to reply

Phil Parkin SSC Guru Points: 246990 More actions · Answer 1

Try this:

if object_id('tempdb..#dates', 'U') is not null

drop table #dates;

create table #dates

(

RecordId int primary key clustered

,RecordDate datetime

);

insert #dates

(RecordId, RecordDate)

values (1, '2015-04-01 14:00'),

(2, '2015-04-03 15:00'),

(3, '2015-04-03 16:05'),

(4, '2015-04-03 19:23'),

(5, '2015-04-06 09:15'),

(6, '2015-04-06 11:30'),

(7, '2015-04-07 12:00'),

(8, '2015-04-09 15:15');

with recs

as (select d.*

,date1 = lag(RecordDate, 1, '19000101') over (order by d.RecordId)

from #dates d

)

select recs.RecordId

from recs

where datediff(hour, recs.date1, recs.RecordDate) > 24

And please post consumable DDL next time.

The absence of evidence is not evidence of absence.
Martin Rees

You can lead a horse to water, but a pencil must be lead.
Stan Laurel

sirokinl Old Hand Points: 335 More actions · Answer 2

Thank you very much Phil for point out the LAG function.

I will post proper DDL next time.

Have a nice day.

Luis Cazares SSC Guru Points: 183706 More actions · Answer 3

I'm not sure if this more simple option would fit you.

What happens if a row falls into the following 24 hours but it's on a different date? This option would include row 9 but Phil's would exclude it.

if object_id('tempdb..#dates', 'U') is not null

drop table #dates;

create table #dates

(

RecordId int primary key clustered

,RecordDate datetime

);

insert #dates

(RecordId, RecordDate)

values (1, '2015-04-01 14:00'),

(2, '2015-04-03 15:00'),

(3, '2015-04-03 16:05'),

(4, '2015-04-03 19:23'),

(5, '2015-04-06 09:15'),

(6, '2015-04-06 11:30'),

(7, '2015-04-07 12:00'),

(8, '2015-04-09 15:15'),

(9, '2015-04-10 09:15');

SELECT MIN(RecordId) AS RecordId

FROM #dates

GROUP BY CAST( RecordDate AS date);

Luis C.
General Disclaimer:
Are you seriously taking the advice and code from someone from the internet without testing it? Do you at least understand it? Or can it easily kill your server?

How to post data/code on a forum to get the best help: Option 1 / Option 2

sirokinl Old Hand Points: 335 More actions · Answer 4

So I'd like to take this a little further, hopefully you can help me with this issue again. So basically I solved the problem of marking records with fall within 7*24 hours of the previous record. I've marked those records with 1 in the column IsDoubleLag.

Unfornately this solution is only based on the previous record. When you have a number of records that all fall within 7*24 hour of the previous records, all records would be marked as IsDoubleLag = 1.

Would it be possible somehow, without the use of a cursor, to have a 7*24 hour break mark.

Consider the following table:

CREATE TABLE #TEMP (ID int, DateCreated datetime, IsDoubleLag bit)

INSERT INTO #TEMP VALUES (1,'2015-01-01 08:11:40.490','0')

INSERT INTO #TEMP VALUES (2,'2015-01-04 02:29:47.777','1')

INSERT INTO #TEMP VALUES (3,'2015-01-04 16:07:12.887','1')

INSERT INTO #TEMP VALUES (4,'2015-01-06 07:26:52.377','1')

INSERT INTO #TEMP VALUES (5,'2015-01-11 00:46:37.117','1')

INSERT INTO #TEMP VALUES (6,'2015-01-11 09:58:24.640','1')

INSERT INTO #TEMP VALUES (7,'2015-01-12 15:43:24.280','1')

INSERT INTO #TEMP VALUES (8,'2015-01-12 19:41:46.213','1')

INSERT INTO #TEMP VALUES (9,'2015-01-15 17:31:49.297','1')

INSERT INTO #TEMP VALUES (10,'2015-01-19 01:20:35.487','1')

INSERT INTO #TEMP VALUES (11,'2015-01-21 15:30:31.100','1')

INSERT INTO #TEMP VALUES (12,'2015-01-26 15:27:54.880','1')

INSERT INTO #TEMP VALUES (13,'2015-03-21 23:52:09.707','0')

INSERT INTO #TEMP VALUES (14,'2015-03-23 13:07:29.353','1')

INSERT INTO #TEMP VALUES (15,'2015-04-01 07:28:20.613','0')

INSERT INTO #TEMP VALUES (16,'2015-04-05 04:56:28.927','1')

In this scenario it starts with RecordID 1 with a date of 2015-01-01 08:11 .. Then RecordID 2, 3 and 4 fall within 7x24 hours. Unfortunately RecordID 5 with a date of 2015-01-11 00:46:37 also falls within the range of the previous record. Which is logical, because this is how it's set up.

But I'd like to have RecordID 5 be marked as a new range record. So basically I'd like it to have the first marked IsDoubleLag record to be leading in marking the subsequent records. When a new break has been marked, it should restart the lag marking.

I know how to fix it through cursor and use a running variable, but I'd like to solve it set based.

Thanks!

Luis Cazares SSC Guru Points: 183706 More actions · Answer 5

Would you be comfortable using the Quirky Update? http://www.sqlservercentral.com/articles/T-SQL/68467/

Luis C.
General Disclaimer:
Are you seriously taking the advice and code from someone from the internet without testing it? Do you at least understand it? Or can it easily kill your server?

How to post data/code on a forum to get the best help: Option 1 / Option 2

Luis Cazares SSC Guru Points: 183706 More actions · Answer 6

This is a possibility using the Quirky Update.

CREATE TABLE #TEMP (ID int PRIMARY KEY, DateCreated datetime, IsDoubleLag bit)