Identifying consecutive values with constraints

Question

Post reply

Identifying consecutive values with constraints

thusi-886269

Old Hand

Points: 331
More actions
November 20, 2008 at 2:01 am

#215535

Hi All
I have a set of blood pressure measurements like:
MMID MESDATE SYS DIA
A006283 2005-11-14 148 80
A006283 2006-01-16 130 88
A006283 2006-10-18 130 80
A006283 2006-12-28 144 96
A006283 2007-01-03 120 80
A006283 2006-05-17 130 80
A006283 2007-02-28 140 80
A006283 2007-05-03 130 80
A006283 2008-01-18 150 80
A006283 2006-06-29 130 70
...
M009781 2006-10-24 110 70
M015182 2008-07-22 130 90
M020100 2006-04-20 130 70
I want to identify sequences where SYS>=130 and DIA>=80 for all the different patients. So for patient A006283 for example, I should get:
1) A006283 2005-11-14 148 80 2006-01-16 130 88 2006-05-17 130 80
2) A006283 2007-02-28 140 80 2007-05-03 130 80 2008-01-18 120 80
cos these are the only consecutive measurements that meet the criteria (note that the raw data was not sorted!).
I'm thinking using a CROSS APPLY will get part of the required results, but I think you need to PIVOT it as well to display the multiple rows (ie. the 3 consecutives) in a single row like above.
Ideally I'm after a generic method (UDF?) where I can run the query for 'n' consecutives.
Thanks

Viewing 15 posts - 1 through 15 (of 24 total)

You must be logged in to reply to this topic. Login to reply

Jerry Hung SSChampion Points: 12968 More actions · Answer 1

My first thoughts are either ROW_NUMBER, or Tally table

I'd say provide a sample data set (in SQL code) so that we can assist you better

To clarify though, shouldn't results be more than just 2? for SYS>=130 and DIA>=80

Unless you only want the first and last?

MMID MESDATE SYS DIA
A006283 2005-11-14 148 80
A006283 2006-01-16 130 88
A006283 2006-10-18 130 80
A006283 2006-12-28 144 96
A006283 2007-01-03 120 80
A006283 2006-05-17 130 80
A006283 2007-02-28 140 80
A006283 2007-05-03 130 80
A006283 2008-01-18 150 80
A006283 2006-06-29 130 70

SQLServerNewbieMCITP: Database Administrator SQL Server 2005

thusi-886269 Old Hand Points: 331 More actions · Answer 2

Thanks for the response Jerry. I'll try to clarify the problem further with a slightly different data set:

Here's some raw data:

DECLARE @BPTable TABLE (

MMID varchar(10),

MESDATE date,

SYSTOLIC int,

DIA int

)

INSERT @BPTable (MMID, MESDATE, SYSTOLIC, DIA) VALUES ('A006283', '2005-11-14', 148, 80)

INSERT @BPTable (MMID, MESDATE, SYSTOLIC, DIA) VALUES ('A006283', '2006-01-16', 130, 88)

INSERT @BPTable (MMID, MESDATE, SYSTOLIC, DIA) VALUES ('A006283', '2006-10-18', 120, 80)

INSERT @BPTable (MMID, MESDATE, SYSTOLIC, DIA) VALUES ('A006283', '2006-12-28', 144, 96)

INSERT @BPTable (MMID, MESDATE, SYSTOLIC, DIA) VALUES ('A006283', '2007-01-03', 120, 80)

INSERT @BPTable (MMID, MESDATE, SYSTOLIC, DIA) VALUES ('A006283', '2006-05-17', 130, 80)

INSERT @BPTable (MMID, MESDATE, SYSTOLIC, DIA) VALUES ('A006283', '2007-02-28', 140, 80)

INSERT @BPTable (MMID, MESDATE, SYSTOLIC, DIA) VALUES ('A006283', '2007-05-03', 130, 80)

INSERT @BPTable (MMID, MESDATE, SYSTOLIC, DIA) VALUES ('A006283', '2008-01-18', 150, 80)

INSERT @BPTable (MMID, MESDATE, SYSTOLIC, DIA) VALUES ('A006283', '2006-06-29', 130, 90)

INSERT @BPTable (MMID, MESDATE, SYSTOLIC, DIA) VALUES ('M009781', '2006-10-24', 110, 70)

INSERT @BPTable (MMID, MESDATE, SYSTOLIC, DIA) VALUES ('M015182', '2008-07-22', 130, 90)

INSERT @BPTable (MMID, MESDATE, SYSTOLIC, DIA) VALUES ('M020100', '2006-04-20', 130, 70)

Note how the raw data is not ordered in anyway, so if you order by id and date, using a simple query:

select *

from @BPTable

order by MMID, MESDATE

then you'll have the following ordered dataset:

A0062832005-11-14 148 80

A0062832006-01-16 130 88

A0062832006-05-17 130 80

A0062832006-06-29 130 90

A0062832006-10-18 120 80

A0062832006-12-28 144 96

A0062832007-01-03 120 80

A0062832007-02-28 140 80

A0062832007-05-03 130 80

A0062832008-01-18 150 80

M0097812006-10-24 110 70

M0151822008-07-22 130 90

M0201002006-04-20 130 70

Now if you look at 3 consecutive occurances, it's the following:

1.

A0062832005-11-1414880

A0062832006-01-1613088

A0062832006-05-1713080

2.

A0062832006-01-1613088

A0062832006-05-1713080

A0062832006-06-2913090

3.

A0062832007-02-2814080

A0062832007-05-0313080

A0062832008-01-1815080

Hope my problem is clearer now 🙂

Thanks

Jerry Hung SSChampion Points: 12968 More actions · Answer 3

Since there's a missing row in the middle you didn't show, so you are looking for any X-consecutive rows right?

so in this case, only rows that fit the criteria would count?

A006283 2006-12-28 144 96

SQLServerNewbieMCITP: Database Administrator SQL Server 2005

thusi-886269 Old Hand Points: 331 More actions · Answer 4

Oh..sorry for the confusion. It wasn't really a missing row as such..but just a blank line I added to show the end of that particular patient who had a bunch of BP measurements.

Yes, I'm looking for 'x' consecutive rows (after being ordered by date though) that meet the required criteria. So if you run the magic-query on the test data I've posted, you should get 1,2,3 outputs I've shown above as the results of that query.

Jerry Hung SSChampion Points: 12968 More actions · Answer 5

I am sure there's a better way, and less #temp tables involved

but here's my first try. Idea is to give Identity to each row, and find the streak, and select based on the streak

DECLARE @BPTable TABLE (

MMID VARCHAR(10),

MESDATE DATE,

SYSTOLIC INT,

DIA INT