How To Flatten JSON Data?

Question

Post reply

How To Flatten JSON Data?

Chris Wooding

SSCarpal Tunnel

Points: 4556

July 1, 2022 at 9:52 am

Go to Answer

#4057474

I receive data in JSON files and for one particular type of data I am having problems getting it into a sensible SQL format. The data consists of an ID and a JSON array of bank accounts belonging to that ID. For the data below, I want one row for ID 4710592 with the data where field "pri" is Bert and another row where it is Bob (note: this field isn't necessarily populated in the actual data).

CREATE TABLE #temptable ( [id] nvarchar(4000), [entry] nvarchar(4000) )
INSERT INTO #temptable ([id], [entry])
VALUES
( N'4710592', N'[
{"pri":"Bert"}
,{"is_pens_pay":""}
,{"acc_nr":"12345678"}
,{"curry_id":55}
,{"limit":""}
,{"acc_name":"Mr. J. Smith"}
,{"acc_nick_name":"J. Smith--GBP"}
,{"init_deposit_amt":""}
,{"is_main":"true"}
,{"is_third_party":""}
,{"bank_name":"NATIONAL WESTMINSTER BANK PLC"}
,{"bank_addr":"NATIONAL WESTMINSTER BANK PLC Leicester"}
,{"sort_code_nr":"123456"}
,{"pri":"Bob"}
,{"is_pens_pay":"True"}
,{"acc_nr":"98765432"}
,{"curry_id":56}
,{"limit":"20000.00"}
,{"acc_name":"Mr. J. Smith No. 2 account"}
,{"acc_nick_name":"J. Smith--USD"}
,{"init_deposit_amt":"1204.53"}
,{"is_main":"False"}
,{"is_third_party":""}
,{"bank_name":"SANTANDER"}
,{"bank_addr":"SANTANDER Moorgate"}
,{"sort_code_nr":"654321"}
]'
)

When I run the query below, I get as many rows per ID value as there are columns and only one column is populated on each row. I've tried using PIVOT, but that only gives me a single row per ID with the MAX (or MIN or whatever other aggregate function I use) per column. I'm sure there must be a relatively easy way to do this, but I'm stumped. Any assistance would be gratefully received.

SELECT *
FROM #temptable t
CROSS APPLY OPENJSON(t.entry)
WITH (
    pri NVARCHAR(4000),
is_pens_pay NVARCHAR(4000),
acc_nr NVARCHAR(4000),
curry_id NVARCHAR(4000),
limit NVARCHAR(4000),
acc_name NVARCHAR(4000),
acc_nick_name NVARCHAR(4000),
init_deposit_amt NVARCHAR(4000),
is_main NVARCHAR(4000),
is_third_party NVARCHAR(4000),
bank_name NVARCHAR(4000),
bank_addr NVARCHAR(4000),
sort_code_nr NVARCHAR(4000)
 ) oj

PS: The only unique identifier for the entries in the array would be a combination of columns - for the sample data the sort_code_nr and acc_nr combination is unique.

Mark Cowne

One Orange Chip

Points: 26952

July 1, 2022 at 11:11 am

Answer

#4057498

Maybe this?

WITH CTE AS (
SELECT t.id,
       j2.[key] AS field,
   j2.value AS val,
   row_number() over(partition by j2.[key] order by cast(j.[key] as int)) as rn
FROM #temptable t
CROSS APPLY OPENJSON(t.entry) j
CROSS APPLY OPENJSON(j.value) j2
)
SELECT id,
       MAX(CASE WHEN field='pri' THEN val END) AS pri,
       MAX(CASE WHEN field='is_pens_pay' THEN val END) AS is_pens_pay,
       MAX(CASE WHEN field='acc_nr' THEN val END) AS acc_nr,
       MAX(CASE WHEN field='curry_id' THEN val END) AS curry_id,
       MAX(CASE WHEN field='limit' THEN val END) AS limit,
       MAX(CASE WHEN field='acc_name' THEN val END) AS acc_name,
       MAX(CASE WHEN field='acc_nick_name' THEN val END) AS acc_nick_name,
       MAX(CASE WHEN field='init_deposit_amt' THEN val END) AS init_deposit_amt,
       MAX(CASE WHEN field='is_main' THEN val END) AS is_main,
       MAX(CASE WHEN field='is_third_party' THEN val END) AS is_third_party,
       MAX(CASE WHEN field='bank_name' THEN val END) AS bank_name,
       MAX(CASE WHEN field='bank_addr' THEN val END) AS bank_addr,
       MAX(CASE WHEN field='sort_code_nr' THEN val END) AS sort_code_nr
FROM CTE
GROUP BY id,rn
ORDER BY id,rn;

____________________________________________________

Deja View - The strange feeling that somewhere, sometime you've optimised this query before

How to get the best help on a forum

http://www.sqlservercentral.com/articles/Best+Practices/61537

Viewing 4 posts - 1 through 3 (of 3 total)

You must be logged in to reply to this topic. Login to reply

Chris Wooding SSCarpal Tunnel Points: 4556 More actions · Answer 1

Chris Wooding

SSCarpal Tunnel

Points: 4556

July 1, 2022 at 11:21 am

#4057499

Thanks. That works perfectly.

Chris Wooding SSCarpal Tunnel Points: 4556 More actions · Answer 2

In case anyone else comes across this as the solution to a similar problem, I had to add ral.id to the partition in order to get it to work with my full data set (ie. the original sample data didn't cover all scenarios).