DELETE with SELECT and join in SQL SERVER

Question

Post reply

DELETE with SELECT and join in SQL SERVER

kdaniapam

Say Hey Kid

Points: 705
More actions
June 27, 2023 at 6:22 pm
Go to Answer
#4220893
I have a select statement below which gives the expected results of duplicate data. I want to convert this statement to a DELETE Statement to delete items leaving a single item but getting error. Below is my query. Sample data
SampleOne 2016-10-14 12:44:39.460
SampleOne 2016-10-14 12:44:39.460
SampleTwo 2016-10-14 12:44:39.460
SampleTwo 2016-10-14 12:44:39.460
```
SELECT t.*
FROM (
SELECT s.* , COUNT(*) OVER (PARTITION BY s.DocumentName, s.SubmitDateTime, s.DocumentType) AS DocCount
FROM Document s
LEFT OUTER JOIN
Staging b ON s.DocumentId = b.DocumentId
LEFT OUTER JOIN Rejections r ON  s.DocumentId =r.DocID 
WHERE b.DocumentId IS NULL
AND r.DocID IS NULL
AND s.SubmitDateTime IS NOT NULL
AND s.InsertDateTime IS NOT NULL
AND s.DocumentName IS NOT NULL
AND s.Description ='Employee Document'
) t
WHERE t.DocCount > 1
ORDER BY t.DocumentName, t.SubmitDateTime
```
- This topic was modified 1 year, 4 months ago by kdaniapam.
- This topic was modified 1 year, 4 months ago by kdaniapam.

kdaniapam

Say Hey Kid

Points: 705
More actions
June 28, 2023 at 2:37 am
Answer
#4221215

Hi Allan,
I tried your suggestion and I have it working. I will appreciate if anyone can look over my query below for me.
WITH CTE AS (
SELECT s.* , ROW_NUMBER() OVER (PARTITION BY s.DocumentName, s.SubmitDateTime, s.DocumentType order by s.DocumentName, s.SubmitDateTime, s.DocumentType) AS DocCount
FROM Document s
WHERE s.SubmitDateTime IS NOT NULL
AND s.InsertDateTime IS NOT NULL
AND s.DocumentName IS NOT NULL
AND s.Description ='Employee Document'
AND s.DocumentId NOT IN(SELECT v.DocumentId from Staging v)
AND s.DocumentId NOT IN(SELECT r.DocID FROM Rejections r )
)
DELETE FROM CTE
WHERE DocCount > 1

Viewing 10 posts - 1 through 9 (of 9 total)

You must be logged in to reply to this topic. Login to reply

rjaye03 Mr or Mrs. 500 Points: 505 More actions · Answer 1

Replace top line "SELECT t.*" with "DELETE t" and remove ORDER BY clause at the end

kdaniapam Say Hey Kid Points: 705 More actions · Answer 2

Hi

Thanks for your response.

Doing below as per your suggestion give me error

View or function 't' is not updatable because the modification affects multiple base tables.

DELETE t

FROM (

SELECT s.* , COUNT(*) OVER (PARTITION BY s.DocumentName, s.SubmitDateTime, s.DocumentType) AS DocCount

FROM Document s

LEFT OUTER JOIN

Staging b ON s.DocumentId = b.DocumentId

LEFT OUTER JOIN Rejections r ON s.DocumentId =r.DocID

WHERE b.DocumentId IS NULL

AND r.DocID IS NULL

AND s.SubmitDateTime IS NOT NULL

AND s.InsertDateTime IS NOT NULL

AND s.DocumentName IS NOT NULL

AND s.Description ='Employee Document'

) t

WHERE t.DocCount > 1

drew.allen SSC Guru Points: 76985 More actions · Answer 3

FIRST, this will delete ALL records rather than "leaving a single item".

I would use ROW_NUMBER() rather than COUNT(), because ROW_NUMBER() guarantees a unique way to identify each row, whereas COUNT() does not (unless you really do want to delete ALL records).

Replace your LEFT OUTER JOINs and NULL checks with NOT EXISTS.

Drew

J. Drew Allen
Business Intelligence Analyst
Philadelphia, PA

Jonathan AC Roberts SSCoach Points: 18354 More actions · Answer 4

I think you might want something like this:

WITH CTE AS
(
    SELECT s.DocumentId,
           ROW_NUMBER() OVER (PARTITION BY s.DocumentName, s.SubmitDateTime, s.DocumentType 
                                  ORDER BY s.DocumentName, s.SubmitDateTime) AS RowNum
      FROM Document s
     WHERE NOT EXISTS(SELECT 1
                        FROM Staging b 
                       WHERE b.DocumentId = s.DocumentId)
       AND NOT EXISTS(SELECT 1
                        FROM Rejections r 
                       WHERE r.DocID = s.DocumentId)
       AND s.SubmitDateTime IS NOT NULL
       AND s.InsertDateTime IS NOT NULL
       AND s.DocumentName IS NOT NULL
       AND s.Description ='Employee Document'
)
DELETE CTE
WHERE RowNum > 1
;

This reply was modified 1 year, 4 months ago by Jonathan AC Roberts. Reason: Edited SQL

kdaniapam Say Hey Kid Points: 705 More actions · Answer 5

kdaniapam

Say Hey Kid

Points: 705

June 28, 2023 at 2:34 am

#4221214

Hi Bob,

Your query gives me an empty results.

jcelko212 32090 SSCrazy Eights Points: 9303 More actions · Answer 6

>> I have a select statement below which gives the expected results of duplicate data. <<

I hate to tell you this, but in RDBMS "expected results of duplicate data" is a really bad thing. The whole goal RDBMS and therefore SQL, was to get rid of duplicates and redundancy. I also see you don't know the part about posting DDL is a convention on SQL forms for last 35 years. So we have no idea what the key is.

It also looks like you've given a single data element two different names same schema! This is a classic beginners error. Is DocumentId the same as DocID ?

If DocumentId really is an identifier, then how can you ever test for null because it will never be null. By the nature of identifiers they exist and have to be known. My guess is that this should not be a problem at all. Your DL should have constraints which will prevent duplication. Remembe we are writing SQL, not COBOL or some other traditional programming language file description..

>> I want to convert this statement to a DELETE Statement to delete items leaving a single item but getting error. Below is my query. Sample data <<

Then why did create redundant dups in the first place??

Here is a guess at what you might have meant, if you were writing SQL:

CREATE TABLE Foobar_Documnts

(document_id CHAR (15) NOT NULL PRIMARY KEY,

creation_timestamp DATETIME2(7) NOT NULL

DEFAULT CURRENT_TIMESTAMP,

PRIMARY KEY (document_id, creation_timestamp));

You don't seem to understand the concept of the key, which is the foundation of the relational model. Nor, do you seem to understand how to do DDL at all.

Please post DDL and follow ANSI/ISO standards when asking for help.

kdaniapam Say Hey Kid Points: 705 More actions · Answer 7

Hi

I have resolved the issue and doesn't need your comment or answer. Your comment is too late.

Jeff Moden SSC Guru Points: 1004324 More actions · Answer 8

kdaniapam wrote:

Hi
I have resolved the issue and doesn't need your comment or answer. Your comment is too late.

You just got done saying "I tried your suggestion and I have it working. I will appreciate if anyone can look over my query below for me."

You got what you asked for. 😉

--Jeff Moden

RBAR is pronounced "ree-bar" and is a "Modenism" for Row-By-Agonizing-Row.
First step towards the paradigm shift of writing Set Based code:
________Stop thinking about what you want to do to a ROW... think, instead, of what you want to do to a COLUMN.

Change is inevitable... Change for the better is not.

Helpful Links:
How to post code problems
How to Post Performance Problems
Create a Tally Function (fnTally)