programming within SELECT statement

Question

programming within SELECT statement

etl2016

Default port

Points: 1459
More actions
March 4, 2016 at 9:24 am

#325344

Hi,
Does SQL Server allow programming within a SELECT statement?
For instance, SELECT col1, col2 (some programming result) as Col3 FROM tables
Could you please share some thoughts.
thank you

Viewing 15 posts - 1 through 15 (of 17 total)

You must be logged in to reply to this topic. Login to reply

Phil Parkin SSC Guru Points: 247216 More actions · Answer 1

etl2016 (3/4/2016)
Hi,
Does SQL Server allow programming within a SELECT statement?
For instance, SELECT col1, col2 (some programming result) as Col3 FROM tables
Could you please share some thoughts.
thank you

Yes.

Select Col3 = Col1 + Col2

for example.

Luis Cazares SSC Guru Points: 183706 More actions · Answer 2

etl2016 (3/4/2016)
Hi,
Does SQL Server allow programming within a SELECT statement?
For instance, SELECT col1, col2 (some programming result) as Col3 FROM tables
Could you please share some thoughts.
thank you

What kind of programming?

Including expressions as Phil did? That won't change the value of the column in the table, it will only affect the result set.

Luis C.
General Disclaimer:
Are you seriously taking the advice and code from someone from the internet without testing it? Do you at least understand it? Or can it easily kill your server?

How to post data/code on a forum to get the best help: Option 1 / Option 2

etl2016 Default port Points: 1459 More actions · Answer 3

thanks.

But, its a bit more programming, involving local-scoped variables, some Queries, some aggregation and a final result.

Can we say something like, SELECT col1, col2, (DECLARE local_variable INT; SET @local_variable = SELECT (Query)) as Col3 etc..

Any T-SQL specific programming constructs to be taken care here, both performance wise and functionality wise .

thank you very much

Luis Cazares SSC Guru Points: 183706 More actions · Answer 4

Absolutely no variable declarations or multiple instructions. All variables should be defined before the query.

You can have queries inside queries. These are called subqueries and are a great tool. However, if you abuse them, they'll generate performance problems.

Also, you can have a SELECT to return a result set

SELECT col1,

col2,

col3

FROM TableA

Or you can have a SELECT to assign values to variables

SELECT @val1 = col1,

@val2 = col2,

@val3 = col3

FROM TableA

But you can't mix them.

If you're trying to do something specific, share it with us and we'll try to guide you. Read the article in my signature to know how to post better questions.

Luis C.
General Disclaimer:
Are you seriously taking the advice and code from someone from the internet without testing it? Do you at least understand it? Or can it easily kill your server?

How to post data/code on a forum to get the best help: Option 1 / Option 2

Jeff Moden SSC Guru Points: 1004701 More actions · Answer 5

etl2016 (3/4/2016)
thanks.
But, its a bit more programming, involving local-scoped variables, some Queries, some aggregation and a final result.
Can we say something like, SELECT col1, col2, (DECLARE local_variable INT; SET @local_variable = SELECT (Query)) as Col3 etc..
Any T-SQL specific programming constructs to be taken care here, both performance wise and functionality wise .
thank you very much

That smacks of RBAR (Row By Agonizing Row) and should be avoided even if you could do it.

--Jeff Moden

RBAR is pronounced "ree-bar" and is a "Modenism" for Row-By-Agonizing-Row.
First step towards the paradigm shift of writing Set Based code:
________Stop thinking about what you want to do to a ROW... think, instead, of what you want to do to a COLUMN.

Change is inevitable... Change for the better is not.

Helpful Links:
How to post code problems
How to Post Performance Problems
Create a Tally Function (fnTally)

Y.B. SSChampion Points: 11652 More actions · Answer 6

It's also worth noting that using SET vs SELECT when assigning a variable will work similarly but have important differences.

Here is some code to illustrate:

--Declare a simple table and add some test data

DECLARE @test TABLE (ID INT, Name VARCHAR(10))

INSERT INTO @test

VALUES (1, 'AAA'), (2, 'BBB'), (3, 'CCC'), (4, 'DDD')

--Declare our variables

DECLARE @set VARCHAR(10), @select VARCHAR(10)

--Let's see what the table looks like

SELECT * FROM @test

--This will fail because it will only accept a single (scalar) value

BEGIN TRY

SET @set = (SELECT Name FROM @test)

END TRY

BEGIN CATCH

SELECT 'ERROR: A SET with multiple values will not work!'

END CATCH

--This will work because it's only one value

SET @set = (SELECT Name FROM @test WHERE ID = 1)

--This will work but if you don't specify an exact value it will store the last returned value. In this case DDD.

SELECT @select = Name FROM @test

SELECT @set

SELECT @select

--------------------------------------------------------------------

--Now let's test what happens when no values are found.

--Set our variables to an initial value.

SET @set = 'INIT'

SET @select = 'INIT'

--This won't fail but notice how it assigns a NULL to the variable

SET @set = (SELECT Name FROM @test WHERE ID = 5)

--This won't fail either but the intial value is perserved.

SELECT @select = Name FROM @test WHERE ID = 5

SELECT @set

SELECT @select

SELECT quote FROM brain WHERE original = 1
0 rows returned

etl2016 Default port Points: 1459 More actions · Answer 7

thank you, I think in my case there is no RBAR

My query extracts from DW and manufactures a summarised aggregate table, which has 1 day information per record, so, max 31 records as the final result set.

Say, it has below structure:

A B C

a1

a2

a3

a4

In the above A,B, C are columns and a1, a2, a3.....are unique days in a month.

Now there is requirement to add a new column D which qualifies each aX, whether it is start of the month, if so what year and what month etc. The unique values a1, a2, a3 .... are not from 1 to 31 always. Instead, they represent sequentially incremented INTs, the smallest INT being dayID representing the inception of business, till today and into future.

There is a Static reference table which holds info on these INTs along with associated attributes.

Here is where the programming is needed when I say what I need is: SELECT c1, c2, (some programming with a result) as c3

Here "some programming" is JOIN related programming. Because, after JOIN-ing, there is some logic and computation to manufacture the last column D in the Aggregate table. There is single field in the ref static table, to derive straight forward.

Hope you agree there is no adversely influencing RBAR situation here (we are talking about 31 max records)

One option I am thinking is to generate a CTE on Static Ref table to reduce programming complexity within the above SELECT statement, and keep JOIN light weight between CTE and aggregate table, to generate final column D.

thank you.

Phil Parkin SSC Guru Points: 247216 More actions · Answer 8

etl2016 (3/4/2016)
thank you, I think in my case there is no RBAR
My query extracts from DW and manufactures a summarised aggregate table, which has 1 day information per record, so, max 31 records as the final result set.
Say, it has below structure:
A B C
a1
a2
a3
a4
In the above A,B, C are columns and a1, a2, a3.....are unique days in a month.
Now there is requirement to add a new column D which qualifies each aX, whether it is start of the month, if so what year and what month etc. The unique values a1, a2, a3 .... are not from 1 to 31 always. Instead, they represent sequentially incremented INTs, the smallest INT being dayID representing the inception of business, till today and into future.
There is a Static reference table which holds info on these INTs along with associated attributes.
Here is where the programming is needed when I say what I need is: SELECT c1, c2, (some programming with a result) as c3
Here "some programming" is JOIN related programming. Because, after JOIN-ing, there is some logic and computation to manufacture the last column D in the Aggregate table. There is single field in the ref static table, to derive straight forward.
Hope you agree there is no adversely influencing RBAR situation here (we are talking about 31 max records)
One option I am thinking is to generate a CTE on Static Ref table to reduce programming complexity within the above SELECT statement, and keep JOIN light weight between CTE and aggregate table, to generate final column D.
thank you.

We cannot really agree anything based on this description. It's just not detailed enough. If you want help with how to code your solution, you are going to have to provide the level of detail previously requested. That includes sample DDL, data and desired results in a format which can be cut & pasted straight into SSMS.

Alexander Suprun SSCertifiable Points: 6253 More actions · Answer 9

etl2016 (3/4/2016)
One option I am thinking is to generate a CTE on Static Ref table to reduce programming complexity within the above SELECT statement, and keep JOIN light weight between CTE and aggregate table, to generate final column D.

SQL Server will have to perform the same JOIN, doesn't matter if you wrap it in CTE or not. It will only simplify the coding.

Alex Suprun

Ed Wagner SSC Guru Points: 287026 More actions · Answer 10

Hope you agree there is no adversely influencing RBAR situation here (we are talking about 31 max records)

Like Phil said, with no code, it's impossible to say one way or another. Also, it sounds like you're talking about generating 31 rows. My guess is that the source of those 31 rows is bigger than 31 rows. There could be 10M rows that go into each one of them.

Tally Tables - Performance Personified
String Splitting with True Performance
Best practices on how to ask questions

Hugo Kornelis SSC Guru Points: 64790 More actions · Answer 11

etl2016 (3/4/2016)
Now there is requirement to add a new column D which qualifies each aX, whether it is start of the month, if so what year and what month etc. The unique values a1, a2, a3 .... are not from 1 to 31 always. Instead, they represent sequentially incremented INTs, the smallest INT being dayID representing the inception of business, till today and into future.

To first address your original question, a lot of "programming" can be done in a SELECT, as long as it is in a single expression. Expressions can be very complex though, includinig subqueries. They can even include user-defined functions but this generally performs so poor that my advice is to not go there unless you really know what you are doing.

The description above sounds like something that can easily be done using expressions.

Can you post the tables involved (as CREATE TABLE statements - please incluce all constraints, but please remove extra columns that are not relevant to the question), a few well-chose rows of sample data (as INSERT statements), the output you need to get from that sample data, plus an explanation of how this output logically derives from the sample data? That makes it a lot easier for us to help you!

Hugo Kornelis, SQL Server/Data Platform MVP (2006-2016)
Visit my SQL Server blog: https://sqlserverfast.com/blog/
SQL Server Execution Plan Reference: https://sqlserverfast.com/epr/

etl2016 Default port Points: 1459 More actions · Answer 12

thanks everyone for sharing your thoughts.

I implemented what I need.

Structurally it looks like a RBAR scenario, but, given the situation that, tables being joined T1 and T2 are slim, I believe RBAR is okay.

T1 is manufactured from DW and always contains 30-odd records only (1 day per month). And T2 contains Reference dates of the business, since its inception.

I derived my final column D as below:

(

SELECT

(

CONVERT ( varchar(30), ( SELECT MAX (some_date_field) from REF_TABLE T2 WHERE CLAUSE on T1 and T2), 103 )

+

' - '

+

CONVERT ( varchar(30), ( SELECT MAX (some_date_field) from REF_TABLE T2 WHERE CLAUSE on T1 and T2), 103 )

)

) AS Column-D

thank you

Sergiy SSC Guru Points: 110209 More actions · Answer 13

etl2016 (3/7/2016)
thanks everyone for sharing your thoughts.
I implemented what I need.
Structurally it looks like a RBAR scenario, but, given the situation that, tables being joined T1 and T2 are slim, I believe RBAR is okay.
T1 is manufactured from DW and always contains 30-odd records only (1 day per month). And T2 contains Reference dates of the business, since its inception.
I derived my final column D as below:
(
SELECT
(
CONVERT ( varchar(30), ( SELECT MAX (some_date_field) from REF_TABLE T2 WHERE CLAUSE on T1 and T2), 103 )
+
' - '
+
CONVERT ( varchar(30), ( SELECT MAX (some_date_field) from REF_TABLE T2 WHERE CLAUSE on T1 and T2), 103 )
)
) AS Column-D
thank you

Check CROSS APPLY.

You may find some examples here:

https://social.msdn.microsoft.com/Forums/sqlserver/en-US/796821b0-ba48-49b4-ba42-1783061d8c41/cross-apply-in-sqlserver?forum=transactsql

_____________
Code for TallyGenerator

MMartin1 One Orange Chip Points: 27867 More actions · Answer 14

Going deeper with the CROSS APPLY suggestion :

Consider something like this >

SELECT SUM(someNumberColumn) / (SELECT SUM(someNumberColumn) FROM Table) AS westCoastSalesAsA%OfTotal

FROM Table

WHERE someFilterColumn= 'west coast'

Could be done with something like ( I didnt totally check my syntax but the idea is )

SELECT SUM(t1.someNumberColumn) / (t2.totalSales) AS westCoastSalesAsA%OfTotal

FROM Table t1 CROSS APPLY

(SELECT SUM(someNumberColumn) as totalSales FROM Table) as t2

WHERE t1.someFilterColumn= 'west coast'

It appears cleaner.

----------------------------------------------------