Apache Spark

I have published videos and articles before about Lakehouse maintenance. In this article I want to address a missing point for a lot of Fabric administrators: How to do maintenance on multiple lakehouses that are located in different workspaces.

2024-01-03

Learn Spark SQL Date and Time Functions

by Additional Articles

MSSQLTips.com

Apache Spark

Article

This article covers how to use the different date and time functions when working with Spark SQL.

2023-12-08

Exploring Hive Tables with Spark SQL

by Additional Articles

MSSQLTips.com

Apache Spark

Article

In this article, we will look at how to use an Azure Databricks Workspace to explore Hive tables using Spark SQL along with several examples.

2022-12-12

Improving Performance In Spark Using Partitions

by Steve Jones

SQLServerCentral

Apache Spark

DatabaseWeekly

In this blog post we are going to show how to optimize your Spark job by partitioning the data correctly. To demonstrate this we are going to use the College Score Card public dataset, which has several key data points from colleges all around the United States. We will compute the average student fees by state with this dataset.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2019-04-12

Ingesting Data From Files With Apache Spark, Part 1

by Steve Jones

SQLServerCentral

Apache Spark

DatabaseWeekly

In this post, a data expert teaches us how to take in large data sets using Apache Spark.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2019-04-12

Apache Spark

Engineering a Lakehouse with Azure Databricks with Spark Dataframes

Learning Spark SQL String Functions with Explanations and Code Examples

Managing Files and Folders with Python – Data Engineering with Fabric

Managed Vs Unmanaged Tables – Data Engineering with Fabric

Using Spark Jobs for Multiple Lakehouse Maintenance in Microsoft Fabric

Learn Spark SQL Date and Time Functions

Exploring Hive Tables with Spark SQL

Improving Performance In Spark Using Partitions

Ingesting Data From Files With Apache Spark, Part 1

Blogs

The Book of Redgate: Mistakes

ADF Pipeline Debugging Fails with BadRequest – The Sequel

Why I stopped using MCP for AI coding stuff

Forums

Dynamic Unpivot

Writing as an Art and a Job

String Similarity II

Question of the Day

String Similarity II