stepbystepdatascience
2.07K subscribers
9:52
Modern Financial Dashboard from Scratch in Tableau - Day 1
stepbystepdatascience
70 views • 1 month ago
9:02
#10- How catalyst optimizer chooses join strategy in Databricks?
stepbystepdatascience
30 views • 2 months ago
5:31
#9- What is Catalyst Optimizer in Databricks?| Physical & Logical Plan | Demo in Databricks
stepbystepdatascience
50 views • 2 months ago
5:11
#8 - Parquet vs Delta | Fix tiny file problem in Databricks
stepbystepdatascience
161 views • 2 months ago
4:43
#7 - What is tiny file problem in Databricks | Effects and Solution
stepbystepdatascience
90 views • 2 months ago
5:15
#6 - What is Lazy Evaluation and DAG with simple example in databricks?
stepbystepdatascience
21 views • 2 months ago
3:52
#5- Difference between Resilient Distributed Dataset(RDD) and Data frame with simple example
stepbystepdatascience
35 views • 2 months ago
1:39
#4 - What is Resilient Distributed Data Set (RDD) with simple example?
stepbystepdatascience
39 views • 2 months ago
4:29
#3- What is shuffling/Exchange and its inner mechanism in Databricks?
stepbystepdatascience
47 views • 2 months ago
14:58
#2 - How Apache Spark breaks up a single job into multiple stages| Practical Example in Databricks
stepbystepdatascience
83 views • 3 months ago
6:04
Data Alerts in Databricks| Nicely Conditionally Formatted table in Databricks
stepbystepdatascience
799 views • 3 months ago
11:56
#1 - What is Apache Spark and its key concept in a simple terms?
stepbystepdatascience
40 views • 3 months ago
7:32
Top 7 techniques of writing better queries in PostgreSQL
stepbystepdatascience
151 views • 5 months ago
1:46
De-identifying PII in Databricks
stepbystepdatascience
75 views • 6 months ago
5:41
Handling PII in Databricks
stepbystepdatascience
286 views • 6 months ago
3:24
Data Warehouse Vs Data Lake Vs Delta Lake Vs Lakehouse in simple terms
stepbystepdatascience
298 views • 6 months ago
2:43
What is Databricks in simple terms?
stepbystepdatascience
348 views • 6 months ago
15:30
Handling Duplication- Case VII-XI
stepbystepdatascience
24 views • 6 months ago
12:34
Handling Duplication Based on Address- Case VI
stepbystepdatascience
13 views • 6 months ago
18:25
Handling Duplication in SQL Server - Case I - V
stepbystepdatascience
31 views • 6 months ago
15:47
11 Ways to Handle Duplication in SQL Server- Introduction
stepbystepdatascience
76 views • 7 months ago
35:59
Into to EDA: Baby Step for Data Science
stepbystepdatascience
162 views • 2 years ago
27:09
Data Cleaning using Regex in Pandas Data Frame
stepbystepdatascience
5.9K views • 2 years ago
34:31
Manipulating text using Regular Expression in python
stepbystepdatascience
699 views • 2 years ago
16:19
Configure, Import, Create Pipeline to Auto-Ingest S3 data to Snowflake from Scratch-Day 2
stepbystepdatascience
254 views • 2 years ago
27:30
Data Warehouse| Why Snowflake| CSV file Import | S3 Access- Day 1
stepbystepdatascience
178 views • 2 years ago
15:13
Handling text in python
stepbystepdatascience
221 views • 2 years ago
21:10
Cohort Retention Rate Analysis in Python
stepbystepdatascience
1.9K views • 2 years ago
18:24
Filters, Annotations, Icons, Collapsible containers in Tableau
stepbystepdatascience
192 views • 2 years ago
36:21
Basic Numpy in Python (Difference between Array vs List vs Numpy)
stepbystepdatascience
235 views • 2 years ago
Load More