EduArn | Next-Gen LMS & Training Platform
Course Name: Bigdata PySpark and Kafka Training
Completion Progress: 0%
Section 2: Best practices
Section 4: PySpark
-
Introduction
-
PySpark Architecture
-
PySpark Iinstallation
-
PySpark Configuration
-
Connectivity to various sources
-
RDD Transformations & Actions
-
PYspark Performance
-
Spark SQL
-
Batch & Stream Processing
-
ETL data pipeline
-
Best practices
Section 5: Kafka
-
Overview
-
Kafka Architecture
-
Installation and Configuration
-
Consumer and Producer
-
Producing & Consuming Messages
-
Connector and Stream API’s
-
Kafka Streaming
-
Structured Streaming Pipelines
-
Best practices
Section 6: BigData
-
Introduction
-
3 V's of Bigdata
-
Hadoop
-
HDFS
-
YARN