RDD Shared Variables In Spark, when any function passed to a transformation operation, then it is executed on a remote cluster node.…
Category:
Apache Spark Tutorial
-
-
Spark Cartesian Function In Spark, the Cartesian function generates a Cartesian product of two datasets and returns all the possible combination of…
-
What is RDD? The RDD (Resilient Distributed Dataset) is the Spark’s core abstraction. It is a collection of elements, partitioned across the…
-
Spark reduceByKey Function In Spark, the reduceByKey function is a frequently used transformation operation that performs aggregation of data. It receives key-value…
Older Posts