OrderedRDDFunctions
OrderedRDDFunctions repartitionAndSortWithinPartitions Operator Caution FIXME sortByKey Operator ...
OrderedRDDFunctions repartitionAndSortWithinPartitions Operator Caution FIXME sortByKey Operator ...
MapPartitionsRDD MapPartitionsRDD is an RDD that applies the provided function f to every partition of the parent ...
ParallelCollectionRDD ParallelCollectionRDD is an RDD of a collection of elements with numSlices partitions and op ...
TaskLocation TaskLocation is a location where a task should run. TaskLocation can either be a host alone or a (h ...
RDD Lineage — Logical Execution Plan
RDD RDD is a description of a distributed computation over dataset of records of type T. RDD is identified by a ...
RDD — Resilient Distributed Dataset
Local Properties — Creating Logical Job Groups
SparkStatusTracker SparkStatusTracker is…FIXME SparkStatusTracker is created when SparkContext is c ...
ConsoleProgressBar ConsoleProgressBar shows the progress of active stages to standard error, i.e. stderr. It uses ...