HashPartitioner
HashPartitioner
is a Partitioner
that uses partitions
configurable number of partitions to shuffle data around.
Property | Description |
---|---|
|
Exactly |
|
|
|
|
|
Exactly |
Note
|
HashPartitioner is the default Partitioner for coalesce transformation with shuffle enabled, e.g. calling repartition.
|
It is possible to re-shuffle data despite all the records for the key k
being already on a single Spark executor (i.e. BlockManager to be precise). When HashPartitioner
‘s result for k1
is 3
the key k1
will go to the third executor.