关注 spark技术分享,
撸spark源码 玩spark最佳实践

SortExec

SortExec Unary Physical Operator

SortExec is a unary physical operator that is created when:

SortExec supports Java code generation (aka codegen).

When requested for the output attributes, SortExec simply gives whatever the child operator uses.

When requested for the output data partitioning requirements, SortExec simply gives whatever the child operator uses.

When requested for the required partition requirements, SortExec gives the OrderedDistribution (with the sorting order expressions for the ordering) when the global flag is enabled (true) or the UnspecifiedDistribution.

SortExec operator uses the spark.sql.sort.enableRadixSort internal configuration property (enabled by default) to control…​FIXME

Table 1. SortExec’s Performance Metrics
Key Name (in web UI) Description

peakMemory

peak memory

sortTime

sort time

spillSize

spill size

Generating Java Source Code for Produce Path in Whole-Stage Code Generation — doProduce Method

Note
doProduce is part of CodegenSupport Contract to generate the Java source code for produce path in Whole-Stage Code Generation.

doProduce…​FIXME

Creating SortExec Instance

SortExec takes the following when created:

createSorter Method

createSorter…​FIXME

Note
createSorter is used when…​FIXME
赞(0) 打赏
未经允许不得转载:spark技术分享 » SortExec
分享到: 更多 (0)

关注公众号:spark技术分享

联系我们联系我们

觉得文章有用就打赏一下文章作者

支付宝扫一扫打赏

微信扫一扫打赏