关注 spark技术分享,
撸spark源码 玩spark最佳实践

ExternalRDD

ExternalRDD

ExternalRDD is a leaf logical operator that is a logical representation of (the data from) an RDD in a logical query plan.

ExternalRDD is created when:

ExternalRDD is a MultiInstanceRelation and a ObjectProducer.

Note
ExternalRDD is resolved to ExternalRDDScanExec when BasicOperators execution planning strategy is executed.

newInstance Method

Note
newInstance is part of MultiInstanceRelation Contract to…​FIXME.

newInstance…​FIXME

Computing Statistics — computeStats Method

Note
computeStats is part of LeafNode Contract to compute statistics for cost-based optimizer.

computeStats…​FIXME

Creating ExternalRDD Instance

ExternalRDD takes the following when created:

Creating ExternalRDD — apply Factory Method

apply…​FIXME

Note
apply is used when SparkSession is requested to create a DataFrame from RDD of product types (e.g. Scala case classes, tuples) or Dataset from RDD of a given type.
赞(0) 打赏
未经允许不得转载:spark技术分享 » ExternalRDD
分享到: 更多 (0)

关注公众号:spark技术分享

联系我们联系我们

觉得文章有用就打赏一下文章作者

支付宝扫一扫打赏

微信扫一扫打赏