关注 spark技术分享,
撸spark源码 玩spark最佳实践

Project

Project Unary Logical Operator

Project is a unary logical operator that takes the following when created:

Project is created to represent the following:

  • Dataset operators, i.e. joinWith, select (incl. selectUntyped), unionByName

  • KeyValueGroupedDataset operators, i.e. keys, mapValues

  • CreateViewCommand logical command is executed (and aliasPlan)

  • SQL’s SELECT queries with named expressions

Project can also appear in a logical plan after analysis or optimization phases.

Note
Nondeterministic expressions are allowed in Project logical operator and enforced by CheckAnalysis.

The output schema of a Project is…​FIXME

maxRows…​FIXME

resolved…​FIXME

validConstraints…​FIXME

Tip

Use select operator from Catalyst DSL to create a Project logical operator, e.g. for testing or Spark SQL internals exploration.

赞(0) 打赏
未经允许不得转载:spark技术分享 » Project
分享到: 更多 (0)

关注公众号:spark技术分享

联系我们联系我们

觉得文章有用就打赏一下文章作者

支付宝扫一扫打赏

微信扫一扫打赏