关注 spark技术分享,
撸spark源码 玩spark最佳实践

AllStagesPage — Stages for All Jobs

Stages for All Jobs Page

AllStagesPage is a web page (section) that is registered with the Stages tab that displays all stages in a Spark application – active, pending, completed, and failed stages with their count.

spark webui stages alljobs.png
Figure 1. Stages Tab in web UI for FAIR scheduling mode (with pools only)

In FAIR scheduling mode you have access to the table showing the scheduler pools as well as the pool names per stage.

Note
Pool names are calculated using SparkContext.getAllPools.

Internally, AllStagesPage is a WebUIPage with access to the parent Stages tab and more importantly the JobProgressListener to have access to current state of the entire Spark application.

Rendering AllStagesPage (render method)

render generates a HTML page to display in a web browser.

It uses the parent’s JobProgressListener to know about:

  • active stages (as activeStages)

  • pending stages (as pendingStages)

  • completed stages (as completedStages)

  • failed stages (as failedStages)

  • the number of completed stages (as numCompletedStages)

  • the number of failed stages (as numFailedStages)

Note
Stage information is available as StageInfo object.

There are 4 different tables for the different states of stages – active, pending, completed, and failed. They are displayed only when there are stages in a given state.

spark webui stages.png
Figure 2. Stages Tab in web UI for FAIR scheduling mode (with pools and stages)

You could also notice “retry” for stage when it was retried.

Caution
FIXME A screenshot
赞(0) 打赏
未经允许不得转载:spark技术分享 » AllStagesPage — Stages for All Jobs
分享到: 更多 (0)

关注公众号:spark技术分享

联系我们联系我们

觉得文章有用就打赏一下文章作者

支付宝扫一扫打赏

微信扫一扫打赏