Causing Stage to Fail

2015-03-07 分类：spark-core 阅读(1121) 评论(0)

Exercise: Causing Stage to Fail

The example shows how Spark re-executes a stage in case of stage failure.

Recipe

Start a Spark cluster, e.g. 1-node Hadoop YARN.



start-yarn.sh

start-yarn.sh



// 2-stage job -- it _appears_ that a stage can be failed only when there is a shuffle
sc.parallelize(0 to 3e3.toInt, 2).map(n => (n % 2, n)).groupByKey.count

// 2-stage job -- it _appears_ that a stage can be failed only when there is a shuffle

sc.parallelize(0 to 3e3.toInt, 2).map(n => (n % 2, n)).groupByKey.count

Use 2 executors at least so you can kill one and keep the application up and running (on one executor).



YARN_CONF_DIR=hadoop-conf ./bin/spark-shell --master yarn \
  -c spark.shuffle.service.enabled=true \
  --num-executors 2

YARN_CONF_DIR=hadoop-conf ./bin/spark-shell --master yarn \

-c spark.shuffle.service.enabled=true \

--num-executors 2

赞(0) 打赏

未经允许不得转载：spark技术分享 » Causing Stage to Fail

标签：spark-core

关注公众号：spark技术分享

联系我们联系我们

QQ咨询
QQ咨询
回顶
回顶部

Causing Stage to Fail

Exercise: Causing Stage to Fail

Recipe

相关推荐

欢迎关注：spark技术分享

热门标签

近期文章

分类目录

关注公众号：spark技术分享

觉得文章有用就打赏一下文章作者

支付宝扫一扫打赏

微信扫一扫打赏

QQ咨询

回顶部