大数据spark,kafka,hbase,elasticsearch,机器学习AI,海量实践干货,面试宝典spark技术分享

关注 spark技术分享,
撸spark源码 玩spark最佳实践

精品课程系列

大数据-机器学习-成长训练营

一周热门排行

  • 暂无最多评论文章!
RDD Dependencies-spark技术分享
spark-core

RDD Dependencies

admin阅读(1523)赞(0)

RDD Dependencies Dependency class is the base (abstract) class to model a dependency relationship between two or m ...

Checkpointing-spark技术分享
spark-core

Checkpointing

admin阅读(1251)赞(0)

Checkpointing Checkpointing is a process of truncating RDD lineage graph and saving it to a reliable distributed ( ...

Shuffling-spark技术分享
spark-core

Shuffling

admin阅读(1493)赞(0)

RDD shuffling Tip Read the official documentation about the topic Shuffle operations. It is still better than ...

HashPartitioner-spark技术分享
spark-core

HashPartitioner

admin阅读(1514)赞(0)

HashPartitioner HashPartitioner is a Partitioner that uses partitions configurable number of partitions to shuffle ...

Partitioner-spark技术分享
spark-core

Partitioner

admin阅读(1480)赞(0)

Partitioner Caution FIXME Partitioner captures data distribution at the output. A scheduler can optimize ...

Partition-spark技术分享
spark-core

Partition

admin阅读(1470)赞(0)

Partition Partition is a contract of a partition index of a RDD. Note A partition is missing when it has no ...

StorageLevel-spark技术分享
spark-core

StorageLevel

admin阅读(1553)赞(0)

StorageLevel StorageLevel describes how an RDD is persisted (and addresses the following concerns): Does RDD u ...

关注公众号:spark技术分享

联系我们联系我们