关注 spark技术分享,
撸spark源码 玩spark最佳实践

标签:spark-core

第26页
RDD Dependencies-spark技术分享
spark-core

RDD Dependencies

admin阅读(1406)赞(0)

RDD Dependencies Dependency class is the base (abstract) class to model a dependency relationship between two or m ...

Checkpointing-spark技术分享
spark-core

Checkpointing

admin阅读(1132)赞(0)

Checkpointing Checkpointing is a process of truncating RDD lineage graph and saving it to a reliable distributed ( ...

Shuffling-spark技术分享
spark-core

Shuffling

admin阅读(1367)赞(0)

RDD shuffling Tip Read the official documentation about the topic Shuffle operations. It is still better than ...

HashPartitioner-spark技术分享
spark-core

HashPartitioner

admin阅读(1396)赞(0)

HashPartitioner HashPartitioner is a Partitioner that uses partitions configurable number of partitions to shuffle ...

Partitioner-spark技术分享
spark-core

Partitioner

admin阅读(1344)赞(0)

Partitioner Caution FIXME Partitioner captures data distribution at the output. A scheduler can optimize ...

Partition-spark技术分享
spark-core

Partition

admin阅读(1347)赞(0)

Partition Partition is a contract of a partition index of a RDD. Note A partition is missing when it has no ...

StorageLevel-spark技术分享
spark-core

StorageLevel

admin阅读(1418)赞(0)

StorageLevel StorageLevel describes how an RDD is persisted (and addresses the following concerns): Does RDD u ...

关注公众号:spark技术分享

联系我们联系我们