关注 spark技术分享,
撸spark源码 玩spark最佳实践

spark-core 第27页

Partitions and Partitioning-spark技术分享

Partitions and Partitioning

admin阅读(1349)赞(0)

Partitions and Partitioning Introduction Depending on how you look at Spark (programmer, devop, admin), an RDD is ...

StorageLevel-spark技术分享

StorageLevel

admin阅读(1418)赞(0)

StorageLevel StorageLevel describes how an RDD is persisted (and addresses the following concerns): Does RDD u ...

Actions-spark技术分享

Actions

admin阅读(1466)赞(0)

Actions Actions are RDD operations that produce non-RDD values. They materialize a value in a Spark program. In ot ...

PairRDDFunctions-spark技术分享

PairRDDFunctions

admin阅读(1268)赞(0)

PairRDDFunctions Tip Read up the scaladoc of PairRDDFunctions. PairRDDFunctions are available in RDDs of ...

Transformations-spark技术分享

Transformations

admin阅读(1217)赞(0)

Transformations Transformations are lazy operations on a RDD that create one or many new RDDs, e.g. map, filter, ...

Operators-spark技术分享

Operators

admin阅读(1089)赞(0)

Operators - Transformations and Actions RDDs have two types of operations: transformations and actions. Note ...

ShuffledRDD-spark技术分享

ShuffledRDD

admin阅读(1221)赞(0)

ShuffledRDD ShuffledRDD is an RDD of key-value pairs that represents the shuffle step in a RDD lineage. It uses cu ...

NewHadoopRDD-spark技术分享

NewHadoopRDD

admin阅读(1284)赞(0)

NewHadoopRDD NewHadoopRDD is an RDD of K keys and V values. NewHadoopRDD is created when: SparkContext.newAP ...

HadoopRDD-spark技术分享

HadoopRDD

admin阅读(1638)赞(0)

HadoopRDD HadoopRDD is an RDD that provides core functionality for reading data stored in HDFS, a local file syste ...

关注公众号:spark技术分享

联系我们联系我们