spark-core 第27页

Partitions and Partitioning

2014-07-11admin阅读(1505)赞(0)

Partitions and Partitioning Introduction Depending on how you look at Spark (programmer, devop, admin), an RDD is ...

2014-07-10admin阅读(1541)赞(0)

StorageLevel StorageLevel describes how an RDD is persisted (and addresses the following concerns): Does RDD u ...

2014-07-09admin阅读(2419)赞(0)

RDD Caching and Persistence cache和persist都是用于将一个RDD进行缓存的，这样在之后使用的过程中就不需要重新计算了， ...

2014-07-08admin阅读(1586)赞(0)

Actions Actions are RDD operations that produce non-RDD values. They materialize a value in a Spark program. In ot ...

2014-07-07admin阅读(1389)赞(0)

PairRDDFunctions Tip Read up the scaladoc of PairRDDFunctions. PairRDDFunctions are available in RDDs of ...

2014-07-06admin阅读(1349)赞(0)

Transformations Transformations are lazy operations on a RDD that create one or many new RDDs, e.g. map, filter, ...

2014-07-05admin阅读(1201)赞(0)

Operators - Transformations and Actions RDDs have two types of operations: transformations and actions. Note ...

2014-07-04admin阅读(1343)赞(0)

ShuffledRDD ShuffledRDD is an RDD of key-value pairs that represents the shuffle step in a RDD lineage. It uses cu ...

2014-07-03admin阅读(1401)赞(0)

NewHadoopRDD NewHadoopRDD is an RDD of K keys and V values. NewHadoopRDD is created when: SparkContext.newAP ...

2014-07-02admin阅读(1738)赞(0)

HadoopRDD HadoopRDD is an RDD that provides core functionality for reading data stored in HDFS, a local file syste ...