文章浏览阅读6.6k次,点赞4次,收藏33次。RDD 的五大特性一、简介版二、详细版一、简介版(1)A list of partitions一组分区:RDD由很多partition构成,有多少partition就对应有多少task(2)A function for computing each split一个函数:对RDD做计......
文章浏览阅读849次。一.RDD的官网定义A Resilient Distributed Dataset (RDD), the basic abstraction in Spark. Represents an immutable,partitioned collection of elements that can be operated on in parallel. 翻译:弹性分布式数据集(RDD),Spark中的基本抽象。表示不......
文章浏览阅读849次。一.RDD的官网定义A Resilient Distributed Dataset (RDD), the basic abstraction in Spark. Represents an immutable,partitioned collection of elements that can be operated on in parallel. 翻译:弹性分布式数据集(RDD),Spark中的基本抽象。表示不......