关注 spark技术分享,
撸spark源码 玩spark最佳实践

Spark SQL — Batch and Streaming Queries Over Structured Data on Massive Scale

Spark SQL — Batch and Streaming Queries Over Structured Data on Massive Scale

Like Apache Spark in general, Spark SQL in particular is all about distributed in-memory computations on massive scale.

The primary difference between Spark SQL’s and the “bare” Spark Core’s RDD computation models is the framework for loading, querying and persisting structured and semi-structured data using structured queries that can be expressed using good ol’ SQL, HiveQL and the custom high-level SQL-like, declarative, type-safe Dataset API called Structured Query DSL.

Tip
You can find more information about Spark SQL in my Mastering Spark SQL gitbook.
赞(0) 打赏
未经允许不得转载:spark技术分享 » Spark SQL — Batch and Streaming Queries Over Structured Data on Massive Scale
分享到: 更多 (0)

关注公众号:spark技术分享

联系我们联系我们

觉得文章有用就打赏一下文章作者

支付宝扫一扫打赏

微信扫一扫打赏