DevOps Articles

Curated articles, resources, tips and trends from the DevOps World.

Apache Spark: Resilient Distributed Datasets

6 years ago dzone.com
Apache Spark: Resilient Distributed Datasets

Summary: This is a summary of an article originally published by the source. Read the full original article here →

RDDs represent both the idea of how a large dataset is represented in Apache Spark and the abstraction for working with it. This section will cover the former, and the following sections will cover the latter.

Made with pure grit © 2026 Jetpack Labs Inc. All rights reserved. www.jetpacklabs.com