Posts

Apache Spark: How to decide number of Executor & Memory per Executor?

Apache Spark: Client v/s Cluster Mode

Apache Spark: Accumulators & Broadcast Variables

What's new in Spark 3.0?

Apache Spark: Structured Streaming - Part I

Apache Spark: Handle Corrupt/bad Records

Amazon EMR (Elastic MapReduce)

Big Data Evolution: Migrating on-premise Database to Hadoop