Blog - Dimajix

This is part 3 of a series on data engineering in a big data environment.…

Allgemein

This is part 2 of a series on data engineering in a big data environment.…

Allgemein

This is part 1 of a series on data engineering in a big data environment.…

most attendees of dimajix Spark workshops seem to like the hands-on approach I am offering…

Amazon Elastic MapReduce (EMR) is something wonderful if you need compute capacity on demand. I…

Traditionally HDFS was the primary storage for Hadoop (and therefore also for Apache Spark). Naturally…

Working with PySpark Currently Apache Spark with its bindings PySpark and SparkR is the processing…

So the other day I wanted to investigate into using Druid as a reporting backend…

dominik_adm1n23. März 2016