Big Data (pyspark)
Instructor: Anne Claire Fouilloux
This one-day Carpentry@UiO hands-on workshop will give a short introduction to big data analysis using pyspark.
The Spark Python API (PySpark) exposes the Spark programming model to Python. Apache® Spark™ is an open source and is one of the most popular Big Data frameworks for scaling up your tasks in a cluster. It was developed to utilize distributed, in-memory data structures to improve data processing speeds. A basic knowledge of python is recommended but you don't need to have any previous knowledge of big data analysis or Apache Spark.
More information about the course can be found at the Carpentry GitHub.
This workshop is a part of Research Bazaar 2017