Webinar: What is Hadoop?

 

23 February 2016
Online, 15.00 - 16.00

 

Have you heard of Hadoop but don’t know what it is or what it does? Or do you know that Hadoop is used to store very large datasets but you don’t know what it can do or why it might be relevant to you? If so, this webinar is for you. 

 

This webinar will provide an overview of Hadoop, including:

 

This webinar is intended for researchers with no in-depth knowledge of programming with data. The webinar will consist of a 30 minute presentation followed by 20 minutes for questions.

 

Resources: Booking

 

Webinar: What is Hive?


22 March 2016
Online, 15.00 - 16.00

 

Hive is a package that works with Hadoop that allows users to manipulate very large datasets. This webinar is intended as an overview of what Hive is and why you might want to learn more about it.

 

This webinar will provide an overview of:


This webinar is intended for researchers with no in-depth knowledge of programming with data. However, attendees are more likely to find this webinar of interest if they already have some experience of doing simple data manipulations (e.g. obtaining summary statistics or aggregating data in SPSS, Stata or R). The webinar will consist of a 30 minute presentation followed by 20 minutes for questions.

 

Resources: Booking

 

Webinar: What is Spark?

 

19 April 2016
Online, 15.00 - 16.00

 

Spark might be considered as a one-stop tool for big data processing, providing data manipulation facilities to slice and dice datasets as well as statistical functionality and visualisation capabilities to present your results. This webinar is intended as an overview of the Spark system and what you can use it for.

 

This webinar will provide:

 

This webinar is intended for researchers with no in-depth knowledge of programming with data. However, attendees are more likely to find this webinar of interest if they already have some experience of doing simple data manipulations and analyses (e.g. obtaining summary statistics and graphs in SPSS, Stata or R). The webinar will consist of a 30 minute presentation followed by 20 minutes for questions.

 

Resources: Booking