Fermer

Advanced data management & manipulation using R - in coll. with CUSO

22-23 October

In collaboration with CUSO doctoral program in Ecology and Evolution

Venue: University fo Bern

Flyer

Speaker

Objectives

Participants will be able to apply R as a powerful tool to manage, manipulate and analyse their own data sets. Particularly, there are going to learn:

  • the basic concepts of data structures & data management in R
  • the application of fast and efficient libraries specifically designed for the analysis large data sets
  • how to connect R to data bases and access them using SQL queries

Content

The analysis of large data sets (“big data”) is becoming increasingly important in science and elsewhere. In this course you will learn how to use R to manage and manipulate large data sets, i.e. to sort, merge, subset, aggregate and reshape data, including outlier detection and gap filling algorithms.

For advanced data manipulation, we are going to use novel developments such as dplyr (“A Grammar of Data Manipulation”), the pipe operator (%>%) for simpler R-coding and data.table for the fast aggregation of large data sets. Furthermore, we will have a closer look at R-data base connections, SQL queries and the creation of new data bases from R.

Depending on the course progress, there will be scope for individuals to work on small projects and/ or their own data sets.

 

Course outline:

  1. Data structures
  2. 
Data management (merge, sort, reshape,...)
  3. “The data.table way” (data.table)
  4. “The grammar of data manipulation” (dplyr)
  5. Tidying up messy data (tidyr, NAs & outliers)
  6. Databases (ODB)
  7. Reporting (knitr)

(teaching hours: 12 hours, home work: 4 hours)

Requirements for attending and completing the workshop

Familiarity with R before attending the workshop or previous attendance of an introductory course to R.

For information: An Introduction to R 4-7 June 2018, University of Lausanne

Bring your own laptop to the workshop with recent versions of R and R-Studio installed. Make sure that your laptop is properly connecting to the University of Bern or eduroam WLAN.

 

Course completion requirements:

  • Attendance – Presence and active participation is required during the entire course.
  • Home work - Participants are required to hand in a home work consisting of several exercises before November 2.

General information

Date: 22-23 October

Schedule: from 9:00 to 18:00 - more information on CUSO E&E web site

Venue: University of Bern, Uni Tobler, Raum B -181, Länggassstrasse 49, 3012 Bern

ECTS: 1.0 (Research tools) - only after completion of the homework

Evaluation: Full attendance, active participation and completion of an homework

Information: Please contact Dr Wunder, or the Doctoral Program Coordinator Sara Santi (administration), or see CUSO E&E web site

Registration fee: free

Meals: no meal reimbursement

Travel expenses: For participants of the Interuniversity doctoral program in organismal biology (DP-biol ) please see reimbursement conditions. Contact Sara Santi before sending the original train tickets.

Make sure to sign the presence list each and every day.

Registration

  • This course is free and open to all PhD students. However, until 24 September priority is given to PhD students enrolled into the CUSO Doctoral Program E&E and "Interuniversity doctoral program in organismal biology".
  • Post-docs are welcome as long as places are available.
  • Maximum number of participants: 16 (minimum 8 participants)

Registration through the web only: CUSO E&E website Closed. The course is full.