Introduction to R for data analytics

You are here

Description:

The purpose of this course is to present researchers and scientists with R machine learning techniques for the analysis of large data sets. The course, held by data analytics experts, will consist of introductory lectures on Machine Learning techniques. It will provide basic concepts such as training and tests sets, overfitting, bagging, boosting and error rates. The course will also introduce a range of model based and algorithmic machine learning methods including clustering, association rules, decision trees, Naive Bayes, and random forests. It also covers practical issues in machine learning which includes programming in R, reading data into R, accessing R packages Examples of parallel R programming will be shown. Participants will use R software for lab exercises using Cineca HPC facilities.

Topics:

Machine Learning techniques
R basics
R parallel packages

Target audience:

Students and researchers with different backgrounds, looking for technologies and methods to analyze large amount of data.

Pre-requisites:

Participants must have basic statistics knowledge and some programming experience (in any language) is recommended. Participants should be also familiar with basic Linux commands since some of them will be used in the course.

Area: 
Languages
Techniques
Data
Target: 
Companies
Research Institutions
Universities
Length: 
2 dd
Minimum number of attendants required: 
6

Next courses

Non sono previste edizioni di questo corso.

Any question?

For HPC and computer graphics courses, write to corsi.hpc@cineca.it

About CINECA

Cineca is a non profit Consortium, made up of 70 Italian universities, 5 Italian Research Institutions and the Italian Ministry of Education.

Today it is the largest Italian computing centre, one of the most important worldwide. With more seven hundred employees, it operates in the technological transfer sector through high performance scientific computing, the management and development of networks and web based services, and the development of complex information systems for treating large amounts of data.

It develops advanced Information Technology applications and services, acting like a trait-d'union between the academic world, the sphere of pure research and the world of industry and Public Administration. .

Visit the Cineca website