• LOGIN
  • No products in the cart.

Build powerful predictive models in R.


Course Description

R is a statistical programming language that provides impressive tools to analyze data and create high-level graphics. This video course will take you from very basics of R to creating insightful machine learning models with R. You will start with setting up the environment and then perform data ETL in R.

Data exploration examples are provided that demonstrate how powerful data visualization and machine learning is in discovering hidden relationship. You will then dive into important machine learning topics, including data classification, regression, clustering, association rule mining, and dimensionality reduction.


Learning Outcomes

  • Create and inspect the transaction dataset, performing association analysis with the Apriori algorithm
  • Visualize patterns and associations using a range of graphs and find frequent itemsets using the Eclat algorithm
  • Compare the differences between each regression method to discover how they solve problems
  • Predict possible churn users with the classification approach
  • Implement the clustering method to segment customer data
  • Compress images with the dimension reduction method
  • Incorporate R and Hadoop to solve machine learning problems on big data

Pre-requisites

  • Although programming with R is not a prerequisite, it would be helpful. A background in linear algebra and statistics is expected.
  • This easy-to-follow guide is full of hands-on examples of data analysis with R. Each topic is fully explained beginning with the core concepts, followed by step-by-step, practical examples and concluding with detailed explanations of each concept used.

Who is this course intended for?

  • This course is for anyone who wants to enter the world of machine learning and is looking for a guide that is easy to follow.

Course Curriculum

Getting Started with R
The Course Overview 00:00:00
Downloading and Installing R 00:00:00
Downloading and Installing RStudio 00:00:00
Installing and Loading Packages 00:00:00
Reading and Writing Data 00:00:00
Using R to Manipulate Data 00:00:00
Applying Basic Statistics 00:00:00
Visualizing Data 00:00:00
Getting a Dataset for Machine Learning 00:00:00
Data Exploration with RMS Titanic
Reading a Titanic Dataset from a CSV File 00:00:00
Converting Types on Character Variables 00:00:00
Detecting Missing Values 00:00:00
Imputing Missing Values 00:00:00
Exploring and Visualizing Data 00:00:00
Predicting Passenger Survival with a Decision Tree 00:00:00
Validating the Power of Prediction with a Confusion Matrix 00:00:00
Assessing performance with the ROC curve 00:00:00
R and Statistics
Understanding Data Sampling in R 00:00:00
Operating a Probability Distribution in R 00:00:00
Working with Univariate Descriptive Statistics in R 00:00:00
Performing Correlations and Multivariate Analysis 00:00:00
Operating Linear Regression and Multivariate Analysis 00:00:00
Conducting an Exact Binomial Test 00:00:00
Performing Student’s t-test 00:00:00
Performing the Kolmogorov-Smirnov Test 00:00:00
Understanding the Wilcoxon Rank Sum and Signed Rank Test 00:00:00
Working with Pearson’s Chi-Squared Test 00:00:00
Conducting a One-Way ANOVA 00:00:00
Performing a Two-Way ANOVA 00:00:00
Understanding Regression Analysis
Fitting a Linear Regression Model with lm 00:00:00
Summarizing Linear Model Fits 00:00:00
Using Linear Regression to Predict Unknown Values 00:00:00
Generating a Diagnostic Plot of a Fitted Model 00:00:00
Fitting a Polynomial Regression Model with lm 00:00:00
Fitting a Robust Linear Regression Model with rlm 00:00:00
Studying a case of linear regression on SLID data 00:00:00
Applying the Gaussian Model for Generalized Linear Regression 00:00:00
Applying the Poisson model for Generalized Linear Regression 00:00:00
Applying the Binomial Model for Generalized Linear Regression 00:00:00
Fitting a Generalized Additive Model to Data 00:00:00
Visualizing a Generalized Additive Model 00:00:00
Diagnosing a Generalized Additive Model 00:00:00
Classification – Tree, Lazy, and Probabilistic
Preparing the Training and Testing Datasets 00:00:00
Building a Classification Model with Recursive Partitioning Trees 00:00:00
Visualizing a Recursive Partitioning Tree 00:00:00
Measuring the Prediction Performance of a Recursive Partitioning Tree 00:00:00
Pruning a Recursive Partitioning Tree 00:00:00
Building a Classification Model with a Conditional Inference Tree 00:00:00
Visualizing a Conditional Inference Tree 00:00:00
Measuring the Prediction Performance of a Conditional Inference Tree 00:00:00
Classifying Data with the K-Nearest Neighbor Classifier 00:00:00
Classifying Data with Logistic Regression 00:00:00
Classifying data with the Naïve Bayes Classifier 00:00:00
Neural Network and SVM
Classifying Data with a Support Vector Machine 00:00:00
Choosing the Cost of an SVM 00:00:00
Visualizing an SVM Fit 00:00:00
Predicting Labels Based on a Model Trained by an SVM 00:00:00
Tuning an SVM 00:00:00
Training a Neural Network with neuralnet 00:00:00
Visualizing a Neural Network Trained by neuralnet 00:00:00
Predicting Labels based on a Model Trained by neuralnet 00:00:00
Training a Neural Network with nnet 00:00:00
Predicting labels based on a model trained by nnet 00:00:00
Model Evaluation
Estimating Model Performance with k-fold Cross Validation 00:00:00
Performing Cross Validation with the e1071 Package 00:00:00
Performing Cross Validation with the caret Package 00:00:00
Ranking the Variable Importance with the caret Package 00:00:00
Ranking the Variable Importance with the rminer Package 00:00:00
Finding Highly Correlated Features with the caret Package 00:00:00
Selecting Features Using the Caret Package 00:00:00
Measuring the Performance of the Regression Model 00:00:00
Measuring Prediction Performance with a Confusion Matrix 00:00:00
Measuring Prediction Performance Using ROCR 00:00:00
Comparing an ROC Curve Using the Caret Package 00:00:00
Measuring Performance Differences between Models with the caret Package 00:00:00
Ensemble Learning
Classifying Data with the Bagging Method 00:00:00
Performing Cross Validation with the Bagging Method 00:00:00
Classifying Data with the Boosting Method 00:00:00
Performing Cross Validation with the Boosting Method 00:00:00
Classifying Data with Gradient Boosting 00:00:00
Calculating the Margins of a Classifier 00:00:00
Calculating the Error Evolution of the Ensemble Method 00:00:00
Classifying Data with Random Forest 00:00:00
Estimating the Prediction Errors of Different Classifiers 00:00:00
Clustering
Clustering Data with Hierarchical Clustering 00:00:00
Cutting Trees into Clusters 00:00:00
Clustering Data with the k-Means Method 00:00:00
Drawing a Bivariate Cluster Plot 00:00:00
Comparing Clustering Methods 00:00:00
Extracting Silhouette Information from Clustering 00:00:00
Obtaining the Optimum Number of Clusters for k-Means 00:00:00
Clustering Data with the Density-Based Method 00:00:00
Clustering Data with the Model-Based Method 00:00:00
Visualizing a Dissimilarity Matrix 00:00:00
Validating Clusters Externally 00:00:00
Association Analysis and Sequence Mining
Transforming Data into Transactions 00:00:00
Displaying Transactions and Associations 00:00:00
Mining Associations with the Apriori Rule 00:00:00
Pruning Redundant Rules 00:00:00
Visualizing Association Rules 00:00:00
Mining Frequent Itemsets with Eclat 00:00:00
Creating Transactions with Temporal Information 00:00:00
Mining Frequent Sequential Patterns with cSPADE 00:00:00
Dimension Reduction
Performing Feature Selection with FSelector 00:00:00
Performing Dimension Reduction with PCA 00:00:00
Determining the Number of Principal Components Using the Scree Test 00:00:00
Determining the Number of Principal Components Using the Kaiser Method 00:00:00
Visualizing Multivariate Data Using biplot 00:00:00
Performing Dimension Reduction with MDS 00:00:00
Reducing Dimensions with SVD 00:00:00
Compressing Images with SVD 00:00:00
Performing Nonlinear Dimension Reduction with ISOMAP 00:00:00
Performing Nonlinear Dimension Reduction with Local Linear Embedding 00:00:00
Big Data Analysis with R and Hadoop
Preparing the RHadoop Environment 00:00:00
Installing rmr2 00:00:00
Installing rhdfs 00:00:00
Operating HDFS with rhdfs 00:00:00
Implementing a Word Count Problem with RHadoop 00:00:00
Comparing the Performance between an R MapReduce Program and a Standard R Progra 00:00:00
Testing and Debugging the rmr2 Program 00:00:00
Installing plyrmr 00:00:00
Manipulating Data with plyrmr 00:00:00
Conducting Machine Learning with RHadoop 00:00:00
Configuring RHadoop Clusters on Amazon EMR 00:00:00

Course Reviews

N.A

ratings
  • 5 stars0
  • 4 stars0
  • 3 stars0
  • 2 stars0
  • 1 stars0

No Reviews found for this course.

TAKE THIS COURSE
  • $99.00 $10.00
  • UNLIMITED ACCESS
  • Course Certificate
STUDENTS ENROLLED
    © Learnsector