Get in Touch

Course Outline

Introduction to Programming Big Data with R (bpdR)

  • Configuring your environment for pbdR usage
  • Overview of scope and tools available in pbdR
  • Packages frequently utilized alongside pbdR for Big Data applications

Message Passing Interface (MPI)

  • Leveraging pbdR MPI 5
  • Parallel processing techniques
  • Point-to-point communication methods
  • Sending Matrices
  • Summing Matrices
  • Collective communication strategies
  • Summing Matrices using Reduce
  • Scatter / Gather operations
  • Other MPI communication patterns

Distributed Matrices

  • Creating a distributed diagonal matrix
  • Performing SVD on a distributed matrix
  • Constructing a distributed matrix in parallel

Statistics Applications

  • Monte Carlo Integration
  • Loading Datasets
  • Reading data across all processes
  • Broadcasting from a single process
  • Reading partitioned data
  • Distributed Regression
  • Distributed Bootstrap
 21 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories