Program Card
Praxis logo

Certificate Program in Big Data Technologies

Program Highlights:
Praxis Certification, Placement Support

Certificate Program in Big Data Technologies

This Program on Big Data Technologies – designed by veterans in the Analytics industry; helps to establish a decent career in the growing Data and Analytics domain. This uniquely blended Program is brought to you by Praxis, a Top-ranked Analytics B-School in India.

Sneak Peak

INR 14,000

Program Summary

  • 8 credits
    Credits

    With this course, you are 5 credits short of an assured placement.

    Learn more
  • Duration 3 months
  • 35 Hours of projects/Assignments
  • 86 Hours of online sessions

Course Topics

  • 1

    Big Data 101

    • Big Data Characteristics

      • Volume
      • Variety
      • Velocity
      • Veracity
      • Valence
      • Value
    • Big Data and Business

    • Data Relationships and Data Model

      • One-to-one relationship
      • One-to-many relationship
      • Many-to-many relationship
      • Flat model
      • Hierarchical model
      • Network model
      • Relational model
      • Star schema model
      • Data vault model
    • Data Grouping

    • Clustering Algorithms

      • partitioning
      • hierarchical
      • grid based
      • density based
      • model based
    • Getting ready for Clustering Algorithms

    • Clustering Algorithms – UPGMA, single Link Clustering

    • KPIs, Businesses & Data Elements

    • Mapping for business outcomes

      • Define the pain point
      • Define the goal
      • Identify the actors
      • Identify the impacts
      • Identify the deliverables
      • Creating your impact map
    • Basic Query

    • Advanced Query – Embedding

    • Introduction to key mathematical concepts

      • eigenvalues and eigenvectors
    • Application of eigenvalues and eigenvectors

      • investigate prototypical problems of ranking big data
    • Application of the graph Laplacian

      • investigate prototypical problems of clustering big data
    • Application of PCA and SVD

      • investigate prototypical problems of big data compression
  • 2

    R Programming

    • R Programming

    • Introduction to R – I

    • Introduction to R – II

    • Common Data Structures in R

    • Conditional Operation and Loops

    • Looping in R using Apply Family Functions

    • Creating User Defined Functions in R

    • Graphics with R

    • Advanced Graphics with R

  • 3

    Hadoop

    • Introduction to Big Data and Hadoop

    • Introduction to DBMS systems using MySQL

    • Big Data and Hadoop EcoSystem

    • HDFS

    • Unix & HDFS Hands-on

    • Map-Reduce basics

    • Map Reduce Advanced Topics and Hands on

    • Pig introduction and Hands on

    • Pig Scripting

    • Hive Introduction, Metastore, Limitations of Hive

    • Comparison with Traditional Database and HIVE scripting

    • Hive Data Types, Partitioning and Bucketing

    • Hive Tables (Managed and External)

    • Hive Continued

    • Scoop Introduction and Hands-on

    • Introduction to NoSql and HBASE

    • HBASE architecture and Hands-on

  • 4

    Big Data with Spark and Python

Industry Collaboration

Program Mentors

Learn from the best in the industry

Browse Courses