Analytics – Nuances of big data – Value – Issues – Case for Big data – Big data options Team challenge – Big data sources – Acquisition – Nuts and Bolts of Big data. Features of Big Data - Security, Compliance, auditing and protection - Evolution of Big data – Best Practices for Big data Analytics - Big data characteristics - Volume, Veracity, Velocity, Variety – Data Appliance and Integration tools – Greenplum – Informatica
Evolution of analytic scalability – Convergence – parallel processing systems – Cloud computing – grid computing – map reduce – enterprise analytic sand box – analytic data sets – Analytic methods – analytic tools – Cognos – Microstrategy - Pentaho. Analysis approaches – Statistical significance – business approaches – Analytic innovation – Traditional approaches – Iterative
Introduction to Streams Concepts – Stream data model and architecture - Stream Computing, Sampling data in a stream – Filtering streams – Counting distinct elements in a stream – Estimating moments – Counting oneness in a window – Decaying window - Realtime Analytics Platform(RTAP) applications IBM Infosphere – Big data at rest – Infosphere streams – Data stage – Statistical analysis – Intelligent scheduler – Infosphere Streams
Predictive Analytics – Supervised – Unsupervised learning – Neural networks – Kohonen models – Normal – Deviations from normal patterns – Normal behaviours – Expert options – Variable entry - Mining Frequent itemsets - Market based model – Apriori Algorithm – Handling large data sets in Main memory – Limited Pass algorithm – Counting frequent itemsets in a stream – Clustering Techniques – Hierarchical – K- Means – Clustering high dimensional data Visualizations - Visual data analysis techniques, interaction techniques; Systems and applications
Basic of R, concepts before starting, Working of R - Creating, listing and deleting the objects in memory - The on-line help Data with R Objects, R data Frames and Matrices, Reading data in a file , Saving data, Generating data, Manipulating objects Graphics with R Managing graphics , Graphical functions - Low-level plotting commands, Graphical parameters, A practical example - The grid and lattice packages