Data Science
Institute Of Future Analytics
Data Science
Introduction
Flow Controllers
Modules
Functions
File Handling
Oops Concepts
Multi-Threading
GUI Programming
Component and events
N/w Programming
String Handling
Operators
Collections
Packages
List, Tuples And Dictionaries
Regular Expressions
Database Access
Introduction to RDBMS
Installation of MySQL Python Modules
Working with csv , xml and Json files
Introduction
Array indexing
Array math
Introduction to Pandas
Series object
Built in functions
Working With Text
String Methods
Working With Group
pd.concat()
Arrays
Data types
Useful functions in Numpy
Installations
Attributes of series
Introduction to Data Frames
Filtering Data Frames
Filtering with String
Handling missing values
Joins
Introduction to R
Installation and use of software
Data input/output
Data types and variables
Operators in R
Conditional executions
loops
Vectorization
Lists Operations
Matrices
Factors
DataFrames
Importing and exporting data to/from external sources
Data Manipulation with bindings
Functions
Statistical Concepts
Mean, Median Mode
Variance, Standard Deviation
Working with messy data
Data querying: SQL and R
Data Visualisation in R using GGPlot2
Box Plot,Histograms,Scatter Plotter,Line chart,Bar Chart
Introduction to DBMS
Introduction to SQL
DDL and DML Statements
Working with Constraints
Implementing Views
Working with Indexes
Implementing Triggers
Working with Queries (DQL)
Aggregate Functions
Joins and Set Operations
Implementation of Data integrity
Data Control language (DCL)
Working with Stored Procedures
Working With Functions
Introduction to NOSQL Databases
Installing MongoDB
Creating documents
Updating documents
Deleting documents
Selecting data
Using index
Types of index
Operators
Concept of replication
Connecting with mongoservers
Replication handon
Concept of sharding
Sharding hands on
History of machine learning
Concept of machine learning
application of ml
types of ml algorithm
Different types of Regression
Linear Regression
Logistic Regression
Decision tree Algorithms
Classification problems
KNN Classification
SVM Classification
Unsupervised learning
k-means clustering
Overfitting / Underfitting
performance matrix
introduction to data mining
Downloading Tableau Public
Understanding Tableau Interface
Connecting with Datasets
Plotting Simple Charts
Mapping in Tableau
Joining Dataset
Sorting Data
Histogramss
Applying Filters
Calculation in Tableau
Table Calculation
Using Inbuilt Functions
Creating Calculated Fields
Formatting
Creating Story
Creating Dashboards
Introduction to Big Data and Analytics
Structured and Unstructured
Big Data Characteristics
Evolution – Definition – Challenges with Big Data
Traditional approaches vs big data
Why Big Data
Introduction To Technology Landscape
NoSQL
Hadoop
Spark
Hadoop Overview
HDFS
MapReduce and yarn
Partitioners
Combiners
Introduction To HIVE and PIG
HQL
PigLatin
Sqooping with Hadoop and sql
Hbase
Introduction to Data Analysis with Spark
Who Uses Spark, and for What?
Spark Framework
Spark Architecure
RDDs (Resilient Distributed Datasets)
RDD Lineage
Implementing Triggers
Transformations
Actions
Introduction to Spark SQL
Using Spark SQL
Processing Live Data with spark streaming
Using netcat
DataFrames in spark