Data science is an interdisciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge and insights from structured and unstructured data. Data science is related to data mining and big data.
Data science is a “concept to unify statistics, data analysis, machine learning, and their related methods” in order to “understand and analyze actual phenomena” with data. It employs techniques and theories drawn from many fields within the context of mathematics, statistics, computer science, and information science.
Python for Data Science
- What is Data Science
- Artificial Intelligence in Data Science
- Data Analysis in Data Science
- Deep learning in Data Science
Data Preparation
- Selecting and Filtering Data
- How to remove duplicated from Dataset
- How to transform Dataset
- How to aggregate Dataset in Python
- How to fill missing values in Python
- How to concat dataframes in Python
- How to group in Dataset in Python
Data Visualization
- Types of Data Visualization
- Bar Chart
- Define elements of Plot
- How to create a label and annotation
- How to create statistical data graphics
- Line Chart
- Pie Chart
- How to format plot
- How to visualize time-series
Math and Statistics for Data Science
- Simple arithmetic for Data Science
- Generating Summary Statistics
- Parametric Correlation Analysis
- Transforming dataset distributions
- Multivariate analysis for outliers
- Basic Linear algebra for Data Science
- Summarizing Categorical Data
- Non-parametric Correlation Analysis
- Extreme value analysis for outliers