In data science we can mainly classify data into two main types: qualitative(categorical) and quantitative(numeric).
Describes an object or a matter of quality that can be labeled or named. It cannot be represented in numerical form. Examples include colors, places, etc.
Numerical data that can be measured mathematically. Examples include height, weight, number of students in a school, etc.
A dataset is a structured or organized collection of data, usually associated with a unique body of work. A database is an organized collection of data stored in multiple datasets or tables.
Structured data in tables with rows and columns
Various data models like document, key-value, wide-column, and graph
Databases are crucial for managing and storing large amounts of data. They were introduced to manage and store large amounts of data effectively.
Data Collection is the process of gathering information from relevant sources to find a solution to the given statistical inquiry.
Effective storage of data is essential for managing and analyzing large volumes of data.
Data visualization is graphical representation of data to get meaningful insight, trends, and patterns from data. The visual elements include charts, graphs, maps, figures, and dashboards.
Summary statistics provide information about the data in a sample. It helps understand the values better. It includes the total number of values, minimum value, and maximum value, along with the mean value and the standard deviation corresponding to a data collection.