Data Pre-processing and Post Processing Lab

Instructions:

  1. The course objectives and course outcomes are identical to that of (Data Pre-processing and Post Processing) as this is the practical component of the corresponding theory paper.
  2. The practical list shall be notified by the teacher in the first week of the class commencement under intimation to the office of the Head of Department / Institution in which the paper is being offered from the list of practicals below. 

List of Programs

  1. Download, install and explore the features of NumPy, SciPy, Jupyter, Stats models and Pandas packages.
  2. Working with NumPy arrays
  3. Working with Pandas data frames
  4. Reading data from text files, Excel and the web and exploring various commands for doing descriptive analytics on the Iris data set.
  5. Use the diabetes data set from UCI and Pima Indians Diabetes data set for performing the following: 
    • Univariate analysis: Frequency, Mean, Median, Mode, Variance, Standard Deviation, Skewness and Kurtosis. 
    • Bivariate analysis: Linear and logistic regression modelling 
    • Multiple Regression analysis 
    • Also compare the results of the above analysis for the two data sets.
  6. Perform following pre-processing techniques on loan prediction dataset
    • Feature Scaling
    • Feature Standardization
    • Label Encoding
    • One Hot Encoding 
  7. Apply and explore various plotting functions on UCI data sets. 
    • a. Normal curves 
    • b. Density and contour plots 
    • c. Correlation and scatter plots 
    • d. Histograms 
    • e. Three-dimensional plotting 
  8. Perform following visualizations using matplotlib: Bar Graph, Pie Chart, Box Plot, Histogram, Line Chart and Subplots, Scatter Plot.

No comments:

Post a Comment