- Install Pandas (if not already installed).
- Import Pandas and any other necessary libraries.
- Create a DataFrame from a dictionary and from a list of dictionaries.
- Load a dataset from a CSV file that is provide called
messed_dataset.csv
- Display the first and last few rows of the DataFrame.
- Get a summary of the DataFrame, including basic statistics and data types.
- Display the shape and column names of the DataFrame.
- Select a single column and multiple columns.
- Select rows by index and by label.
- Select specific rows and columns using loc and iloc.
- Identify missing values in the DataFrame.
- Drop rows with missing values.
- Fill missing values with a specified value.
- Add a new column to the DataFrame.
- Delete a column from the DataFrame.
- Rename columns in the DataFrame.
- Apply a function to a column.
- Group the DataFrame by a column and calculate summary statistics.
- Iterate over groups and display the group names and data.
- Merge two DataFrames on a common column.
- Join two DataFrames using their indices.
- Create a datetime index for the DataFrame.
- Convert a column to datetime and extract date components.
- Read data from a CSV file into a DataFrame.
- Write the DataFrame to a CSV file.
- Read data from an Excel file into a DataFrame. The data given is
SaleData.xlsl
. - Write the DataFrame to an Excel file.
- Create a simple plot using Matplotlib.
- Create a bar plot using Seaborn.
- Fork this repository.
- Complete the tasks in
assignment.ipynb
. - Commit and push your changes to your forked repository.
- Submit the link to your repository.
- Task 1: Load the provided dataset into a Pandas DataFrame.
- Task 2: Perform basic data cleaning.
- Task 3: Perform data analysis to answer the provided questions.
- Task 4: Visualize the results using Matplotlib/Seaborn.