This project contains comprehensive resources and code examples that cover various aspects of data analysis using Pandas, the popular Python library.
This repository is structured into several folders, each focusing on a different topic in Pandas. These topics range from basic DataFrame manipulation to advanced techniques like multi-indexing, merging, and visualizations.
- Python Crash Course : A quick Python refresher before diving into Pandas.
- Series : Operations on Pandas Series, a one-dimensional array for handling data.
- DataFrames I - Introduction : Basics of working with DataFrames – selecting, indexing, and modifying data.
- DataFrames II - Filtering Data : Intermediate DataFrame operations like filtering and sorting.
- DataFrames III - Data Extraction : Advanced DataFrame tasks, including applying functions and reshaping data.
- Working with Text Data : Processing and manipulating text data in Pandas.
- MultiIndex : Working with hierarchical (multi-level) indices in Pandas.
- The GroupBy Object : Grouping data and applying aggregate functions using
groupby. - Merging, Joining, and Concatenating DataFrames : Combining multiple datasets with
merge,join, andconcat. - Working with Dates and Times in Datasets : Handling date and time data, including parsing and time-based indexing.
- Input and Output in Pandas : Reading data from files (CSV, Excel) and saving outputs.
- Visualizations : Creating plots and charts to visualize data.
- Pandas Options and Settings : Configuring Pandas display settings like precision, max rows/columns, and more to control the output format.
To follow along with the examples in this repository, you will need the following Python libraries installed:
- Python 3.x
- Pandas
- NumPy
- Matplotlib