Data Duster is an intelligent web application developed by SMAASU Corporation to simplify data cleaning and processing. Whether youβre dealing with messy datasets in CSV or XLSX format, Data Duster ensures clean, structured, and optimized data in just a few clicks.
- π Upload Dataset β Supports CSV and XLSX files.
- π§Ή Smart Cleaning β Remove duplicates, handle null values, and fill missing data.
- π Selective Processing β Choose specific columns for refinement.
- π Data Visualization β Generate insightful graphs for better analysis.
- π Format Conversion β Convert files into different formats effortlessly.
- πΎ Download Processed Data β Get the refined dataset instantly.
To run Data Duster locally, follow these steps:
# Clone the repository
git clone https://github.com/smaasui/Data-Duster.git
# Navigate to the project directory
cd Data-Duster
# Install dependencies
pip install -r requirements.txt
# Run the Streamlit app
streamlit run app.py- Upload a dataset in CSV or XLSX format.
- Select cleaning options (remove duplicates, fill missing values, etc.).
- Choose specific columns for processing.
- Visualize data insights using charts.
- Download the refined dataset in the desired format.
- Check password security and get recommendations.
Contributions are welcome! To contribute:
- Fork the repository
- Create a new branch (
feature-branch) - Commit your changes (
git commit -m "Added a new feature") - Push to the branch (
git push origin feature-branch) - Open a pull request
This project is licensed under the MIT License.
For any queries or suggestions, reach out to us at smaasu01@gmail.com or visit SMAASU Corporation.
π Enjoy seamless data cleaning and thanks to SMAASU I ! π
