Dados limpos para a versão em português do Certificado Profissional de Análise de Dados do Google
-
Updated
Dec 3, 2022 - Python
Dados limpos para a versão em português do Certificado Profissional de Análise de Dados do Google
Databroom is a cross-language data cleaning tool with CLI, GUI, and API. Clean CSV, Excel, or JSON files and generate reproducible scripts in Python (pandas) or R (tidyverse). Now supports saving and loading cleaning pipelines as JSON for fully automated, shareable workflows.
Automated text preprocessing pipeline for large corpora. Features customizable filters for diacritics, stop words, punctuation, and regex.
Language model assisted data cleaning
This AI Agent collects data from sources like Database connection, API connection, Direct CSV/XSLX file upload; cleans & preprocess it; can query data using custom SQL commands through NLP chatbot; visualizations are provided; and a final Report is generated at the end. It acts as a accessory tool for a Data Analyst.
Add a description, image, and links to the data-cleaning-automation topic page so that developers can more easily learn about it.
To associate your repository with the data-cleaning-automation topic, visit your repo's landing page and select "manage topics."