This repository contains analysis of IMDB data from multiple sources and analysis of movies/cast/box office revenues, movie brands and franchises.
-
Updated
Jun 1, 2020 - TSQL
This repository contains analysis of IMDB data from multiple sources and analysis of movies/cast/box office revenues, movie brands and franchises.
Reedelk Runtime Platform Community Edition
Hospital Database Management System (DBMS) is a comprehensive SQL project designed to streamline and optimize the management of hospital operations. This project aims to provide an efficient and user-friendly solution for storing, retrieving, and manipulating various types of healthcare-related data.
Information Integration Architecture (IIA CSE656) course project at IIIT-Delhi: end-to-end ETL pipelines, global-schema mapping, federated SQL querying, and AI-driven analytics for restaurant & vendor data. Built with Python, React, and LLM-powered natural-language interfaces.
AIDevs project files
Integrating multimodal data through heterogeneous ensembles
Uses Rapid API to fetch IMDb data, filters, & uploads the data in different tables in a MySQL Database, in one click using Talend.
A project to enhance ontology matching accuracy using Large Language Models (LLMs) like S-BERT.
This repository for a project detailing the step by step approach of scraping data, integrating data from various sources, performing analysis on data from various sources for the purpose of analaysis. It also shows how APIs can be harnessed for data engr operations. In this project, the four square API was utilized for the location data.
Combining multimodality histopathology images for integrated cancer research
Describe the different entities that form a modern data ecosystem. Describe and differentiate between the role and responsibilities of Data Engineers, Data Scientists, Data Analysts, Business Analysts, and Business Intelligence Analysts. Explain what Data Engineering is. List the tasks that need to be performed in a typical data engineering life…
Farr, M. T., D. S. Green, K. E. Holekamp, and E. F. Zipkin. 2020. Integrating distance sampling and presence-only data to estimate species abundance. Ecology 00(00):e03204. 10.1002/ecy.3204
Project involves merging customer reviews from Fudgemart and FudgeFlix to create a unified data warehouse using Kimball's approach. Utilizing Power BI, it aims to extract actionable insights for Fudge Inc., guiding strategic decisions, product enhancements, and market expansion based on comprehensive business intelligence.
To integrate data from "Orderline.csv" and "Product.csv" using Talend, filtering based on price, and performing inner and left joins to extract insights and facilitate data warehousing integration with Microsoft SQL Server.
Proyecto para el Hackathon Innovation Challenge Microsoft, utilizando datos públicos para mejorar la gestión del conocimiento en salud global. Facilitamos la colaboración interinstitucional y decisiones basadas en evidencia entre agencias, empresas y organizaciones.
A lab for DataAnalytics | DataEngineering | AnalyticsEngineering | DataScience | DataVisualization | BusinessIntelligence
Building a modern data warehouse using PostgreSQL, covering the full data pipeline from raw data ingestion to analytics. This includes designing robust data models, implementing ETL processes, and organizing data into bronze, silver, and gold layers to support efficient analysis and reporting.
Hormone Therapy Decision Support System for Breast Cancer
The pragmatic technology journey for an Enterprise Data Model serving reporting, analytical, advanced data science and other digital use cases with integrated data from a variety of sources.
An implementation of the data integration process Extract, Transform, Load (ETL)
Add a description, image, and links to the dataintegration topic page so that developers can more easily learn about it.
To associate your repository with the dataintegration topic, visit your repo's landing page and select "manage topics."