Skip to content
View Shreyash-rsk's full-sized avatar
🌐
🌐

Block or report Shreyash-rsk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Shreyash-rsk/README.md

logo

#πŸ‘‹ Shreyash Khote [cite: 1]

πŸ“ Location: Ahmedabad, Gujarat, IN [cite: 2]
πŸ“ž Phone: +91 8805128129 [cite: 2]
πŸ“§ Email: shreyashkhote01@gmail.com

I am a Data Engineer with expertise in designing and deploying scalable ETL/ELT architectures and GenAI infrastructure across Information Technology domains. Specialized in the Azure ecosystem (Data Factory, AI Foundry) and Apache Airflow for orchestrating complex data workflows, I have a proven track record of optimizing data pipelines, reducing query latency, and engineering robust data ingestion layers for RAG systems and real-time analytics. πŸš€βš™οΈ


πŸ’Ό Experience

πŸ§‘β€πŸ’» Data Engineer

Relay Human Cloud | Nov 2024 - Present | Ahmedabad, IN

  • βš™οΈ Currently working on a recommendation system project, focusing on improving data pipeline reliability, security, and algorithm performance within a scalable data engineering architecture.
  • πŸ” Engineered a secure AWS authentication layer by implementing temporary session token logic within the core recommendation algorithm and data processing modules, replacing static credentials with a dynamic, auto-refreshing IAM integration.
  • πŸ” Resolved a high-priority "low-recommendation" bottleneck by performing full-stack root cause analysis, using systematic log injection and SQL data modeling to identify restrictive business logic constraints within CORE and CORE MAINLINE product tags.
  • 🀝 Partnered with senior engineers to code, review, and document scalable data workflows in Jira, establishing team-wide best practices for pipeline monitoring and error handling.

πŸ§‘β€πŸ’» Data Analyst Intern

S4S Technology Pvt. Ltd | Jan 2023 - Jul 2023 - πŸ“Š Organized, cleaned, and validated data for in-depth analysis.

  • πŸ” Conducted data analysis to identify key patterns and insights, supporting strategic decisions.
  • πŸ“ˆ Developed interactive dashboards to present complex data in a clear and concise manner.

πŸ›  Skills

  • Data Engineering Fundamentals: ETL/ELT, Data Warehousing, Data Lake, Delta Lake, Data Modeling, RAG, Vector Databases.
  • Cloud & Tools: Azure (AI Foundry, Azure Function, Blob Storage), AWS (S3), Snowflake, Airflow, Docker, Kafka, Git.
  • Languages & Frameworks: Python 🐍 (Pandas, NumPy), SQL, Spark.
  • Soft Skills: Teamwork, Collaboration, Communication, Time Management, Planning.

πŸš€ Projects

  • Real-Time Data Pipeline & Analytics Platform
    πŸ› οΈ Built an end-to-end pipeline ingesting data from Reddit APIs, CSV/JSON files, and a mock Kafka stream, storing raw data in local Parquet/Data Lake format organized by date and source. Designed a star schema data warehouse with fact and dimension tables, applied daily partitioning, and wrote SQL transformations. Orchestrated batch and streaming ETL workflows using Apache Airflow DAGs with data quality checks and robust logging.

  • Twitter Data Pipeline
    🐦 Architected a daily automated ingestion pipeline using Python to extract, clean, and process raw Twitter API data. Orchestrated the ELT workflow with Apache Airflow to ensure zero-touch daily execution and robust failure handling. Engineered a secure data storage layer in AWS S3 for a centralized, query-ready data repository.

  • Olist E-commerce Platform Analysis
    πŸ›’ Built an analysis platform to understand key metrics of e-commerce sales data, providing actionable insights for decision-makers.

  • Sentiment Analysis
    πŸ’¬ Implemented sentiment analysis using Natural Language Processing (NLP) to classify customer reviews into positive, neutral, or negative sentiments.


πŸŽ“ Education

  • M.Sc. Data Science | MGM University, Aurangabad | 2021-2023 | CGPA: 8.99/10
  • B.Sc. Computer Science | Deogiri College, Aurangabad | 2018-2021 | CGPA: 6.75/10

πŸ“œ Certifications & Awards

  • πŸ† Going Above and Beyond Award | Relay Human Cloud - March 2025
  • πŸ… Data Analyst Certification | ExcelR - Reg No: 18897/EXCELR/24012024
  • πŸŽ“ Data Analyst Professional | IBM
  • πŸ“š Data, Data, Everywhere & Relational Database and SQL | Coursera
  • πŸ”‘ Data Analyst Essentials | Cisco

πŸ“¬ Contact

Feel free to reach out to me via email: shreyashkhote01@gmail.com for any inquiries or opportunities. πŸ“§


πŸ”— Connect with Me

Pinned Loading

  1. Olist-E-commerce-Platform-Analysis- Olist-E-commerce-Platform-Analysis- Public

    Analyze Olist's e-commerce data with Tableau for actionable insights into sales trends and customer behaviors. Optimize strategies based on detailed transaction records.

  2. Prediction-Model Prediction-Model Public

    A Prediction Model is a statistical or machine learning model used to predict an outcome based on input data. These models are widely used in fields like finance, marketing, and healthcare to forec…

  3. Sentiment_Analysis Sentiment_Analysis Public

    A project focused on analyzing text data to determine the sentiment (positive, negative, neutral) using natural language processing (NLP) techniques in Python πŸ“‰.

  4. Superstore_Data_Analysis_Insights Superstore_Data_Analysis_Insights Public

    Explore the Superstore dataset with interactive visualizations. Gain insights into sales trends, customer behavior, and product performance to drive informed decision-making for your business.