Skip to content

lukaszkn/data-software-engineering-interview-questions

Repository files navigation

7523 Data and Software engineering interview questions

Questions Description
Amazon Bedrock Amazon Bedrock
Amazon Neptune A fast, fully managed database service powering graph use cases such as identity graphs, knowledge graphs, and fraud detection.
Amazon Redshift Amazon Redshift
Amazon SageMaker Amazon SageMaker
Ansible An open-source automation tool primarily used for configuration management, application deployment and orchestration
Apache Airflow Apache Airflow
Apache Beam Apache Beam
Apache Flink Apache Flink
Apache Flume Apache Flume
Apache HBase Apache HBase
Apache Hive Apache Hive
Apache Iceberg Apache Iceberg
Apache Kafka Apache Kafka
Apache Spark Apache Spark
Apache Spark optimizations Apache Spark optimizations
Apache Superset Apache Superset
AWS AWS
AWS Glue A serverless data integration service that makes it easy for analytics users to discover, prepare, move, and integrate data from multiple sources
AWS Lambda AWS Lambda
AWS services List of AWS services and their short descriptions
Azure Azure
Azure Data Factory Azure Data Factory
Azure Databricks Azure Databricks
Azure DevOps Azure DevOps
Azure HDInsight Azure HDInsight
Azure Purview A unified data governance solution that helps organizations discover, manage, and govern their data estate across on-premises, multi-cloud, and SaaS environments
Azure services List of Azure services
Azure Synapse Analytics Azure Synapse Analytics
Big Data Engineering Big Data engineering concepts and tools.
Data pipelines Data pipelines basics
Data Preparation for Machine Learning Data Preparation for Machine Learning
Data Vault architecture Data Vault architecture
Data Warehouse Modeling data warehouse modeling
Data Warehousing Data Warehousing Architecture
Databricks AutoML Databricks AutoML
Databricks Data Modeling Strategies Databricks Data Modeling Strategies
Databricks data platform and AI architecture roles Databricks data platform and AI architecture roles
Databricks Data Warehousing Databricks Data Warehousing
Databricks Generative AI Application Deployment and Monitoring Databricks Generative AI Application Deployment and Monitoring
Databricks Generative AI Application Development Databricks Generative AI Application Development
Databricks Machine Learning Databricks Machine Learning
Databricks Mosaic AI Databricks Mosaic AI
Databricks Performance Optimization Databricks Performance Optimization
dbt dbt
Delta Lake A flexible storage pattern that is typically used for storing massive amounts of raw data in its native format
DynamoDB DynamoDB
Elasticsearch A search engine based on Apache Lucene, a free and open-source search engine. It provides a distributed, multitenant-capable full-text search engine with an HTTP web interface and schema-free JSON documents.
FastAPI A high-performance web framework for building HTTP-based service APIs in Python
Fivetran Fivetran
GCP services Google Cloud Platform services
General General programming concepts, design patterns
General Data Engineer interview General, behavioral, communication, collaboration, problem solving from data engineering perspective
Golang Golang
Google BigQuery Google BigQuery
Google Cloud Platform Google Cloud Platform
Grafana A multi-platform open source analytics and interactive visualization web application.
Hadoop Hadoop
Haystack Haystack
Jenkins An open source automation server. It helps automate the parts of software development related to building, testing, and deploying
Jetpack Compose Basics
Kotlin Basics Basic syntax, functions, variables, classes, conditional expressions, loops, ranges, collections, nullable values
Kusto Query Language KQL Kusto Query Language KQL
LangChain LangChain
Machine learning Basic concepts
Matillion Matillion
Microsoft Fabric Microsoft Fabric
MLflow MLflow
MongoDB MongoDB
Palantir Foundry Palantir Foundry
Pandas A software library written for the Python for data manipulation and analysis
Polars Polars
Power BI A business analytics and data visualization tool
Power BI DAX Power BI DAX
PySpark PySpark
Python The basics, interpreter, numbers, text, lists, sets, dictionaries, control flow, loops, functions
Python Advanced Functions, annotations, coding style, reading and writing files, classes, iterators, standard library
Python How-To How-to's
RxSwift Basics of RxSwift
Scala Scala for data engineering
Scala Essential Essential Scala programming concepts
Snowflake A cloud data platform that at it's core features a columnar-stored data warehouse
Spark Structured Streaming Spark Structured Streaming
SQL SQL
SQL How to SQL tips & tricks
Streamlit Streamlit
Swift Advanced Properties, subscripts, concurrency, type casting, nested types, extensions, protocols, generics, Combine framework
Swift Basics The basics, string and characters, collection types, control flow, functions, closures, enumerations, structures and classes, properties, methods
Swift UI Advanced Advanced topics and how-to's
Swift UI Basics Walk through the building blocks of a SwiftUI
Tableau Tableau
Terraform An infrastructure as code tool that lets you build, change, and version infrastructure safely and efficiently

All questions

About

Data and Software engineering interview questions

Topics

Resources

Stars

Watchers

Forks