7523 Data and Software engineering interview questions

Questions	Description
Amazon Bedrock	Amazon Bedrock
Amazon Neptune	A fast, fully managed database service powering graph use cases such as identity graphs, knowledge graphs, and fraud detection.
Amazon Redshift	Amazon Redshift
Amazon SageMaker	Amazon SageMaker
Ansible	An open-source automation tool primarily used for configuration management, application deployment and orchestration
Apache Airflow	Apache Airflow
Apache Beam	Apache Beam
Apache Flink	Apache Flink
Apache Flume	Apache Flume
Apache HBase	Apache HBase
Apache Hive	Apache Hive
Apache Iceberg	Apache Iceberg
Apache Kafka	Apache Kafka
Apache Spark	Apache Spark
Apache Spark optimizations	Apache Spark optimizations
Apache Superset	Apache Superset
AWS	AWS
AWS Glue	A serverless data integration service that makes it easy for analytics users to discover, prepare, move, and integrate data from multiple sources
AWS Lambda	AWS Lambda
AWS services	List of AWS services and their short descriptions
Azure	Azure
Azure Data Factory	Azure Data Factory
Azure Databricks	Azure Databricks
Azure DevOps	Azure DevOps
Azure HDInsight	Azure HDInsight
Azure Purview	A unified data governance solution that helps organizations discover, manage, and govern their data estate across on-premises, multi-cloud, and SaaS environments
Azure services	List of Azure services
Azure Synapse Analytics	Azure Synapse Analytics
Big Data Engineering	Big Data engineering concepts and tools.
Data pipelines	Data pipelines basics
Data Preparation for Machine Learning	Data Preparation for Machine Learning
Data Vault architecture	Data Vault architecture
Data Warehouse Modeling	data warehouse modeling
Data Warehousing	Data Warehousing Architecture
Databricks AutoML	Databricks AutoML
Databricks Data Modeling Strategies	Databricks Data Modeling Strategies
Databricks data platform and AI architecture roles	Databricks data platform and AI architecture roles
Databricks Data Warehousing	Databricks Data Warehousing
Databricks Generative AI Application Deployment and Monitoring	Databricks Generative AI Application Deployment and Monitoring
Databricks Generative AI Application Development	Databricks Generative AI Application Development
Databricks Machine Learning	Databricks Machine Learning
Databricks Mosaic AI	Databricks Mosaic AI
Databricks Performance Optimization	Databricks Performance Optimization
dbt	dbt
Delta Lake	A flexible storage pattern that is typically used for storing massive amounts of raw data in its native format
DynamoDB	DynamoDB
Elasticsearch	A search engine based on Apache Lucene, a free and open-source search engine. It provides a distributed, multitenant-capable full-text search engine with an HTTP web interface and schema-free JSON documents.
FastAPI	A high-performance web framework for building HTTP-based service APIs in Python
Fivetran	Fivetran
GCP services	Google Cloud Platform services
General	General programming concepts, design patterns
General Data Engineer interview	General, behavioral, communication, collaboration, problem solving from data engineering perspective
Golang	Golang
Google BigQuery	Google BigQuery
Google Cloud Platform	Google Cloud Platform
Grafana	A multi-platform open source analytics and interactive visualization web application.
Hadoop	Hadoop
Haystack	Haystack
Jenkins	An open source automation server. It helps automate the parts of software development related to building, testing, and deploying
Jetpack Compose	Basics
Kotlin Basics	Basic syntax, functions, variables, classes, conditional expressions, loops, ranges, collections, nullable values
Kusto Query Language KQL	Kusto Query Language KQL
LangChain	LangChain
Machine learning	Basic concepts
Matillion	Matillion
Microsoft Fabric	Microsoft Fabric
MLflow	MLflow
MongoDB	MongoDB
Palantir Foundry	Palantir Foundry
Pandas	A software library written for the Python for data manipulation and analysis
Polars	Polars
Power BI	A business analytics and data visualization tool
Power BI DAX	Power BI DAX
PySpark	PySpark
Python	The basics, interpreter, numbers, text, lists, sets, dictionaries, control flow, loops, functions
Python Advanced	Functions, annotations, coding style, reading and writing files, classes, iterators, standard library
Python How-To	How-to's
RxSwift	Basics of RxSwift
Scala	Scala for data engineering
Scala Essential	Essential Scala programming concepts
Snowflake	A cloud data platform that at it's core features a columnar-stored data warehouse
Spark Structured Streaming	Spark Structured Streaming
SQL	SQL
SQL How to	SQL tips & tricks
Streamlit	Streamlit
Swift Advanced	Properties, subscripts, concurrency, type casting, nested types, extensions, protocols, generics, Combine framework
Swift Basics	The basics, string and characters, collection types, control flow, functions, closures, enumerations, structures and classes, properties, methods
Swift UI Advanced	Advanced topics and how-to's
Swift UI Basics	Walk through the building blocks of a SwiftUI
Tableau	Tableau
Terraform	An infrastructure as code tool that lets you build, change, and version infrastructure safely and efficiently

All questions

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
content		content
.gitignore		.gitignore
Interview sample 001.md		Interview sample 001.md
Interview sample 002.md		Interview sample 002.md
Interview sample 003.md		Interview sample 003.md
Interview sample 004.md		Interview sample 004.md
Interview sample 005.md		Interview sample 005.md
Interview sample 006.md		Interview sample 006.md
Interview sample 007.md		Interview sample 007.md
Kafka interview.md		Kafka interview.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

7523 Data and Software engineering interview questions

About

Uh oh!

Uh oh!

Uh oh!

lukaszkn/data-software-engineering-interview-questions

Folders and files

Latest commit

History

Repository files navigation

7523 Data and Software engineering interview questions

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks