| Questions | Description |
|---|---|
| Amazon Bedrock | Amazon Bedrock |
| Amazon Neptune | A fast, fully managed database service powering graph use cases such as identity graphs, knowledge graphs, and fraud detection. |
| Amazon Redshift | Amazon Redshift |
| Amazon SageMaker | Amazon SageMaker |
| Ansible | An open-source automation tool primarily used for configuration management, application deployment and orchestration |
| Apache Airflow | Apache Airflow |
| Apache Beam | Apache Beam |
| Apache Flink | Apache Flink |
| Apache Flume | Apache Flume |
| Apache HBase | Apache HBase |
| Apache Hive | Apache Hive |
| Apache Iceberg | Apache Iceberg |
| Apache Kafka | Apache Kafka |
| Apache Spark | Apache Spark |
| Apache Spark optimizations | Apache Spark optimizations |
| Apache Superset | Apache Superset |
| AWS | AWS |
| AWS Glue | A serverless data integration service that makes it easy for analytics users to discover, prepare, move, and integrate data from multiple sources |
| AWS Lambda | AWS Lambda |
| AWS services | List of AWS services and their short descriptions |
| Azure | Azure |
| Azure Data Factory | Azure Data Factory |
| Azure Databricks | Azure Databricks |
| Azure DevOps | Azure DevOps |
| Azure HDInsight | Azure HDInsight |
| Azure Purview | A unified data governance solution that helps organizations discover, manage, and govern their data estate across on-premises, multi-cloud, and SaaS environments |
| Azure services | List of Azure services |
| Azure Synapse Analytics | Azure Synapse Analytics |
| Big Data Engineering | Big Data engineering concepts and tools. |
| Data pipelines | Data pipelines basics |
| Data Preparation for Machine Learning | Data Preparation for Machine Learning |
| Data Vault architecture | Data Vault architecture |
| Data Warehouse Modeling | data warehouse modeling |
| Data Warehousing | Data Warehousing Architecture |
| Databricks AutoML | Databricks AutoML |
| Databricks Data Modeling Strategies | Databricks Data Modeling Strategies |
| Databricks data platform and AI architecture roles | Databricks data platform and AI architecture roles |
| Databricks Data Warehousing | Databricks Data Warehousing |
| Databricks Generative AI Application Deployment and Monitoring | Databricks Generative AI Application Deployment and Monitoring |
| Databricks Generative AI Application Development | Databricks Generative AI Application Development |
| Databricks Machine Learning | Databricks Machine Learning |
| Databricks Mosaic AI | Databricks Mosaic AI |
| Databricks Performance Optimization | Databricks Performance Optimization |
| dbt | dbt |
| Delta Lake | A flexible storage pattern that is typically used for storing massive amounts of raw data in its native format |
| DynamoDB | DynamoDB |
| Elasticsearch | A search engine based on Apache Lucene, a free and open-source search engine. It provides a distributed, multitenant-capable full-text search engine with an HTTP web interface and schema-free JSON documents. |
| FastAPI | A high-performance web framework for building HTTP-based service APIs in Python |
| Fivetran | Fivetran |
| GCP services | Google Cloud Platform services |
| General | General programming concepts, design patterns |
| General Data Engineer interview | General, behavioral, communication, collaboration, problem solving from data engineering perspective |
| Golang | Golang |
| Google BigQuery | Google BigQuery |
| Google Cloud Platform | Google Cloud Platform |
| Grafana | A multi-platform open source analytics and interactive visualization web application. |
| Hadoop | Hadoop |
| Haystack | Haystack |
| Jenkins | An open source automation server. It helps automate the parts of software development related to building, testing, and deploying |
| Jetpack Compose | Basics |
| Kotlin Basics | Basic syntax, functions, variables, classes, conditional expressions, loops, ranges, collections, nullable values |
| Kusto Query Language KQL | Kusto Query Language KQL |
| LangChain | LangChain |
| Machine learning | Basic concepts |
| Matillion | Matillion |
| Microsoft Fabric | Microsoft Fabric |
| MLflow | MLflow |
| MongoDB | MongoDB |
| Palantir Foundry | Palantir Foundry |
| Pandas | A software library written for the Python for data manipulation and analysis |
| Polars | Polars |
| Power BI | A business analytics and data visualization tool |
| Power BI DAX | Power BI DAX |
| PySpark | PySpark |
| Python | The basics, interpreter, numbers, text, lists, sets, dictionaries, control flow, loops, functions |
| Python Advanced | Functions, annotations, coding style, reading and writing files, classes, iterators, standard library |
| Python How-To | How-to's |
| RxSwift | Basics of RxSwift |
| Scala | Scala for data engineering |
| Scala Essential | Essential Scala programming concepts |
| Snowflake | A cloud data platform that at it's core features a columnar-stored data warehouse |
| Spark Structured Streaming | Spark Structured Streaming |
| SQL | SQL |
| SQL How to | SQL tips & tricks |
| Streamlit | Streamlit |
| Swift Advanced | Properties, subscripts, concurrency, type casting, nested types, extensions, protocols, generics, Combine framework |
| Swift Basics | The basics, string and characters, collection types, control flow, functions, closures, enumerations, structures and classes, properties, methods |
| Swift UI Advanced | Advanced topics and how-to's |
| Swift UI Basics | Walk through the building blocks of a SwiftUI |
| Tableau | Tableau |
| Terraform | An infrastructure as code tool that lets you build, change, and version infrastructure safely and efficiently |
-
Couldn't load subscription status.
- Fork 0
lukaszkn/data-software-engineering-interview-questions
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
Data and Software engineering interview questions