Skip to content

zeotap/spark-property-tests

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

spark-property-tests

Join the chat at https://gitter.im/zeotap/spark-property-tests

Write property based tests easily on spark dataframes

Why

While writing tests for Spark code, we tend to write a lot of boilerplate just to create a test spark dataframe initialised with some test data. Not only were these test sets not readable, but they also do not adhere to Property-based testing standards.

We needed a utility that would have

🥇 less boilerplate code

🥇 easily extensible interface for your custom use-cases

🥇 easily build out-of-box support for most common attributes in your data/project

🥇 promote usage of Property-based tests

This utility is based on the spark-testing-base library by Holden Karau

Usage

Please go through the Wiki to understand the usage of the library.

We have made use of Typeclasses in Scala and Generators in scalacheck to present some simple interfaces to write easy property-based-tests in spark.

Additionally, we have provided examples of how you can leverage the library for your own organization under package com.zeotap.utility.spark.example

Dependency Management

List of available versions are available at this Maven Repo Link

Choose from one of the versions available - generally we have chosen the spark version as the version for our repo

sbt

"com.zeotap" %% "spark-property-tests" % "2.4.5" % "test"

maven

<dependency>
    <groupId>com.zeotap</groupId>
    <artifactId>spark-property-tests_2.11</artifactId>
    <version>2.4.5</version>
    <scope>test</scope>
</dependency>

Build

Project is build using sbt

About

Write property based tests easily on spark dataframes

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages