Feature/scala code/ch02 biman (#8) #9

deepakmca05 · 2022-01-07T09:40:54Z

added kmers for FASTA and FASTQ formats
DNA Based count in scala
Indentation fix
improved documentation
ch02
Feature/scala code/ch01 (Feature/scala code/ch01 #5)
Scala Code for data algorithms with spark-ch01
Refactored the package name
With spark contextclose
Feature/scala code/ch01 biman (Feature/scala code/ch01 biman #3)
fold by key
added comments for fold by key
added Average by Group By Key
added Average by ReduceByKey
dataframe action describe
dataframe add column
dataframe drop column
dataframe filter rows
dataframe join
dataframe join inner
dataframe join left
dataframe join right
dataframe sql
rdd map transformations
rdd filter transformations
rdd flatMap transformations
rdd transformations groupByKey
rdd transformations Join
rdd transformations Cartesian
rdd transformations Combine By Key
rdd transformations MapPartitions
rdd transformations Sort By
rdd action take ordered
Feature/scala code/ch01 deepak (Feature/scala code/ch01 deepak #4)
fold by key
added comments for fold by key
added Average by Group By Key
added Average by ReduceByKey
dataframe action describe
dataframe add column
dataframe drop column
dataframe filter rows
dataframe join
dataframe join inner
dataframe join left
Refactroed code + RDD Transformation
INDENTATION CORRECT
ReduceByKey
ReduceByKey1

Co-authored-by: bimanmandal biman29@gmail.com

added gradle shorthands
Feature/generate sh files (Feature/generate sh files #6)
added gradle shorthands
added the shell script
added the shell script
updated the shell commands

Co-authored-by: Biman Mandal 43241698+bimanmandal@users.noreply.github.com
Co-authored-by: bimanmandal biman29@gmail.com

updated README.md
updated README.md
ch02-changes
ch02-changes
Feature/scala code/ch01 missing class gradle (Feature/scala code/ch01 missing class gradle #7)
gradle wrapper fix
gradle wrapper fix
gradle wrapper fix
gradle wrapper fix
added bonus chapter correlation
added bonus chapter correlation
updated docs
updated docs
updated docs
updated docs
updated docs
updated docs
updated docs
updated docs
updated docs
updated docs
improved documentation
improved documentation
improved documentation
improved documentation
improved documentation
improved documentation
improved documentation
improved documentation
improved documentation
improved documentation
improved documentation
improved documentation
improved documentation
improved documentation
DNABaseCountFastq
added the code changes for chapter 2
added the run_spark_applications_scripts
added the conditions for 1GB data
added the readme file

Co-authored-by: Mahmoud Parsian mahmoud.parsian@yahoo.com
Co-authored-by: deepakmca05 deepakmca05@gmail.com

* added kmers for FASTA and FASTQ formats * DNA Based count in scala * Indentation fix * improved documentation * ch02 * Feature/scala code/ch01 (#5) * Scala Code for data algorithms with spark-ch01 * Refactored the package name * With spark contextclose * Feature/scala code/ch01 biman (#3) * fold by key * added comments for fold by key * added Average by Group By Key * added Average by ReduceByKey * dataframe action describe * dataframe add column * dataframe drop column * dataframe filter rows * dataframe join * dataframe join inner * dataframe join left * dataframe join right * dataframe sql * rdd map transformations * rdd filter transformations * rdd flatMap transformations * rdd transformations groupByKey * rdd transformations Join * rdd transformations Cartesian * rdd transformations Combine By Key * rdd transformations MapPartitions * rdd transformations Sort By * rdd action take ordered * Feature/scala code/ch01 deepak (#4) * fold by key * added comments for fold by key * added Average by Group By Key * added Average by ReduceByKey * dataframe action describe * dataframe add column * dataframe drop column * dataframe filter rows * dataframe join * dataframe join inner * dataframe join left * Refactroed code + RDD Transformation * INDENTATION CORRECT * ReduceByKey * ReduceByKey1 Co-authored-by: bimanmandal <biman29@gmail.com> * added gradle shorthands * Feature/generate sh files (#6) * added gradle shorthands * added the shell script * added the shell script * updated the shell commands Co-authored-by: Biman Mandal <43241698+bimanmandal@users.noreply.github.com> Co-authored-by: bimanmandal <biman29@gmail.com> * updated README.md * updated README.md * ch02-changes * ch02-changes * Feature/scala code/ch01 missing class gradle (#7) * gradle wrapper fix * gradle wrapper fix * gradle wrapper fix * gradle wrapper fix * added bonus chapter correlation * added bonus chapter correlation * updated docs * updated docs * updated docs * updated docs * updated docs * updated docs * updated docs * updated docs * updated docs * updated docs * improved documentation * improved documentation * improved documentation * improved documentation * improved documentation * improved documentation * improved documentation * improved documentation * improved documentation * improved documentation * improved documentation * improved documentation * improved documentation * improved documentation * DNABaseCountFastq * added the code changes for chapter 2 * added the run_spark_applications_scripts * added the conditions for 1GB data * added the readme file Co-authored-by: Mahmoud Parsian <mahmoud.parsian@yahoo.com> Co-authored-by: deepakmca05 <deepakmca05@gmail.com>

bimanmandal and others added 2 commits January 7, 2022 10:47

Resolved merge conflicts

17fbcaf

deepakmca05 merged commit 5b951fe into master Jan 7, 2022

deepakmca05 deleted the feature/scala-code/ch02 branch January 7, 2022 10:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/scala code/ch02 biman (#8) #9

Feature/scala code/ch02 biman (#8) #9

Uh oh!

deepakmca05 commented Jan 7, 2022

Uh oh!

Uh oh!

Feature/scala code/ch02 biman (#8) #9

Feature/scala code/ch02 biman (#8) #9

Uh oh!

Conversation

deepakmca05 commented Jan 7, 2022

Uh oh!

Uh oh!