Skip to content

Feature/scala code/ch02 biman (#8) #9

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Jan 7, 2022
Merged

Conversation

deepakmca05
Copy link
Collaborator

  • added kmers for FASTA and FASTQ formats

  • DNA Based count in scala

  • Indentation fix

  • improved documentation

  • ch02

  • Feature/scala code/ch01 (Feature/scala code/ch01 #5)

  • Scala Code for data algorithms with spark-ch01

  • Refactored the package name

  • With spark contextclose

  • Feature/scala code/ch01 biman (Feature/scala code/ch01 biman #3)

  • fold by key

  • added comments for fold by key

  • added Average by Group By Key

  • added Average by ReduceByKey

  • dataframe action describe

  • dataframe add column

  • dataframe drop column

  • dataframe filter rows

  • dataframe join

  • dataframe join inner

  • dataframe join left

  • dataframe join right

  • dataframe sql

  • rdd map transformations

  • rdd filter transformations

  • rdd flatMap transformations

  • rdd transformations groupByKey

  • rdd transformations Join

  • rdd transformations Cartesian

  • rdd transformations Combine By Key

  • rdd transformations MapPartitions

  • rdd transformations Sort By

  • rdd action take ordered

  • Feature/scala code/ch01 deepak (Feature/scala code/ch01 deepak #4)

  • fold by key

  • added comments for fold by key

  • added Average by Group By Key

  • added Average by ReduceByKey

  • dataframe action describe

  • dataframe add column

  • dataframe drop column

  • dataframe filter rows

  • dataframe join

  • dataframe join inner

  • dataframe join left

  • Refactroed code + RDD Transformation

  • INDENTATION CORRECT

  • ReduceByKey

  • ReduceByKey1

Co-authored-by: bimanmandal biman29@gmail.com

  • added gradle shorthands

  • Feature/generate sh files (Feature/generate sh files #6)

  • added gradle shorthands

  • added the shell script

  • added the shell script

  • updated the shell commands

Co-authored-by: Biman Mandal 43241698+bimanmandal@users.noreply.github.com
Co-authored-by: bimanmandal biman29@gmail.com

  • updated README.md

  • updated README.md

  • ch02-changes

  • ch02-changes

  • Feature/scala code/ch01 missing class gradle (Feature/scala code/ch01 missing class gradle #7)

  • gradle wrapper fix

  • gradle wrapper fix

  • gradle wrapper fix

  • gradle wrapper fix

  • added bonus chapter correlation

  • added bonus chapter correlation

  • updated docs

  • updated docs

  • updated docs

  • updated docs

  • updated docs

  • updated docs

  • updated docs

  • updated docs

  • updated docs

  • updated docs

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • DNABaseCountFastq

  • added the code changes for chapter 2

  • added the run_spark_applications_scripts

  • added the conditions for 1GB data

  • added the readme file

Co-authored-by: Mahmoud Parsian mahmoud.parsian@yahoo.com
Co-authored-by: deepakmca05 deepakmca05@gmail.com

bimanmandal and others added 2 commits January 7, 2022 10:47
* added kmers for FASTA and FASTQ formats

* DNA Based count in scala

* Indentation fix

* improved documentation

* ch02

* Feature/scala code/ch01 (#5)

* Scala Code for data algorithms with spark-ch01

* Refactored the package name

* With spark contextclose

* Feature/scala code/ch01 biman (#3)

* fold by key

* added comments for fold by key

* added Average by Group By Key

* added Average by ReduceByKey

* dataframe action describe

* dataframe add column

* dataframe drop column

* dataframe filter rows

* dataframe join

* dataframe join inner

* dataframe join left

* dataframe join right

* dataframe sql

* rdd map transformations

* rdd filter transformations

* rdd flatMap transformations

* rdd transformations groupByKey

* rdd transformations Join

* rdd transformations Cartesian

* rdd transformations Combine By Key

* rdd transformations MapPartitions

* rdd transformations Sort By

* rdd action take ordered

* Feature/scala code/ch01 deepak (#4)

* fold by key

* added comments for fold by key

* added Average by Group By Key

* added Average by ReduceByKey

* dataframe action describe

* dataframe add column

* dataframe drop column

* dataframe filter rows

* dataframe join

* dataframe join inner

* dataframe join left

* Refactroed code + RDD Transformation

* INDENTATION CORRECT

* ReduceByKey

* ReduceByKey1

Co-authored-by: bimanmandal <biman29@gmail.com>

* added gradle shorthands

* Feature/generate sh files (#6)

* added gradle shorthands

* added the shell script

* added the shell script

* updated the shell commands

Co-authored-by: Biman Mandal <43241698+bimanmandal@users.noreply.github.com>
Co-authored-by: bimanmandal <biman29@gmail.com>

* updated README.md

* updated README.md

* ch02-changes

* ch02-changes

* Feature/scala code/ch01 missing class gradle (#7)

* gradle wrapper fix

* gradle wrapper fix

* gradle wrapper fix

* gradle wrapper fix

* added bonus chapter correlation

* added bonus chapter correlation

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* DNABaseCountFastq

* added the code changes for chapter 2

* added the run_spark_applications_scripts

* added the conditions for 1GB data

* added the readme file

Co-authored-by: Mahmoud Parsian <mahmoud.parsian@yahoo.com>
Co-authored-by: deepakmca05 <deepakmca05@gmail.com>
@deepakmca05 deepakmca05 merged commit 5b951fe into master Jan 7, 2022
@deepakmca05 deepakmca05 deleted the feature/scala-code/ch02 branch January 7, 2022 10:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants