Skip to content

Conversation

@mavroudo
Copy link

Query now works on a spark cluster. The RDD framework has been swapped to Datasets. Additionally, the DeltaController has been integrated with S3Connector. Using try-catch can now read both formats. Cluster mode has been tested for both Delta and Parquet.

mavroudo and others added 25 commits March 2, 2025 08:46
…orExploration to work with Dataframes. Up until now, metadata and stats query works correctly in cluster
…tor. The idea is that Connectors that work with spark will provide a connection to the Dataset for each Table. Then SparkDatabaseRepository will be responsible to utilize them in order to implement the various methods required by the DatabaseRepository. In this case we put trust in spark that it will run the query in an optimized manner. Also, this makes extremely easier to add new database as it wil only require 4-5 methods.
…issue was that the events were read from SingleTable and handled as EventBoth. Since EventBoth extends EventTs it was handled as a timestamp event and the position information was missed.
Fixed issue with Kleene* when working in PatternDetectionSingle.
…vent type. The issue was that since there was no et-pairs, causing issue with the detection queries.
@mavroudo
Copy link
Author

mavroudo commented Apr 4, 2025

Fixed #6

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants