Rework query handling #184

frensing · 2022-10-31T13:56:30Z

Rework of the query handling

Each Worker gets their own QueryHandler.

The updated config will have for example the following structure:

...
workers:
  - className: "WorkerName"
    queries:
      location: "path/to/file"
      format: "one-per-line"
      caching: true
      order: linear
      pattern:
        endpoint: "http://localhost:3030/sparql"
        outputFolder: "queryCache"
        limit: 10
      lang: "lang.SPARQL"
...

QueryHandler

Each QueryHandler has:

location of the query file or folder containing the query files
QuerySet, containing all the queries from the file (or folder)
QuerySelector, which generates the index of the next query
langProcessor to generate TripleStats
pattern to generate queries from pattern queries

QuerySet

The QuerySet is either in-memory or file-based.

InMemoryQuerySet loads all the queries into Strings in memory when initializing.
FileBasedQuerySet retrieves a query directly from the file when its requested.

The config option caching can be set to true for in-memory or false for file-based.

Each QuerySet has a QuerySource from which the queries are read.

QuerySource

A QuerySource is the wrapper for the handling of the query files.
3 different QuerySources are implemented:

FileLineQuerySource expects a query file with one query per line
FileSeparatorQuerySource expects a file with (multi-line) queries separated by a separator line. Default separator line is "###"
FolderQuerySource expects a directory with query files that each contain one (multi-line) query

QuerySelector

A QuerySelector is basically a number generator giving the next index of a query to load.
2 QuerySelectors are implemented:

LinearQuerySelector which gives each index in ascending order, restarting at 0 when reaching the last one.
RandomQuerySelector uses java.util.Random to generate the next index. The seed is either provided in the config or the workerID is used

TODO

update documentation
update javadoc
update IGUANA schema

frensing · 2022-11-04T19:15:23Z

All prior functionality is now in the new QueryHandler.
All prior test cases have been updated where it made sense and run successfully.

Next step is to update the documentations

bigerl

Thank you for the PR. Looks very good. I've pointed out only some minor things and added some questions here and there.

As you wrote, todos left are:

JavaDoc
update the Documentation

iguana.corecontroller/src/main/java/org/aksw/iguana/cc/config/elements/Task.java

iguana.corecontroller/src/main/java/org/aksw/iguana/cc/query/handler/QueryHandler.java

iguana.corecontroller/src/main/java/org/aksw/iguana/cc/query/set/QuerySet.java

iguana.corecontroller/src/main/java/org/aksw/iguana/cc/query/source/impl/FolderQuerySource.java

iguana.corecontroller/src/main/java/org/aksw/iguana/cc/query/source/AbstractQuerySource.java

iguana.corecontroller/src/main/java/org/aksw/iguana/cc/tasks/impl/Stresstest.java

iguana.corecontroller/src/main/java/org/aksw/iguana/cc/utils/FileUtils.java

iguana.corecontroller/src/main/java/org/aksw/iguana/cc/tasks/impl/Stresstest.java

nck-mlcnv · 2023-03-22T09:38:41Z

Things left to do:

update development section of documentation
update the README.md file

bigerl

Documentation looks very good and the code around query handling, also. In general, there still seems to be a long way to go.

-[ ] Besides the comments please review if we really need interfaces+abstract classes for various abstractions. It seems to me that there is no real value in the interface and that the interface could be merged into the abstract class.

iguana.corecontroller/src/main/java/org/aksw/iguana/cc/config/IguanaConfig.java

iguana.corecontroller/src/main/java/org/aksw/iguana/cc/query/handler/QueryHandler.java

iguana.corecontroller/src/main/java/org/aksw/iguana/cc/query/pattern/PatternHandler.java

iguana.corecontroller/src/main/java/org/aksw/iguana/cc/tasks/impl/Stresstest.java

iguana.corecontroller/src/main/java/org/aksw/iguana/cc/worker/AbstractWorker.java

.gitignore

iguana.corecontroller/src/main/java/org/aksw/iguana/cc/worker/impl/SPARQLWorker.java

bigerl

@nck-mlcnv

Was SPARQLWorker removed from the Documentation?
Please double check that the removed abstractions are not referenced in the documentation.
update documentation to reflect renaming QueryList and getQuery

Please check the boxes here when done.

frensing added 14 commits October 24, 2022 13:21

QuerySource

3c5291c

QuerySet

3134afb

QuerySelector

2b9266f

FileSeparatorQuerySource

c534cd3

FolderQuerySource

f829e41

FileLineQuerySourceTest

f533fea

remove getContent from QuerySet

7ee76ec

QuerySelector and LinearQuerySelectorTest

746c2ec

QueryHandler

315df06

QueryHandler add folder test

884cd81

add hashcode and triplestats generation

aecae1a

gitignore

b0e23f2

cleanup

f2a5699

fix order evaluation

e6de3d6

frensing requested a review from bigerl October 31, 2022 13:56

frensing added 7 commits October 31, 2022 18:27

refactor httpworkers to use new query handling

75fece4

refactor cliworkers to use new query handling

0d17a69

refactor stresstest

f8d53b0

PatternHandler and remove of old QueryHandler

6b5a7be

PatternHandler and remove of old QueryHandler

c23cff3

update QueryHandler to init pattern

2228f4f

add QueryHandlerTest with PatternHandler

708a3f7

frensing marked this pull request as ready for review November 4, 2022 19:12

bigerl requested changes Nov 7, 2022

View reviewed changes

frensing added 5 commits November 7, 2022 12:00

add requested changes

4f03a62

rm tripleStats todo

39fa000

rm unused method

3c1b9f0

override hashCode method

b145eb3

documentation and version update

35a8d42

bigerl linked an issue Mar 8, 2023 that may be closed by this pull request

UPDATEWorker does not initialize #174

Closed

nck-mlcnv added 6 commits March 8, 2023 11:51

reformat schema-file

f6281fb

update configuration file schema

b4ce0f1

added endpoint as requirement for pattern key in the schema file

e91b181

add missing dependency for PatternHandler

3d1400a

add javadocs for PatternHandler

6a6e9e8

fix the tutorial page in the documentation

1897018

bigerl linked an issue Mar 22, 2023 that may be closed by this pull request

Support for non-random query chooser #183

Closed

nck-mlcnv added 2 commits March 22, 2023 11:45

update README.md

09ad332

update docs

7e79bb4

nck-mlcnv mentioned this pull request Mar 23, 2023

QueryHandler isn't easily extendable anymore #192

Closed

nck-mlcnv mentioned this pull request Mar 31, 2023

Workers from the same worker-configuration should only have one QuerySet #194

Closed

nck-mlcnv added 2 commits March 31, 2023 15:45

ignore cli tests

8950776

update ci badge

f7db189

nck-mlcnv self-assigned this Apr 3, 2023

bigerl requested changes Apr 4, 2023

View reviewed changes

nck-mlcnv added 8 commits April 5, 2023 16:27

remove parenthesis

06ba515

fix spelling in javadocs

5f6f6ce

change .gitignore

b28283d

remove SPARQLWorker

6b07d04

fix javadocs and rename the method "initPattern"

99d40de

remove unnecessary abstraction

3228e0c

rename abstract classes

470b938

rename QuerySet to QueryList and the method getQueryAtPos to getQuery

9bf549d

bigerl requested changes Apr 6, 2023

View reviewed changes

update the query handling development doc page

92498c8

bigerl approved these changes Apr 11, 2023

View reviewed changes

bigerl merged commit efd02d5 into develop Apr 11, 2023

bigerl deleted the feature/rework-query-handling branch April 11, 2023 10:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rework query handling #184

Rework query handling #184

frensing commented Oct 31, 2022 •

edited by nck-mlcnv

Loading

frensing commented Nov 4, 2022

bigerl left a comment •

edited by nck-mlcnv

Loading

nck-mlcnv commented Mar 22, 2023 •

edited

Loading

bigerl left a comment

bigerl left a comment •

edited by nck-mlcnv

Loading

Rework query handling #184

Rework query handling #184

Conversation

frensing commented Oct 31, 2022 • edited by nck-mlcnv Loading

Rework of the query handling

QueryHandler

QuerySet

QuerySource

QuerySelector

TODO

frensing commented Nov 4, 2022

bigerl left a comment • edited by nck-mlcnv Loading

Choose a reason for hiding this comment

nck-mlcnv commented Mar 22, 2023 • edited Loading

bigerl left a comment

Choose a reason for hiding this comment

bigerl left a comment • edited by nck-mlcnv Loading

Choose a reason for hiding this comment

frensing commented Oct 31, 2022 •

edited by nck-mlcnv

Loading

bigerl left a comment •

edited by nck-mlcnv

Loading

nck-mlcnv commented Mar 22, 2023 •

edited

Loading

bigerl left a comment •

edited by nck-mlcnv

Loading