Ddlog update #376

feiranwang · 2015-09-04T08:00:39Z

Fix delta deriver for incremental function call, revise syntax.
Add support for globally auto-set parallelism for extractors.
deepdive initdb TABLE command for initializing a single table.

chrismre · 2015-09-10T17:35:54Z

@feiranwang Is this waiting @netj approval? He should be assigned :)

netj · 2015-09-13T16:35:13Z

shell/deepdive-run

@@ -91,6 +91,15 @@ fullConfig=$run_dir/deepdive.conf
    ddlog compile "${ddlogFiles[@]}"
    export PIPELINE=  # XXX ddlog shouldn't emit this
    : ${Pipeline:=endtoend}
+
+    # set PARALLELISM env var, use max parallelism if the variable is not set
+    if [[ $(uname) = 'Linux' ]]; then


Checking if nproc or sysctl is available makes more sense than relying on uname. You could chain the options with something like:

: ${PARALLELISM:=$({ # Linux typically has coreutils which includes nproc nproc || # OS X sysctl -n hw.ncpu || # fall back to 1 echo 1 } 2>/dev/null)}

netj · 2015-09-13T16:36:18Z

Doesn't deepdive initdb TABLE still drop and create the whole database before creating the given table, affecting others? Here's what I think users expect from the initdb command:

When there are arguments, DD should drop/create/load just those specified tables. Assuming it's a DDlog app, DD should drop/create the tables from the DDlog schema, then optionally load data to the new tables from an assumed path under input/ by some naming convention. It's an error if it's not a DDlog app.
When no argument is given, DD should drop/create all known tables. If it's a DDlog app, all tables defined in the schema should be created then loaded as if the names were all given manually. If it's not DDlog, it should rely on schema.sql and input/init.sh to initialize the database. For this last non-DDlog case, DD should perhaps do a dropdb to be backward compatible.

In any case, DD should first make sure the database is created.

netj · 2015-09-13T16:37:00Z

shell/deepdive-initdb

@@ -16,8 +16,19 @@ db-init "$@"

 # make sure the necessary tables are all created
 if [[ -e app.ddlog ]]; then
-    # TODO export schema.sql from ddlog instead of running initdb pipeline
-    deepdive-run initdb
+    if [[ $# -gt 0 ]]; then


Rather than having these argument count checks buried deep inside, I think it's much clearer to define initdb's behavior entirely differently when arguments are specified. Please see my comment on the PR for reorganizing.

netj · 2015-09-13T16:40:58Z

Nice updates. Please see my comments.

feiranwang · 2015-09-16T00:35:22Z

Thanks! Will update accordingly.

feiranwang · 2015-09-20T09:53:48Z

@netj Updated. Thanks!

netj · 2015-09-24T04:32:04Z

Looks good, merging.

Ddlog update

feiranwang added 3 commits August 31, 2015 22:27

DDlog update - incremental function call

e5958de

Add support for autoset parallelism for extractors in ddlog

0b387b3

Add support for command: deepdive initdb TABLE

70e8342

feiranwang assigned netj Sep 10, 2015

netj reviewed Sep 13, 2015
View reviewed changes

netj mentioned this pull request Sep 13, 2015

schema.sql file generated from ddlog? #357

Closed

Refactor setting PARALLELISM

05852e4

feiranwang mentioned this pull request Sep 20, 2015

fix DISTINCT issue in grounding #382

Merged

feiranwang force-pushed the ddlog_update branch 2 times, most recently from 30fda2e to 2bec633 Compare September 20, 2015 10:40

Refactor deepdive-initdb

2bec633

feiranwang mentioned this pull request Sep 24, 2015

Revise incremental documentation and interface #393

Open

4 tasks

netj added a commit that referenced this pull request Sep 24, 2015

Merge pull request #376 from HazyResearch/ddlog_update

3f5c360

Ddlog update

netj merged commit 3f5c360 into master Sep 24, 2015

netj deleted the ddlog_update branch September 24, 2015 04:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ddlog update #376

Ddlog update #376

feiranwang commented Sep 4, 2015

chrismre commented Sep 10, 2015

netj Sep 13, 2015

netj commented Sep 13, 2015

netj Sep 13, 2015

netj commented Sep 13, 2015

feiranwang commented Sep 16, 2015

feiranwang commented Sep 20, 2015

netj commented Sep 24, 2015

Ddlog update #376

Ddlog update #376

Conversation

feiranwang commented Sep 4, 2015

chrismre commented Sep 10, 2015

netj Sep 13, 2015

Choose a reason for hiding this comment

netj commented Sep 13, 2015

netj Sep 13, 2015

Choose a reason for hiding this comment

netj commented Sep 13, 2015

feiranwang commented Sep 16, 2015

feiranwang commented Sep 20, 2015

netj commented Sep 24, 2015