Skip to content

Restructure the abstract domains for data frame shapes #1760

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 20 commits into
base: main
Choose a base branch
from

Conversation

ydq23
Copy link
Collaborator

@ydq23 ydq23 commented Jun 17, 2025

No description provided.

EagleoutIce and others added 19 commits March 14, 2025 15:03
* feat: basic concept for data frame create hook

* feat: data frame hook for string based column access

* feat: map variables to data frame domain at assignment

* feat: data frame semantics for assignment and expression list

* refactor: structure abstract interpretation hooks

* feat: basic tests for data frame abstract domain

* feat: test framework for inferred data frame properties

* test: add tests for data frame domain

* refactor: restructure processor decorators and function mapping

* lint-fix: linting errors

---------

Co-authored-by: Oliver <oliver.gerstl@uni-ulm.de>
* test: add support for multiple criteria for data frame tests

* test: test label support and clearer test entry type
* test: parse code only once and check criteria on result

* test: support parser selection for data frame assert tests
* feat: basic fold over CFG for forward traversal

* feat: add simplified forward-connected control flow graph

* feat-fix: correct label for CFG exit point edges

* test: unit tests for simple control flow graph

* feat: add CFG visitor for fixpoint iteration for data frames

* feat-fix: add missing negation in has-changed check

* feat-fix: subtract top from colnames should have no effect
* feat: basic fold over CFG for forward traversal

* feat: add simplified forward-connected control flow graph

* feat-fix: correct label for CFG exit point edges

* test: unit tests for simple control flow graph

* feat: add CFG visitor for fixpoint iteration for data frames

* feat-fix: add missing negation in has-changed check

* feat: setup for data frame processors and semantics

* feat: directly evaluate arguments when processing function

* feat: add semantics mapper for column/row access and assignment

* feat: implement data frame processors for basic nodes

* feat: return resulting data frame domain

* feat: support different types of inferred constraints

* test: add tests for data frame state domain

* feat: store abstract state in nodes

* feat: support control flow constructs in abstract interpretation

* test: add tests for control flow support

* feat: support colnames assignment, cbind and rbind
* feat: basic fold over CFG for forward traversal

* feat: add simplified forward-connected control flow graph

* feat-fix: correct label for CFG exit point edges

* test: unit tests for simple control flow graph

* feat: add CFG visitor for fixpoint iteration for data frames

* feat-fix: add missing negation in has-changed check

* feat: setup for data frame processors and semantics

* feat: directly evaluate arguments when processing function

* feat: add semantics mapper for column/row access and assignment

* feat: implement data frame processors for basic nodes

* feat: return resulting data frame domain

* feat: support different types of inferred constraints

* test: add tests for data frame state domain

* feat: store abstract state in nodes

* feat: support control flow constructs in abstract interpretation

* test: add tests for control flow support

* feat: support colnames assignment, cbind and rbind

* feat: support data frame head and tail function

* feat: support data frame subsetting via access operator

* feat: support data frame subset, filter, and select

* feat: support column and row assignment and magrittr pipe

* feat: support mutate, left_join and group_by

* test-fix: try to fix CI data frame test errors

* test-fix: skip tests with library to test CI

* test-fix: check evaluation of R code with library call in CI

* test-fix: unload library after test

* test-fix: restart R shell after library test

* refactor: restructure semantics and processors

* feat: support summarize and limit col names

* feat: basic delayed widening for fixpoint iteration

* test: add test for multiple else-if branches

* feat: use dataflow graph to resolve origin processor
* feat(benchmark): benchmark and summarizer for abstract interpretation

* test(benchmark): test for abstract interpretation benchmark

* feat-fix: adapt new absint info format

* perf: improve performance and fix max call stack exceeded

* perf: optimize performance of abstract domain operators

* feat-fix: summarize data frame operations

* feat: optional slicing criterion argument for df-shape query

* feat-fix: only map data frame operation if arg is data frame

* feat-fix: only infer colnames at creation if args are vectors

* feat: add number of files containing data frames to benchmark

* feat: add number of files without data frames to benchmark

* perf: improve performance of queue pop

* feat-fix: small issues in function mapping

* feat-fix: fix function mapping of replacement functions
* feat-fix: issues after merging new CFG

* feat-fix: fix constant propagation of vectors

* feat: use new CFG visitor for abstract interpretation

* feat: adapt semantic mappers and support named arguments

* feat-fix: restore auto-trimmed trailing whitespace

* test-fix: adapt config amendment
@ydq23 ydq23 linked an issue Jun 19, 2025 that may be closed by this pull request
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
2 participants