[0. Dependent on issue #146.] 1. Add a constant cache manager which allocates and manages the buffers for folded constant tensors. 2. Add runtime pipeline to prepare correct args and call `fold` and `entry` functions.