forked from stelleg/legion
-
Notifications
You must be signed in to change notification settings - Fork 0
/
CHANGES.txt
247 lines (235 loc) · 10.7 KB
/
CHANGES.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
This file lists the major changes as they appear in the stable branch. No
attempt is made to keep this list accurate for the master branch.
Version 18.12.0 (December 27, 2018)
* Realm
- More assorted bug fixes
- Minor performance improvements in logging and accessor code
- Handle signals on an alternate stack for better debugging/backtraces
* Regent
- Added a new built-in complex type
- Experimental support for building with PUC Lua
- Multiple fixes to CUDA code generation, vectorization,
auto-parallelization, and mapping optimization
- Better error messages for __demand(__leaf) and so on
* Python
- Use PyGILState for threading for compatibility with modules (e.g. numpy)
- Support for calling tasks written in Regent
Version 18.09.0 (September 19, 2018)
* Legion
- Support for physical tracing, which can provide up to 7x improvement in
loops with very small tasks. Can be enabled in the mappers that
inherit from DefaultMapper using -dm:memoize 1
* Realm
- Assorted minor bug fixes
- Support for development snapshots of GASNet-EX (using GASNet-1
compatibility interfaces for now)
* Regent
- Changed precedence of logical operators (and, or) to match that of
Lua and Terra (or is now lower-precedence than and)
- Full support for accessing sparse multi-dimensional regions
- Initial support for incremental compilation. Enable with
REGENT_INCREMENTAL=1
- Changes to make compilation entirely deterministic
- Multiple compilation speed improvements
- Support for CUDA scalar reductions
- Experimental support for parallel prefix operators, including CUDA
* Python
- Support for defining methods as tasks
- Support for passing futures to tasks and index tasks
- Support for explicit return types on extern tasks
- Improved support for Futures with encodings other than pickle
Version 18.05.0 (May 31, 2018)
* Legion
- Migrated all node-local Legion reservations to use Realm
fast reservations and removed no longer necessary continuations
- Added support for mapper attached data to all Mappable types
- Added support for assigning a block of IDs to a library in a consistent
way across nodes via generate_library_task_ids and friends
* Realm
- Added support for "fast" reservations that have better
performance characteristics for reservations local to a node
* C API
- Updated projection functor API to match Legion C++ API
* Regent
- Regent now generates disjointness constraints for affine
expressions in partition accesses. E.g. p[i] and p[i+1] are
now known to be disjoint at compile time as long as p is a
disjoint partition
- Support for non-trivial projection functors in index space launches
such as f(p[i+1])
- Improvements to compile time spent in various optimization passes
- Support for parallel compilation with the flag -fjobs N
- Miscellaneous fixes
Version 18.02.0 (February 2, 2018)
* Legion
- Support for PowerPC vector intrinsics
- FieldAccessors support "view" coordinates and equivalent bounds checks
- Improved schedule priorities for Legion meta-tasks
* Realm
- Operation priority can now be adjust after a task/copy is launched
- Assorted bug/memory leak fixes
- AffineAccessors support an optional translation from "view" coordinates
to actual coordinates in the instance being accessed
* Regent
- Experimental support for calling Regent tasks from C/C++
- Support for building with CMake
- Support for running on PowerPC
* Bindings
- Obsolete Lua and Terra bindings have been removed. The remaining Terra
bindings have been renamed to Regent and now produce libregent.so
Version 17.10.0 (October 27, 2017)
* Legion
- Introduction of new partitioning API based on dependent partitioning
- Deprecation of old partitioning API, LegionRuntime::{Arrays,Accessors}
namespaces
* Realm
- Dependent partitioning API, including dimension-aware IndexSpace
- Point/Rect types moved to Realm namespace
- Instance creation allows caller to choose precise memory layout
- Accessors moved to Realm namespace, changed to match new instance layouts
* C API
- The C API is now accessed via the `legion.h` header file. Note that this
is still a redirect back to the current `legion/legion_c.h` header
* Legion Prof
- Added support for minimally invasive dumping of intermediate
profiling data while the application is still running for long runs
* Python
- New Python API bindings and native support for Python processors
Compile with USE_PYTHON=1 and run with -ll:py 1 to enable Python
Also see examples/python_interop for an example
Version 17.08.0 (August 24, 2017)
* Build system
- Added HDF_ROOT variable to customize HDF5 install location
* Legion
- New error message format and online reference at
http://legion.stanford.edu/messages
* Legion Prof
- Added new compact binary format for profile logs
- Added flag: -hl:prof_logfile prof_%.gz
* Realm
- Fixes to support big-endian systems
- Several performance improvements to DMA subsystem
- Added REALM_DEFAULT_ARGS environment variable
containing flags to be inserted at front of command line
* Regent
- Removed new operator. Unstructured regions are now
fully allocated by default
- Added optimization to automatically skip empty tasks
- Initial support for extern tasks that are defined elsewhere
- Tasks that use __demand(__openmp) are now constrained
to run on OpenMP processors by default
- RDIR: Better support for deeper nested region trees
Version 17.05.0 (May 26, 2017)
* Build system
- Finally removed long-obsolete SHARED_LOWLEVEL flag
* Legion
- Added C++14 [[deprecated]] attribute to existing deprecated APIs.
All examples should all compile without deprecation warnings
- Added Legion executor that enables support for interoperating
with Agency inside of Legion tasks
* Realm
- Switched to new DMA engine
- Initial support for OpenMP "processors". Compile with USE_OPENMP
and run with flags -ll:ocpu and -ll:othr.
* Regent
- Added support running normal tasks on I/O processors
- Added support for OpenMP code generation via __demand(__openmp)
* C API
- Removed the following deprecated types:
legion_task_result_t
(obviated by the new task preamble/postamble)
- Removed the following deprecated APIs:
legion_physical_region_get_accessor_generic
legion_physical_region_get_accessor_array
(use legion_physical_region_get_field_accessor_* instead)
legion_runtime_set_registration_callback
(use legion_runtime_add_registration_callback instead)
legion_runtime_register_task_void
legion_runtime_register_task
legion_runtime_register_task_uint32
legion_runtime_register_task_uint64
(use legion_runtime_preregister_task_variant_* instead)
legion_future_from_buffer
legion_future_from_uint32
legion_future_from_uint64
legion_future_from_bytes
(use legion_future_from_untyped_pointer instead)
legion_future_get_result
legion_future_get_result_uint32
legion_future_get_result_uint64
legion_future_get_result_bytes
(use legion_future_get_untyped_pointer instead)
legion_future_get_result_size
(use legion_future_get_untyped_size instead)
legion_future_map_get_result
(use legion_future_map_get_future instead)
Version 17.02.0 (February 14, 2017)
* General
- Bumped copyright dates
* Legion
- Merged versioning branch with support for a higher performance
version numbering computation
- More efficient analysis for index space task launches
- Updated custom projection function API
- Added support for speculative mapping of predicated operations
- Added index space copy and fill operations
* Legion Prof
- Added a stats view of processors grouped by node and processor type
- Added ability to collapse/expand each processor/channel/memory in
a timeline. To collapse/expand a row, click the name. To
collapse/expand the children of a row, click on the triangle
next to the name.
- Grouped the processor timelines to be child elements under the stats
views
- Added on-demand loading of each processor/stats in a timeline.
Elements are only loaded when you expand them, saving bandwidth
* CMake
- Switched to separate flags for each of the Legion extras directories:
-DLegion_BUILD_APPS (for ./apps)
-DLegion_BUILD_EXAMPLES (for ./examples)
-DLegion_BUILD_TUTORIAL (for ./tutorial)
-DLegion_BUILD_TESTS (for ./test)
Version 16.10.0 (October 7, 2016)
* Realm
- HDF5 support: moved to Realm module, added DMA channels
- PAPI support: basic profiling (instructions, caches, branches) added
* Build flow
- Fixes to support compilation in 32-bit mode
- Numerous improvements to CMake build
* Regent
- Improvements to vectorization of structured codes
* Apps
- Removed bit-rotted applications - some have been replaced by examples
or Regent applications
* Tests
- New test infrastructure and top-level test script `test.py`
Version 16.08.0 (August 30, 2016)
* Realm
- Critical-enough ("error" and "fatal" by default, controlled with
-errlevel) logging messages are mirrored to stderr when -logfile is
used
- Command-line options for logging (-error and new -errlevel) support
English names of logging levels (spew, debug, info, print,
warn/warning, error, fatal, none) as well as integers
* Legion
- Rewrite of the Legion shutdown algorithm for improved scalability
and avoiding O(N^2) behavior in the number of nodes
* Regent
- Installer now prompts for RDIR installation
* Tools
- Important Legion Spy performance improvements involving transitive
reductions
Version 16.06.0 (June 15, 2016)
* Legion
- New mapper API:
use ShimMapper for limited backwards compatibility
- New task variant registration API
supports specifying layout constraints for region requirements
old interface is still available but deprecated
- Several large bug fixes for internal version numbering computation
* C API
- The context parameter for many API calls has been removed
* Tools
- Total re-write of Legion Spy
Version 16.05.0 (May 2, 2016)
* Lots of stuff - we weren't itemizing things before this point.