Skip to content

Commit b5dc2f4

Browse files
committed
Update on "[XNNPACK][Weights Cache] Initial Weights Cache Design with NamedDataMap"
XNNWeightsCache Design with NamedDataMap. The intent of the weights cache is for tensors to be loaded (via name) through the named data map. APIs to be used by XNNCompiler: - load_unpacked_data - Takes in a string name (tensor name). The weights cache loads the data for this string from the named data map and returns the pointer. It also creates a mapping of this pointer to the name which is later used by the XNNPACK's internal weight cache implementation - free_unpacked_data - Frees all the unpacked data loaded from NamedDataMap. This is only safe to call after xnn_create_runtime has been called. This is because create_runtime takes unpacked data pointers and packs them into a separate buffer. - a couple getter methods - get_packed_data_names - get_unpacked_data_names - get_num_packed_data - get() (get's the xnn_weights_cache object) Internal APIs used by XNNPACK Library - look_up - takes a cache key (weight and bias pointers) and looks up the offset to the packed weight if it exists - look_up_or_insert - takes a cache key and pointer to packed weights and looks_up the offset if it exists, or inserts a new packed weight into the cache and returns that offset - offset_to_addr - gets offset and returns address to packed pointer - reserve_space - returns memory address with appropriate sie for XNNPACK to populate with packed weights ( I want to use the runtime_allocator for this but i don't think we have the right sizes, so for now we are just using a string buffer and resizing it) - is_finalized - since this cache doesn't necessarily need to care about a finalized state we always return true. - delete_cache - deletes cache Differential Revision: [D70885917](https://our.internmc.facebook.com/intern/diff/D70885917/) [ghstack-poisoned]
2 parents a3e1d71 + c05fd47 commit b5dc2f4

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

backends/xnnpack/test/runtime/test_xnn_weights_cache.cpp

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -281,7 +281,6 @@ TEST_F(XNNWeightsCacheTest, ReusePackedWeights) {
281281
// Packed Data Still exists because it has a ref count of 2
282282
ASSERT_EQ(packed_data_names.size(), 1);
283283
weight_cache.delete_packed_data(weight_cache.get_packed_data_names());
284-
packed_data_names =
285-
weight_cache.get_packed_data_names();
284+
packed_data_names = weight_cache.get_packed_data_names();
286285
ASSERT_EQ(packed_data_names.size(), 0);
287286
}

0 commit comments

Comments
 (0)