Fix typo in the DSA1 implementation

After reworking Hypercane to use '.halg' formatted files as part of the IIPC 2021 Grant work, the DSA1 algorithm implementation is now wrong. We execute the time slice twice instead of the DBSCAN step: https://github.com/oduwsdl/hypercane/blob/b9656621b5859a872d3cc6ffef1c0fd028181df5/hypercane/packaged_algorithms/dsa1.halg#L79-L93

It needs to follow AlNoamany's Algorithm again, like it did while working on my dissertation work.

	# prevent extra work if we already have it from previous runs
	if [ ! -e ${TIME_SLICE_FILE} ]; then
	echo "clustering mementos from remainder by time"
	hc cluster time-slice -i mementos -a ${ONLY_ENGLISH_FILE} -o ${TIME_SLICE_FILE} -l ${TIME_SLICE_LOG}
	fi

	# apply DBSCAN to cluster by Simhash distance
	DBSCAN_FILE=${WORKING_DIRECTORY}/dsa1-dbscan.tsv
	DBSCAN_LOG=${WORKING_DIRECTORY}/dsa1-cluster-dbscan.log

	# prevent extra work if we already have it from previous runs
	if [ ! -e ${DBSCAN_FILE} ]; then
	echo "clustering mementos from remainder by Simhash"
	hc cluster time-slice -i mementos -a ${TIME_SLICE_FILE} -o ${DBSCAN_FILE} -l ${DBSCAN_LOG}
	fi

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix typo in the DSA1 implementation #67

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Fix typo in the DSA1 implementation #67

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions