Add `ST_Azimuth()` #596

yutannihilation · 2025-06-01T01:51:12Z

This pull request implements ST_Azimuth().

I'm not sure if this is very common function considering there's only one feature request in Discussions, but I found I need this.

SELECT degrees(ST_Azimuth(ST_Point(0, 0), ST_Point(0, 1)));
----
90.0

(Maybe I should retry this after #588 gets merged...?)

Maxxen

Looks good! Thanks a lot for picking this up!
I've left some small comments, when you address them you might want to switch the target branch to v1.3.0 too!

Maxxen · 2025-06-01T10:58:31Z

src/spatial/modules/main/spatial_functions_scalar.cpp

+		auto left_x = FlatVector::GetData<double>(*left_entries[0]);
+		auto left_y = FlatVector::GetData<double>(*left_entries[1]);
+		auto right_x = FlatVector::GetData<double>(*right_entries[0]);
+		auto right_y = FlatVector::GetData<double>(*right_entries[1]);
+
+		auto &result_mask = FlatVector::Validity(result);


You can't assume that the input data is a FlatVector (unless you call .Flatten() on the DataChunk first). You can either do the vector access manually by setting up UnifiedVectorFormats but it's much easier to use the Binary/Unary/GenericExector helper classes in simple cases. For STRUCT inputs, you can use the GenericExectuor. As an example, have a look at e.g. ST_Distance_Spheroid:

//------------------------------------------------------------------------------------------------------------------ // POINT_2D //------------------------------------------------------------------------------------------------------------------ static void ExecutePoint(DataChunk &args, ExpressionState &state, Vector &result) { D_ASSERT(args.data.size() == 2); auto &left = args.data[0]; auto &right = args.data[1]; auto count = args.size(); using POINT_TYPE = StructTypeBinary<double, double>; using DISTANCE_TYPE = PrimitiveType<double>; GenericExecutor::ExecuteBinary<POINT_TYPE, POINT_TYPE, DISTANCE_TYPE>( left, right, result, count, [&](POINT_TYPE left, POINT_TYPE right) { return sgl::math::haversine_distance(left.a_val, left.b_val, right.a_val, right.b_val); }); }

Maxxen · 2025-06-01T10:59:14Z

src/spatial/modules/main/spatial_functions_scalar.cpp

+		}
+	}
+
+	static double calcAngle(double x1, double y1, double x2, double y2) {


nit: we use PascalCase for functions - but its not super important.

Ah, thanks for catching!

yutannihilation · 2025-06-01T11:23:14Z

Thanks for the review! I'm still not very familiar with DuckDB internals, so that information is really helpful.

yutannihilation · 2025-06-01T12:43:00Z

src/spatial/modules/main/spatial_functions_scalar.cpp

+				// If the points are the same, return NULL
+				if (left.a_val == right.a_val && left.b_val == right.b_val) {
+					// TODO
+					// result_mask.SetInvalid(i);
+					return 0.0;
+				}


Unlike BinaryExecutor, it seems there's no variant of *WithNulls, so, if I understand correctly, it's not possible to return NULL.

Personally, I feel this should be NaN for this case instead of NULL, so using NaN might be an option. However, since PostGIS returns NULL, it might be confusing.

Alright, maybe it's easiest to just call .Flatten() on args, then you can assume that all input vectors are FlatVector.

Just to illustrate, this is how it would look if you wanted to handle it the most general way possible, the following will handle all types of input vector layouts:

static void ExecutePoint(DataChunk &args, ExpressionState &state, Vector &result) { D_ASSERT(args.data.size() == 2); const auto count = args.size(); auto &lhs = args.data[0]; auto &rhs = args.data[1]; UnifiedVectorFormat lhs_format; UnifiedVectorFormat rhs_format; lhs.ToUnifiedFormat(count, lhs_format); rhs.ToUnifiedFormat(count, rhs_format); auto &lhs_coords = StructVector::GetEntries(lhs); auto &rhs_coords = StructVector::GetEntries(rhs); UnifiedVectorFormat lhs_x_format; UnifiedVectorFormat lhs_y_format; lhs_coords[0]->ToUnifiedFormat(count, lhs_x_format); lhs_coords[1]->ToUnifiedFormat(count, lhs_y_format); UnifiedVectorFormat rhs_x_format; UnifiedVectorFormat rhs_y_format; rhs_coords[0]->ToUnifiedFormat(count, rhs_x_format); rhs_coords[1]->ToUnifiedFormat(count, rhs_y_format); auto &validity = FlatVector::Validity(result); for (idx_t out_idx = 0; out_idx < args.size(); out_idx++) { const auto lhs_idx = lhs_format.sel->get_index(out_idx); const auto rhs_idx = rhs_format.sel->get_index(out_idx); // We assume that the coordinate vectors dont contain any nulls, so we just check if the struct itself is // null, dont bother checking nested validity. if (!lhs_format.validity.RowIsValid(lhs_idx) || !rhs_format.validity.RowIsValid(rhs_idx)) { validity.SetInvalid(out_idx); continue; } const auto lhs_x_idx = lhs_x_format.sel->get_index(out_idx); const auto lhs_y_idx = lhs_y_format.sel->get_index(out_idx); const auto rhs_x_idx = rhs_x_format.sel->get_index(out_idx); const auto rhs_y_idx = rhs_y_format.sel->get_index(out_idx); const auto &lhs_x_val = UnifiedVectorFormat::GetData<double>(lhs_x_format)[lhs_x_idx]; const auto &lhs_y_val = UnifiedVectorFormat::GetData<double>(lhs_y_format)[lhs_y_idx]; const auto &rhs_x_val = UnifiedVectorFormat::GetData<double>(rhs_x_format)[rhs_x_idx]; const auto &rhs_y_val = UnifiedVectorFormat::GetData<double>(rhs_y_format)[rhs_y_idx]; auto res = /* operation, involving lhs_x_val, lhs_y_val, rhs_x_val, rhs_y_val */ 0.0; FlatVector::GetData<double>(result)[out_idx] = res; } if (args.AllConstant()) { result.SetVectorType(VectorType::CONSTANT_VECTOR); } }

But if you really want to maximize performance you switch on the vector.GetVectorType() and specialize cases when all inputs are Flat or Constant - thats kind of what the *Executor helpers do under-the-hood. But again, in this case I would just call .Flatten() to turn everything into flat vectors and call it a day.

yutannihilation · 2025-06-03T00:16:55Z

Thanks for explaining such details! I agree .Flatten() is handy here. Considering the POINT2D case would be probably less frequent than GEOMETRY, we don't need to worry much about the performance. Hopefully, DuckDB will eventually get GenericExecutor::ExecuteBinaryWithNulls so that I can switch the implementation.

Maxxen · 2025-06-03T13:10:51Z

Thanks!

yutannihilation · 2025-06-03T13:17:30Z

Thanks for reviewing!

yutannihilation added 6 commits June 1, 2025 07:43

Implement ST_Azimuth

ac349f7

Accept GEOMETRY

0ed52df

Tweak

77d4c0e

Tweak

cdfe9cf

Add test

680b5dc

Add doc to functions.md

85974ad

Maxxen requested changes Jun 1, 2025

View reviewed changes

yutannihilation added 2 commits June 1, 2025 20:39

Merge remote-tracking branch 'upstream/v1.3.0' into feat/st_azimuth

ce075be

Address comment

60ee1c5

yutannihilation changed the base branch from main to v1.3.0 June 1, 2025 12:38

yutannihilation commented Jun 1, 2025

View reviewed changes

yutannihilation marked this pull request as draft June 1, 2025 12:43

yutannihilation added 2 commits June 3, 2025 08:56

Merge remote-tracking branch 'upstream/v1.3.0' into feat/st_azimuth

e8e6155

Use .Flatten()

f830338

yutannihilation marked this pull request as ready for review June 3, 2025 00:16

Use the PI constant exported from duckdb

e5b1cd4

Maxxen merged commit c7f5bd4 into duckdb:v1.3.0 Jun 3, 2025
22 checks passed

yutannihilation deleted the feat/st_azimuth branch June 3, 2025 13:17

yutannihilation mentioned this pull request Jun 11, 2025

Add GenericExecutor::Execute*WithNulls duckdb/duckdb#17874

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add `ST_Azimuth()` #596

Add `ST_Azimuth()` #596

Uh oh!

yutannihilation commented Jun 1, 2025

Uh oh!

Maxxen left a comment

Uh oh!

Maxxen Jun 1, 2025

Uh oh!

Maxxen Jun 1, 2025

Uh oh!

yutannihilation Jun 1, 2025

Uh oh!

yutannihilation commented Jun 1, 2025

Uh oh!

yutannihilation Jun 1, 2025

Uh oh!

Maxxen Jun 2, 2025

Uh oh!

Maxxen Jun 2, 2025 •

edited

Loading

Uh oh!

yutannihilation commented Jun 3, 2025

Uh oh!

Maxxen commented Jun 3, 2025

Uh oh!

Uh oh!

yutannihilation commented Jun 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add ST_Azimuth() #596

Add ST_Azimuth() #596

Uh oh!

Conversation

yutannihilation commented Jun 1, 2025

Uh oh!

Maxxen left a comment

Choose a reason for hiding this comment

Uh oh!

Maxxen Jun 1, 2025

Choose a reason for hiding this comment

Uh oh!

Maxxen Jun 1, 2025

Choose a reason for hiding this comment

Uh oh!

yutannihilation Jun 1, 2025

Choose a reason for hiding this comment

Uh oh!

yutannihilation commented Jun 1, 2025

Uh oh!

yutannihilation Jun 1, 2025

Choose a reason for hiding this comment

Uh oh!

Maxxen Jun 2, 2025

Choose a reason for hiding this comment

Uh oh!

Maxxen Jun 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yutannihilation commented Jun 3, 2025

Uh oh!

Maxxen commented Jun 3, 2025

Uh oh!

Uh oh!

yutannihilation commented Jun 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add `ST_Azimuth()` #596

Add `ST_Azimuth()` #596

Maxxen Jun 2, 2025 •

edited

Loading