Implemented some goodness-of-fit tests #60

JoseKling · 2025-07-21T08:01:01Z

Statistics

Implemented two versions of the Kolmogorov-Smirnov distance as a test statistic.

Time-rescaled event times against a uniform distribution
Time-rescaled interevent times against a unit exponential distribution

Tests

Bootstrap and non-bootstrap based tests

Bootstrap
1. Estimate process
2. Calculate test statistic
3. Simulate data with estimated process
4. Estimate processes from each simulation
5. Calculate test statistic for each simulation
6. Compare test statistic from data and from simulations to get the p value
Non-bootstrap
Same as above, but skip step 4, so use estimation in step 1 to calculate test statistic

Issues

Types and methods names
Perhaps the names can be improved. If you have any suggestions...
Type piracy
Tried implementing a fit method for the Dirac distribution (there isn't one in Distributions.jl), so the fit method works for the UnivariatePoissonProcess, but tests complain about type piracy. Decided to implement a fit method specifically for unmarked Poisson processes instead.

- Bootstrap and non-bootstrap based tests - KSDistance statistic for hypothesis testing - Time re-scaled events againt uniform - Time re-scaled inter events against exponential

- Tests take types instead of instances of point processes - `NoBootstrapTest` may take an instances TODO: - Add Tests

- Statistics now work for general types - Added a `AbstractRNG` to the tests to control simulations - `BootstrapTest` and `NoBootstrapTest` are subtypes of `HypothesisTest` - Fields have fixed types, as done in `HypothesisTests.jl` - Fixed some mistakes TODO: Add tests

codecov · 2025-08-04T08:01:42Z

Codecov Report

❌ Patch coverage is 98.79518% with 1 line in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/HypothesisTests/point_process_tests.jl	90.00%	1 Missing ⚠️

📢 Thoughts on this report? Let us know!

ISSUES: - Type piracy: needed to implement `fit` for the `Dirac` distribution - Names: type and method names could maybe be improved - PPTest: Fields in `BootstrapTest` and `NoBootstrapTest` make sense only for simulation based tests. Anything more general?

`Distributions.jl` does not provide a `fit` method, so a separate `fit` method for unmarked Poisson processes was added.

src/HypothesisTests/PPTests/BootstrapTest.jl

src/HypothesisTests/PPTests/NoBootstrapTest.jl

src/HypothesisTests/Statistics/KSDistance.jl

rsenne · 2025-12-06T16:23:05Z

src/HypothesisTests/pp_test.jl

+on the event times according to the selected distribution.
+=#
+function transform(::Type{<:Uniform}, pp::AbstractPointProcess, h)
+    (length(h.times) < 1) && return 1.0 # No events ⇒ maximum distance


I think this line could be a problem. The first is that this creates a type instability. In the normal branching logic we return a tuple (inter_transf, Uniform(1)) but in this other logic we return just 1.0. Would it make more sense to throw an error? 1.0 doesn'thave a clear statistical meaning to me as we can't even really do any type of GOF here? E.g., if i witnessed a single observation how can i really say whether or not it fits my model without strong prior info? We could also throw a warning and do something like:

if length(h.times) < 2 @warn "No inter-event times: length(h.times) = $(length(h.times))" return Float64[], Uniform(1) end

Yes, this is a reminiscent of some old code, it does not make sense.
In principle, we could just send the result of the time_transform as it is, no matter if it is empty or not.
Now, perhaps performing a test on an empty history would not make sense, so it might make sense to throw an error if the argument h in NoBootstrapTest or BootstrapTest is an empty history. What would make sense is if one of the simulated histories inside the loop are empty.
The thing is how to define the empirical cumulative distribution when the number of samples is 0. To me, it makes sense to define it to be $\hat{F}(t) = 0$ (which is not really a probability distribution, but this is a degenerate case), therefore, the KS distance between this and any other non-degenerate distribution (like Uniform(0, 0)) would be 1.

rsenne · 2025-12-06T16:23:34Z

src/HypothesisTests/pp_test.jl

+end
+
+function transform(::Type{<:Exponential}, pp::AbstractPointProcess, h)
+    (length(h.times) < 2) && return 1.0 # If `h` has only 2 elements, than there are no interevent times


Same comment as in the uniform case.

src/HypothesisTests/pp_test.jl

src/HypothesisTests/PPTests/BootstrapTest.jl

src/HypothesisTests/PPTests/NoBootstrapTest.jl

- Fixed tests for empty histories - Added `@testset`s for clarity

JoseKling · 2025-12-11T11:16:19Z

I think everything was resolved.
The only issue is that I get error in the tests here, but not locally. I will have a look at it later.

Two remarks:

Not sure about all the names, especially NoBootstrapTest and PPTest. Of course, this is just a small detail, but if you have something better...

And I kept the value of the test statistic equal to 1 when the simulated event history is empty, but I raise an error when history to be tested is empty. The exception is when the NoBootstrapTest is called with an empty history and a instantiated process (as opposed to called with a type).
For example, if the history is empty and the process is a Poisson(10). then the p value would return the percentage of simulations that generated an empty history, which is actually how probable it is for the process to produce an empty history.

rsenne · 2025-12-11T15:41:02Z

src/HypothesisTests/PPTests/NoBootstrapTest.jl

+- `pp::Union{AbstractPointProcess, Type{<:AbstractPointProcess}}`: the null hypothesis model family
+- `h::History`: the observed event history
+- `n_sims::Int=1000`: number of simulations to perform for the test
+- `rng::AbstractRNG=default_rng()test statistics from simulated data`: Random number generator


Suggested change

- `rng::AbstractRNG=default_rng()test statistics from simulated data`: Random number generator

- `rng::AbstractRNG=default_rng()`: Random number generator

rsenne · 2025-12-11T15:41:40Z

My thoughts on naming:

PPTest i think should be PointProcessTest its more verbose but i think Julia style reccs generally prefer verbosity to non-canonical acronyms.
NoBootstrapTest would better be described as MonteCarloTest i think its better to describe what it does and not what it doesn't do.
BootstrapTest is good to me. If you want to be more specific ParametricBootstrapTest is an alternative.

rsenne · 2025-12-11T15:42:55Z

src/HypothesisTests/PPTests/bootstrap_test.jl

+function StatsAPI.pvalue(bt::BootstrapTest)
+    (count(>=(bt.stat), bt.sim_stats) + 1) / (bt.n_sims + 1)
+end


Suggested change

function StatsAPI.pvalue(bt::BootstrapTest)

(count(>=(bt.stat), bt.sim_stats) + 1) / (bt.n_sims + 1)

end

function StatsAPI.pvalue(bt::BootstrapTest)

return (count(>=(bt.stat), bt.sim_stats) + 1) / (bt.n_sims + 1)

end

rsenne · 2025-12-11T15:43:44Z

src/HypothesisTests/pp_test.jl

+Calculate the p-value of a goodness-of-fit test on a process.
+
+# Arguments
+- `::PPTest`: the bootstrap test result object


Suggested change

- `::PPTest`: the bootstrap test result object

- `::PPTest`: the test result object

rsenne · 2025-12-11T15:47:23Z

Left my last final comments as well!

JoseKling · 2025-12-12T08:58:13Z

Incorporated the minor suggestions, but, more importantly, refactored the multi-threaded part of the tests.

I was getting an error in test (lts) saying that the vector rngs = [Xoshiro(rand(rng, UInt)) for _ in 1:Threads.nthreads] was being accessed at nthreads + 1. I found out that the thread ids inside the loop were not from 1 to nthreads anymore, but from 2 to nthreads + 1.
I looked more into it and found this Julia Language Blog post and this Discord discussion. Good to keep in mind.

rsenne · 2025-12-13T00:34:42Z

Looks good to me. Just approved the merge

JoseKling · 2025-12-16T09:34:12Z

Added a lock for accessing the master rng in the tests. Although the default_rng() is thread safe and would not cause problems without the lock, this works for any rng.

JoseKling added 3 commits July 16, 2025 08:31

Implemented basic goodness-of-fit tests

90cd7aa

- Bootstrap and non-bootstrap based tests - KSDistance statistic for hypothesis testing - Time re-scaled events againt uniform - Time re-scaled inter events against exponential

- Added docstrings

34cb55a

- Tests take types instead of instances of point processes - `NoBootstrapTest` may take an instances TODO: - Add Tests

JoseKling mentioned this pull request Oct 28, 2025

Standard Hawkes Process #59

Merged

JoseKling added 2 commits November 10, 2025 09:49

Merge branch 'main' into GoFTest

01752a1

Improved types, added documentation

50b8f79

JoseKling mentioned this pull request Nov 11, 2025

HomogeneousPoissonProcess type #63

Closed

JoseKling added 2 commits November 15, 2025 11:51

Merge branch 'main' into GoFTest

42f0351

JoseKling force-pushed the GoFTest branch from cd360af to fcb91c6 Compare November 17, 2025 10:51

JoseKling marked this pull request as ready for review November 17, 2025 10:54

Tests and solved type piracy

fedc286

`Distributions.jl` does not provide a `fit` method, so a separate `fit` method for unmarked Poisson processes was added.

JoseKling requested a review from rsenne December 4, 2025 10:56