Implement confidence intervals for predictions #104

andreasnoack · 2025-10-30T20:24:17Z

This implements the confidence interval as described in Cleveland and Grosse 1991 except for the statistical approximation of the deltas described in section 4 of the paper. They don't seem to share the coefficients of their fit, so it is not easy to implement that part. In addition, computers have much more memory these days, so I don't think the big matrix is a problem in most cases. I'd be interested in anybody knows about more recent approaches to calculating the deltas without the need for the big matrix.

I tried to mimic the interface for predict in GLM but decided to use a struct to return the expensive helper quantities together with the confidence bounds. I'm not a big fan of using Symbols for finite options like here but that is what GLM currently does. Maybe we should change it, but that is a separate concern.

With this, you can construct this plot

The same plot with ggplot2 which uses Cleveland and Grosses code for the computations is

Closes #29

the derivates

andreasnoack · 2025-10-30T20:55:04Z

The new implementation stores the rows of the hat matrix (and a bit more) for each vertex when constructing the Loess fit. That requires much more memory than the old implementation and this causes the benchmark runs to run out of memory because it tests relatively large problems, see

Loess.jl/benchmark/benchmarks.jl

Lines 7 to 12 in 5ecfefa

    
           for i in 2:6 
        
               n = 10^i 
        
               x = rand(MersenneTwister(42), n) 
        
               y = sqrt.(x) 
        
               SUITE["random"][string(n)] = @benchmarkable loess($x, $y) 
        
           end

. It looks like R might be avoiding these rows in the loess function and only constructs when when you call predict on the loess object but you pay the price of essentially recomputing all the local fits in predict. In my opinion, the main use of Loess these days is for visualization with uncertainty bounds, so I'm leaning towards just accepting that the implementation can't handle as large problems.

andreasnoack requested a review from palday October 30, 2025 20:24

andreasnoack force-pushed the an/uncertain branch from 535f401 to 3479ce7 Compare October 30, 2025 20:29

andreasnoack added 3 commits October 30, 2025 21:30

Center predictors to simplify the calculation of predictions and

68fb266

the derivates

Compute the hat matrix

1ea668c

Implement confidence intervals for predictions

32fcc1f

andreasnoack force-pushed the an/uncertain branch from 3479ce7 to 32fcc1f Compare October 30, 2025 20:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement confidence intervals for predictions #104

Implement confidence intervals for predictions #104

Uh oh!

andreasnoack commented Oct 30, 2025 •

edited

Loading

Uh oh!

andreasnoack commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Implement confidence intervals for predictions #104

Are you sure you want to change the base?

Implement confidence intervals for predictions #104

Uh oh!

Conversation

andreasnoack commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

andreasnoack commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

andreasnoack commented Oct 30, 2025 •

edited

Loading