to Log or not to Log transform biomass data #310
-
Hi all, |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 2 replies
-
With a Tweedie distribution, you'd typically use a log link with the raw biomass data as the response. The Tweedie is for 0's and positive continuous values. With a low mean, this can create lots of zeros. I imagine you'd only log transform the response data (and add some small arbitrary value first because of the zeros) if you wanted to try to model this with a Gaussian family. Other alternatives would be |
Beta Was this translation helpful? Give feedback.
-
How exactly are you getting from the first histogram to the second? The log of 0 is -Inf and the log of anything less than 1 will be negative. But, yes, the Tweedie can handle zeros just fine and the linear predictor will be in log space with a log link. After fitting you could try the |
Beta Was this translation helpful? Give feedback.
With a Tweedie distribution, you'd typically use a log link with the raw biomass data as the response. The Tweedie is for 0's and positive continuous values. With a low mean, this can create lots of zeros. I imagine you'd only log transform the response data (and add some small arbitrary value first because of the zeros) if you wanted to try to model this with a Gaussian family. Other alternatives would be
delta_gamma()
ordelta_lognormal()
within sdmTMB, which are also known as hurdle models, to model the biomass data. There's a section in the latest preprint version on these 'delta' families within sdmTMB. https://doi.org/10.1101/2022.03.24.485545