Skip to content

Getting predicted distributions for y ~ Normal(mean=0, std=x) #329

Closed Answered by joseortiz3
joseortiz3 asked this question in Q&A
Discussion options

You must be logged in to vote

@alejandroschuler I see now that the class Normal is parameterized by log(sigma) (aka log(scale) / log(std_dev)) and mean (aka loc). So when the linear base models are fit, they output distribution params that are linear, and these become linear log(sigma) and linear mean; in math, ax + b = log(sigma) -> sigma = exp(ax + b). The standard deviation ends up exponential while the mean has the correct linear dependence.

So here, with a linear base model, features would need to contain at least two columns - log(x) and x - to hope to capture y ~ N(0, std=x).

When I do this with:

# Need one feature for the linear log(std_dev) dependence
# because log(std_dev) is the NGBoost Normal distribution'…

Replies: 3 comments 2 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@alejandroschuler
Comment options

Answer selected by joseortiz3
Comment options

You must be logged in to vote
1 reply
@joseortiz3
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants