linear interpolation?

Hi - I've been doing a lot of work lately with interpolation in latent space, and I think linear interpolation might not be the best interpolation operator for high dimensional spaces. Though admittedly this is common practice, this seemed as good a place as any to discuss this, since the dcgan code seems to do exactly that [here](https://github.com/soumith/dcgan.torch/blob/29b8dbce7ea5d8711c971729244aeec5072ace65/generate.lua#L40-L48):

``` lua
noiseL = torch.FloatTensor(opt.nz):uniform(-1, 1)
noiseR = torch.FloatTensor(opt.nz):uniform(-1, 1)
if opt.noisemode == 'line' then
   -- do a linear interpolation in Z space between point A and point B
   -- each sample in the mini-batch is a point on the line
    line  = torch.linspace(0, 1, opt.batchSize)
    for i = 1, opt.batchSize do
        noise:select(1, i):copy(noiseL * line[i] + noiseR * (1 - line[i]))
    end
```

I'm starting with the assumption that `torch.FloatTensor(opt.nz):uniform(-1, 1)` is a valid way to uniformly sample from the prior in the latent space. In the examples below, I'll leave the `nz` dimension at the default of `100`. Let's do an experiment and see what the expected lengths of these vectors are.

![image](https://cloud.githubusercontent.com/assets/945979/13698389/90a42be4-e728-11e5-9fbc-e34226d194a4.png)

I see a gaussian with mean about 5.76 and with 0.25 standard deviation. I believe this means that >99% of vectors would be expected to have a length between 4.8 and 6.8 (4 standard deviations out). This result should not be a big surprise if we think about taking 100 independent random numbers and then running them through the distance formula.

But now let's think about the effects of linear interpolation between these random vectors. At an extreme, we have the linearly interpolated midpoints halfway between any two of these vectors - let's see what the expected lengths of these are.

![image](https://cloud.githubusercontent.com/assets/945979/13698332/42c032ce-e728-11e5-8079-a4831372bd8a.png)

So now we have a gaussian with a mean vector of 4.06 and 0.24 standard deviation. Needless to say, these are not the same distribution, and in fact they are effectively disjoint - the probability of an item from the second appearing in the first is vanishingly small. In other words, the points on the linearly interpolated path are many standard deviations away from points expected in the prior distribution.

If my premise is correct that `torch.FloatTensor(opt.nz):uniform(-1, 1)` performs a uniform sampling across the latent space (a big if, and I'd like to verify this!), then the prior is more shaped like a hypersphere. In that case, spherical interpolation makes a lot more sense, and in my own experiments I've had good qualitative results with this approach. Curious what others think. Also note that this reasoning could be extended beyond just interpolation since this would also affect other interpretable operations - such as finding the average in a subset of labeled data (eg: average man or woman in faces). 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

linear interpolation? #14

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

linear interpolation? #14

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions