autoencoder
paper - https://arxiv.org/abs/2201.03898
- taking an image passing it through an autoencoder to get a low dimensional representation in the latent space and then passing it through the decoder to reconstruct the original image from the latent representation!
- ex- if we have a 28x28 image in input, we have 764dim in the input and maybe we can represent it in 64dim in the latent space and then reconstruct it back into the original image using the 64dim latent representation!
- goal: reduce the loss ie MSE (pixel wise loss between the reconstruction and the original image)
- but when we even take a point in the latent space that is near to the original point we get a messy reconstruction because it might represent something meaningful
Limitations:
- random sampling wonβt work!
Architecture
input image β encoder (784 β 256 β RELU) β Latent dim (64) β RELU β decoder (256 β RELU β 784 β sigmoid) β reconstructed image


uses:
- dim reduction
- denoising
- anomaly detection
- compression
challenges:
- not true gen model
- unstructured latent space
- poor extrapolation
- mem reconstruction
Links:
202606172354