Probing LeWM: What Does a JEPA World Model Actually Learn?
Published:
LeWM is a latent world model for robot manipulation trained with a JEPA objective. It encodes visual observations with a ViT, then learns a transformer predictor that forecasts future latent states conditioned on actions — without decoding back to pixels.
