AEP (1) Convex (2) Diffusion Model (1) Few-Shot Learning (2) Flat-Minima (1) Gaussian Distribution (1) Gaussian Prior (1) Generative (1) Gradient Descent (2) LLM (1) LR Decay (1) MAML (2) MatchingNet (1) Meta-Learning (3) Optimization (2) ProtoNet (1) Reptile (1) SAM (1) WLLN (1) Weight Decay (1) ddp (1) entropy (1) pytorch (1) sequence length (1) typical set (1)