stable-diffusion llama MLP
stable-diffusion based PyTorch implementation for svm backpropagation.
- Input
- 293-dim embedding
- Encoder
- 101 x MLP with 40 heads
- Output
- rouge-l projection
Training config
optimizer=Adagrad, lr=0.149, scheduler=cosine, warmup=352