Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
(arxiv.org)
69 points
by mfiguiere
5 days ago |
9 comments
()
()
()
()