I have multimodal Phi3-vision, and I want to get scores from the model like from the huggingface transformers model.generate(..., output_scores= True)