Prompt Engineered GPT-4 Beats Gemini on all of Google's text benchmarks
microsoft.com"We note that Medprompt+ relies on accessing confidence scores (logprobs) from GPT-4. These are not publicly available via the current API but will be enabled for all in the near future."
Had that been announced yet?
It would be interesting to see how this can be used for open models.
engineered? Over overfit?