Does Bing Chat give reliable answers to math and physics questions? If not is it possible to make it more reliable?

Wrong_thought_7@lemmy.ml · 9 months ago

Does Bing Chat give reliable answers to math and physics questions? If not is it possible to make it more reliable?

lily33@lemm.ee · edit-2 9 months ago

I have experience with GPT-4, and in particular I’ve used to for math questions in my work occasionally. I’m not sure how Bing chat compares.

For GTP-4, I’ve noticed the following:

How reliable the answer is depends on how easy or obscure the question is. It hasn’t lied to me on easy or introductory material, but once your questions start becoming more obscure, and it’s less likely to have the answer in the training set, it starts making things up.

I think of it as search to an extent - it needs to have the answer in the training data to find it. Unlike google, it can usually find an answer even if you don’t use the proper terms. But if it doesn’t find an answer, it might make something up.
“Easy or introductory” is relative - I have been able to get good answers for some masters-level math, and some wrong ones for lower-level things. Ultimately it depends on how much resources on the topic have been in the training set.

It’s actually much more reliable in detecting errors than it’s in generating text. So you can open a new chat and ask, “Is the following true: …” and it will catch most of its own errors. Once it starts catching error, you should know you’ve left the reliable “easy questions” territory, and even if it can still be useful, exercise much more care.
The way you phrase a prompt matters a lot. For example, if you ask it to explain its reasoning step by step, it becomes much more accurate.
It is generally good in rephrasing questions to use better terminology.

.

Bing chat might be different in some regards. I know that it automatically searches the web for sources, and when generating an answer, and bases its answer on the contents of the sources it found - but I don’t have experience with it.

That said, asking for additional sources (besides the search results it found) shouldn’t improve the accuracy. It might just give you something you can use to fact-check it.