Inconsistent results #21

AIRobotZhang · 2023-06-25T07:25:56Z

Hello, Table 1-2 is not consistent with results reported in https://github.com/WeOpenML/PandaLM/tree/main#test-data

zhuohaoyu · 2023-06-26T01:15:34Z

Hi, we are aware of the issue and it was fixed in #22. The results are different because we updated our inference methods with PandaLM/ChatGPT as reported in our paper to calculate more robust metrics. Please refer to our paper for latest results.

AIRobotZhang · 2023-06-28T01:41:39Z

Hi, we are aware of the issue and it was fixed in #22. The results are different because we updated our inference methods with PandaLM/ChatGPT as reported in our paper to calculate more robust metrics. Please refer to our paper for latest results.

How calculate the F1 score in Table 2. Is F1 score micro or macro?

arjunbansal · 2023-08-30T05:12:16Z

The results are different because we updated our inference methods with PandaLM/ChatGPT as reported in our paper to calculate more robust metrics. Please refer to our paper for latest results.

Do you plan to share the inference dataset files that accompany the updated methods?

qianlanwyd closed this as completed Jun 26, 2023

arjunbansal mentioned this issue Sep 5, 2023

add metrics #30

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Inconsistent results #21

Inconsistent results #21

AIRobotZhang commented Jun 25, 2023 •

edited

Loading

zhuohaoyu commented Jun 26, 2023

Uh oh!

AIRobotZhang commented Jun 28, 2023

Uh oh!

arjunbansal commented Aug 30, 2023

Uh oh!

Inconsistent results #21

Inconsistent results #21

Comments

AIRobotZhang commented Jun 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

zhuohaoyu commented Jun 26, 2023

Uh oh!

AIRobotZhang commented Jun 28, 2023

Uh oh!

arjunbansal commented Aug 30, 2023

Uh oh!

AIRobotZhang commented Jun 25, 2023 •

edited

Loading