Leaderboard Logo
Leaderboards
The Ai2 Leaderboard will be archived at the end of April 2025. We plan to show past results, but will not accept new submissions. Currently, new accounts cannot be created.
SciFact: Scientific claim verification Logo

SciFact

Due to the rapid growth in the scientific literature, there is a need for automated systems to assist researchers and the public in assessing the veracity of scientific claims. To facilitate the development of systems for tmore
Download
Rank
Submission
Created
Sent+L (F1)
Sent+L (P)
Sent+L (R)
Sent (F1)
Sent (P)
Sent (R)
Abst+R (F1)
Abst+R (P)
Abst+R (R)
Abst (F1)
Abst (P)
Abst (R)
1
10/15/20240.67670.68610.66760.74790.75830.73780.71460.73680.69370.72850.75120.7072
2
itsatest
no name
07/22/20240.67660.67750.67570.74970.75070.74860.71560.72900.70270.72940.74300.7162
3
vrl
Sam
10/14/20240.67510.70380.64860.74820.78010.71890.71430.75760.67570.72860.77270.6892
4
test
test
07/09/20240.67280.65970.68650.74170.72730.75680.71910.71750.72070.73260.73090.7342
5
10/15/20240.67220.69140.65410.74440.76570.72430.71060.74380.68020.72470.75860.6937
5
10/15/20240.67220.69140.65410.74440.76570.72430.71060.74380.68020.72470.75860.6937
7
10/15/20240.67210.67390.67030.74250.74460.74050.71100.72430.69820.72480.73830.7117
7
test
test
05/23/20240.67210.67390.67030.74250.74460.74050.71100.72430.69820.72480.73830.7117
7
aka
asd
07/19/20230.67210.67390.67030.74250.74460.74050.71100.72430.69820.72480.73830.7117
7
MultiVerS
Allen Institute for AI and Un…
06/04/20210.67210.67390.67030.74250.74460.74050.71100.72430.69820.72480.73830.7117
11
MRE
mre
03/05/20220.67120.67210.67030.74150.74250.74050.70940.72090.69820.72310.73490.7117
12
test2
test if performance change
07/09/20240.67020.65710.68380.74170.72730.75680.71460.71300.71620.72810.72650.7297
12
07/05/20240.67020.65710.68380.74170.72730.75680.69890.68240.71620.71210.69530.7297
14
10/15/20240.66940.67970.65950.74070.75210.72970.70860.73430.68470.72260.74880.6982
15
02/13/20220.66940.67120.66760.73980.74180.73780.70640.71960.69370.72020.73360.7072
16
10/15/20240.66760.70030.63780.74120.77740.70810.70810.75510.66670.72250.77040.6802
17
qr_test0
anonymous
11/25/20220.66490.65190.67840.73910.72470.75410.70880.71040.70720.72230.72400.7207
18
testing0
annoymous
11/28/20220.66490.65450.67570.74200.73040.75410.70880.71040.70720.72230.72400.7207
19
11/25/20220.66400.64780.68110.73780.71980.75680.70850.70540.71170.72200.71880.7252
20
07/05/20240.65800.63410.68380.73340.70680.76220.70040.68530.71620.71370.69830.7297
21
05/22/20240.65660.66760.64590.72530.73740.71350.68690.71360.66220.70090.72820.6757
22
01/27/20220.65540.68330.62970.79040.82400.75950.68680.70810.66670.71000.73210.6892
22
test
test
01/30/20220.65540.68330.62970.79040.82400.75950.68680.70810.66670.71000.73210.6892
24
test-prob-ranking
xd sheffield
06/17/20240.65490.65850.65140.72830.73220.72430.68660.70280.67120.70050.71700.6847
25
TEST
TEST
01/31/20220.65370.67050.63780.78950.80970.77030.68950.69910.68020.71230.72220.7027
25
02/04/20220.65370.67050.63780.78950.80970.77030.68950.69910.68020.71230.72220.7027
27
VerT5erini (2-stage Neural Re…
University of Waterloo
02/19/20210.63400.60590.66490.65980.63050.69190.66950.62850.71620.68210.64030.7297
28
Test
Elfsong
01/13/20220.63390.64090.62700.65570.66300.64860.66080.65070.67120.68290.67250.6937
29
Test
Elfsong
01/24/20220.63090.62670.63510.65230.64800.65680.66670.64170.69370.69260.66670.7207
30
ARSJoint
Zhiwei Zhang, Jiyi Li, Fumiyo…
08/13/20210.63080.66170.60270.75810.79530.72430.65710.69700.62160.68100.72220.6441
31
RerrFact
Under anonymous review, code …
11/17/20210.62090.73430.53780.67710.80070.58650.66310.81580.55860.67380.82890.5676
32
Rs-Lp JointModel
Rs-Lp JointModel
12/05/20210.61130.72760.52700.70220.83580.60540.64800.74710.57210.66840.77060.5901
33
ParagraphJoint
PLUS Lab: Xiangci Li (UT Dall…
01/26/20210.60940.68940.54590.70590.79860.63240.67160.73660.61710.69120.75810.6351
34
enhanced-paragraph-joint
enhanced-paragraph-joint
12/04/20210.60700.74220.51350.68050.83200.57570.62140.73910.53600.64230.76400.5541
35
Test
Will be released soon
12/29/20210.59880.69780.52430.61730.71940.54050.64060.70050.59010.66990.73260.6171
36
VerT5erini (2-stage Neural Re…
University of Waterloo
01/26/20210.58760.60000.57570.64830.66200.63510.62690.61470.63960.64900.63640.6622
37
regression
Mitchell DeHaven
02/26/20230.58310.56960.59730.61740.60310.63240.68340.69120.67570.73800.74650.7297
38
05/14/20210.58260.65540.52430.70270.79050.63240.62310.70450.55860.64320.72730.5766
39
FEVER transfer
Mitchell DeHaven
09/28/20220.58100.53530.63510.61560.56720.67300.67850.66810.68920.73170.72050.7432
40
base_rel_(train+dev)
Will disclose after final sub…
12/05/20210.57960.70540.49190.65290.79460.55410.61250.76870.50900.62330.78230.5180
41
test
n.a.
09/15/20220.57920.51470.66220.60990.54200.69730.67800.64000.72070.71610.67600.7613
42
test
test
08/22/20220.57110.51880.63510.60750.55190.67570.67110.65380.68920.72370.70510.7432
43
test
test
09/27/20220.56970.52760.61890.61190.56680.66490.65770.65330.66220.70690.70220.7117
44
test
test
05/26/20230.56600.68730.48110.65500.79540.55680.63490.76920.54050.66670.80770.5676
45
V-MultiVerS(N20)
Xingyu Deng from NLP research…
10/07/20240.56300.51820.61620.61980.57050.67840.61950.57530.67120.66940.62160.7252
46
base-rel 3 prototypes
Will be released soon
12/17/20210.56260.65700.49190.65840.76900.57570.60800.74510.51350.62930.77120.5315
47
test
test
05/26/20230.56090.68900.47300.64740.79530.54590.63300.77270.53600.66490.81170.5631
48
base-rel prototype submission
Currently the solution in tes…
11/14/20210.56040.65580.48920.65330.76450.57030.58600.72670.49100.60750.75330.5090
49
Law & Econ
fine-tuning the e-FEVER sytem…
02/11/20210.56010.56630.55410.62570.63260.61890.60610.62800.58560.64800.67150.6261
50
ArgJointModel
Zhiyuan Guo from Beihang Univ…
10/05/20210.55960.61490.51350.71280.78320.65410.59190.62940.55860.62050.65990.5856
51
Test
test
06/01/20230.55770.68500.47030.64740.79530.54590.62730.77480.52700.65420.80790.5495
52
test
test
05/26/20230.55700.67180.47570.64560.77860.55140.62960.76280.53600.66140.80130.5631
53
VerT5erini (BM25 Retrieval)
University of Waterloo
01/27/20210.55520.58330.52970.61760.64880.58920.59170.60280.58110.61930.63080.6081
54
QMUL-SDS
Xia Zeng and Arkaitz Zubiaga …
03/09/20210.55350.66170.47570.68240.81580.58650.58380.72970.48650.59460.74320.4955
55
base-rel prototype submission
Currently the solution in tes…
10/17/20210.55210.63830.48650.65640.75890.57840.60270.73860.50900.62400.76470.5270
56
base-rel
Baseline model testing in pro…
08/22/20210.55180.72370.44590.64550.84650.52160.60280.80450.48200.61410.81950.4910
57
test
test
06/01/20230.54830.64710.47570.64170.75740.55680.60320.73080.51350.62960.76280.5360
58
first_1
Wangchaochao from PingAn of C…
02/24/20210.54770.65060.47300.60720.72120.52430.58350.65360.52700.61350.68720.5541
59
Multiverse: 3.0
Will release with final submi…
11/12/20210.54740.63030.48380.66360.76410.58650.58360.70970.49550.59950.72900.5090
60
Multiverse 3.1
Will release with final submi…
11/14/20210.54660.62810.48380.66260.76140.58650.58200.70510.49550.59790.72440.5090
61
test
test
05/24/20230.54550.62070.48650.66060.75170.58920.61500.72120.53600.62530.73330.5450
62
cogat
anonymous
08/26/20230.54300.60200.49460.58460.64800.53240.63540.75310.54950.66150.78400.5721
63
SCIFCHEX
Filip J. Cierkosz from Univer…
05/18/20240.53720.66940.44860.60840.75810.50810.60000.78260.48650.61670.80430.5000
64
test
test
08/12/20220.53300.48660.58920.58190.53130.64320.63860.62880.64860.68740.67690.6982
65
Test #SFC
FJC Sheffield
05/11/20240.53040.62730.45950.60220.71220.52160.61330.79290.50000.61880.80000.5045
66
test
anonymous
03/22/20240.52910.60920.46760.56570.65140.50000.60920.75840.50900.63070.78520.5270
67
11/16/20210.52850.63740.45140.57590.69470.49190.57780.75360.46850.60000.78260.4865
68
10/07/20240.52500.47880.58110.60070.54790.66490.59880.53760.67570.64270.57710.7252
69
aaaaa
anonymous
03/20/20240.52290.57650.47840.57900.63840.52970.61340.71690.53600.62370.72890.5450
70
10/05/20210.52040.59110.46490.63240.71820.56490.54730.61110.49550.59200.66110.5360
71
TEST 2
test2
05/11/20240.52020.64660.43510.59770.74300.50000.59090.80000.46850.59660.80770.4730
72
ssss
anonymous
03/24/20240.51380.59640.45140.57850.67140.50810.59460.74320.49550.63240.79050.5270
73
aaaaaa
anonymous
05/22/20240.51300.55310.47840.59130.63750.55140.60000.69640.52700.61540.71430.5405
73
test
anonymous
03/21/20240.51300.55310.47840.59130.63750.55140.60000.69640.52700.61540.71430.5405
73
aaaa
anonymous
03/23/20240.51300.55310.47840.59130.63750.55140.60000.69640.52700.61540.71430.5405
76
tttt
anonymous
03/24/20240.51080.59780.44590.55730.65220.48650.58700.73970.48650.60870.76710.5045
76
aaaa
anonymous
03/24/20240.51080.59780.44590.55730.65220.48650.58700.73970.48650.60870.76710.5045
78
Yonky G
YonkyG from NEU
06/13/20230.51020.60590.44050.57900.68770.50000.59280.76980.48200.60940.79140.4955
79
SciKGAT
Zhenghao Liu from Tsinghua Un…
01/28/20210.50480.61150.42970.55240.66920.47030.58330.76090.47300.60000.78260.4865
80
03/20/20240.50240.61000.42700.54690.66410.46490.59220.77940.47750.59780.78680.4820
80
ssss
anonymous
03/25/20240.50240.61000.42700.54690.66410.46490.59220.77940.47750.59780.78680.4820
82
aaa
anonymous
03/24/20240.50230.58420.44050.57010.66310.50000.58600.72670.49100.60220.74670.5045
83
bioBert for sciFact
bioBert for sciFact
03/16/20210.49850.54870.45680.64010.70450.58650.53300.58290.49100.54280.59360.5000
84
aaaa
anonymous
03/20/20240.49790.51910.47840.55130.57480.52970.48450.44830.52700.50930.47130.5541
85
test
anonymous
03/22/20240.49700.56120.44590.55120.62240.49460.57440.68320.49550.59530.70810.5135
86
ttttt
anonymous
03/23/20240.49630.55520.44860.58000.64880.52430.58640.70000.50450.60210.71880.5180
86
test
anonymous
03/24/20240.49630.55520.44860.58000.64880.52430.58640.70000.50450.60210.71880.5180
88
aaaa
anonymous
03/23/20240.49480.55560.44590.57870.64980.52160.58420.70250.50000.59470.71520.5090
89
aaa
anonymous
03/20/20240.48050.51710.44860.58470.62930.54590.58310.67460.51350.60360.69820.5315
89
aaaa
aaaaa
03/25/20240.48050.51710.44860.58470.62930.54590.58310.67460.51350.60360.69820.5315
91
729
anonymous
03/21/20240.47940.58080.40810.57780.70000.49190.53910.67110.45050.56600.70470.4730
92
base-rel
Baseline model testing in pro…
09/01/20210.47540.57470.40540.67510.81610.57570.50960.65030.41890.51510.65730.4234
93
10/03/20210.47530.57980.40270.66350.80930.56220.51790.66670.42340.52890.68090.4324
94
base-rel
Baseline model testing in pro…
09/13/20210.46540.55640.40000.68240.81580.58650.49190.61490.40990.49730.62160.4144
94
Multiverse
Deepanshu Khanna from Thapar …
08/22/20210.46540.55640.40000.68240.81580.58650.50270.62840.41890.50810.63510.4234
96
RANLI + MultiVerS
Théophile Mandon (LIRMM)
05/22/20240.46340.46470.46220.74250.74460.74050.49080.50000.48200.49540.50470.4865
97
279
anonymous
03/21/20240.46060.55300.39460.55840.67050.47840.52600.67130.43240.54250.69230.4459
98
mmmm
anonymous
03/24/20240.44820.52350.39190.57190.66790.50000.51610.64000.43240.55380.68670.4640
99
ASCME
Will publish
04/29/20240.44550.38430.52970.54770.47250.65140.50850.43690.60810.54610.46930.6532
100
Multiverse-base
Will share soon...
09/07/20210.44340.53010.38110.68240.81580.58650.48650.60810.40540.49730.62160.4144
101
aaaa
anonymous
03/24/20240.44030.51640.38380.52710.61820.45950.52690.65330.44140.55380.68670.4640
102
ASCME
Computer Science Student @ Un…
03/13/20240.43410.37450.51620.54770.47250.65140.49340.42390.59010.52730.45310.6306
103
test
anonymous
03/21/20240.43120.55790.35140.56050.72530.45680.49290.66410.39190.49860.67180.3964
104
AscMe
Will publish
04/29/20240.41960.35740.50810.54690.46580.66220.45830.39540.54500.48860.42160.5811
105
pipeline_longformer_test_rera…
test rerank with pipeline xd
01/09/20240.41430.39320.43780.50900.48300.53780.45280.40210.51800.50000.44410.5721
106
V-MultiVerS(N5)
Xingyu Deng from NLP research…
10/07/20240.41420.30790.63240.47260.35130.72160.50550.38800.72520.54000.41450.7748
107
aaa
anonymous
03/24/20240.41390.53420.33780.54970.70940.44860.46510.65570.36040.47670.67210.3694
108
aaa
anonymous
03/21/20240.41370.56280.32700.56750.77210.44860.43790.63790.33330.44970.65520.3423
109
new_rerank
xd sheffield
01/09/20240.41210.38720.44050.50820.47740.54320.45010.39790.51800.49710.43940.5721
110
AdaptedSciCheck
Will publish later
03/31/20240.41030.34910.49730.54630.46490.66220.45760.38750.55860.48710.41250.5946
111
JC_UKP
Jorge Cardona at UKP
03/02/20210.41020.40690.41350.48790.48400.49190.48960.50240.47750.48960.50240.4775
112
test4
test
05/11/20240.41010.45540.37300.64190.71290.58380.45620.55480.38740.46680.56770.3964
113
AscMe
Will publish
04/29/20240.40950.38050.44320.51440.47800.55680.47740.43940.52250.50210.46210.5495
114
TEST 1
Test.
05/11/20240.40870.47830.35680.64090.75000.55950.45050.57750.36940.45600.58450.3739
115
Test 0
test0
05/11/20240.40540.46050.36220.63840.72510.57030.45410.56760.37840.45950.57430.3829
116
12/10/20220.40440.37190.44320.45870.42180.50270.45870.42370.50000.48760.45040.5315
116
Test
Test
05/04/20230.40440.37190.44320.45870.42180.50270.45870.42370.50000.48760.45040.5315
118
sum_rationale
Rationale Selection with Summ…
03/02/20210.39860.34450.47300.50800.43900.60270.46440.42380.51350.50510.46100.5586
119
ssss
anonymous
03/24/20240.39630.46860.34320.52730.62360.45680.45730.58870.37390.47380.60990.3874
120
08/06/20230.39540.42070.37300.49000.52130.46220.45220.46860.43690.46620.48310.4505
121
VeriSci
Allen Institute for AI and Un…
01/26/20210.39530.38560.40540.46110.44990.47300.46500.46610.46400.47400.47510.4730
122
08/02/20210.39530.38560.40540.46110.44990.47300.46500.46610.46400.47400.47510.4730
123
07/26/20230.39370.41690.37300.48790.51660.46220.45010.46410.43690.46400.47850.4505
124
08/15/20230.39350.39250.39460.48520.48390.48650.44210.41500.47300.47580.44660.5090
125
10/10/20220.39200.33330.47570.44320.37690.53780.45360.40350.51800.47730.42460.5450
126
XYD-v1-test2
XD from the University of Man…
07/12/20230.38900.41670.36490.48700.52160.45680.44080.45450.42790.45480.46890.4414
127
Longformer-pipeline-3
XD from the University of Man…
08/15/20230.38420.41990.35410.47800.52240.44050.43880.45020.42790.47580.48820.4640
128
10/10/20220.38380.32290.47300.44300.37270.54590.44360.39040.51350.46300.40750.5360
129
XYD
XD from the University of Man…
07/04/20230.37950.44690.32970.44790.52750.38920.45700.50270.41890.50610.55680.4640
130
04/04/20240.37710.35660.40000.51720.48920.54860.44120.40680.48200.47840.44110.5225
131
XD-test-v2
XD from the University of Man…
07/11/20230.37630.37430.37840.43280.43050.43510.43400.41130.45950.46380.43950.4910
132
07/26/20230.37330.36840.37840.44270.43680.44860.44350.42860.45950.46960.45380.4865
133
07/30/20230.37120.48680.30000.48490.63600.39190.43800.52870.37390.47490.57320.4054
134
sum_rationale
Rationale Selection with Summ…
03/01/20210.37020.28450.52970.46080.35410.65950.43160.35340.55410.47370.38790.6081
135
2024_test
xd sheffield
01/08/20240.36640.36780.36490.45860.46050.45680.39070.36950.41440.43310.40960.4595
136
08/09/20230.36270.34830.37840.43520.41790.45410.42350.39610.45500.45280.42350.4865
137
Multiverse 4.0
Will disclose after final sub…
11/19/20210.35620.40770.31620.66060.75610.58650.42110.50630.36040.44210.53160.3784
138
test7
test
08/09/20230.35600.33820.37570.43020.40880.45410.41000.38280.44140.44770.41800.4820
139
mul-rel-prototype
work in progress, will make e…
11/18/20210.34980.40940.30540.65330.76450.57030.40860.50670.34230.43010.53330.3604
140
longformer_v1_test
XD from the University of Man…
08/04/20230.33950.35950.32160.48790.51660.46220.38520.39710.37390.40840.42110.3964
141
decontextualization_scifact_v1
Mengfei Lan from School of In…
01/14/20250.32270.26570.41080.42250.34790.53780.36400.30750.44590.38240.32300.4685
142
XYD - v2
XD from the University of Man…
07/26/20230.32210.31000.33510.41820.40250.43510.39420.36540.42790.41910.38850.4550
143
07/26/20230.31160.29540.32970.41120.38980.43510.38170.35380.41440.40660.37690.4414
144
GPT Test using ChromaDB
Jan Disselhoff from JGU Mainz
03/31/20230.30790.29530.32160.36740.35240.38380.36880.29210.50000.48840.38680.6622
145
First
Zhelin Chu
07/31/20240.30180.24340.39730.37370.30130.49190.41840.36450.49100.44530.38800.5225
146
test
test
05/16/20230.29720.29440.30000.47120.46680.47570.35750.35910.35590.37100.37270.3694
147
test
test
08/22/20220.29410.23080.40540.38430.30150.52970.36520.30120.46400.37590.30990.4775
148
08/04/20230.29100.30820.27570.48790.51660.46220.33870.34930.32880.36190.37320.3514
149
Zero-shot FEVER
Allen Institute for AI and Un…
01/26/20210.26900.23710.31080.32510.28660.37570.36410.42260.31980.48210.55950.4234
150
test
test
05/15/20230.24040.15270.56490.27140.17240.63780.24650.15290.63510.26050.16160.6712
150
test
test
05/14/20230.24040.15270.56490.27140.17240.63780.24650.15290.63510.26050.16160.6712
152
ssss
anonymous
03/24/20240.23950.53770.15410.29830.66980.19190.28980.67210.18470.28980.67210.1847
153
improved reranker test
improved reranker test xd she…
07/05/20240.22150.13540.60810.27560.16850.75680.16440.09370.67120.16880.09620.6892
154
AINLP
AINLP
08/22/20220.20240.40320.13510.25510.50810.17030.28040.45450.20270.31150.50510.2252
155
AINLP
AINLP
08/07/20220.20240.14000.36490.28490.19710.51350.25820.18950.40540.26400.19370.4144
156
test
test
08/06/20220.19620.13690.34590.22220.15510.39190.24390.16720.45050.29020.19900.5360
157
base-rel Prototype
WIll disclose soon
11/16/20210.18830.31060.13510.21470.35400.15410.25090.55380.16220.26480.58460.1712
158
Scalable Scientific Verificat…
Mikkel Bak Bertelsen, Mikkel …
06/01/20210.18590.22830.15680.50960.62600.42970.24340.29490.20720.25400.30770.2162
159
07/24/20230.17510.12230.30810.24420.17060.42970.22540.15300.42790.28940.19650.5495
160
07/24/20230.13730.10000.21890.21690.15800.34590.19570.14380.30630.23880.17550.3739
161
Test
Test
05/14/20230.12300.08880.20000.24940.18010.40540.11410.07810.21170.12380.08470.2297
162
base_3_4
claim verification
03/15/20210.10630.05870.55950.13660.07550.71890.16050.09540.50450.21350.12690.6712
163
01/02/20220.10600.15310.08110.23320.33670.17840.08640.07950.09460.09050.08330.0991
164
03/15/20210.10220.07950.14320.27770.21590.38920.12460.10110.16220.13840.11240.1802
165
ASCME 2 stage
Will provide
04/19/20240.08330.04480.59190.11140.05990.79190.07630.05560.12160.36440.26540.5811
166
08/10/20220.07300.03940.50540.09840.05300.68110.08900.04830.56760.09250.05020.5901
167
test
test
05/05/20230.06960.04480.15680.07800.05020.17570.08960.05750.20270.35860.23020.8108
168
test
test
05/05/20230.05580.04330.07840.05770.04480.08110.04100.02430.13060.19210.11390.6126
169
Retrieval Augmented NLI
Théophile Mandon from LIRMM w…
05/07/20240.05360.04330.07030.09280.07500.12160.06950.05120.10810.20840.15350.3243
170
test
test
05/04/20230.03990.25810.02160.03990.25810.02160.04880.25000.02700.05690.29170.0315
171
tttta
AAAA11
10/13/20220.02970.01550.34860.05810.03030.68110.02850.01780.07210.14080.08780.3559
172
test
Carlos Alvarez, Maxwell Benne…
04/22/20240.02510.02990.02160.02510.02990.02160.03270.02990.03600.72240.66040.7973
173
test_our_icl
Carlos Alvarez, Maxwell Benne…
05/01/20240.02460.02860.02160.02460.02860.02160.03190.02860.03600.71710.64290.8108
174
Multi-shot Prompting with GPT…
Carlos Alvarez, Maxwell Benne…
04/04/20240.02380.02650.02160.02380.02650.02160.03050.02650.03600.67560.58610.7973
175
thamtran
Tran Thi Tham from HCMUT
12/12/20230.02170.01710.02970.08470.06670.11620.01300.02350.00900.07170.12940.0495
176
test_no_icl
Carlos Alvarez, Maxwell Benne…
05/09/20240.02130.02430.01890.02130.02430.01890.02750.02430.03150.68630.60760.7883
177
test_our_icl_3.5
Contributors: Carlos Alvarez,…
05/27/20240.01210.01370.01080.01820.02060.01620.01560.01370.01800.38990.34360.4505
178
test_no_icl_3.5
Carlos Alvarez, Maxwell Benne…
05/23/20240.01030.01420.00810.01370.01890.01080.01380.01420.01350.35020.35850.3423
179
test_scifact_icl_3.5
Carlos Alvarez, Max Bennett, …
05/27/20240.00880.00970.00810.01760.01940.01620.01130.00970.01350.39850.34190.4775
180
test
test
05/14/20230.00540.50000.00270.00540.50000.00270.00890.50000.00450.00890.50000.0045
181
test
test
10/18/20220.00000.00000.00000.00410.00860.00270.00000.00000.00000.07410.11760.0541

Sentence Selection+Label (F1) Over Time

01/01/202101/01/202201/01/202301/01/202401/01/20250.000.200.400.600.801.00
Running BestSubmissionsSubmission DateSentence Selection+Label (F1)

Metrics.

Sentence Selection+Label (F1)Sentence Selection-Only (F1)Abstract Label+Rationale (F1)Abstract Label-Only (F1)0.000.200.400.600.801.00
+VeriRel(N5)…itsatestvrltest+VeriRel(N5)…wrong+VeriRel(N5)…testakaMetricScore