sub:assertion {
<
https://arxiv.org/abs/2407.08188> <
https://sense-nets.xyz/hasZoteroItemType> "preprint" .
sub:assertion dcterms:creator <
https://w3id.org/np/RAoSadUw99CeqDlR2400018nqTzR_38fT86OrTzk16Vts> ;
<
http://purl.org/spar/cito/agreesWith> <
https://arxiv.org/abs/2407.08188> ;
rdfs:comment """ best #icmi2024 position:
103 datasets that claim to be more diverse, are not.
Diversity claims are subjective, political and not tested, instead of claiming, let's measure.
But how?
@dorazhao9 @SciOrestis @alicexiang
https://arxiv.org/abs/2407.08188 https://twitter.com/LChoshen/status/1816031646568583532/photo/1
Basically, like we evaluate everything else.
Measure one thing at a time (don't also test a new model)
Have a specific claim (is it language diverse, background,origin) and quantify it
Separate it from other constructs like how much data was collected or whether it is biased https://twitter.com/LChoshen/status/1816031649416556577/photo/1
""" ;
schema:keywords "datasetdiversity" , "datasets" , "diversity" , "icmi2024" , "measurement" , "value-laden" ;
<
https://sense-nets.xyz/summarizes> <
https://arxiv.org/abs/2407.08188> .
}