1 min readfrom Machine Learning

Was looking at a ICLR 2025 Oral paper and I am shocked it got oral [D]

After my last post about score analysis of ICLR, I am looking into the review itself now.

They evaled SQL code generation by LLM using nature language metric and not executation metric, and they tested it and found around 20% false positive rate. This is a major flaw how is it even getting oral?

https://openreview.net/forum?id=GGlpykXDCa

submitted by /u/Striking-Warning9533
[link] [comments]

Want to read more?

Check out the full article on the original site

View original article

Tagged with

#rows.com
#natural language processing for spreadsheets
#AI formula generation techniques
#generative AI for data analysis
#conversational data analysis
#Excel alternatives for data analysis
#no-code spreadsheet solutions
#natural language processing
#data analysis tools
#ICLR
#oral paper
#false positive rate
#SQL code generation
#score analysis
#LLM
#natural language metric
#execution metric
#evaluation
#flaw
#testing