Sayash Kapoor(@sayashk) 's Twitter Profileg
Sayash Kapoor

@sayashk

CS PhD candidate @PrincetonCITP. I study the societal impact of AI. Currently writing a book on AI Snake Oil: https://t.co/tb2lXSP2gB

ID:3084274082

linkhttp://cs.princeton.edu/~sayashk calendar_today15-03-2015 09:03:24

644 Tweets

5,3K Followers

1,4K Following

rishi(@RishiBommasani) 's Twitter Profile Photo

Transparency for foundation models is an outstanding challenge.

To make progress the White House and G7 have recommended that foundation model developers prepare *transparency reports*.

We recently put out a paper that articulates what this should mean and its policy impact🧵

Transparency for foundation models is an outstanding challenge. To make progress the White House and G7 have recommended that foundation model developers prepare *transparency reports*. We recently put out a paper that articulates what this should mean and its policy impact🧵
account_circle
Princeton Computer Science(@PrincetonCS) 's Twitter Profile Photo

Growing evidence has revealed deep flaws in how machine learning is used in science, a problem that spans dozens of fields.

New guidelines from an interdisciplinary team, including Arvind Narayanan, Sayash Kapoor and Emily Cantrell, tackle this problem.

bit.ly/49YBnuA

Growing evidence has revealed deep flaws in how machine learning is used in science, a problem that spans dozens of fields. New guidelines from an interdisciplinary team, including @random_walker, @sayashk and @EmilyMCantrell, tackle this problem. bit.ly/49YBnuA
account_circle
Sayash Kapoor(@sayashk) 's Twitter Profile Photo

Very interesting paper on overreliance in LLMs, led by Sunnie S. Y. Kim.

The results on overreliance are very interesting, but equally fascinating is the evaluation design: they random assign users to different LLM behaviors + check against a baseline with internet access.

account_circle
Sunnie S. Y. Kim(@sunniesuhyoung) 's Twitter Profile Photo

There is a lot of interest in estimating LLMs' uncertainty, but should LLMs express uncertainty to end users? If so, when and how?

In our paper, we explore how users perceive and act upon LLMs’ natural language uncertainty expressions.

arxiv.org/abs/2405.00623

1/6

There is a lot of interest in estimating LLMs' uncertainty, but should LLMs express uncertainty to end users? If so, when and how? In our #FAccT2024 paper, we explore how users perceive and act upon LLMs’ natural language uncertainty expressions. arxiv.org/abs/2405.00623 1/6
account_circle
Gilles Vandewiele(@Gillesvdwiele) 's Twitter Profile Photo

Very proud to share that our paper introducing the REFORMS checklist is now published in Science Advances! Within this paper, we propose a checklist of 32 questions across 8 different steps of an ML pipeline that should help avoid common mistakes.

science.org/doi/10.1126/sc…

account_circle
Michael Lones(@michael_lones) 's Twitter Profile Photo

Great to have been involved in this initiative led by Sayash Kapoor and Arvind Narayanan to (hopefully!) improve the use of machine learning in science. Further thoughts in my Substack post: fetchdecodeexecute.substack.com/p/reforms-a-gu…

account_circle
rishi(@RishiBommasani) 's Twitter Profile Photo

REFORMS focuses on applying ML in the sciences; good toh highlight some folks within ML who have worked on reproducibility for a long time:

Joelle Pineau - cs.mcgill.ca/~jpineau/Repro…

Percy Liang - worksheets.codalab.org

Jesse Dodge - jessedodge.github.io/NLP_Reproducib…

account_circle
rishi(@RishiBommasani) 's Twitter Profile Photo

REFORMS is an exceptional work by an ensemble cast spanning institutions and disciplines! Check it out!

The approach also directly inspired our work on open foundation models, where we worked towards consensus across folks from different institutions:
crfm.stanford.edu/open-fms/

account_circle
Jessica Hullman(@JessicaHullman) 's Twitter Profile Photo

Lots of practical advice to help researchers doing ML-based science avoid unintentional irreproducibility and overgeneralization in this new paper led by Sayash Kapoor

account_circle
Russ Poldrack(@russpoldrack) 's Twitter Profile Photo

I am really excited to be part of this project led by Sayash Kapoor and Arvind Narayanan to help improve practices in machine-learning based science. science.org/doi/10.1126/sc…

account_circle