Paper Digest: NeurIPS-2023 Highlights (Full List) #333

irthomasthomas · 2024-01-13T14:23:12Z

Paper Digest: NeurIPS-2023 Highlights (Full List)

Paper Digest: NeurIPS 2023 Highlights

https://www.paperdigest.org

1, Toolformer: Language Models Can Teach Themselves to Use Tools
Timo Schick; Jane Dwivedi-Yu; Roberto Dessi; Roberta Raileanu; Maria Lomeli; Eric Hambro; Luke Zettlemoyer; Nicola Cancedda; Thomas Scialom;
Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View
Highlight: In this paper, we show that LMs can teach themselves to use external tools via simple APIs and achieve the best of both worlds.

2, Self-Refine: Iterative Refinement with Self-Feedback
Aman Madaan; Niket Tandon; Prakhar Gupta; Skyler Hallinan; Luyu Gao; Sarah Wiegreffe; Uri Alon; Nouha Dziri; Shrimai Prabhumoye; Yiming Yang; Shashank Gupta; Bodhisattwa Prasad Majumder; Katherine Hermann; Sean Welleck; Amir Yazdanbakhsh; Peter Clark;
Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View
Highlight: Motivated by how humans refine their written text, we introduce Self-Refine, an approach for improving initial outputs from LLMs through iterative feedback and refinement.

3, Vicuna Evaluation: Exploring LLM-as-a-Judge and Chatbot Arena
Lianmin Zheng; Wei-Lin Chiang; Ying Sheng; Siyuan Zhuang; Zhanghao Wu; Yonghao Zhuang; Zi Lin; Zhuohan Li; Dacheng Li; Eric Xing; Hao Zhang; Joseph Gonzalez; Ion Stoica;
Related Papers Related Patents Related Grants Related Venues Related Experts View
Highlight: To address this, we explore using strong LLMs as judges to evaluate these models on more open-ended questions. We examine the usage and limitations of LLM-as-a-judge, including position, verbosity, and self-enhancement biases, as well as limited reasoning ability, and propose solutions to mitigate some of them.

Suggested labels

{ "key": "LLM-Applications", "value": "Topics related to practical applications of Large Language Models in various fields" }

This was referenced Mar 14, 2024

LMOps/README.md at main · microsoft/LMOps #706

Open

Introducing PPLX Online LLMs #763

Open

Self-Retrieval: An LLM-Driven Information Retrieval Architecture for the Era of Large Language Models #768

Open

This was referenced Apr 22, 2024

Challenges in Evaluating Agent Performance: A Critical Analysis #812

Open

"transformers can use meaningless filler tokens (e.g., '......') in place of a chain of thought to solve two hard algorithmic tasks" Let's Think Dot by Dot. #815

Open

This was referenced May 2, 2024

ClipNotes for paper: GPTScore - Evaluate as you desire #823

Open

WisdomShell/kieval: A Knowledge-grounded Interactive Evaluation Framework for Large Language Models #829

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Paper Digest: NeurIPS-2023 Highlights (Full List) #333

Paper Digest: NeurIPS-2023 Highlights (Full List) #333

irthomasthomas commented Jan 13, 2024

Paper Digest: NeurIPS-2023 Highlights (Full List) #333

Paper Digest: NeurIPS-2023 Highlights (Full List) #333

Comments

irthomasthomas commented Jan 13, 2024

Suggested labels

{ "key": "LLM-Applications", "value": "Topics related to practical applications of Large Language Models in various fields" }