Skip to content

Commit 56a432a

Browse files
authored
Merge pull request #2 from sileod/patch-1
added mindgames (tom)
2 parents ae96cf3 + b1d4b99 commit 56a432a

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

README.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -86,7 +86,7 @@ For the **latest version** of the paper, keep an eye on the [releases](https://g
8686

8787
#### Natural language understanding
8888

89-
##### Setiment analysis
89+
##### Sentiment analysis
9090
1. Holistic evaluation of language models. _Percy Liang et al._ arXiv 2022. [[paper](https://arxiv.org/abs/2211.09110)]
9191
2. Can chatgpt forecast stock price movements? return predictability and large language models. _Alejandro Lopez-Lira et al._ SSRN 2023. [[paper](https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4412788)]
9292
3. Is chatgpt a general-purpose natural language processing task solver? _Chengwei Qin et al._ arXiv 2023. [[paper](https://arxiv.org/abs/2302.06476)]
@@ -122,6 +122,8 @@ For the **latest version** of the paper, keep an eye on the [releases](https://g
122122
8. Are large language models really good logical reasoners? a comprehensive evaluation from deductive, inductive and abductive views. _Fangzhi Xu et al._ arXiv 2023. [[paper](https://arxiv.org/abs/2306.09841)]
123123
9. Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4. _Hanmeng Liu et al._ arXiv 2023. [[paper](https://arxiv.org/abs/2304.03439)]
124124
10. A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets, _Laskar et al._ ACL 2023 (Findings). [[paper](https://arxiv.org/abs/2305.18486)]
125+
11. MindGames: Targeting Theory of Mind in Large Language Models with Dynamic Epistemic Modal Logic _Sileo et Lernould_ arXiv 2023. [[paper](https://arxiv.org/abs/2305.03353)]
126+
125127

126128
#### Natural language generation
127129

0 commit comments

Comments
 (0)