Skip to content
@CUHK-ARISE

CUHK ARISE Lab

Popular repositories Loading

  1. PsychoBench PsychoBench Public

    Benchmarking LLMs' Psychological Portrayal

    Python 114 4

  2. EmotionBench EmotionBench Public

    Benchmarking LLMs' Emotional Alignment with Humans

    Python 100 6

  3. GAMABench GAMABench Public

    Benchmarking LLMs' Gaming Ability in Multi-Agent Environments

    Jupyter Notebook 72 1

  4. ml4code-dataset ml4code-dataset Public

    A collection of datasets for machine learning for big code

    56 5

  5. LLMPersonality LLMPersonality Public

    Code and Results of the Paper: On the Reliability of Psychological Scales on Large Language Models

    Python 30

  6. MAS-Resilience MAS-Resilience Public

    Code and Results of the Paper: On the Resilience of Multi-Agent Systems with Malicious Agents

    Python 19 1

Repositories

Showing 10 of 12 repositories
  • CodeCrash Public

    Official repository for the paper "CodeCrash: Stress Testing LLM Reasoning under Structural and Semantic Perturbations"

    CUHK-ARISE/CodeCrash’s past year of commit activity
    Python 7 GPL-3.0 0 0 0 Updated Apr 22, 2025
  • VisFactor Public

    Benchmarking MLLMs' Basic Visual Abilities

    CUHK-ARISE/VisFactor’s past year of commit activity
    Jupyter Notebook 6 GPL-3.0 0 1 0 Updated Apr 19, 2025
  • GAMABench Public

    Benchmarking LLMs' Gaming Ability in Multi-Agent Environments

    CUHK-ARISE/GAMABench’s past year of commit activity
    Jupyter Notebook 72 GPL-3.0 1 1 0 Updated Feb 9, 2025
  • EmotionBench Public

    Benchmarking LLMs' Emotional Alignment with Humans

    CUHK-ARISE/EmotionBench’s past year of commit activity
    Python 100 GPL-3.0 6 2 1 Updated Feb 9, 2025
  • MAS-Resilience Public

    Code and Results of the Paper: On the Resilience of Multi-Agent Systems with Malicious Agents

    CUHK-ARISE/MAS-Resilience’s past year of commit activity
    Python 19 GPL-3.0 1 0 0 Updated Jan 28, 2025
  • PsychoBench Public

    Benchmarking LLMs' Psychological Portrayal

    CUHK-ARISE/PsychoBench’s past year of commit activity
    Python 114 GPL-3.0 4 0 1 Updated Dec 31, 2024
  • LLMPersonality Public

    Code and Results of the Paper: On the Reliability of Psychological Scales on Large Language Models

    CUHK-ARISE/LLMPersonality’s past year of commit activity
    Python 30 0 0 0 Updated Sep 24, 2024
  • ECHO Public

    Evaluating AI Chatbots’ Role-Play Ability

    CUHK-ARISE/ECHO’s past year of commit activity
    Python 3 GPL-3.0 0 0 0 Updated Apr 30, 2024
  • 3100-PJ-TUT-3 Public
    CUHK-ARISE/3100-PJ-TUT-3’s past year of commit activity
    HTML 1 2 0 0 Updated Feb 13, 2023
  • 3100-PJ-TUT-2 Public
    CUHK-ARISE/3100-PJ-TUT-2’s past year of commit activity
    Python 0 3 0 0 Updated Jan 29, 2023

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…