/

/

google-research

Обзор Центр заботыВойти

google-research

Ветки: 311 Коммиты: 4490 Теги: 0

..

google-research

/

psyborgs

PsyBORGS - Prepare PsyBORGS for compatibility with "Personality traits in LLMs"

год назад

PsyBORGS - Prepare PsyBORGS for compatibility with "Personality traits in LLMs"

год назад

llm_scoring_lib.py

Open-sourcing the code for "CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor". https://arxiv.org/abs/2312.07661

9 месяцев назад

psychometric_utils.R

PsyBORGS - Prepare PsyBORGS for compatibility with "Personality traits in LLMs"

год назад

requirements.txt

PsyBORGS - Prepare PsyBORGS for compatibility with "Personality traits in LLMs"

год назад

score_calculation.py

Open-sourcing the code for "CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor". https://arxiv.org/abs/2312.07661

9 месяцев назад

score_calculation_test.py

Open-sourcing the code for "CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor". https://arxiv.org/abs/2312.07661

9 месяцев назад

survey_bench_lib.py

Open-sourcing the code for "CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor". https://arxiv.org/abs/2312.07661

9 месяцев назад

survey_bench_lib_test.py

Open-sourcing the code for "CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor". https://arxiv.org/abs/2312.07661

9 месяцев назад

README.md

PsyBORGS - Psychometric Benchmark of Racism, Generalization, and Stereotyping

Context

The recent progress in generative language models unlocked new possibilities notably for questions answering tasks. Can we use these new abilities to make the model pass well-established surveys unveiling societal stereotypes in the format of Multiple Choice Question ?

Such questionnaires are already established and calibrated baselines to measure bias and personality of populations, leveraging them to evaluate LLMs could be a credible and objective evaluation metric.

Code

We will release the full code soon.

Papers

Personality Traits in Large Language Models. Greg Serapio-García^$\ast$, Mustafa Safdari^$\ast$, Clément Crepy, Luning Sun, Stephen Fitz, Peter Romero, Marwa Abdulhai, Aleksandra Faust^$\dagger$, and Maja Matarić^$\dagger$.

^$\ast$Contributed equally. ^$\dagger$Jointly supervised.

Использование cookies

Мы используем файлы cookie в соответствии с Политикой конфиденциальности и Политикой использования cookies.

Нажимая кнопку «Принимаю», Вы даете АО «СберТех» согласие на обработку Ваших персональных данных в целях совершенствования нашего веб-сайта и Сервиса GitVerse, а также повышения удобства их использования.

Запретить использование cookies Вы можете самостоятельно в настройках Вашего браузера.