Skip to content
View gsarti's full-sized avatar
๐Ÿ“š
Learning
๐Ÿ“š
Learning

Highlights

  • Pro

Organizations

@sissa @blackboxnlp @interpretingdl @AI-Student-Society @inseq-team @GroNLP @FOR-sight-ai @InCLow-LM

Block or report gsarti

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
gsarti/README.md

Portfolio Huggingface Hub Twitter LinkedIn Google Scholar

I am a postdoc at the BauLab at Northeastern University, and a member of the NSF National Deep Inference Fabric (NDIF) team working on open-source interfaces for interpretability research. Previously, I was a PhD student at the University of Groningen GroNLP Lab and part of the Dutch InDeep consortium, where I wrote a thesis on actionable interpretability for machine translation. Before that, I was also an applied scientist intern at AWS AI Labs NYC, a research scientist at Aindo and a founding member of the AI Student Society in Trieste.

My research aims to bridge the gap between advances in interpretability research on large language models (LLMs) and their downstream applications for improving the transparency and trustworthiness of such models. I am also very passionate about open-source collaboration :octocat:, and I believe that good tools play a fundamental role in scientific discovery. For this reason, I participate in the development of NDIF's nnsight interpretability toolkit, and lead the development of inseq for attributional analyses of generative language models.

Pinned Loading

  1. inseq-team/inseq inseq-team/inseq Public

    Interpretability for sequence generation models ๐Ÿ› ๐Ÿ”

    Python 451 38

  2. pecore pecore Public

    Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 ๐Ÿ‘ ๐Ÿ‘

    Jupyter Notebook 15 1

  3. it5 it5 Public

    Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" ๐Ÿ‡ฎ๐Ÿ‡น

    Jupyter Notebook 30 4

  4. verbalized-rebus verbalized-rebus Public

    Materials for "Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses" at CLiC-it'24 ๐Ÿงฉ

    Jupyter Notebook 3 1

  5. covid-papers-browser covid-papers-browser Public

    Browse Covid-19 & SARS-CoV-2 Scientific Papers with Transformers ๐Ÿฆ  ๐Ÿ“–

    CSS 184 27

  6. qe4pe qe4pe Public

    Code for "QE4PE: Word-level Quality Estimation for Human Post-Editing" โœ๏ธ

    Jupyter Notebook 5 1