Daniel Scalena

When I’m old I want to be an astronaut 🚀

prof_pic.jpg
Milan, Italy

Hi! I am Daniel, a (double) first-year PhD student at the 🇮🇹 University of Milano - Bicocca and the 🇳🇱 University of Groningen working on interpretability, fairness and security of generative (and non-generative) Large Language Models. My supervisors are Elisabetta Fersini and Malvina Nissim.

My research focuses on the use of interpretability as a tool to make generative models safer, more reliable and less toxic in order to extend and improve their real-world applications.

news

Jun 26, 2024 📚 New work available on arXiv: Multi-property Steering of Large Language Models with Dynamic Activation Composition
Dec 7, 2023 Presenting my poster and paper at the BlackBoxNLP workshop @EMNLP 2023 in Singapore 🇸🇬
Oct 26, 2023 Graduated! Thesis here 🎓

latest posts

selected publications

  1. duckSteering.jpeg
    Multi-property Steering of Large Language Models with Dynamic Activation Composition
    Daniel Scalena, Gabriele Sarti, and Malvina Nissim
    2024
  2. LetTheModelRespond.png
    Let the Models Respond: Interpreting Language Model Detoxification Through the Lens of Prompt Dependence
    Daniel Scalena, Gabriele Sarti, Malvina Nissim, and 1 more author
    2023