Skip to content
View HaritzPuerto's full-sized avatar

Block or report HaritzPuerto

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
HaritzPuerto/README.md

Hi there 👋

I am a postdoctoral researcher at the ELLIS Institute Tübingen (co-affiliated with the Max Planck Institute for Intelligent Systems). I am part of the Cooperative Machine Intelligence for People-Aligned Safe Systems (COMPASS) group, led by Sahar Abdelnabi. My research focuses on enhancing trustworthy, safe, and effective reasoning in language models and agentic systems.

I did my Ph.D. in Machine Learning & Natural Language Processing at UKP Lab in TU Darmstadt, supervised by Prof. Iryna Gurevych. During my Ph.D., I interned at Parameter Lab, where we worked with Naver AI on trustworthy AI.

Before my Ph.D., I worked at the Coleridge Initiative, where we organized the Kaggle Competition Show US the Data. I got my master’s degree from the School of Computing at KAIST, where I was a research assistant at IR&NLP Lab and advised by Professor Sung-Hyon Myaeng.

https://haritzpuerto.github.io

Pinned Loading

  1. parameterlab/c-seo-bench parameterlab/c-seo-bench Public

    Source code of "C-SEO Bench: Does Conversational SEO Work?" NeurIPS D&B 2025

    Jupyter Notebook 17 3

  2. UKPLab/acl2025-diverse-cot UKPLab/acl2025-diverse-cot Public

    Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"

    Python 32 4

  3. parameterlab/mia-scaling parameterlab/mia-scaling Public

    Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"

    Python 16 4

  4. UKPLab/emnlp2024-code-prompting UKPLab/emnlp2024-code-prompting Public

    Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs. EMNLP 2024

    Python 27 4

  5. MetaQA MetaQA Public

    Forked from UKPLab/MetaQA

    MetaQA: Combining Expert Agents for Multi-Skill Question Answering

    Python

  6. UKPLab/arxiv2026-controllable-reasoning-models UKPLab/arxiv2026-controllable-reasoning-models Public

    Code to reproduce the experimental results from the arXiv 2026 paper Controllable Reasoning Models Are Private Thinkers

    Python 2 1