I have successfully defended my MSc thesis on the topic βWhy Better Cross-Lingual Alignment Fails for Better Cross-Lingual Transfer?β, advised by Yihong Liu (LMU Munich) and supervised by Prof. Dr. Vera Demberg (Saarland University) and Prof. Dr. Hinrich SchΓΌtze (LMU Munich).
Paper Accepted at NeurIPS 2025 π
Our paper investigates how architectural limitations of Transformers manifest after pretraining. Read it here.
Paper Accepted at COLM 2025 β¨
We mechanistically explore the information flow in ICL tasks in Gemma model family. Read more here.
Paper Accepted at LoResLM workshop at COLING 2025 π°πΏπΊπΏπ°π¬πΉπ²
Our paper is a survey on current state of the art technologies for Turkic Central Asian languages. Read more here.
Paper Accepted at NeurIPS 2024 π
We study expressivity of SSMs in comparison to Transformers and RNNs. Read the full paper here.