Lea's Homepage

Prospective students, please read this first.

About

I am a senior lecturer and DECRA 2023 fellow in natural language processing in the School of Computing and Information Systems, University of Melbourne.

My research focusses on understanding how humans learn about and represent complex and evolving information in the context of large-scale and noisy environments; and on using these insights to develop fairer and more robust automatic systems. I combine methods from natural language processing, machine learning and computational cognitive modelling. Representative projects include scalable models of category and feature learning in children from noisy language data; and modeling the historical change of word meaning over centuries. My current research focusses on improving automatic understanding of narratives, both in fiction (e.g., inducing structured representations of novels; or movie summarization) and in reality, by analyzing framing and narrative strategies in (biased) news stories.

Before joining Melbourne University, I was a postdoc / applied scientist at Amazon Core AI in Berlin, and before that a research associate in the Edinburgh NLP group , ILCC, University of Edinburgh, working with Mirella Lapata and Shay Cohen.

Contact Details

Senior Lecturer
School of Computing and Information Systems
The University of Melbourne

Office: 4107 Melbourne Connect
Phone: +61 3 9035 9888
Email: lea.{my_lastname}@unimelb.edu.au

Mini Bio

2023 -	Senior Lecturer at Melbourne University
2019 - 2023	Lecturer at Melbourne University
2018 - 2019	Postdoc at Amazon Core AI (Berlin)
2017 - 2018	Research associate at ILCC, University of Edinburgh (collaborators Mirella Lapata and Shay Cohen)
2017	Visiting scholar at Language and Cognition Lab, Stanford University (host Michael Frank)
2013 - 2017	PhD at ILCC, University of Edinburgh (supervisors Mirella Lapata and Charles Sutton)
2016	Machine Learning Internship with Amazon Berlin (3 months)
2010 - 2013	MSc in Language Science and Technology from Saarland University (supervisors Ivan Titov and Manfred Pinkal)
2012	Erasmus Mundus Research exchange to NTU Singapore. Research project with Francis Bond.
2007 - 2010	BA in Linguistics, University of Bremen, Germany.

Collaborators, Postdocs, Students, ...

Current

Yulia Otmakhova, Research Fellow, (2023--)

Chaoyi Xiang , Ph.D. student, (2025--; co-supervised with Olya Ohrimenko)

Damian Curran, Ph.D. student, (2024--; co-supervised with Ed Hovy)

Matteo Guida , Ph.D. student, (2024--; co-supervised with Ed Hovy)

Bryan Chen, Ph.D. student, (2023--; co-supervised with Jey Han Lau)

Yilin Geng, Ph.D. student, (2022--; co-supervised with Omri Abend and Ed Hovy)

Gisela Vallejo, Ph.D. student, (2021--; co-supervised with Tim Baldwin)

Uri Berger, Ph.D. student, (2021--; co-supervised with Omri Abend and Gabi Stanovsky)

Katie Warburton, Ph.D. student, (2021--; co-supervised with Charles Kemp and Yang Xu)

John Xu, Ph.D. student, (2021--; co-supervised with Charles Kemp and Yang Xu)

Former

Sheilla Njoto, Ph.D. student, (2020--2024) now postdoc at Melbourne University

Chunhua Liu, Ph.D. student, (2019--2024) now postdoc at Melbourne University

Shima Khanezar, Ph.D. student, (2020--2023) now at CSIRO

Kemal Kurniawan, Ph.D. student, (2019--2023) now postdoc at Melbourne University

Biaoyan Fang, Research Fellow (2022--2023) now at CSIRO

Shiva Subramanian, Research Fellow (2020--2021) now at Amazon

Aili Shen, Research Fellow (2021--2022) now at Amazon

Publications

Uri Berger, Gabriel Stanovsky, Omri Abend and Lea Frermann. (to appear). Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis. Transactions of the Association for Computational Linguistics (TACL) . Preprint

Chaoyi Xiang, Chunhua Liu, Simon DeDeyne and Lea Frermann. (2025). Comparing Moral Values in Western English-speaking societies and LLMs with Word Associations. ACL 2025.

Yulia Otmakhova and Lea Frermann. (2025). Narrative Media Framing in Political Discourse. Findings of ACL 2025.

Rena Gao, Bryan Chen, Lea Frermann and Jey Han Lau. (2025). Moderation Matters: Measuring Conversational Moderation Impact in English as a Second Language Group Discussion. Findings of ACL 2025.

Katie Warburton, Charles Kemp and Lea Frermann. (2025). Scientists & Women Scientists: Exploring Gender Biases in Institutional Category Systems. Proceedings of the 47th Annual Conference of the Cognitive ScienceSociety (CogSci 2025).

John Xu, Charles Kemp, Lea Frermann and Yang Xu. (2025). Modeling compounding across languages with analogy and composition. Proceedings of the 47th Annual Conference of the Cognitive ScienceSociety (CogSci 2025).

Bryan Chen, Jey Han Lau and Lea Frermann. (2025). WHoW: A Cross-domain Approach for Analysing Conversation Moderation. NAACL 2025.

Gisela Vallejo, Christine DeKock, Timothy Baldwin, Lea Frermann. (2025). Human Interest Framing across Cultures: A Case Study on Climate Change . COLING 2025.

Sheilla Njoto, Marc Cheong, Lea Frermann, Leah Ruppanner. (2024). Bias and Discrimination Against Women and Parents in Semi-Automated Hiring Systems . New Technology, Work and Employment.

John Xu, Charles Kemp, Lea Frermann, Yang Xu. (2024). Word reuse and combination support efficient communication of emerging concepts . Proceedings of the National Academy of Sciences (PNAS). [Preprint]

Yulia Otmakhova, Shima Khanehzar, Lea Frermann. (2024). Media Framing: A Typology and Survey of Computational Approaches Across Disciplines. ACL 2024. [Github] 💫 Outstanding Paper Award 💫

Francisco Zanartu, Yulia Otmakhova, John Cook, Lea Frermann. (2024). Generative Debunking of Climate Misinformation, Proceedings of the 1st Workshop on Natural Language Processing Meets Climate Change (ClimateNLP 2024).

Gisela Vallejo, Timothy Baldwin, Lea Frermann. (2024). Connecting the Dots in News Analysis: Bridging the Cross-Disciplinary Disparities in Media Bias and Framing , Proceedings of the Sixth Workshop on Natural Language Processing and Computational Social Science (NLP+CSS 2024).

Simon De Deyne, Chunhua Liu, Lea Frermann. (2024). Can GPT-4 Recover Latent Semantic Relational Information from Word Associations? A Detailed Analysis of Agreement with Human-annotated Semantic Ontologies., Proceedings of the Workshop on Cognitive Aspects of the Lexicon @ LREC-COLING 2024 .

Katie Warburton, Charles Kemp, Yang Xu and Lea Frermann. (2024). Quantifying Bias in Hierarchical Category Systems, Open Mind.

Alfonso Martı́nez Arranz, Thomas Scelsi, Sandra Kentish, and Lea Frermann. (2024). Mechanisation, wages, and royalties: unpacking coal lock-in through 120 years of parliamentary debates in the world’s largest exporter, Energy Research and Social Science.

Leila Ismail, Nada Shahin, Huned Materwala, Alain Hennebelle, Lea Frermann. (2023). ML-NLPEmot: Machine Learning-Natural Language Processing (NLP) Event-based Emotion Detection Proactive Framework Addressing Mental Health, IEEE Access.

Biaoyan Fang, Trevor Cohn, Tim Baldwin and Lea Frermann. (2023). More than Votes? Voting and Language based Partisanship in the US Supreme Court, EMNLP 2023 (Findings).

Biaoyan Fang, Trevor Cohn, Tim Baldwin and Lea Frermann. (2023). Super-SCOTUS: A multi-sourced dataset for the Supreme Court of the US, Workshop on Natural Legal Language Processing 2023. ; [Code & Data]

Ming-Bin Chen, Jeh Han Lau and Lea Frermann. (2023).The uncivil empathy: Investigating the relation between empathy and toxicity in online mental health support forums, ALTA 2023. 💫 Best Paper Award 💫

Biaoyan Fang, Trevor Cohn, Tim Baldwin and Lea Frermann. (2023). It's not only What You Say, It's also Who It's Said to: Counterfactual Analysis of Interactive Behavior in the Courtroom, AACL 2023.

Lea Frermann, Jiatong Li, Shima Khanehzar and Gosia Mikolajczak. (2023). Conflicts, Villains, Resolutions: Towards models of Narrative Media Framing, ACL 2023. BibTeX [Code & Data]

Chunhua Liu, Trevor Cohn and Lea Frermann. (2023). Seeking Clozure: Robust Hypernym Extraction from BERT with Anchored Prompts, *SEM 2023. BibTeX

John Xu, Charles Kemp, Lea Frermann and Yang Xu. (2023). Predicting strategy choice in word formation: A case study of reuse and compounding, Proceedings of the 45th Annual Conference of the Cognitive ScienceSociety (CogSci 2023).

Katie Warburton, Charles Kemp, Yang Xu and Lea Frermann. (2023). Quantifying Bias in Library Classification Systems, Proceedings of the 45th Annual Conference of the Cognitive ScienceSociety (CogSci 2023).

Shima Khanehzar, Trevor Cohn, Gosia Mikolajczak and Lea Frermann. (2023). Probing Power by Prompting: Harnessing Pre-trained Language Models for Power Connotation Framing, EACL 2023. BibTeX

Uri Berger, Lea Frermann, Gabriel Stanovsky and Omri Abend. (2023). A Large-Scale Multilingual Study of Visual Constraints on Linguistic Selection of Descriptions>, EACL 2023 (Findings). BibTeX

Gisela Vallejo, Tim Baldwin and Lea Frermann. (2022). Evaluating the Examiner: The Perils of Pearson Correlation for Automatic Summarisation Evaluation, ALTA 2022. BibTeX

Xudong Hang, Aili Shen, Yitong Li, Lea Frermann, Tim Baldwin and Trevor Cohn. (2022). Towards Fair Supervised Dataset Distillation for Text Classification, SustaiNLP 2022.

Jinrui Yang, Sheilla Njoto, Marc Cheong, Leah Ruppanner and Lea Frermann. (2022). Professional Presentation and Projected Power: A Case Study of Implicit Gender Information in English CVs, 5th workshop on NLP+CSS 2022.

Chunhua Liu, Trevor Cohn, Simon De Deyne and Lea Frermann. (2022). WAX: A New Dataset for Word Association eXplanations, AACL-IJCNLP 2022. BibTeX

Xudong Han, Aili Shen, Trevor Cohn, Timothy Baldwin and Lea Frermann. (2022). Systematic Evaluation of Predictive Fairness, AACL-IJCNLP 2022. BibTeX

Aili Shen, Xudong Han, Trevor Cohn, Timothy Baldwin and Lea Frermann. (2022). Does Representational Fairness Imply Empirical Fairness?, Findings of AACL-IJCNLP 2022. BibTeX

Xudong Han, Aili Shen, Yitong Li, Lea Frermann, Timothy Baldwin and Trevor Cohn. (2022). FairLib: A Unified Framework for Assessing and Improving Fairness, EMNLP 2022 System Demonstrations. GitHub

John Xu, Lea Frermann, Charles Kemp and Yang Xu. (2022). Word formation supports efficient communication: The case of compounds, Proceedings of the 43rd Annual Meeting of the Cognitive Science Society (CogSci 2022).

Aili Chen, Xudong Han, Timothy Baldwin, Trevor Cohn and Lea Frermann. (2022). Optimising Equal Opportunity Fairness in Model Training, NAACL 2022.

Uri Berger, Gabriel Stanovsky, Omri Abend and Lea Frermann. (2022). A Computational Acquisition Model for Multimodal Word Categorization, NAACL 2022.

Kemal Kurniawan, Philip Schulz, Lea Frermann and Trevor Cohn. (2022). Unsupervised Cross-Lingual Transfer of Structured Predictors without Source Data, NAACL 2022.

Thomas Scelsi, Alfonso Martinez Arranz and Lea Frermann. Principled Analysis of Energy Discourse across Domains with Thesaurus-based Automatic Topic Labeling, ALTA 2021. BibTeX

Karun Varghese Mathew, Venkata S Aditya Tarigoppula and Lea Frermann. Multi-modal Intent Classification for Assistive Robots with Large-scale Naturalistic Datasets, ALTA 2021. BibTeX

Chunhua Liu, Trevor Cohn and Lea Frermann (to appear). Commonsense Knowledge in Word Associations and ConceptNet, Conference on Computational Natural Language Learning (CoNLL) 2021. BibTeX

Subramanian, Shivashankar, Afshin Rahimi, Timothy Baldwin, Trevor Cohn and Lea Frermann. Fairness-aware Class Imbalanced Learning, EMNLP 2021. BibTeX

Subramanian, Shivashankar, Xudong Han, Timothy Baldwin, Trevor Cohn and Lea Frermann. Evaluating Debiasing Techniques for Intersectional Biases, EMNLP 2021. BibTeX

Lea Frermann, Mirella Lapata (2021) Categorization in the Wild: Generalizing Cognitive Models to Naturalistic Data across Languages, Proceedings of the 43rd Annual Meeting of the Cognitive Science Society. Data

Shima Khanehzar, Trevor Cohn, Gosia Mikolajczak, Andrew Turpin and Lea Frermann (2021) Framing Unpacked: A Semi-Supervised Interpretable Multi-View Model of Media Frames, NAACL 2021. BibTex Code

Kemal Kurniawan, Lea Frermann, Philip Schulz and Trevor Cohn (2021) PTST-UoM at SemEval-2021 Task 10: Parsimonious Transfer for Sequence Tagging, SemEval 2021. BibTeX Code

Kemal Kurniawan, Lea Frermann, Philip Schulz and Trevor Cohn (2021) PPT: Parsimonious Parser Transfer for Unsupervised Cross-lingual Transfer , EACL 2021. BibTeX Code

Nelly Papalampidi, Frank Keller, Lea Frermann, Mirella Lapata (2020) Screenplay Summarization Using Latent Narrative Structure , ACL 2020 BibTeX Data Code

Lahari Poddar, Gyorgy Szarvas, Lea Frermann (2019) A Probabilistic Framework for Learning Domain Specific Hierarchical Word Embeddings , arXiv cs.CL 1910.07333.

Lea Frermann (2019) Extractive NarrativeQA with Heuristic Pre-Training , 2nd Workshop on Machine Reading for Question Answering (MRQA), Hong Kong. BibTeX Poster

Stefanos Angelidis, Diego Marcheggiani, Lluís Màrquez, Roi Blanco and Lea Frermann (2019) Book QA: Stories of Challenges and Opportunities , 2nd Workshop on Machine Reading for Question Answering (MRQA), Hong Kong. BibTeX

Nikos Papasarantopoulos, Lea Frermann, Mirella Lapata and Shay B. Cohen (2019) Partners in Crime: Multi-view Sequential Inference for Movie Understanding , EMNLP 2019, Hong Kong. BibTeX

Lea Frermann, Alex Klementiev (2019) Inducing Document Structure for Aspect-based Summarization , In proceedings of ACL 2019, Florence, Italy. BibTeX Poster

Maria Barrett, Lea Frermann, Ana Valeria Gonzalez-Garduño and Anders Søgaard , (2018) Unsupervised Induction of Linguistic Categories with Records of Reading, Speaking, and Writing , In Proceedings of NAACL-HLT 2018, New Orleans, Louisiana, USA. BibTeX

Lea Frermann, Shay B. Cohen and Mirella Lapata, (2018) Whodunnit? Crime Drama as a Case for Natural Language Understanding, Transactions of the Association for Computational Linguistics (TACL) . BibTeX Data Slides

Lea Frermann and Michael C. Frank, (2017) Prosodic Features from Large Corpora of Child-Directed Speech as Predictors of the Age of Acquisition of Words, arXiv cs.CL 1709.09443. repo

Lea Frermann and Gyorgy Szarvas, (2017) Inducing Semantic Micro-Clusters from Deep Multi-View Representations of Novels, In Proceedings of the Conference on Empirical Methods on Natural Language Processing (EMNLP), Copenhagen, Denmark. BibTeX Poster

Lea Frermann, (2017) Bayesian Models of Category Acquisition and Meaning Development (Ph.D Thesis Abstract). The IEEE Intelligent Informatics Bulletin, 18, (1), 23.

Lea Frermann, (2017) Bayesian Models of Category Acquisition and Meaning Development , Ph.D Thesis, University of Edinburgh, Scotland, UK.

Lea Frermann and Mirella Lapata, (2016)A Bayesian Model of Diachronic Meaning Change , Transactions of the Association for Computational Linguistics (TACL). BibTeX Slides *Code*

Lea Frermann and Mirella Lapata, (2016) Incremental Bayesian Category Learning from Natural Language, Cognitive Science. BibTeX

Lea Frermann, (2016)A Bayesian Model of Joint Category and Feature Learning 11th Workshop for Women in Machine Learning (WiML) in conj. with NIPS, Barcelona, Spain. Poster

Lea Frermann and Mirella Lapata, (2015) A Bayesian Model for Joint Learning of Categories and their Features, In Proceedings of NAACL-HLT 2015, Denver, Colorado, USA. BibTeX Slides

Lea Frermann, (2015) A Bayesian Model of the Temporal Dynamics of Word Meaning 10th Workshop for Women in Machine Learning (WiML) in conj. with NIPS, Montreal, Canada. Poster

Lea Frermann and Mirella Lapata, (2014) Incremental Bayesian Learning of Semantic Categories, In Proceedings of EACL 2014, Gothenburg, Sweden. BibTeX Data Poster

Lea Frermann, Ivan Titov and Manfred Pinkal, (2014) A Hierarchical Bayesian Model for Unsupervised Induction of Script Knowledge , In Proceedings of EACL 2014, Gothenburg, Sweden. BibTeX Slides

Lea Frermann, (2013) A Hierarchical Bayesian Model for Unsupervised Learning of Script Knowledge , MSc Thesis, Saarland University, Germany.

Lea Frermann and Francis Bond, (2012) Cross-lingual Parse Disambiguation based on Semantic Correspondence, In Proceedings of ACL 2012, Jeju, Republic of Korea. BibTeX

Lea Frermann, (2010) Information Extraction from Written Natural Language Input to Interactive Wayfinding Systems BA Thesis, University of Bremen, Germany.

Teaching

2024 (semester 2)	Introduction to Machine Learning (COMP90049)
2023 (semester 1)	Introduction to Machine Learning (COMP90049)
2022 (semester 2)	Advanced Business Analytics: Text and Web Analytics (BUSA90501)
2022 (semester 1)	Introduction to Machine Learning (COMP90049)
2021 (semester 1)	Introduction to Machine Learning (COMP90049)
2020 (semester 1)	Introduction to Machine Learning (COMP90049)
2019 (semester 2)	Knowledge Technologies (COMP90049)

Invited Talks and Presentations

07 / 2025	B-IT, University of Bonn, Germany
07 / 2024	ML/NLP Seminar, Heidelberg University, Germany
04 / 2024	ML/NLP Seminar, Macquairie University, Sydney, AU
04 / 2024	ML/NLP Seminar, University of Sydney, AU
08 / 2023	ML/NLP Seminar, Potsdam University, Germany
07 / 2023	NLP Seminar, Ludwig Maximilian University, Munic, Germany
06 / 2023	NLP Seminar, UC Louvain, Belgium
05 / 2023	Complex Human Data Hub Seminar series, University of Melbourne
03 / 2023	Department of Economics, University of Melbourne
02 / 2023	NLP seminar, University of Toronto
12 / 2022	The Hebrew University Jerusalem Models of Narratives and Framing
10 / 2021	Victorian GPGPU Research Symposium GPU-powered Natural Language Processing: Opportunities and Challenges in Academic Research
11 / 2020	ML Seminar, Monash University. Improving Narrative Understanding with Inductive Biases
09 / 2020	CIS Seminar Series, Melbourne University, Melbourne, Australia. Improving Narrative Understanding with Inductive Biases
01 / 2020	Monash Neuroscience of Consciousness Lab, Monash University, Melbourne, Australia. Scaling Concept Learning and Story Understanding Through Natural Language Processing
11 / 2019	Complex Human Data Hub, Melbourne University, Melbourne, Australia. Towards Conceptual Story Understanding
04 / 2019	Data Science Seminar, Columbia University, New York, USA. Learning representations of long narratives for summarization and inference
04 / 2019	University of Toronto, Toronto, Canada. Modeling Dynamics in Language, Learning and Inference
04 / 2019	Johns Hopkins University, Baltimore, USA. Learning representations of long narratives for summarization and inference
07 / 2018	University of Melbourne, Melbourne, VIC. From word learners to crime detectives: bridging the gap between human and machine learning
04 / 2018	University of Washington, Seattle, USA. Title: Whodunnit? Crime Drama as a Case for Natural Language Understanding
02 / 2018	Univeristät Stuttgart, IMS, Stuttgart, Germany. Title: Modelling the Dynamics of fine-grained Change in Word Meaning over Centuries
10 / 2017	Saarland University, Saarbruecken, Germany. Title: Whodunnit? Crime Drama as a Case for Natural Language Understanding
10 / 2017	Alan Turing Institute, London, UK. Title: Structured dynamic models of meaning for understanding language change and representing book plots
09 / 2017	CoAStaL Copenhagen Natural Language Processing Group, Copenhagen, Denmark. Title: Structure and Dynamics of Meaning in Humans and in Language
08 / 2017	Stanford NLP Seminar Series, Stanford University, USA. Title: Of Space Piracy and Secret Baby Romances: Deep Multi-View Book Representations and a Scalable Evaluation Framework
11 / 2016	Keynote talk at the Drift-a-LOD workshop (co-located with EKAW), Bologna. Title: Modelling fine-grained Change in Word Meaning over centuries from Large Collections of Unstructured Text
12 / 2015	Heriot-Watt University, Edinburgh, Scotland, UK. Title: Incremental Bayesian Learning of Semantic Categories and their Features
09 / 2015	Google NLP PhD Summit, Zürich.
07 / 2012	Invited Paper at the First Workshop on Multilingual Modeling (in conjunction with ACL 2012), Jeju, Korea. Title: Cross-lingual Parse Disambiguation based on Semantic Correspondence