Research
My research sits at the intersection of information retrieval, natural language processing, and societal impact. I’m particularly interested in how we can use AI to make information more accessible and transparent; whether that’s helping people find jobs, making sense of the news, or holding governments accountable.
I also teach and supervise students, see: Teaching and Students.
AI for Open Government (2025-present)

I lead the ICAI OpenGov Lab, a collaboration between the University of Amsterdam and the Rijksorganisatie voor Informatiehuishouding (RvIHH). Government transparency is central to a democratic society, and increasingly, governments must publish data proactively (and passively, through FOIA requests or “Woo-verzoeken” in Dutch).
Our mission is to improve the interpretation, retrieval, and use of open government data to increase transparency, public trust, and democratic participation. We work on the entire information lifecycle: from data creation and disclosure on the government side, to responsible use and societal impact on the citizen side.
In practice, this means we develop a search engine for government documents (WooGLe), work on making Woo-documents more accessible, and do research how AI can help bridge the information gap between government and citizens.
Job RecSys and HR (2020-2024)

At Randstad, I led research on job recommender systems: algorithms that match candidates to vacancies and vice versa. This is a high-stakes domain: the recommendations we make influence people’s careers and livelihoods.
I co-organized the RecSys in HR Workshop series from 2021–2024, bringing together researchers and practitioners working on HR-related recommendation problems. Key research directions include:
- Fairness and bias: Recommender systems can perpetuate or amplify societal biases. We worked extensively on detecting and mitigating bias in candidate recommenders, including work on fairness gates for end-to-end bias mitigation and using synthetic data for fairness assessments.
- Job-candidate matching: Fine-tuning language models for semantic matching between resumes and job descriptions (see our work on conSultantBERT).
- Career path recommendation: Using reinforcement learning and knowledge graphs to recommend not just the next job, but career trajectories.
- Skills extraction: Automatically extracting skills from job descriptions and resumes to enable better matching.
Selected publications
-
A. Fabris, N. Baranowska, M. J. Dennis, D. Graus, P. Hacker, J. Saldivar, F. Zuiderveen Borgesius, and A. J. Biega, “Fairness and bias in algorithmic hiring: a multidisciplinary survey,” Acm trans. intell. syst. technol., vol. 16, iss. 1, 2025.
[Bibtex]@article{3696457, author = {Fabris, Alessandro and Baranowska, Nina and Dennis, Matthew J. and Graus, David and Hacker, Philipp and Saldivar, Jorge and Zuiderveen Borgesius, Frederik and Biega, Asia J.}, title = {Fairness and Bias in Algorithmic Hiring: A Multidisciplinary Survey}, year = {2025}, issue_date = {February 2025}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, volume = {16}, number = {1}, issn = {2157-6904}, url = {https://doi.org/10.1145/3696457}, doi = {10.1145/3696457}, journal = {ACM Trans. Intell. Syst. Technol.}, month = jan, articleno = {16}, numpages = {54}, keywords = {Algorithmic hiring, Online recruitment, Algorithmic fairness, Bias, Anti-discrimination} } -
A. M. Arafan, D. Graus, F. P. Santos, and E. Beauxis-Aussalet, “End-to-end bias mitigation in candidate recommender systems with fairness gates,” in Recsys in hr’22: the 2\textsuperscriptnd workshop on recommender systems for human resources, 2022.
[Bibtex]@inproceedings{arafan2022end, author = {Arafan, Adam Mehdi and Graus, David and Santos, Fernando P. and Beauxis-Aussalet, Emma}, title = {End-to-End Bias Mitigation in Candidate Recommender Systems with Fairness Gates}, year = {2022}, booktitle = {RecSys in HR’22: The 2\textsuperscript{nd} Workshop on Recommender Systems for Human Resources}, numpages = {8}, location = {Seattle, WA, USA and Online}, series = {CEUR Workshop Proceedings}, url = {https://ceur-ws.org/Vol-3218/RecSysHR2022-paper_6.pdf}, month={9} } -
D. Lavi, V. Medentsiy, and D. Graus, “Consultantbert: fine-tuned siamese sentence-bert for matching jobs and job seekers,” in Workshop on recommender systems for human resources (recsys in hr), Amsterdam, Netherlands, 2021.
[Bibtex]@inproceedings{lavi2021consultantbert, author = {Lavi, Dor and Medentsiy, Volodymyr and Graus, David}, title = {conSultantBERT: Fine-tuned Siamese Sentence-BERT for Matching Jobs and Job Seekers}, year = {2021}, numpages = {8}, booktitle = {Workshop on Recommender Systems for Human Resources (RecSys in HR)}, location = {Amsterdam, Netherlands}, address = {Amsterdam, Netherlands}, month={10}, series = {CEUR Workshop Proceedings}, url = {https://ceur-ws.org/Vol-2967/paper_8.pdf}, }
News RecSys and Smart Radio (2019-2020)

At FD Mediagroep, I led the AI team working on news personalization. This included developing recommender systems for Het Financieele Dagblad (a Dutch financial newspaper) and BNR Nieuwsradio (a news radio station).
A key challenge in news recommendation is moving beyond engagement metrics. Clicks and reading time are easy to optimize for, but they don’t necessarily serve readers well. We developed approaches that incorporate editorial values like diversity, relevance, and “usefulness” into recommendation algorithms.
Another project was BNR SMART Radio: a personalized news radio app that automatically segments radio shows and creates personalized playlists. This required solving fun problems like audio-based show segmentation. We won a bunch of awards with SMART Radio!
Selected publications
-
O. Berlage, K. Lux, and D. Graus, “Improving automated segmentation of radio shows with audio embeddings,” in Icassp 2020 – 2020 ieee international conference on acoustics, speech and signal processing (icassp), 2020, pp. 751-755.
[Bibtex]@inproceedings{berlage2020improving, author={O. {Berlage} and K. {Lux} and D. {Graus}}, booktitle={ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, title={Improving Automated Segmentation of Radio Shows with Audio Embeddings}, year={2020}, pages={751-755}, doi={10.1109/ICASSP40776.2020.9054315}, url={https://doi.org/10.1109/ICASSP40776.2020.9054315} } -
F. Lu, A. Dumitrache, and D. Graus, “Beyond optimizing for clicks: incorporating editorial values in news recommendation,” in Proceedings of the 28th acm conference on user modeling, adaptation and personalization, New York, NY, USA, 2020, p. 145–153.
[Bibtex]@inproceedings{lu2020beyond, author = {Lu, Feng and Dumitrache, Anca and Graus, David}, title = {Beyond Optimizing for Clicks: Incorporating Editorial Values in News Recommendation}, year = {2020}, isbn = {9781450368612}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, url = {https://doi.org/10.1145/3340631.3394864}, doi = {10.1145/3340631.3394864}, booktitle = {Proceedings of the 28th ACM Conference on User Modeling, Adaptation and Personalization}, pages = {145–153}, numpages = {9}, keywords = {usefulness, news recommendation, editorial values}, location = {Genoa, Italy}, series = {UMAP ’20} } -
E. Sullivan, D. Bountouridis, J. Harambam, S. Najafian, F. Loecherbach, M. Makhortykh, D. Kelen, D. Wilkinson, D. Graus, and N. Tintarev, “Reading news with a purpose: explaining user profiles for self-actualization,” in Adjunct publication of the 27th conference on user modeling, adaptation and personalization, 2019, p. 241–245.
[Bibtex]@inproceedings{sullivan2019reading, title={Reading news with a purpose: Explaining user profiles for self-actualization}, author={Sullivan, Emily and Bountouridis, Dimitrios and Harambam, Jaron and Najafian, Shabnam and Loecherbach, Felicia and Makhortykh, Mykola and Kelen, Domokos and Wilkinson, Daricia and Graus, David and Tintarev, Nava}, booktitle={Adjunct Publication of the 27th Conference on User Modeling, Adaptation and Personalization}, pages={241--245}, year={2019}, url={https://doi.org/10.1145/3314183.3323456}, doi={10.1145/3314183.3323456} }
Semantic Search and E-Discovery (2012-2016)

My PhD research, supervised by Maarten de Rijke, focused on semantic search for e-discovery; the legal process of finding relevant documents in large collections (think: emails in corporate litigation).
I developed methods for entity-centric search, where entities (people, organizations, events) serve as the bridge between user queries and document collections. This included work on entity linking, emerging entity detection, and building entity representations from multiple data sources.
During my PhD, I also interned at Microsoft Research with Paul Bennett, Ryen White, and Eric Horvitz, analyzing Cortana user-interaction logs. This led to a best paper award at UMAP 2016 and a patent.
My thesis, Entities of Interest — Discovery in Digital Traces, was defended in June 2017.
Selected publications
-
D. Graus, D. Odijk, and M. de Rijke, “The birth of collective memories: analyzing emerging entities in text streams,” Journal of the association for information science and technology, vol. 69, iss. 6, pp. 773-786, 2018.
[Bibtex]@article{graus2018birth, author = {Graus, David and Odijk, Daan and de Rijke, Maarten}, title = {The birth of collective memories: Analyzing emerging entities in text streams}, journal = {Journal of the Association for Information Science and Technology}, year = {2018}, volume = {69}, number = {6}, pages = {773-786}, doi = {10.1002/asi.24004}, url = {https://asistdl.onlinelibrary.wiley.com/doi/abs/10.1002/asi.24004}, eprint = {https://asistdl.onlinelibrary.wiley.com/doi/pdf/10.1002/asi.24004}, } -
D. Graus, P. N. Bennett, R. W. White, and E. Horvitz, “Analyzing and predicting task reminders,” in Proceedings of the 2016 conference on user modeling adaptation and personalization, New York, NY, USA, 2016, p. 7–15.
[Bibtex]@inproceedings{graus2016analyzing, author = {Graus, David and Bennett, Paul N. and White, Ryen W. and Horvitz, Eric}, title = {Analyzing and Predicting Task Reminders}, year = {2016}, isbn = {9781450343688}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, url = {https://doi.org/10.1145/2930238.2930239}, doi = {10.1145/2930238.2930239}, booktitle = {Proceedings of the 2016 Conference on User Modeling Adaptation and Personalization}, pages = {7–15}, numpages = {9}, keywords = {prospective memory, reminders, log studies, intelligent assistant}, location = {Halifax, Nova Scotia, Canada}, series = {UMAP '16} } -
D. Graus, M. Tsagkias, W. Weerkamp, E. Meij, and M. de Rijke, “Dynamic collective entity representations for entity ranking,” in Proceedings of the ninth acm international conference on web search and data mining, New York, NY, USA, 2016, p. 595–604.
[Bibtex]@inproceedings{graus2016dynamic, author = {Graus, David and Tsagkias, Manos and Weerkamp, Wouter and Meij, Edgar and de Rijke, Maarten}, title = {Dynamic Collective Entity Representations for Entity Ranking}, year = {2016}, isbn = {9781450337168}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, url = {https://doi.org/10.1145/2835776.2835819}, doi = {10.1145/2835776.2835819}, booktitle = {Proceedings of the Ninth ACM International Conference on Web Search and Data Mining}, pages = {595–604}, numpages = {10}, keywords = {fielded retrieval, entity retrieval, entity ranking, content representation}, location = {San Francisco, California, USA}, series = {WSDM '16} } -
D. Graus, M. Tsagkias, L. Buitinck, and M. de Rijke, “Generating pseudo-ground truth for predicting new concepts in social streams,” in Advances in information retrieval, Cham, 2014, p. 286–298.
[Bibtex]@inproceedings{graus2014generating, author={Graus, David and Tsagkias, Manos and Buitinck, Lars and de Rijke, Maarten}, title={Generating Pseudo-ground Truth for Predicting New Concepts in Social Streams}, booktitle={Advances in Information Retrieval}, year={2014}, publisher={Springer International Publishing}, address={Cham}, pages={286--298}, url={https://doi.org/10.1007/978-3-319-06028-6_24}, doi={10.1007/978-3-319-06028-6_24}, series = {ECIR '14} }
Full publication list
For a complete and up-to-date list of publications, see my Publications page or Google Scholar profile.