“Fairness and Bias in Algorithmic Hiring” Survey Paper published in ACM TIST

The aforementioned survey paper on fairness and bias in algorithmic hiring, in which I contributed a section on bias measurement and mitigation in practice, has been accepted for publication in ACM Transactions on Intelligent Systems and Technology (TIST)!

The paper is published open access, and is currently to be found in the “Just accepted” section of the journal. See the DOI here: http://dx.doi.org/10.1145/3696457

“Fairness and Bias in Algorithmic Hiring: a Multidisciplinary Survey” preprint available

Together with some of the researchers in FINDHR we have authored and submitted an extensive survey on algorithmic hiring. The preprint is available here:

In this multidisciplinary work we bring together different perspectives from computer science, law, and practitioners to extensively survey literature and classify so-called “bias conducive factors,” i.e., factors that contribute to bias in the algorithmic hiring process. These factors span the complete hiring pipeline, and are classified into three main families: institutional biases, individual preferences, and technology blindspots.

In addition, our paper surveys bias measures (n=21) and bias mitigation strategies (n=12) that have been applied and studied specifically in the context of algorithmic hiring, which we present in unified notation.

Finally, our survey lists datasets, summarizes the relevant legal landscape (w.r.t. regulations and non-discrimination provisions concerning algorithmic hiring in the EU and the US), and shows practical considerations and examples for bias mitigation in practice (which was my main contribution to this paper).

One of my personal main positive takeaways from this paper is around the potential of positive effects that algorithmic components can have in an inherently biased and complex hiring process, i.e.:

One upshot of understanding bias as an inherently intersectional process is that it also offers a way to reduce discrimination. Since the factors that create bias are interrelated and mutually reinforcing, by halting or ameliorating one BCF, we may introduce positive feedback loops on other BCFs. By removing the discriminatory effect of any one factor, we can hope to reduce its influence on the other factors that reinforce each other in a discriminatory way.

All in all, I am very happy and proud to be listed in this monumental work, which surveys a highly complex field and leaves both enough pointers to get started as useful recommendations for future work, grounded in (gaps in) extensive literature.

“Aspect-Based Sentiment Analysis for Open-Ended HR Survey Responses” accepted at NLP4HR workshop at EACL2024

Lois Rink, our former MSc data science intern at Randstad Groep Nederland, has published her thesis at the 1st Workshop on Natural Language Processing for Human Resources (NLP4HR), to be held at EACL 2024.

Lois’ thesis is titled Aspect-Based Sentiment Analysis for Open-Ended HR Survey Responses, and was written under supervision of Job Meijdam and myself. Get the preprint here or on arXiv:

  • [PDF] L. Rink, J. Meijdam, and D. Graus, “Aspect-based sentiment analysis for open-ended HR survey responses,” in 1st workshop on natural language processing for human resources, 2024.
    [Bibtex]
    @inproceedings{rink2024aspectbased,
    title={Aspect-Based Sentiment Analysis for Open-Ended {HR} Survey Responses},
    author={Lois Rink and Job Meijdam and David Graus},
    booktitle={1st Workshop on Natural Language Processing for Human Resources},
    year={2024}
    }

Abstract:

Understanding preferences, opinions, and sentiment of the workforce is paramount for effective employee lifecycle management. Open-ended survey responses serve as a valuable source of information. This paper proposes a machine learning approach for aspect-based sentiment analysis (ABSA) of Dutch open-ended responses in employee satisfaction surveys. Our approach aims to overcome the inherent noise and variability in these responses, enabling a comprehensive analysis of sentiments that can support employee lifecycle management. Through response clustering we identify six key aspects (salary, schedule, contact, communication, personal attention, agreements), which we validate by domain experts. We compile a dataset of 1,458 Dutch survey responses, revealing label imbalance in aspects and sentiments. We propose few-shot approaches for ABSA based on Dutch BERT models, and compare them against bag-of-words and zero-shot baselines. Our work significantly contributes to the field of ABSA by demonstrating the first successful application of Dutch pre-trained language models to aspect-based sentiment analysis in the domain of human resources (HR).

Three papers accepted at RecSys in HR 2023

For this year’s edition (the third in a row) of the Recommender Systems in Human Resources workshop, to be held at the ACM RecSys Conference in Singapore, I co-authored three accepted papers:

Enhancing Resume Content Extraction in Question Answering Systems through T5 Model Variant

  • [PDF] Y. Luo, F. Lu, V. Pal, and D. Graus, “Enhancing resume content extraction in question answering systems through t5 model variants,” in Recsys in hr’23: the 3\textsuperscriptrd workshop on recommender systems for human resources, 2023.
    [Bibtex]
    @inproceedings{luo2023enhancing,
    title={Enhancing Resume Content Extraction in Question Answering Systems through T5 Model Variants},
    author={Yuxin Luo and Feng Lu and Vaishali Pal and David Graus},
    year={2023},
    booktitle = {RecSys in HR’23: The 3\textsuperscript{rd} Workshop on Recommender Systems for Human Resources},
    numpages = {10},
    location = {Singapore},
    series = {CEUR Workshop Proceedings},
    month={9}
    }

This paper is based on the MSc Data Science thesis of Yuxin, who was supervised by Feng. Yuxin applied Large Language Models (mT5) to do Question Answering over resumes for information extraction.

Career Path Recommendations for Long-term Income Maximization: A Reinforcement Learning Approach

  • [PDF] S. Avlonitis, D. Lavi, M. Mansoury, and D. Graus, “Career path recommendations for long-term income maximization: a reinforcement learning approach,” in Recsys in hr’23: the 3\textsuperscriptrd workshop on recommender systems for human resources, 2023.
    [Bibtex]
    @inproceedings{avlonitis2023career,
    title={Career Path Recommendations for Long-term Income Maximization: A Reinforcement Learning Approach},
    author={Spyros Avlonitis and Dor Lavi and Masoud Mansoury and David Graus},
    year={2023},
    booktitle = {RecSys in HR’23: The 3\textsuperscript{rd} Workshop on Recommender Systems for Human Resources},
    numpages = {8},
    location = {Singapore},
    series = {CEUR Workshop Proceedings},
    month={9}
    }

This is Spyros’ MSc AI thesis (from 2022), who was jointly supervised by me and Dor. Spyros applied reinforcement learning to career path recommendations, leveraging Randstad’s rich data for simulating an environment in which agents can apply for jobs, be hired, and receive salary.

Enhancing PLM Performance on Labour Market Tasks via Instruction-based Finetuning and Prompt-tuning with Rules

  • [PDF] J. Vrolijk and D. Graus, “Enhancing PLM performance on labour market tasks via instruction-based finetuning and prompt-tuning with rules,” in Recsys in hr’23: the 3\textsuperscriptrd workshop on recommender systems for human resources, 2023.
    [Bibtex]
    @inproceedings{vrolijk2023enhancing,
    title={Enhancing {PLM} Performance on Labour Market Tasks via Instruction-based Finetuning and Prompt-tuning with Rules},
    author={Jarno Vrolijk and David Graus},
    year={2023},
    booktitle = {RecSys in HR’23: The 3\textsuperscript{rd} Workshop on Recommender Systems for Human Resources},
    numpages = {10},
    location = {Singapore},
    series = {CEUR Workshop Proceedings},
    month={9}
    }

This paper is based on work by Jarno on using structured taxonomy data for training and fine-tuning Large Language Models for different downstream tasks (such as relation classification, entity linking, and question answering).

See the full list of accepted papers.

“Transfer learning for multilingual vacancy text generation” preprint available

Anna Lőrincz‘ UvA MSc. data science thesis “Transfer learning for multilingual vacancy text generation” — which was graded a 9/10 💫 — was recently accepted at the The Second Version of Generation, Evaluation & Metrics (GEM) Workshop 2022 which will be held as part of EMNLP, December 7-11, 2022!

Get the pre-print here:

  • [PDF] [DOI] A. Lőrincz, D. Graus, D. Lavi, and J. L. M. Pereira, “Transfer learning for multilingual vacancy text generation,” in Proceedings of the 2nd workshop on natural language generation, evaluation, and metrics (gem), Abu Dhabi, United Arab Emirates (Hybrid), 2022, p. 207–222.
    [Bibtex]
    @inproceedings{lorincz2022transfer,
    author = {L{\H{o}}rincz, Anna and Graus, David and Lavi, Dor and Pereira, Jo{\~a}o L. M.},
    title = {Transfer learning for multilingual vacancy text generation},
    booktitle = "Proceedings of the 2nd Workshop on Natural Language Generation, Evaluation, and Metrics (GEM)",
    month = dec,
    year = "2022",
    address = "Abu Dhabi, United Arab Emirates (Hybrid)",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.gem-1.18",
    doi = "10.18653/v1/2022.gem-1.18",
    pages = "207--222",
    abstract = "Writing job vacancies is a repetitive and expensive task for humans. This research focuses on automatically generating the benefit sections of vacancies at redacted from job attributes using mT5, the multilingual version of the state-of-the-art T5 transformer trained on general domains to generate texts in multiple languages. While transformers are accurate at generating coherent text, they are sometimes incorrect at including the structured data (the input) in the generated text. Including the input correctly is crucial for vacancy text generation; otherwise, the candidates may get misled. To evaluate how the model includes the input we developed our own domain-specific metrics (input generation accuracy). This was necessary, because Relation Generation, the pre-existing evaluation metric for data-to-text generation uses only string matching, which was not suitable for our dataset (due to the binary field). With the help of the new evaluation method we were able to measure how well the input is included in the generated text separately for different types of inputs (binary, categorical, numeric), offering another contribution to the field. Additionally, we also evaluated how accurate the mT5 model generates the text in the requested language. The results show that mT5 is very accurate at generating the text in the correct language, at including seen categorical inputs and binary values correctly in the generated text. However, mT5 performed worse when generating text from unseen city names or working with numeric inputs. Furthermore, we found that generating additional synthetic training data for the samples with numeric input can increase the input generation accuracy, however this only works when the numbers are integers and only cover a small range.",
    }

In her work, Anna explores transformer models for data-to-text generation, or more specifically: given structured inputs such as categorical features (e.g., location), real valued features (e.g., salary of hours of work per week), or binary features (e.g., contract type) that represent benefits of vacancy texts, the task is to generate a natural language snippet that expresses said feature.

Anna finds that using transformers greatly increases (vocabulary) variation when compared to template-based models, and needs less human effort. The results were — to me — surprisingly good, another proof that transformers are taking over the world and making traditional NLP methods partly obsolete.

I was very much impressed with this work! But, to show how even transformers are not perfect, yet, I present you with my favorite error from the paper:

input: LOCATION = Zwaag
output: Pal gelegen achter het centraal station Zwaaijdijk!

Hope to catch you sometime in Zwaaijdijk!

Two papers accepted at CompJobs ’22

We have two papers accepted at “The First International Workshop on Computational Jobs Marketplace“, co-located with WSDM 2022. Both papers are based on work done by two of our former thesis interns at Randstad Groep Nederland!

  • [PDF] N. Vermeer, V. Provatorova, D. Graus, T. Rajapakse, and S. Mesbah, “Using robbert and extreme multi-label classification to extract implicit and explicit skills from dutch job descriptions,” in Compjobs ’22: the first international workshop on computational jobs marketplace, 2022.
    [Bibtex]
    @inproceedings{vermeer2022using,
    author = {Vermeer, Ninande and Provatorova, Vera and Graus, David and Rajapakse, Thilina and Mesbah, Sepideh},
    title = {Using RobBERT and eXtreme Multi-Label Classification to Extract Implicit and Explicit Skills From Dutch Job Descriptions},
    year = {2022},
    booktitle = {CompJobs '22: The First International Workshop on Computational Jobs Marketplace},
    numpages = {5},
    location = {Online},
    month={2}
    }

☝️ Ninande Vermeer worked under supervision of Sepideh Mesbah and Vera Provatorova (UvA) on: “Using RobBERT and eXtreme Multi-Label Classification to Extract Implicit and Explicit Skills From Dutch Job Descriptions” in which we study to what extent a RobBERT-XMLC model can be used to extract explicit and implicit skills from Dutch job descriptions.

  • [PDF] S. van Els, D. Graus, and E. Beauxis-Aussalet, “Improving fairness assessments with synthetic data: a practical use case with a recommender system for human resources,” in Compjobs ’22: the first international workshop on computational jobs marketplace, 2022.
    [Bibtex]
    @inproceedings{vanels2022improving,
    author = {van Els, Sarah-Jane and Graus, David and Beauxis-Aussalet, Emma},
    title = {Improving Fairness Assessments with Synthetic Data: a Practical Use Case with a Recommender System for Human Resources},
    year = {2022},
    booktitle = {CompJobs '22: The First International Workshop on Computational Jobs Marketplace},
    numpages = {5},
    location = {Online},
    month={2}
    }

✌️ Sarah-Jane van Els worked under supervision of myself and Emma Beauxis-Aussalet (Civic AI Lab) on “Improving Fairness Assessments with Synthetic Data: a Practical Use Case with a Recommender System for Human Resources” in which we explore approaches and methods for assessing algorithmic bias by using synthetic data to improve the size and representativity of a test set used for training candidate recommender systems.

👏 Proud of our former interns for having published their work! And happy with the collaborations we have had with our co-authors 😁.

“Beyond Optimizing for Clicks: Incorporating Editorial Values in News Recommendation” accepted at UMAP2020!

The paper we wrote with former FD team mates Feng Lu and Anca Dumitrache has been accepted for publication as a long paper at UMAP 2020, the 28th Conference on User Modeling, Adaptation and Personalization! (I fondly remember my last time at UMAP, in 2016 😏)

We have published a preprint of this paper, get it: here, or from arXiv.

  • [PDF] [DOI] F. Lu, A. Dumitrache, and D. Graus, “Beyond optimizing for clicks: incorporating editorial values in news recommendation,” in Proceedings of the 28th acm conference on user modeling, adaptation and personalization, New York, NY, USA, 2020, p. 145–153.
    [Bibtex]
    @inproceedings{lu2020beyond,
    author = {Lu, Feng and Dumitrache, Anca and Graus, David},
    title = {Beyond Optimizing for Clicks: Incorporating Editorial Values in News Recommendation},
    year = {2020},
    isbn = {9781450368612},
    publisher = {Association for Computing Machinery},
    address = {New York, NY, USA},
    url = {https://doi.org/10.1145/3340631.3394864},
    doi = {10.1145/3340631.3394864},
    booktitle = {Proceedings of the 28th ACM Conference on User Modeling, Adaptation and Personalization},
    pages = {145–153},
    numpages = {9},
    keywords = {usefulness, news recommendation, editorial values},
    location = {Genoa, Italy},
    series = {UMAP ’20}
    }

Update 08/05: Cool, @NickKivits mentioned our paper in his Villamedia column: Het idee van de filterbubbel kan in de prullenbak and newsletter (with over 11k subscribers!)

I am particularly happy with this work because:

1️⃣ In our paper we show how you can align algorithm design across stakeholders (in this case: data scientists and journalists), by effectively modeling an editorial value (“dynamicness”) in the news recommender of Het Financieele Dagblad without losing accuracy.

2️⃣ We present (more) empirical proof that #recsys (can) offer(s) users *more* diverse, serendipitous, and dynamic lists of articles, compared to editorially curated lists, and hence (can) help in *avoiding*, not creating filter bubbles!

3️⃣ It is the perfect wrap-up of our Google DNI-funded “SMART Journalism” project at FD Mediagroep (we wrote most of the paper in our spare time after the project ended).

See below the video of the talk at UMAP 2020 below:

Continue reading ““Beyond Optimizing for Clicks: Incorporating Editorial Values in News Recommendation” accepted at UMAP2020!”

Improving automated segmentation of radio shows with audio embeddings published @ IEEE ICASSP2020

Oberon Berlage’s MSc. thesis: “Improving automated segmentation of radio shows with audio embeddings” which he wrote under my supervision during his internship at FD Mediagroep was awarded a 9/10, under condition that the work was publishable.

Turns out it was, as it was recently accepted at IEEE ICASSP2020 (the 45th International Conference on Acoustics, Speech, and Signal Processing) without any additional work/experiments (just a bit of reduction). But you already knew this… Oberon will be presenting this work in Barcelona, thanks to the generous support of UvA’s Information Studies program.

We now published a preprint, read it below:

  • [PDF] [DOI] O. Berlage, K. Lux, and D. Graus, “Improving automated segmentation of radio shows with audio embeddings,” in Icassp 2020 – 2020 ieee international conference on acoustics, speech and signal processing (icassp), 2020, pp. 751-755.
    [Bibtex]
    @inproceedings{berlage2020improving,
    author={O. {Berlage} and K. {Lux} and D. {Graus}},
    booktitle={ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
    title={Improving Automated Segmentation of Radio Shows with Audio Embeddings},
    year={2020},
    pages={751-755},
    doi={10.1109/ICASSP40776.2020.9054315},
    url={https://doi.org/10.1109/ICASSP40776.2020.9054315}
    }

His work revolved around improving BNR SMART Radio‘s text-based segmentation by incorporating audio signals in the form of audio embeddings. This turns out to improve over our text-based baseline by a whopping +32.3% F1-measure!

Even better: an audio-only approach, trained on a smallish openly available dataset, outperforms our text-only baseline by 9.4%. This means the segmentation method can be employed without need for audio transcription, which could be a money-saver.

Reading News with a Purpose: Explaining User Profiles for Self-Actualization

Really excited to have co-authored “Reading News with a Purpose,” which was accepted at the International Workshop on Transparent Personalization Methods based on Heterogeneous Personal Data (ExHUM), at UMAP 2019!

With the largest list of authors (ranging from philosophers via polcomm researchers to computer scientists), from a wide array of institutions; Emily Sullivan, Dimitrios Bountouridis, Jaron Harambam, Shabnam Najafian, Felicia Loecherbach, Mykola Makhortykh, Domokos Kelen, Darcia Wilkinson, and Nava Tintarev!

This is work that came out of our ICT with Industry project “Opening the black box of user profiles in content-based recommender systems” where we (FD Mediagroep) collaborated with Nava Tintarev and our excellent team of academics in a week-long academic hackathon!

Read the pre-print, below:

  • [PDF] [DOI] E. Sullivan, D. Bountouridis, J. Harambam, S. Najafian, F. Loecherbach, M. Makhortykh, D. Kelen, D. Wilkinson, D. Graus, and N. Tintarev, “Reading news with a purpose: explaining user profiles for self-actualization,” in Adjunct publication of the 27th conference on user modeling, adaptation and personalization, 2019, p. 241–245.
    [Bibtex]
    @inproceedings{sullivan2019reading,
    title={Reading news with a purpose: Explaining user profiles for self-actualization},
    author={Sullivan, Emily and Bountouridis, Dimitrios and Harambam, Jaron and Najafian, Shabnam and Loecherbach, Felicia and Makhortykh, Mykola and Kelen, Domokos and Wilkinson, Daricia and Graus, David and Tintarev, Nava},
    booktitle={Adjunct Publication of the 27th Conference on User Modeling, Adaptation and Personalization},
    pages={241--245},
    year={2019},
    url={https://doi.org/10.1145/3314183.3323456},
    doi={10.1145/3314183.3323456}
    }

Read the original idea that sparked the project, presented at the 2nd FATREC Workshop at RecSys 2018, here:

  • [PDF] D. Graus, M. Sappelli, and D. M. Chu, “Let me tell you who you are,” in The 2nd fatrec workshop on responsible recommendation, 2018.
    [Bibtex]
    @inproceedings{graus2018let,
    title={Let me tell you who you are},
    author={Graus, David and Sappelli, Maya and Chu, Dung Manh},
    booktitle={The 2nd FATREC Workshop on Responsible Recommendation},
    year={2018}
    }

Position paper ““Let Me Tell You Who You are” — Explaining Recommender Systems by Opening Black Box User Profiles”

Our position paper ““Let Me Tell You Who You are” — Explaining Recommender Systems by Opening Black Box User Profiles” was accepted at the 2nd FATREC Workshop on Responsible Recommendation, held at RecSys ’18!

In this paper, we detail some our ideas and approaches of providing transparency in recommendations through displaying the user profiles, used ‘internally’ by our recommender system. Read the pre-print below!

  • [PDF] D. Graus, M. Sappelli, and D. M. Chu, “Let me tell you who you are,” in The 2nd fatrec workshop on responsible recommendation, 2018.
    [Bibtex]
    @inproceedings{graus2018let,
    title={Let me tell you who you are},
    author={Graus, David and Sappelli, Maya and Chu, Dung Manh},
    booktitle={The 2nd FATREC Workshop on Responsible Recommendation},
    year={2018}
    }
FATREC Position paper: Explaining recommender systems by opening black box user profiles

Pre-print of position paper “SMART Journalism: Personalizing, Summarizing, and Recommending Financial Economic News”

Our position paper “SMART Journalism: Personalizing, Summarizing, and Recommending Financial Economic News” was accepted at Algorithmic Personalization and News (APEN18) workshop, held at ICWSM ’18!

In this paper, we detail some of the ideas and opportunities of personalization in the domain of financial economic news. Read the pre-print below!

  • [PDF] M. Sappelli, D. M. Chu, B. Cambel, J. Nortier, and D. Graus, “Smart radio: personalized news radio,” in Proceedings of the 17th dutch-belgian information retrieval workshop, 2018, p. 27.
    [Bibtex]
    @inproceedings{sappelli2018smart,
    title={SMART Radio: Personalized News Radio},
    author={Sappelli, Maya and Chu, Dung Manh and Cambel, Bahadir and Nortier, Joeri and Graus, David},
    booktitle={Proceedings of the 17th Dutch-Belgian Information Retrieval Workshop},
    pages={27},
    year={2018}
    }

“The birth of collective memories” published in JASIST!

The journal paper “The birth of collective memories: Analyzing emerging entities in text streams” I wrote with Daan Odijk and Maarten de Rijke is now (finally) published at JASIST! It is published under OpenAccess/CC BY 4.0 and available in “early view” (published before it’s published) in the Wiley Online Library. Click on the image below to access it:

The Birth of Collective Memories: Analyzing Emerging Entities in Text Streams

Our paper “The Birth of Collective Memories: Analyzing Emerging Entities in Text Streams” was accepted for publication at JASIST (the Journal of the Association for Information Science and Technology)! Grab a pre-print here:

  • [PDF] [DOI] D. Graus, D. Odijk, and M. de Rijke, “The birth of collective memories: analyzing emerging entities in text streams,” Journal of the association for information science and technology, vol. 69, iss. 6, pp. 773-786, 2018.
    [Bibtex]
    @article{graus2018birth,
    author = {Graus, David and Odijk, Daan and de Rijke, Maarten},
    title = {The birth of collective memories: Analyzing emerging entities in text streams},
    journal = {Journal of the Association for Information Science and Technology},
    year = {2018},
    volume = {69},
    number = {6},
    pages = {773-786},
    doi = {10.1002/asi.24004},
    url = {https://asistdl.onlinelibrary.wiley.com/doi/abs/10.1002/asi.24004},
    eprint = {https://asistdl.onlinelibrary.wiley.com/doi/pdf/10.1002/asi.24004},
    }

This paper is is:
1. My first journal paper
2. Based on Chapter 3 of my PhD thesis “Entities of Interest — Discovery in Digital Traces
3. The first collabo on a paper (on paper) between the FD Mediagroep, Blendle, and the UvA
4. The tombstone on my academic career! (?)

In this paper we study news and social media streams spanning over 18 months, and comprising over 579 million documents, and analyze ’emergence patterns’ of entities, i.e., how a real-world entity (such as a person, organization, product, etc.) appears in these documents in the timespan between the entity’s first mention in online text streams, and when an article devoted to the entity is subsequently added to Wikipedia.