“Transfer learning for multilingual vacancy text generation” preprint available

📅 November 1, 2022 • 🕐 09:08 • 🏷 Papers and Research

Anna Lőrincz‘ UvA MSc. data science thesis “Transfer learning for multilingual vacancy text generation” — which was graded a 9/10 💫 — was recently accepted at the The Second Version of Generation, Evaluation & Metrics (GEM) Workshop 2022 which will be held as part of EMNLP, December 7-11, 2022!

Get the pre-print here:

  • [PDF] A. Lőrincz, D. Graus, D. Lavi, and J. L. M. Pereira, “Transfer learning for multilingual vacancy text generation,” in Proceedings of the 2nd workshop on natural language generation, evaluation, and metrics (gem 2022), 2022.
    [Bibtex]
    @inproceedings{lorincz2022transfer,
    author = {L{\H{o}}rincz, Anna and Graus, David and Lavi, Dor and Pereira, Jo{\~a}o L. M.},
    title = {Transfer Learning for Multilingual Vacancy Text Generation},
    year = {2022},
    booktitle = {Proceedings of the 2nd Workshop on Natural Language Generation, Evaluation, and Metrics (GEM 2022)},
    location = {Online},
    publisher = {Association for Computational Linguistics},
    month={12}
    }

In her work, Anna explores transformer models for data-to-text generation, or more specifically: given structured inputs such as categorical features (e.g., location), real valued features (e.g., salary of hours of work per week), or binary features (e.g., contract type) that represent benefits of vacancy texts, the task is to generate a natural language snippet that expresses said feature.

Layout of benefit section from Randstad.nl

Anna finds that using transformers greatly increases (vocabulary) variation when compared to template-based models, and needs less human effort. The results were — to me — surprisingly good, another proof that transformers are taking over the world and making traditional NLP methods partly obsolete.

I was very much impressed with this work! But, to show how even transformers are not perfect, yet, I present you with my favorite error from the paper:

input: LOCATION = Zwaag
output: Pal gelegen achter het centraal station Zwaaijdijk!

Hope to catch you sometime in Zwaaijdijk!

RecSys in HR 2022 Workshop Recording available

📅 October 6, 2022 • 🕐 15:02 • 🏷 Research

We have published the full recording of our RecSys in HR 2022 workshop, which we held September 22 in Seattle, WA, USA.

The video is 5h42m43s long, so to guide you, I provide you the following list of highlights (see the video description for timestamps that will allow you to instantly skip to the sections described below):

1️⃣ Our first keynote speaker, Robyn Rap, a data science leader at Indeed.com talks in depth about the importance of collaboration between #UX and Data Scientists in evaluating and developing search and recommendation systems. She provides a great (broad) overview of the challenges and differences of doing recsys in HR, compared to more common scenarios such as e-commerce or media. Great introduction into our deep field!

2️⃣ The panel, which includes Randstad’s Helen HulskerCarlos Castillo (ChaTo)Liangjie Hong (director of AI, engineering at LinkedIn) and the aforementioned Robyn Rap (still Indeed.com). The topics discussed by these experts: the role of HR Tech in the Global Labor Shortage, fair AI in Practice, multi-stakeholder development of HR Tech, and Regulation and Accountability.

3️⃣ Our second keynote speaker, Liangjie Hong, presents some of the foundational engineering work at LinkedIn that aims to serve many downstream AI applications, which revolves around a pipeline with (continuously updating) embedding representation for job seekers, jobs, and everything else, which are fused with LinkedIns (huge) Knowledge Graph.

4️⃣ There’s also a bunch of interesting paper presentations, e.g., a bunch from Indeed.com: Model Threshold Optimization for Segmented Job-Jobseeker Recommendation System (where the authors show a sneakpeek in their overall setup of recommendations at Indeed.com), Flexible Job Classification with Zero-Shot Learning by thomas lake, which shows how to use off-the-shelf transformer models for doing job classification. And Beyond human-in-the-loop: scaling occupation taxonomy at Indeed: where the authors show how they combine human intelligence with automation for scaling taxonomies across languages and markets. Finally, some interesting and very pragmatic/hands-on papers on skill extraction, e.g., Mike Zhang‘s Skill Extraction from Job Postings using Weak Supervision and Jens-Joris Decorte‘s Design of Negative Sampling Strategies for Distantly Supervised Skill Extraction.

Enjoy watching!

Three papers accepted at RecSys in HR 2022 Workshop

📅 September 6, 2022 • 🕐 14:40 • 🏷 Blog and Research

🎉 A little success to share: three of our former data science interns at the Data Science chapter at Randstad Groep Nederland have written and published their master theses at our upcoming RecSys in HR Workshop; an academic workshop that revolves around AI in HR, which is part of an ACM International Conference on Recommender Systems (the AI systems used for matching; whether it is Netflix movies to users, or in our case; jobs to job seekers).

As always, the work of the students is pretty technical, but I will go ahead and try to provide little human-understandable summaries below.

Explainable Career Path Predictions using Neural Models

Roan Schellingerhout worked under supervision of Volodymyr Medentsiy on Explainable Career Path Prediction using Neural Networks, where he trained deep neural networks on our own talent work history data, to create a tool that can help consultants or talents to predict possible career switches, given as input a talent’s work history. The predictions are visually explained, in the sense that the underlying reasons for proposing a certain job are provided. Roan tested these visualizations on consultants, and found consultants generally like them.

End-to-End Bias Mitigation in Candidate Recommender Systems with Fairness Gates

  • [PDF] A. M. Arafan, D. Graus, F. P. Santos, and E. Beauxis-Aussalet, “End-to-end bias mitigation in candidate recommender systems with fairness gates,” in Recsys in hr’22: the 2nd workshop on recommender systems for human resources, 2022.
    [Bibtex]
    @inproceedings{arafan2022end,
    author = {Arafan, Adam Mehdi and Graus, David and Santos, Fernando P. and Beauxis-Aussalet, Emma},
    title = {End-to-End Bias Mitigation in Candidate Recommender Systems with Fairness Gates},
    year = {2022},
    booktitle = {RecSys in HR’22: The 2nd Workshop on Recommender Systems for Human Resources },
    numpages = {8},
    location = {Seattle, WA, USA and Online},
    month={9}
    }

Adam Arafan worked under supervision of myself on “End-to-End Bias Mitigation in Candidate Recommender Systems with Fairness Gates,” in his thesis he experimented with making the SmartMatch Talent Recommender more fair (at the level of gender), either by changing the “input” of the algorithm (for example; by balancing male and female candidates in the training data), or by changing its “output” (for example: for a given list of candidates, go through the list to make sure the top 10 has a 50/50 balance between male and female candidates). His work is novel because these type of “bias mitigation” strategies have been studied in isolation, but never together.

Automated Personnel Scheduling with Reinforcement Learning and Graph Neural Networks

  • [PDF] B. Platten, M. Macfarlane, D. Graus, and S. Mesbah, “Automated personnel scheduling with reinforcement learning and graph neural networks,” in Recsys in hr’22: the 2nd workshop on recommender systems for human resources, 2022.
    [Bibtex]
    @inproceedings{platten2022automated,
    author = {Platten, Benjamin and Macfarlane, Matthew and Graus, David and Mesbah, Sepideh},
    title = {Automated Personnel Scheduling with Reinforcement Learning and Graph Neural Networks},
    year = {2022},
    booktitle = {RecSys in HR’22: The 2nd Workshop on Recommender Systems for Human Resources },
    numpages = {10},
    location = {Seattle, WA, USA and Online},
    month={9}
    }

Ben Platten worked under supervision of Sepideh Mesbah on Automated Personnel Scheduling with Reinforcement Learning and Graph Neural Networks, in which he experimented with “reinforcement learning” (a specific machine learning paradigm) which in theory suits the challenging task of scheduling well. He experimented on a toy problem to assess that, indeed, the method seems to work quite well.

See the full list of accepted papers here: https://recsyshr.aau.dk/accepted-papers/.

And stay tuned for the pre-prints, which I’ll share as soon as they’re available!

RecSys in HR at ACM RecSys 2022 in Seattle!

📅 April 10, 2022 • 🕐 09:37 • 🏷 Blog and Research

Fantastic news! We’ve received word the 2nd edition of our “Recommender Systems for Human Resources” (RecSys in HR) Workshop has been accepted to be included in the ACM RecSys 2022 program, to be held in Seattle!

Last year’s (first) edition of our workshop was co-located with ACM RecSys 2021 in Amsterdam, and featured two keynotes, a panel, breakout sessions and 8 paper presentations. The recording, workshop proceedings, and a workshop report are available through our workshop’s website at: https://recsyshr2021.aau.dk/

Check back there soon for information on the 2022 edition we’re planning with Toine Bogers, Mesut Kaya, Francisco Gutiérrez, and newly joined co-organizers Sepideh Mesbah (Randstad Groep Nederland) and Chris Johnson (Indeed.com)!

Two papers accepted at the RecSys in HR Workshop!

📅 August 23, 2021 • 🕐 12:10 • 🏷 Research

Happy to have learned we have two papers accepted at the first Recommender Systems in Human Resources Workshop, co-located with ACM RecSys 2021! These papers are the first academic publications of the Data Science Chapter at Randstad Groep Nederland!

  • [PDF] M. de Groot, J. Schutte, and D. Graus, “Job posting-enriched knowledge graph for skills-based matching,” in Recsys in hr 2021, Amsterdam, Netherlands, 2021.
    [Bibtex]
    @inproceedings{degroot2021job,
    author = {de Groot, Maurits and Schutte, Jelle and Graus, David},
    title = {Job Posting-Enriched Knowledge Graph for Skills-based Matching},
    year = {2021},
    booktitle = {RecSys in HR 2021},
    numpages = {9},
    location = {Amsterdam, Netherlands},
    address = {Amsterdam, Netherlands},
    month={10}
    }
  • [PDF] D. Lavi, V. Medentsiy, and D. Graus, “Consultantbert: fine-tuned siamese sentence-bert for matching jobs and job seekers,” in Recsys in hr 2021, Amsterdam, Netherlands, 2021.
    [Bibtex]
    @inproceedings{lavi2021consultantbert,
    author = {Lavi, Dor and Medentsiy, Volodymyr and Graus, David},
    title = {conSultantBERT: Fine-tuned Siamese Sentence-BERT for Matching Jobs and Job Seekers},
    year = {2021},
    booktitle = {RecSys in HR 2021},
    numpages = {8},
    location = {Amsterdam, Netherlands},
    address = {Amsterdam, Netherlands},
    month={10}
    }

Curious to know what they’re about? I tweet better than I blog 👇

Stay tuned for pre-prints! See the other accepted papers here.

Disclaimer: yes, I co-organize the workshop, but I was not involved with reviewing/decisions, we have a great (and independent) Program Committee for that!

Co-organizing “RecSys in HR” workshop at RecSys 2021!

📅 March 18, 2021 • 🕐 12:14 • 🏷 Blog and Research

We received news that our workshop proposal “RecSys in HR: Workshop on Recommender Systems for Human Resources” was accepted for inclusion in the 15th ACM Conference on Recommender Systems (RecSys 2021) program! That means we’ll be running a full-day workshop with (research and position) papers, keynotes, and a panel (all TBD) during the conference which will be held in Amsterdam, 27th September-1st October 2021.

We wrote this workshop proposal with Toine Bogers (Aalborg University), Mesut Kaya (Aalborg University), Katrien Verbert (KU Leuven) and Francisco Gutiérrez (KU Leuven), at the initiative/idea of Toine, who virtually approached me in RecSys 2020’s gather.town :-D. Toine and Mesut work on a large research project with Denmark’s largest online recruitment portal, JobIndex.

For now, check out our stunning stub page at https://recsyshr2021.aau.dk/ and stay tuned for updates!

“Beyond Optimizing for Clicks: Incorporating Editorial Values in News Recommendation” accepted at UMAP2020!

📅 April 21, 2020 • 🕐 17:33 • 🏷 Papers and Research

The paper we wrote with former FD team mates Feng Lu and Anca Dumitrache has been accepted for publication as a long paper at UMAP 2020, the 28th Conference on User Modeling, Adaptation and Personalization! (I fondly remember my last time at UMAP, in 2016 😏)

We have published a preprint of this paper, get it: here, or from arXiv.

  • [PDF] [DOI] F. Lu, A. Dumitrache, and D. Graus, “Beyond optimizing for clicks: incorporating editorial values in news recommendation,” in Proceedings of the 28th acm conference on user modeling, adaptation and personalization, New York, NY, USA, 2020, p. 145–153.
    [Bibtex]
    @inproceedings{lu2020beyond,
    author = {Lu, Feng and Dumitrache, Anca and Graus, David},
    title = {Beyond Optimizing for Clicks: Incorporating Editorial Values in News Recommendation},
    year = {2020},
    isbn = {9781450368612},
    publisher = {Association for Computing Machinery},
    address = {New York, NY, USA},
    url = {https://doi.org/10.1145/3340631.3394864},
    doi = {10.1145/3340631.3394864},
    booktitle = {Proceedings of the 28th ACM Conference on User Modeling, Adaptation and Personalization},
    pages = {145–153},
    numpages = {9},
    keywords = {usefulness, news recommendation, editorial values},
    location = {Genoa, Italy},
    series = {UMAP ’20}
    }

Update 08/05: Cool, @NickKivits mentioned our paper in his Villamedia column: Het idee van de filterbubbel kan in de prullenbak and newsletter (with over 11k subscribers!)

I am particularly happy with this work because:

1️⃣ In our paper we show how you can align algorithm design across stakeholders (in this case: data scientists and journalists), by effectively modeling an editorial value (“dynamicness”) in the news recommender of Het Financieele Dagblad without losing accuracy.

2️⃣ We present (more) empirical proof that #recsys (can) offer(s) users *more* diverse, serendipitous, and dynamic lists of articles, compared to editorially curated lists, and hence (can) help in *avoiding*, not creating filter bubbles!

3️⃣ It is the perfect wrap-up of our Google DNI-funded “SMART Journalism” project at FD Mediagroep (we wrote most of the paper in our spare time after the project ended).

See below the video of the talk at UMAP 2020 below:

plop

PodRecs: Workshop on Podcast Recommendations PC

📅 April 4, 2020 • 🕐 12:14 • 🏷 Research

I was invited to join the program committee of (the first) PodRecs: Workshop on Podcast Recommendations (to be held at RecSys’20).

Since our work on BNR SMART Radio, I am really interested in the space of audio, recommender systems, and information retrieval. Curious to see the submissions!

See the PodRecs call for papers, and check out the website, by clicking the image below.

“Improving automated segmentation of radio shows with audio embeddings”

📅 July 5, 2019 • 🕐 15:41 • 🏷 Blog and Research

Update (28/1/2020): Oberon’s thesis was accepted and will be published at the IEEE 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020), to be held May 4-8 in Barcelona, Spain! The submission is co-authored with Klaus Lux and myself.

Oberon Berlage recently successfully defended his MSc. thesis (title above!) for the Data Science Master at University of Amsterdam, and graduated with a whopping 9!

He’s the first academic offspring of our AI Team @ FD Mediagroep, and worked on BNR SMART Radio‘s segmenter. Oberon improved our text-based segmenter by adding audio embeddings, improving the F1 score with +32%!

His thesis is now online, check it out at: http://scriptiesonline.uba.uva.nl/document/673254