For those who missed it — like myself π — we have now published the full recording of our Recommender Systems in Human Resources (RecSys in HR) Workshop, which was held on October 1st, in conjunction with the ACM RecSys Conference in Amsterdam.
Our workshop included two keynotes, eight paper presentations, breakout sessions, a virtual panel on the topics of the upcoming EU framework on AI, Fair & inclusive HR Tech, and how to “activate hidden workers.” See the full program here.
So, if you have 4h33m to spare, see the full recording below!
(or use the convenient YouTube chapter to jump through the program π )
Many thanks to all co-organizers, contributors, paper authors, and participants, both virtual and in-person! And hopefully we’ll see each other at the second edition βοΈ.
Another academic year, another (short)list of potential projects. Are you a final-year AI or data science student, interested in doing an internship with us? Reach out! First, read below why you would want to join us, and scroll further down for the following project descriptions:
Job Description Generation (NLP)
Conversational/QA approaches for resume information extraction (NLP)
Segmentation of resumes (NLP, CV)
Synthetic data for bias mitigation in recommender systems
Happy to have learned we have two papers accepted at the first Recommender Systems in Human Resources Workshop, co-located with ACM RecSys 2021! These papers are the first academic publications of the Data Science Chapter at Randstad Groep Nederland!
M. de Groot, J. Schutte, and D. Graus, “Job posting-enriched knowledge graph for skills-based matching,” in Workshop on recommender systems for human resources (recsys in hr), Amsterdam, Netherlands, 2021. [Bibtex]
@inproceedings{degroot2021job,
author = {de Groot, Maurits and Schutte, Jelle and Graus, David},
title = {Job Posting-Enriched Knowledge Graph for Skills-based Matching},
year = {2021},
booktitle = {Workshop on Recommender Systems for Human Resources (RecSys in HR)},
numpages = {9},
location = {Amsterdam, Netherlands},
address = {Amsterdam, Netherlands},
month={10},
series = {CEUR Workshop Proceedings},
url = {https://ceur-ws.org/Vol-2967/paper_3.pdf},
}
D. Lavi, V. Medentsiy, and D. Graus, “Consultantbert: fine-tuned siamese sentence-bert for matching jobs and job seekers,” in Workshop on recommender systems for human resources (recsys in hr), Amsterdam, Netherlands, 2021. [Bibtex]
@inproceedings{lavi2021consultantbert,
author = {Lavi, Dor and Medentsiy, Volodymyr and Graus, David},
title = {conSultantBERT: Fine-tuned Siamese Sentence-BERT for Matching Jobs and Job Seekers},
year = {2021},
numpages = {8},
booktitle = {Workshop on Recommender Systems for Human Resources (RecSys in HR)},
location = {Amsterdam, Netherlands},
address = {Amsterdam, Netherlands},
month={10},
series = {CEUR Workshop Proceedings},
url = {https://ceur-ws.org/Vol-2967/paper_8.pdf},
}
Curious to know what they’re about? I tweet better than I blog π
βοΈ In "conSultantBERT: Fine-tuned Siamese Sentence-BERT for Matching Jobs and Job Seekers" we fine-tune a siamese SBERT model for matching resumes to vacancy texts using a high-quality dataset of over 270k resume-vacancy pairs labeled by our staffing consultants. It works β ! pic.twitter.com/iZRzicZE9F
Together with Helen Hulsker I joined of a webinar on governing AI by Dataiku, read the blurb here:
There are many obstacles for Data Scientists in effectively governing the use of AI such as, overcoming the bias inherent in historical data, agreeing on the boundaries of governing AI with the business, and perhaps most importantly, ensuring the adoption of good AI practices across the organisation.
So join Randstad and Dataiku as we set the stage around implementing AI in a high-risk domain from a technical/practical and jurisdictional perspective.
Excited to be giving a keynote at the International Workshop on Fair, Effective And Sustainable Talent management using data science (FEAST workshop), part of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2021).
I’ll give a talk about some of our recent data science projectsΒ Randstad Groep Nederland, spanning from our job/candidate recommender systems, work on bias and bias mitigation, knowledge graphs, and deep embeddings for learning to match candidates to vacancies.Β
The AI doomsday scenarios, ignited by books such as The Filter Bubble (2011) and Weapons of Math Destruction (2016), are slowly being superseded by more pragmatic and nuanced views of AI. Views in which we acknowledge weβre in control of AI and able to design them in ways that reflect values of our choice.
This shift can be seen in the rising involvement of computer scientists, e.g., through books such as The Ethical Algorithm (2019) or Understand, Manage, and Prevent Algorithmic Mitigate Bias (2019), books that describe and acknowledge the challenges and complexities of algorithmic fairness, but at the same time offer concrete methods and tools for more fair and ethical algorithms. This shift can too be seen in that the methods described in these books have already found their ways into the offerings of all major cloud providers, e.g., at the FAccT 2021 Tutorial βResponsible AI in Industry: Lessons Learned in Practiceβ Microsoft, Google, and Amazon demoed their fair AI solutions to the multidisciplinary audience of the FAccT community.
The message is clear: we can (and should!) operationalize algorithmic fairness.
We received news that our workshop proposal “RecSys in HR: Workshop on Recommender Systems for Human Resources” was accepted for inclusion in the 15th ACM Conference on Recommender Systems (RecSys 2021) program! That means we’ll be running a full-day workshop with (research and position) papers, keynotes, and a panel (all TBD) during the conference which will be held in Amsterdam, 27th September-1st October 2021.
That was fun! The Online Anti-Discrimination Hackathon ran last weekend, a hakacthon co-organized by Ministerie van OCW, Inspectie SZW, Ministerie van BZK, and Hackathon Factory which revolved around “gender discrimination in data collection and labeling for automated assessment and selection of candidates.” Having ample experience in designing and developing AI-powered candidate selection systems, and the risks of algorithmic bias, I was happy to contribute to this hackathon with a few colleagues at Randstad Groep Nederland in several ways.
Talk on Algorithmic Bias Mitigation in Automated Recruitment
First, I gave a (virtual, pre-recorded) talk on algorithmic matching, algorithmic bias, and bias mitigation in the domain of automated recruitment. More specifically, I shared how and where we use AI and recommender systems to facilitate job and candidate matching at Randstad Groep Nederland, and more generally about the challenges of bias and the opportunities of bias mitigation. I showcased both examples of misuse of AI, which results in discriminatory systems, and examples of how AI can be used to actively reduce or mitigate bias in the recruitment process. See the recording of my talk below:
In addition, me and a colleague joined live interactive roundtable sessions during the hackathon, and we brought a panel of four subject-matter experts for one-on-one sessions with hackathon teams and participants.
David Graus, Randstad Groep Nederland, Commissie AI: βAI biedt enorme kansen om filterbubbels te doorbreken en bias te reducerenβ
David heeft een achtergrond in zoekmachinetechnologie waarmee hij inmiddels een carriΓ¨re heeft opgebouwd gericht op de ontwikkeling van personalisatie- en aanbevelingssystemen. In zijn huidige rol geeft David leiding aan de data scientists van Randstad Groep Nederland en is hij betrokken bij alle facetten van het bouwen van AI-systemen, van ideation tot aan het daadwerkelijk in productie brengen en monitoren van systemen. βUit ervaring weet ik dat je als AI-ontwerper en -bouwer over de mogelijkheden beschikt om personalisatie- en aanbevelingssystemen in de basis ethisch verantwoord op te zetten. AI biedt daarom grote kansen om filterbubbels te doorbreken en bias te reduceren. Als lid van de Commissie AI hoop ik de sector hierbij te helpen.β
Leuk stuk van Lubach, maar als filterbubbelontkenner en iemand die een flink deel van z’n boterham verdient aan het bouwen aan soortgelijke “algoritmen” voel ik me wel geroepen wat nuance in te brengen π.
After attending the beautiful virtual 14th ACM Conference on Recommender Systems (RecSys2020), I am happy to start looking forward to RecSys2021, which will be held in Amsterdam!
I am super excited to share that I’ve joined the organizing committee of RecSys2021 as local outreach chair, which means I’ll help out assisting the other chairs and linking the (local) industry and companies to the conference.
Work with impact. At Randstad Groep Nederland IT you keep the country moving, enabling people across sectors to do their work, getting pizza on your table and your suitcase on the plane. Your AI solutions mean tomorrow’s recruiter is smarter and faster but still embodies our human forward approach, combining tech with a personal touch and putting people first – including you. Constantly experimenting, working on new NLP use cases and matching systems or expanding our self-service data platform. If you bring the idea we will provide the freedom to explore, so you can help us shape the world of work.
Data Science @ RGN
Randstad IT is organized in a variation of the Spotify Engineering Model with squads, tribes, and chapters. Our data science chapter spans 12 data scientists, data engineers and machine learning engineers over 3 departments (IT, finance, and marketing), across 6 different teams. These teams work on recommender systems for algorithmic job matching, natural language processing and information extraction, forecasting, and more. We are further interested in AI fairness and auditing, explainability, and transparency.
Who are you?
We’re looking for students studying AI, data science, or related programs, for either graduation projects or regular internships. Fluency in python is required, and we expect our interns to work autonomously. However, as an intern you’ll be a fully fledged member of our chapter, which means you get to benefit from the knowledge that is being shared in our chapter.
I have joined the board of SETUP, a Utrecht-based medialab established in 2010. SETUP’s mission is:
to educate a wide audience, providing them with the tools necessary to design this brave new world, and infuse it with human values and new-found creativity.
This mission perfectly fits my personal conviction that knowledge and understanding of technology through media/algorithmic-literacy β not fear and repression β is vital in progressing into our technology-infused future! See, e.g., what I wrote about it on the neutrality of algorithms, or “algorithmic literacy.”
Prior to joining their board, I have been following SETUP for a couple of years, joining some of their meetups, and giving a talk at one of their events in 2018 “leven met algoritmen.” I am very excited to start as a board member and help set up SETUP’s future!
[1] D. Graus, “Entities of interest β discovery in digital traces,” PhD Thesis, 2017. [Bibtex]
@phdthesis{graus2017entities,
title={Entities of Interest β Discovery in Digital Traces},
author={Graus, David},
year={2017},
month={6},
school={Informatics Institute, University of Amsterdam},
isbn={9789461828002},
url={https://hdl.handle.net/11245.1/51be80bb-1cbf-4633-8ff9-e3128e990bfa}
}
[2] D. Graus, M. Tsagkias, L. Buitinck, and M. de Rijke, “Generating pseudo-ground truth for predicting new concepts in social streams,” in Advances in information retrieval, Cham, 2014, p. 286β298. [Bibtex]
@inproceedings{graus2014generating,
author={Graus, David and Tsagkias, Manos and Buitinck, Lars and de Rijke, Maarten},
title={Generating Pseudo-ground Truth for Predicting New Concepts in Social Streams},
booktitle={Advances in Information Retrieval},
year={2014},
publisher={Springer International Publishing},
address={Cham},
pages={286--298},
url={https://doi.org/10.1007/978-3-319-06028-6_24},
doi={10.1007/978-3-319-06028-6_24},
series = {ECIR '14}
}
[3] D. Graus, D. Odijk, and M. de Rijke, “The birth of collective memories: analyzing emerging entities in text streams,” Journal of the association for information science and technology, vol. 69, iss. 6, pp. 773-786, 2018. [Bibtex]
@article{graus2018birth,
author = {Graus, David and Odijk, Daan and de Rijke, Maarten},
title = {The birth of collective memories: Analyzing emerging entities in text streams},
journal = {Journal of the Association for Information Science and Technology},
year = {2018},
volume = {69},
number = {6},
pages = {773-786},
doi = {10.1002/asi.24004},
url = {https://asistdl.onlinelibrary.wiley.com/doi/abs/10.1002/asi.24004},
eprint = {https://asistdl.onlinelibrary.wiley.com/doi/pdf/10.1002/asi.24004},
}
[4] D. Graus, M. Tsagkias, W. Weerkamp, E. Meij, and M. de Rijke, “Dynamic collective entity representations for entity ranking,” in Proceedings of the ninth acm international conference on web search and data mining, New York, NY, USA, 2016, p. 595β604. [Bibtex]
@inproceedings{graus2016dynamic,
author = {Graus, David and Tsagkias, Manos and Weerkamp, Wouter and Meij, Edgar and de Rijke, Maarten},
title = {Dynamic Collective Entity Representations for Entity Ranking},
year = {2016},
isbn = {9781450337168},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/2835776.2835819},
doi = {10.1145/2835776.2835819},
booktitle = {Proceedings of the Ninth ACM International Conference on Web Search and Data Mining},
pages = {595β604},
numpages = {10},
keywords = {fielded retrieval, entity retrieval, entity ranking, content representation},
location = {San Francisco, California, USA},
series = {WSDM '16}
}
We have published a preprint of this paper, get it: here, or from arXiv.
F. Lu, A. Dumitrache, and D. Graus, “Beyond optimizing for clicks: incorporating editorial values in news recommendation,” in Proceedings of the 28th acm conference on user modeling, adaptation and personalization, New York, NY, USA, 2020, p. 145β153. [Bibtex]
@inproceedings{lu2020beyond,
author = {Lu, Feng and Dumitrache, Anca and Graus, David},
title = {Beyond Optimizing for Clicks: Incorporating Editorial Values in News Recommendation},
year = {2020},
isbn = {9781450368612},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3340631.3394864},
doi = {10.1145/3340631.3394864},
booktitle = {Proceedings of the 28th ACM Conference on User Modeling, Adaptation and Personalization},
pages = {145β153},
numpages = {9},
keywords = {usefulness, news recommendation, editorial values},
location = {Genoa, Italy},
series = {UMAP β20}
}
1οΈβ£ In our paper we show how you can align algorithm design across stakeholders (in this case: data scientists and journalists), by effectively modeling an editorial value (“dynamicness”) in the news recommender of Het Financieele Dagblad without losing accuracy.
2οΈβ£ We present (more) empirical proof that #recsys (can) offer(s) users *more* diverse, serendipitous, and dynamic lists of articles, compared to editorially curated lists, and hence (can) help in *avoiding*, not creating filter bubbles!
3οΈβ£ It is the perfect wrap-up of our Google DNI-funded “SMART Journalism” project at FD Mediagroep (we wrote most of the paper in our spare time after the project ended).
See below the video of the talk at UMAP 2020 below: