David Graus • Page 5 of 24 • AI for OpenGov

RecSys in HR Workshop Recording available

For those who missed it — like myself 😏 — we have now published the full recording of our Recommender Systems in Human Resources (RecSys in HR) Workshop, which was held on October 1st, in conjunction with the ACM RecSys Conference in Amsterdam.

Our workshop included two keynotes, eight paper presentations, breakout sessions, a virtual panel on the topics of the upcoming EU framework on AI, Fair & inclusive HR Tech, and how to “activate hidden workers.” See the full program here.

So, if you have 4h33m to spare, see the full recording below!

(or use the convenient YouTube chapter to jump through the program 😅)

Many thanks to all co-organizers, contributors, paper authors, and participants, both virtual and in-person! And hopefully we’ll see each other at the second edition ✌️.

AI & Data Science MSc thesis internship projects @ Randstad Groep Nederland

Another academic year, another (short)list of potential projects. Are you a final-year AI or data science student, interested in doing an internship with us? Reach out! First, read below why you would want to join us, and scroll further down for the following project descriptions:

Job Description Generation (NLP)
Conversational/QA approaches for resume information extraction (NLP)
Segmentation of resumes (NLP, CV)
Synthetic data for bias mitigation in recommender systems
Career pathing
Algorithmic planning
Open project: IR, RecSys, NLP, or fair AI

Two papers accepted at the RecSys in HR Workshop!

Happy to have learned we have two papers accepted at the first Recommender Systems in Human Resources Workshop, co-located with ACM RecSys 2021! These papers are the first academic publications of the Data Science Chapter at Randstad Groep Nederland!

M. de Groot, J. Schutte, and D. Graus, “Job posting-enriched knowledge graph for skills-based matching,” in Workshop on recommender systems for human resources (recsys in hr), Amsterdam, Netherlands, 2021.
[Bibtex]

@inproceedings{degroot2021job,
author = {de Groot, Maurits and Schutte, Jelle and Graus, David},
title = {Job Posting-Enriched Knowledge Graph for Skills-based Matching},
year = {2021},
booktitle = {Workshop on Recommender Systems for Human Resources (RecSys in HR)},
numpages = {9},
location = {Amsterdam, Netherlands},
address = {Amsterdam, Netherlands},
month={10},
series = {CEUR Workshop Proceedings},
url = {https://ceur-ws.org/Vol-2967/paper_3.pdf},
}

D. Lavi, V. Medentsiy, and D. Graus, “Consultantbert: fine-tuned siamese sentence-bert for matching jobs and job seekers,” in Workshop on recommender systems for human resources (recsys in hr), Amsterdam, Netherlands, 2021.
[Bibtex]

@inproceedings{lavi2021consultantbert,
author = {Lavi, Dor and Medentsiy, Volodymyr and Graus, David},
title = {conSultantBERT: Fine-tuned Siamese Sentence-BERT for Matching Jobs and Job Seekers},
year = {2021},
numpages = {8},
booktitle = {Workshop on Recommender Systems for Human Resources (RecSys in HR)},
location = {Amsterdam, Netherlands},
address = {Amsterdam, Netherlands},
month={10},
series = {CEUR Workshop Proceedings},
url = {https://ceur-ws.org/Vol-2967/paper_8.pdf},
}

Curious to know what they’re about? I tweet better than I blog 👇

✌️ In "conSultantBERT: Fine-tuned Siamese Sentence-BERT for Matching Jobs and Job Seekers" we fine-tune a siamese SBERT model for matching resumes to vacancy texts using a high-quality dataset of over 270k resume-vacancy pairs labeled by our staffing consultants. It works ✅! pic.twitter.com/iZRzicZE9F
— David Graus (@dvdgrs) August 23, 2021

Stay tuned for pre-prints! See the other accepted papers here.

Disclaimer: yes, I co-organize the workshop, but I was not involved with reviewing/decisions, we have a great (and independent) Program Committee for that!

Webinar on governing AI

Together with Helen Hulsker I joined of a webinar on governing AI by Dataiku, read the blurb here:

There are many obstacles for Data Scientists in effectively governing the use of AI such as, overcoming the bias inherent in historical data, agreeing on the boundaries of governing AI with the business, and perhaps most importantly, ensuring the adoption of good AI practices across the organisation.
So join Randstad and Dataiku as we set the stage around implementing AI in a high-risk domain from a technical/practical and jurisdictional perspective.

Check out the recording at BrightTALK!

Keynote on Data Science in HR at FEAST Workshop (@ ECML-PKDD 2021)

Excited to be giving a keynote at the International Workshop on Fair, Effective And Sustainable Talent management using data science (FEAST workshop), part of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2021).

I’ll give a talk about some of our recent data science projects Randstad Groep Nederland, spanning from our job/candidate recommender systems, work on bias and bias mitigation, knowledge graphs, and deep embeddings for learning to match candidates to vacancies.

Forget the Trolley Problem; Pragmatic and Fair AI in the Real World

Update: my article has been published on TowardsDataScience, selected as an editor’s pick, and highlighted in TowardsDataScience’s weekly newsletter “The Variable”!

The AI doomsday scenarios, ignited by books such as The Filter Bubble (2011) and Weapons of Math Destruction (2016), are slowly being superseded by more pragmatic and nuanced views of AI. Views in which we acknowledge we’re in control of AI and able to design them in ways that reflect values of our choice.

This shift can be seen in the rising involvement of computer scientists, e.g., through books such as The Ethical Algorithm (2019) or Understand, Manage, and Prevent Algorithmic Mitigate Bias (2019), books that describe and acknowledge the challenges and complexities of algorithmic fairness, but at the same time offer concrete methods and tools for more fair and ethical algorithms. This shift can too be seen in that the methods described in these books have already found their ways into the offerings of all major cloud providers, e.g., at the FAccT 2021 Tutorial “Responsible AI in Industry: Lessons Learned in Practice” Microsoft, Google, and Amazon demoed their fair AI solutions to the multidisciplinary audience of the FAccT community.

The message is clear: we can (and should!) operationalize algorithmic fairness.

Co-organizing “RecSys in HR” workshop at RecSys 2021!

We received news that our workshop proposal “RecSys in HR: Workshop on Recommender Systems for Human Resources” was accepted for inclusion in the 15th ACM Conference on Recommender Systems (RecSys 2021) program! That means we’ll be running a full-day workshop with (research and position) papers, keynotes, and a panel (all TBD) during the conference which will be held in Amsterdam, 27th September-1st October 2021.

We wrote this workshop proposal with Toine Bogers (Aalborg University), Mesut Kaya (Aalborg University), Katrien Verbert (KU Leuven) and Francisco Gutiérrez (KU Leuven), at the initiative/idea of Toine, who virtually approached me in RecSys 2020’s gather.town :-D. Toine and Mesut work on a large research project with Denmark’s largest online recruitment portal, JobIndex.

For now, check out our stunning stub page at https://recsyshr2021.aau.dk/ and stay tuned for updates!

Algorithmic bias and bias mitigation talk at the anti-discrimination hackathon

That was fun! The Online Anti-Discrimination Hackathon ran last weekend, a hakacthon co-organized by Ministerie van OCW, Inspectie SZW, Ministerie van BZK, and Hackathon Factory which revolved around “gender discrimination in data collection and labeling for automated assessment and selection of candidates.” Having ample experience in designing and developing AI-powered candidate selection systems, and the risks of algorithmic bias, I was happy to contribute to this hackathon with a few colleagues at Randstad Groep Nederland in several ways.

still of me giving the talk in front of a blue wall

Talk on Algorithmic Bias Mitigation in Automated Recruitment

First, I gave a (virtual, pre-recorded) talk on algorithmic matching, algorithmic bias, and bias mitigation in the domain of automated recruitment. More specifically, I shared how and where we use AI and recommender systems to facilitate job and candidate matching at Randstad Groep Nederland, and more generally about the challenges of bias and the opportunities of bias mitigation. I showcased both examples of misuse of AI, which results in discriminatory systems, and examples of how AI can be used to actively reduce or mitigate bias in the recruitment process. See the recording of my talk below:

In addition, me and a colleague joined live interactive roundtable sessions during the hackathon, and we brought a panel of four subject-matter experts for one-on-one sessions with hackathon teams and participants.

Joined the DDMA AI Committee

I recently joined the Artificial Intelligence committee of the Data Driven Marketing Association (DDMA), which aims to promote the use of responsible AI for marketing and other forms of interaction with customers. Read the small introductory post published by the DDMA below (in Dutch)

David Graus, Randstad Groep Nederland, Commissie AI: “AI biedt enorme kansen om filterbubbels te doorbreken en bias te reduceren”

David heeft een achtergrond in zoekmachinetechnologie waarmee hij inmiddels een carrière heeft opgebouwd gericht op de ontwikkeling van personalisatie- en aanbevelingssystemen. In zijn huidige rol geeft David leiding aan de data scientists van Randstad Groep Nederland en is hij betrokken bij alle facetten van het bouwen van AI-systemen, van ideation tot aan het daadwerkelijk in productie brengen en monitoren van systemen. “Uit ervaring weet ik dat je als AI-ontwerper en -bouwer over de mogelijkheden beschikt om personalisatie- en aanbevelingssystemen in de basis ethisch verantwoord op te zetten. AI biedt daarom grote kansen om filterbubbels te doorbreken en bias te reduceren. Als lid van de Commissie AI hoop ik de sector hierbij te helpen.“

source: ddma.nl

De online fabeltjesfuik bestaat (ook) niet

Update: I shared this post on LinkedIn where it received 25k+ views and a bunch of comments!

Leuk stuk van Lubach, maar als filterbubbelontkenner en iemand die een flink deel van z’n boterham verdient aan het bouwen aan soortgelijke “algoritmen” voel ik me wel geroepen wat nuance in te brengen 😉.

still van Zondag met Lubach over de Fabeltjes Fuik

Joined the #RecSys2021 organizing committee

After attending the beautiful virtual 14th ACM Conference on Recommender Systems (RecSys2020), I am happy to start looking forward to RecSys2021, which will be held in Amsterdam!

I am super excited to share that I’ve joined the organizing committee of RecSys2021 as local outreach chair, which means I’ll help out assisting the other chairs and linking the (local) industry and companies to the conference.

I’m looking forward to it! I have quite fond memories of co-organizing last year’s DIR 2019, and helping out the local organization of ECIR 2014 in Amsterdam.

ACM RecSys 2021: September 27 – October 1, in Amsterdam

Internships and MSc. projects at Randstad Groep Nederland

About Randstad

Work with impact. At Randstad Groep Nederland IT you keep the country moving, enabling people across sectors to do their work, getting pizza on your table and your suitcase on the plane. Your AI solutions mean tomorrow’s recruiter is smarter and faster but still embodies our human forward approach, combining tech with a personal touch and putting people first – including you. Constantly experimenting, working on new NLP use cases and matching systems or expanding our self-service data platform. If you bring the idea we will provide the freedom to explore, so you can help us shape the world of work.

Data Science @ RGN

Randstad IT is organized in a variation of the Spotify Engineering Model with squads, tribes, and chapters. Our data science chapter spans 12 data scientists, data engineers and machine learning engineers over 3 departments (IT, finance, and marketing), across 6 different teams. These teams work on recommender systems for algorithmic job matching, natural language processing and information extraction, forecasting, and more. We are further interested in AI fairness and auditing, explainability, and transparency.

Who are you?

We’re looking for students studying AI, data science, or related programs, for either graduation projects or regular internships. Fluency in python is required, and we expect our interns to work autonomously. However, as an intern you’ll be a fully fledged member of our chapter, which means you get to benefit from the knowledge that is being shared in our chapter.

Here’s the overview of our suggested projects:

(Deep) Reinforcement Learning-based Planning & Poolmanagement
Writing style transfer learning
Career pathing MVP
Pairwise learning to rank for SmartMatch
Revenue forecasting using time-series algorithms
Structured information extraction from resumes
Salary parsing from vacancies
Record linkage for company linking
Free text notes and comments for improved job matching

Joined the board of SETUP

I have joined the board of SETUP, a Utrecht-based medialab established in 2010. SETUP’s mission is:

to educate a wide audience, providing them with the tools necessary to design this brave new world, and infuse it with human values and new-found creativity.
~ SETUP

This mission perfectly fits my personal conviction that knowledge and understanding of technology through media/algorithmic-literacy — not fear and repression — is vital in progressing into our technology-infused future! See, e.g., what I wrote about it on the neutrality of algorithms, or “algorithmic literacy.”

photo: Sebastiaan ter Burg (ter-burg.nl) for SETUP

Prior to joining their board, I have been following SETUP for a couple of years, joining some of their meetups, and giving a talk at one of their events in 2018 “leven met algoritmen.” I am very excited to start as a board member and help set up SETUP’s future!

I have emerged…

… as an entity in the Google Knowledge Graph!

Which is funny, because “emerging entities” were the main topic of my PhD Thesis [1]. With my co-authors I’ve published research on:

Learning how to recognize “out-of-knowledge base” entities emerging on social media [2]
How our collective memory is formed through “emerging entities” on Wikipedia [3], and more generally
Entity retrieval and ranking [4] where Google’s so-called “Knowledge Panels” often served as examples…

(FYI: I’m not sure how I ended up there, the metadata seems to be coming from Google Scholar)

#vanitysearch

[1]

D. Graus, “Entities of interest — discovery in digital traces,” PhD Thesis, 2017.
[Bibtex]

@phdthesis{graus2017entities,
title={Entities of Interest — Discovery in Digital Traces},
author={Graus, David},
year={2017},
month={6},
school={Informatics Institute, University of Amsterdam},
isbn={9789461828002},
url={https://hdl.handle.net/11245.1/51be80bb-1cbf-4633-8ff9-e3128e990bfa}
}

[2]

D. Graus, M. Tsagkias, L. Buitinck, and M. de Rijke, “Generating pseudo-ground truth for predicting new concepts in social streams,” in Advances in information retrieval, Cham, 2014, p. 286–298.
[Bibtex]

@inproceedings{graus2014generating,
author={Graus, David and Tsagkias, Manos and Buitinck, Lars and de Rijke, Maarten},
title={Generating Pseudo-ground Truth for Predicting New Concepts in Social Streams},
booktitle={Advances in Information Retrieval},
year={2014},
publisher={Springer International Publishing},
address={Cham},
pages={286--298},
url={https://doi.org/10.1007/978-3-319-06028-6_24},
doi={10.1007/978-3-319-06028-6_24},
series = {ECIR '14}
}

[3]

D. Graus, D. Odijk, and M. de Rijke, “The birth of collective memories: analyzing emerging entities in text streams,” Journal of the association for information science and technology, vol. 69, iss. 6, pp. 773-786, 2018.
[Bibtex]

@article{graus2018birth,
author = {Graus, David and Odijk, Daan and de Rijke, Maarten},
title = {The birth of collective memories: Analyzing emerging entities in text streams},
journal = {Journal of the Association for Information Science and Technology},
year = {2018},
volume = {69},
number = {6},
pages = {773-786},
doi = {10.1002/asi.24004},
url = {https://asistdl.onlinelibrary.wiley.com/doi/abs/10.1002/asi.24004},
eprint = {https://asistdl.onlinelibrary.wiley.com/doi/pdf/10.1002/asi.24004},
}

[4]

D. Graus, M. Tsagkias, W. Weerkamp, E. Meij, and M. de Rijke, “Dynamic collective entity representations for entity ranking,” in Proceedings of the ninth acm international conference on web search and data mining, New York, NY, USA, 2016, p. 595–604.
[Bibtex]

@inproceedings{graus2016dynamic,
author = {Graus, David and Tsagkias, Manos and Weerkamp, Wouter and Meij, Edgar and de Rijke, Maarten},
title = {Dynamic Collective Entity Representations for Entity Ranking},
year = {2016},
isbn = {9781450337168},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/2835776.2835819},
doi = {10.1145/2835776.2835819},
booktitle = {Proceedings of the Ninth ACM International Conference on Web Search and Data Mining},
pages = {595–604},
numpages = {10},
keywords = {fielded retrieval, entity retrieval, entity ranking, content representation},
location = {San Francisco, California, USA},
series = {WSDM '16}
}

“Beyond Optimizing for Clicks: Incorporating Editorial Values in News Recommendation” accepted at UMAP2020!

The paper we wrote with former FD team mates Feng Lu and Anca Dumitrache has been accepted for publication as a long paper at UMAP 2020, the 28th Conference on User Modeling, Adaptation and Personalization! (I fondly remember my last time at UMAP, in 2016 😏)

We have published a preprint of this paper, get it: here, or from arXiv.

F. Lu, A. Dumitrache, and D. Graus, “Beyond optimizing for clicks: incorporating editorial values in news recommendation,” in Proceedings of the 28th acm conference on user modeling, adaptation and personalization, New York, NY, USA, 2020, p. 145–153.
[Bibtex]

@inproceedings{lu2020beyond,
author = {Lu, Feng and Dumitrache, Anca and Graus, David},
title = {Beyond Optimizing for Clicks: Incorporating Editorial Values in News Recommendation},
year = {2020},
isbn = {9781450368612},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3340631.3394864},
doi = {10.1145/3340631.3394864},
booktitle = {Proceedings of the 28th ACM Conference on User Modeling, Adaptation and Personalization},
pages = {145–153},
numpages = {9},
keywords = {usefulness, news recommendation, editorial values},
location = {Genoa, Italy},
series = {UMAP ’20}
}

Update 08/05: Cool, @NickKivits mentioned our paper in his Villamedia column: “Het idee van de filterbubbel kan in de prullenbak“ and newsletter (with over 11k subscribers!)

I am particularly happy with this work because:

1️⃣ In our paper we show how you can align algorithm design across stakeholders (in this case: data scientists and journalists), by effectively modeling an editorial value (“dynamicness”) in the news recommender of Het Financieele Dagblad without losing accuracy.

2️⃣ We present (more) empirical proof that #recsys (can) offer(s) users *more* diverse, serendipitous, and dynamic lists of articles, compared to editorially curated lists, and hence (can) help in *avoiding*, not creating filter bubbles!

3️⃣ It is the perfect wrap-up of our Google DNI-funded “SMART Journalism” project at FD Mediagroep (we wrote most of the paper in our spare time after the project ended).

See below the video of the talk at UMAP 2020 below: