Internships and MSc. projects at Randstad Groep Nederland

πŸ“… July 6, 2020 β€’ πŸ• 13:28 β€’ 🏷 Blog β€’ πŸ‘ 151

Come join us in Diemen!

About Randstad

Work with impact. At Randstad Groep Nederland IT you keep the country moving, enabling people across sectors to do their work, getting pizza on your table and your suitcase on the plane. Your AI solutions mean tomorrow’s recruiter is smarter and faster but still embodies our human forward approach, combining tech with a personal touch and putting people first – including you. Constantly experimenting, working on new NLP use cases and matching systems or expanding our self-service data platform. If you bring the idea we will provide the freedom to explore, so you can help us shape the world of work. 

Data Science @ RGN

Randstad IT is organized in a variation of the Spotify Engineering Model with squads, tribes, and chapters. Our data science chapter spans 12 data scientists, data engineers and machine learning engineers over 3 departments (IT, finance, and marketing), across 6 different teams. These teams work on recommender systems for algorithmic job matching, natural language processing and information extraction, forecasting, and more. We are further interested in AI fairness and auditing, explainability, and transparency.

Who are you?

We’re looking for students studying AI, data science, or related programs, for either graduation projects or regular internships. Fluency in python is required, and we expect our interns to work autonomously. However, as an intern you’ll be a fully fledged member of our chapter, which means you get to benefit from the knowledge that is being shared in our chapter.

Here’s the overview of our suggested projects:

  • (Deep) Reinforcement Learning-based Planning & Poolmanagement
  • Writing style transfer learning
  • Career pathing MVP
  • Pairwise learning to rank for SmartMatch
  • Revenue forecasting using time-series algorithms
  • Structured information extraction from resumes
  • Salary parsing from vacancies
  • Record linkage for company linking
  • Free text notes and comments for improved job matching
(more…)

Joined the board of SETUP

πŸ“… May 29, 2020 β€’ πŸ• 12:32 β€’ 🏷 Blog β€’ πŸ‘ 4

I have joined the board of SETUP, a Utrecht-based medialab established in 2010. SETUP’s mission is:

to educate a wide audience, providing them with the tools necessary to design this brave new world, and infuse it with human values and new-found creativity.

~ SETUP

This mission perfectly fits my personal conviction that knowledge and understanding of technology through media/algorithmic-literacy β€” not fear and repression β€” is vital in progressing into our technology-infused future! See, e.g., what I wrote about it on the neutrality of algorithms, or “algorithmic literacy.”

photo: Sebastiaan ter Burg (ter-burg.nl) for SETUP

Prior to joining their board, I have been following SETUP for a couple of years, joining some of their meetups, and giving a talk at one of their events in 2018 “leven met algoritmen.” I am very excited to start as a board member and help set up SETUP’s future!

I have emerged…

πŸ“… May 9, 2020 β€’ πŸ• 10:12 β€’ 🏷 Blog β€’ πŸ‘ 160

… as an entity in the Google Knowledge Graph!

Which is funny, because “emerging entities” were the main topic of my PhD Thesis [1]. With my co-authors I’ve published research on:

  1. Learning how to recognize “out-of-knowledge base” entities emerging on social media [2]
  2. How our collective memory is formed through “emerging entities” on Wikipedia [3], and more generally
  3. Entity retrieval and ranking [4] where Google’s so-called “Knowledge Panels” often served as examples…
Google’s AI unleashes the long tail?

(FYI: I’m not sure how I ended up there, the metadata seems to be coming from Google Scholar)

#vanitysearch

[1] [pdf] D. Graus, “Entities of interest β€” discovery in digital traces,” PhD Thesis, 2017.
[Bibtex]
@phdthesis{graus2017entities,
title={Entities of Interest β€” Discovery in Digital Traces},
author={Graus, David},
year={2017},
month={6},
school={Informatics Institute, University of Amsterdam},
isbn={9789461828002},
url={https://hdl.handle.net/11245.1/51be80bb-1cbf-4633-8ff9-e3128e990bfa}
}
[2] [pdf] [doi] D. Graus, M. Tsagkias, L. Buitinck, and M. de Rijke, “Generating pseudo-ground truth for predicting new concepts in social streams,” in Advances in information retrieval, Cham, 2014, p. 286–298.
[Bibtex]
@inproceedings{graus2014generating,
author={Graus, David and Tsagkias, Manos and Buitinck, Lars and de Rijke, Maarten},
title={Generating Pseudo-ground Truth for Predicting New Concepts in Social Streams},
booktitle={Advances in Information Retrieval},
year={2014},
publisher={Springer International Publishing},
address={Cham},
pages={286--298},
url={https://doi.org/10.1007/978-3-319-06028-6_24},
doi={10.1007/978-3-319-06028-6_24},
series = {ECIR '14}
}
[3] [pdf] [doi] D. Graus, D. Odijk, and M. de Rijke, “The birth of collective memories: analyzing emerging entities in text streams,” Journal of the association for information science and technology, vol. 69, iss. 6, pp. 773-786, 2018.
[Bibtex]
@article{graus2018birth,
author = {Graus, David and Odijk, Daan and de Rijke, Maarten},
title = {The birth of collective memories: Analyzing emerging entities in text streams},
journal = {Journal of the Association for Information Science and Technology},
year = {2018},
volume = {69},
number = {6},
pages = {773-786},
doi = {10.1002/asi.24004},
url = {https://asistdl.onlinelibrary.wiley.com/doi/abs/10.1002/asi.24004},
eprint = {https://asistdl.onlinelibrary.wiley.com/doi/pdf/10.1002/asi.24004},
}
[4] [pdf] [doi] D. Graus, M. Tsagkias, W. Weerkamp, E. Meij, and M. de Rijke, “Dynamic collective entity representations for entity ranking,” in Proceedings of the ninth acm international conference on web search and data mining, New York, NY, USA, 2016, p. 595–604.
[Bibtex]
@inproceedings{graus2016dynamic,
author = {Graus, David and Tsagkias, Manos and Weerkamp, Wouter and Meij, Edgar and de Rijke, Maarten},
title = {Dynamic Collective Entity Representations for Entity Ranking},
year = {2016},
isbn = {9781450337168},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/2835776.2835819},
doi = {10.1145/2835776.2835819},
booktitle = {Proceedings of the Ninth ACM International Conference on Web Search and Data Mining},
pages = {595–604},
numpages = {10},
keywords = {fielded retrieval, entity retrieval, entity ranking, content representation},
location = {San Francisco, California, USA},
series = {WSDM '16}
}

Panel @ CPDP2020: "Algorithms and AI-driven technologies in the information society"

πŸ“… February 4, 2020 β€’ πŸ• 10:03 β€’ 🏷 Blog β€’ πŸ‘ 3

I was invited by UvA’s Information, Communication and the Data Society (ICDS) to participate in a panel at the Conference on Privacy and Data Protection, which was focused on AI.

The recording of the panel is now online, watch me telling a room full of (highly) privacy-aware (and cookie-averse) people that Cambridge Analytica nudging people to “politically activate them” with tailored information can be a “democratic good” πŸ˜….

See the recording below:

For more information, see CPDP’s page of the panel.

“Bias in Recommendations” lecture @ SIKS Course on advances in IR

πŸ“… October 8, 2019 β€’ πŸ• 20:04 β€’ 🏷 Blog β€’ πŸ‘ 95
πŸ“Έ by @arjen@idf.social

Enjoyed giving a lecture at the SIKS Course “Advances in Information Retrieval” at the Mitland Hotel in Utrecht. I also pitched DIR 2019 πŸ˜… (as evidenced by the picture above from Arjen). See my slidedeck below!

This talk is loosely based on (part of) the talk I gave at the ACM RecSys Summerschool, but I added a few slides on dealing with implicit feedback (= clicks), and popularity bias.

“RecSys in the Media Industry” Lecture at RecSys Summer School

πŸ“… September 11, 2019 β€’ πŸ• 13:39 β€’ 🏷 Blog β€’ πŸ‘ 98

With Daan Odijk I gave a lecture + hands-on workshop at the ACM Summer School on Recommender Systems in Gothenburg, Sweden on RecSys in the Media Industry: Relevance, Recency, Popularity, and Diversity.

πŸ“Έ by Alan Said

For it, we had a long (90+ min) lecture combining insights, experiences, and projects from our work at RTL and Blendle (Daan), and FD Mediagroep (me).

In addition, we did a small hands-on workshop, implementing a content-based re-ranker for WikiNews.

See our slides and notebooks here: https://github.com/graus/recsys_summer_school/

See a tweet by @alansaid, here:

Finally, see my slidedeck here:

“Improving automated segmentation of radio shows with audio embeddings”

πŸ“… July 5, 2019 β€’ πŸ• 15:41 β€’ 🏷 Blog and Research β€’ πŸ‘ 77

Update (28/1/2020): Oberon’s thesis was accepted and will be published at the IEEE 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020), to be held May 4-8 in Barcelona, Spain! The submission is co-authored with Klaus Lux and myself.

Oberon Berlage recently successfully defended his MSc. thesis (title above!) for the Data Science Master at University of Amsterdam, and graduated with a whopping 9!

He’s the first academic offspring of our AI Team @ FD Mediagroep, and worked on BNR SMART Radio‘s segmenter. Oberon improved our text-based segmenter by adding audio embeddings, improving the F1 score with +32%!

His thesis is now online, check it out at: http://scriptiesonline.uba.uva.nl/document/673254

Dutch Interactive Awards (2x) for BNR SMART Radio!

πŸ“… June 28, 2019 β€’ πŸ• 10:17 β€’ 🏷 Blog β€’ πŸ‘ 54

Yesterday BNR SMART Radio won two Dutch Interactive Awards (DIA 2019) Awards! We were nominated together with Elastique, the folks who designed the UX/interface of our BNR SMART Radio app which you can download here (or in your app stores, both Android/iOS)!

πŸ₯‡ Content

We won Gold in the category “Content”. Why, you may ask?

β€œFinally a reason to listen to Radio in general, and BNR in particular. Because advertising is absent and topics are tailored to the personal taste of the listener, who can search in a targeted way and express appreciation for a broadcast through a thumbsup, they reach new target audiences online.”

jury report (source: emerce)

πŸ₯ˆ Disruptor

Next to this, we also won the Silver DIA2019 Award in the category “Disruptor”!

“In a world where radio has never deviated from the linear model, the jury finds this very disruptive. SMART Radio provides a personal radio experience and possibly a new revenue model in the long term. The jury members find the potential of the underlying strategy even more impressive. What if BNR joins forces with FD Mediagroep and combines the work of both parties in this? The jury sees all kinds of opportunities to fit in new forms of advertising (or to omit them). The collaboration between the agency and the media company has resulted in something that has never been seen before.”

jury report (source: emerce)

Current standing

For who has trouble keeping track πŸ˜…, these are the awards we won with SMART Radio:

πŸ† Marconi Online Award
πŸ† AMMA Award (with SMART Journalism)
πŸ† DIA2019 Award, category: Content
πŸ₯ˆ DIA2019 Award, category: Disruptor

We won the AMMA Media-innovation award for our News Personalization projects at FD Mediagroep!

πŸ“… May 17, 2019 β€’ πŸ• 13:50 β€’ 🏷 Blog β€’ πŸ‘ 46

After winning the Marconi award for radio innovation, yesterday we picked up the AMMA Award for Media Innovation for our news personalization efforts at FD Mediagroep (both SMART Journalism for Het FD and SMART Radio for BNR Nieuwsradio)!

It’s really great to see that our current investment into AI and innovation seems to resonate with the outside world πŸ€–. And I am really happy to be with a company that sees this development as such an important direction that we are able to work with a big and talented team of data scientists, interns, engineers, and product folks πŸ€“