One of the key features of a successful online employment marketplace is the ability to match people with the most relevant job opportunities. Our business uses data about candidates, jobs and hirers to perform this task. One valuable datapoint in this process is the job titles, which we discover in semi-structured forms in a candidate’s employment history and in a hirer’s job advertisement. For retrieval, recommendation and analysis purposes, the ability to successfully normalise the various forms in which users provide a given job title on-site to an authoritative form and understand the relationships between the job title and others is essential. Our team have developed innovations to our job title normalisation process by constructing NERD (Named Entity Recognition and Discovery), a suite of web services that leverage data represented using Linked Data standards housed in ontology management software. This role title data is then continuously improved using automated processes that gather insights from our marketplace. This presentation will outline the details of these innovations, the challenges faced along the way, and the key ways we measure success.
This presentation will cover problems and opportunities we faced, solutions implemented, and lessons learned.
Previously, our job title normalisation processes had a number of limitations, including:
Our new normalisation processes include the following developments:
As we developed our new processes, we faced various opportunities to learn and adapt our solutions. The understanding and satisfaction of the needs of consumers of our data and services within the business is a constant priority. At the beginning of our journey these needs were quite straightforward, but as the success of our work became known across the business, we began to encounter more normalisation-related requirements, some of which were in direct conflict with others. As I will discuss, it became clear to us that the success of job title normalisation in our marketplace is only possible through a combination of improvements to our data, our web services, and the business processes that bring them together.