The curent baseline that appears in the leaderboard and achieves
26857.38 points is purely content-based. For a given user u, recommendations are
determined as follows:
- Only consider those items for which
active_during_test = 1 jobrole-title-based= get top 100 items for whichusers.jobrolesanditems.titleoverlap and rank by the following score: number of overlapping IDs * 3jobrole-tags-based= get top 100 items for whichusers.jobrolesanditems.tagsoverlap and rank by the number of overlapping IDs: number of overlapping IDs * 2discipline-and-region-based= get 100 random items for whichusers.discipline_id == items.discipline_id AND users.region == items.region. Score of each item = 2industry-and-region-based= get 100 random items for whichusers.industry_id == items.industry_id AND users.region == items.region. Score of each item = 1- Aggregate scores of
jobrole-title-based,jobrole-tags-based,discipline-and-region-basedandindustry-and-region-basedand ensure thatusers.career_level == items.career_level(and ifusers.career_levelisNULLor0then simply assume that the carer level is 3).