On what basis did you remove some candidates for feature-engineering??

Are you asking how were candidates selected from stage 1 ?

Yes Stage1. Kazuki mentioned that you had ensembled candidate score and then eliminated candidates. How was that elimination done? Generate a score and pick out the top similar products based on score?

Yes. Since stage 1 produced over 1000 candidates and we only wanted about 100 for feature engineering in stage 2, we needed to remove candidates. Candidates were selected using their scores from the generation process. We had multiple models to generate candidates so we designed heuristic rules to select top candidates from each of our generation models.

