I have an extensive list of payments on contracts from the last 6 years, and need to train a model to predict future payments.
In order to do so, I need to answer two questions:
- when will a payment concerning a specific contract take place?
- how much will each payment be?
Through a regression tree I managed to train a model which is quite reliable in answering question 2, but I am now stuck at question 1.
Is this to be treated as a case of PU learning? In fact I have a broad sample of positive cases, which I need to match with another sample of unlabeled mixed cases to build a classifier.
Would you have any suggestion on how to proceed? I am also ready to reconsider the general strategy, if you have concrete cases to bring to my attention.
Many thanks for your help.