Dear Colleagues
my problem seems to be a bit frustrating for me so Im here to ask for suggestions. I have a training set of pairs. each pair is labeled (1,-1). the features value is calculated based on a distance function (edit distance). each pair has a feature value equal to the distance between a pair. the distance is calculated based on sequences of deletion, insertion and substitution. each operation (deletion, insertion and substitution) can have a score value so based on that the distance is calculated. for example for a pair the distance is 5 which is a summation of the score of each operation.
now my problem is how to learn/estimate the best score of each operation without assigning it manually. which algorithm could be more suitable? note that each operation usually happens more than one time in each pair. Thanks in advance for your reply.