10 Jul 2012 08:52
MERT tunning
Hi all, I'm running experiments on subtitle corpora. My training corpus is OpenSub and I have dev and test sets from another source. Before parameter tunning BLEU score was 21%. After running mert BLEU score improves to 24%. Translation model weights after mert were: [weight-t] 3.17108e-06 7.03552e-06 4.10145e-05 7.17818e-07 -0.999893 Small weights for 4 translation model components and large negative score for phrase penalty. But phrase penalty in phrase table is always exp(1). Does it mean that my dev/test set do not have much in common with OpenSub corpus? I trained lm on OpenSub as well. Dev set perplexity was 228 an test set perplexity was 217 (OOV=1%). After mert the weight for language model was 0.01. What do you suggest? Mirjam -- -- ====================================== Mirjam Sepesy Maučec(Continue reading)
RSS Feed