# opusTCv20210807+bt_transformer-big_2022-03-07.zip

* dataset: opusTCv20210807+bt
* model: transformer-big
* source language(s): ces dsb hsb pol
* target language(s): bel bel_Latn orv_Cyrl rus ukr
* raw source language(s): ces dsb hsb pol
* raw target language(s): bel orv rus ukr
* model: transformer-big
* pre-processing: normalization + SentencePiece (spm32k,spm32k)
* a sentence initial language token is required in the form of `>>id<<` (id = valid target language ID)
* valid language labels: 
* download: [opusTCv20210807+bt_transformer-big_2022-03-07.zip](https://object.pouta.csc.fi/Tatoeba-MT-models/zlw-zle/opusTCv20210807+bt_transformer-big_2022-03-07.zip)
* test set translations: [opusTCv20210807+bt_transformer-big_2022-03-07.test.txt](https://object.pouta.csc.fi/Tatoeba-MT-models/zlw-zle/opusTCv20210807+bt_transformer-big_2022-03-07.test.txt)
* test set scores: [opusTCv20210807+bt_transformer-big_2022-03-07.eval.txt](https://object.pouta.csc.fi/Tatoeba-MT-models/zlw-zle/opusTCv20210807+bt_transformer-big_2022-03-07.eval.txt)

## Benchmarks

| testset | BLEU  | chr-F | #sent | #words | BP |
|---------|-------|-------|-------|--------|----|
| newstest2012.ces-rus 	| 20.6 	| 0.49166 	| 3003 	| 64830 	| 0.997 |
| newstest2013.ces-rus 	| 26.8 	| 0.53763 	| 3000 	| 58560 	| 0.973 |
| Tatoeba-test-v2021-08-07.ces-bel 	| 33.8 	| 0.51349 	| 31 	| 181 	| 0.966 |
| Tatoeba-test-v2021-08-07.ces-rus 	| 55.0 	| 0.72626 	| 2934 	| 17743 	| 0.985 |
| Tatoeba-test-v2021-08-07.ces-ukr 	| 51.0 	| 0.68280 	| 1787 	| 8854 	| 0.997 |
| Tatoeba-test-v2021-08-07.dsb-rus 	| 26.9 	| 0.48948 	| 24 	| 124 	| 1.000 |
| Tatoeba-test-v2021-08-07.dsb-ukr 	| 8.8 	| 0.34208 	| 3 	| 13 	| 1.000 |
| Tatoeba-test-v2021-08-07.hsb-rus 	| 19.4 	| 0.40929 	| 38 	| 281 	| 0.859 |
| Tatoeba-test-v2021-08-07.hsb-ukr 	| 3.5 	| 0.16605 	| 8 	| 126 	| 0.585 |
| Tatoeba-test-v2021-08-07.multi-multi 	| 52.8 	| 0.70695 	| 10000 	| 58091 	| 0.987 |
| Tatoeba-test-v2021-08-07.pol-bel 	| 28.7 	| 0.51885 	| 287 	| 1730 	| 1.000 |
| Tatoeba-test-v2021-08-07.pol-bel_Latn 	| 3.8 	| 0.847 	| 2 	| 16 	| 0.794 |
| Tatoeba-test-v2021-08-07.pol-orv 	| 4.5 	| 0.24322 	| 7 	| 31 	| 1.000 |
| Tatoeba-test-v2021-08-07.pol-rus 	| 54.5 	| 0.72518 	| 3543 	| 21971 	| 0.992 |
| Tatoeba-test-v2021-08-07.pol-ukr 	| 48.1 	| 0.67885 	| 2519 	| 13493 	| 0.998 |
