diff --git a/enhanced_direct_s2st_units/audios/en-es/metadata.txt b/enhanced_direct_s2st_units/audios/en-es/metadata.txt
new file mode 100644
index 0000000..d73be2d
--- /dev/null
+++ b/enhanced_direct_s2st_units/audios/en-es/metadata.txt
@@ -0,0 +1,17 @@
+TARGET
+EPST
+1149 /large_experiments/ust/annl/datasets/st/europarl-st/en/es/test/flac_16k/en.20110324.5.4-041-000_8.flac this should also be an important part of our approach to the twenty twelve budget
+476 /large_experiments/ust/annl/datasets/st/europarl-st/en/es/test/flac_16k/en.20090914.23.1-076_1.flac his family who are my constituents are convinced of his innocence
+651 /large_experiments/ust/annl/datasets/st/europarl-st/en/es/test/flac_16k/en.20100210.25.3-177_3.flac of the directive on all taxes including social security contributions the automatic exchange of information and improved cooperation between the member states in matters of taxation
+890 /large_experiments/ust/annl/datasets/st/europarl-st/en/es/test/flac_16k/en.20100907.33.2-567_1.flac information encourages citizens interest in public matters and their participation
+37 /large_experiments/ust/annl/datasets/st/europarl-st/en/es/test/flac_16k/en.20080924.32.3-289_4.flac we want to see energy poverty as a part of this debate
+923 /large_experiments/ust/annl/datasets/st/europarl-st/en/es/test/flac_16k/en.20101018.14.1-115_4.flac in my view one of the most important elements is the follow up of legislative initiative requests from parliament
+970 /large_experiments/ust/annl/datasets/st/europarl-st/en/es/test/flac_16k/en.20101123.37.2-432_15.flac we must find an open and constructive procedure on the next financial framework
+45 /large_experiments/ust/annl/datasets/st/europarl-st/en/es/test/flac_16k/en.20080924.33.3-319_2.flac i agree that we should act and react but we should not overdo it because we need a balanced approach
+651 /large_experiments/ust/annl/datasets/st/europarl-st/en/es/test/flac_16k/en.20100210.25.3-177_3.flac of the directive on all taxes including social security contributions the automatic exchange of information and improved cooperation between the member states in matters of taxation
+
+
+MUSTC
+632 /large_experiments/ust/annl/datasets/st/must-c/must-c/en-es/tst-COMMON/flac_16k/ted_1144_39.flac and apparently it was quite popular
+519 /large_experiments/ust/annl/datasets/st/must-c/must-c/en-es/tst-COMMON/flac_16k/ted_1137_56.flac we can actually do the same thing with much less energy
+1015 /large_experiments/ust/annl/datasets/st/must-c/must-c/en-es/tst-COMMON/flac_16k/ted_1171_1.flac through my work im trying to articulate that humans are not separate from nature and that everything is interconnected
\ No newline at end of file
diff --git a/enhanced_direct_s2st_units/audios/en-es/reference.txt b/enhanced_direct_s2st_units/audios/en-es/reference.txt
new file mode 100644
index 0000000..e2a0f73
--- /dev/null
+++ b/enhanced_direct_s2st_units/audios/en-es/reference.txt
@@ -0,0 +1,12 @@
+1149_epst esto también debería ser una parte importante de nuestro enfoque del presupuesto dos mil doce
+476_epst su familia que son mis electores está convencida de su inocencia
+651_epst de la directiva a todos los impuestos incluidas las contribuciones a la seguridad social el intercambio automático de información y la mejora de la cooperación fiscal entre los estados miembros
+890_epst la información fomenta el interés de los ciudadanos por los asuntos públicos y su participación
+632_mustc y al parecer era muy popular
+519_mustc podemos hacer lo mismo con mucha menos energía
+37_epst queremos ver la pobreza energética como parte de este debate
+923_epst en mi opinión uno de los elementos más importantes es el seguimiento de las solicitudes de iniciativa legislativa del parlamento
+970_epst debemos encontrar un procedimiento abierto y constructivo en el próximo marco financiero
+45_epst estoy de acuerdo en que deberíamos actuar y reaccionar pero no deberíamos excedernos porque necesitamos un enfoque equilibrado
+651_epst de la directiva a todos los impuestos incluidas las contribuciones a la seguridad social el intercambio automático de información y la mejora de la cooperación fiscal entre los estados miembros
+1015_mustc a través de mi trabajo estoy tratando de expresar que los humanos no están separados de la naturaleza y que todo está interconectado
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/asr.txt b/enhanced_direct_s2st_units/audios/en-es/set1/asr.txt
new file mode 100644
index 0000000..764b5a7
--- /dev/null
+++ b/enhanced_direct_s2st_units/audios/en-es/set1/asr.txt
@@ -0,0 +1,47 @@
+LND
+1149_epst esto también debería ser una parte importante de nuestro enfoque al presupuesto dos mil doce
+476_epst su familia que son mis electores está convencida de su inocencia
+651_epst de la directiva a todos los impuestos incluidas las contribuciones a la seguridad social el intercambio automático de información y la mejor cooperación entre los estados miembros en las cuestiones de impuestos
+890_epst la información fomenta el interés de los ciudadanos en asuntos públicos y su participación
+37_epst queremos ver la pobreza energética como parte de este deate
+923_epst en mi opinión uno de los elementos más importantes es el seguimiento de las peticiones de la iniciativa legislativa por parte del pagamento
+970_epst debemos encontrar un procedimiento abierto y constructivo sobre el próximo marco financiero
+45_epst estoy de acuerdo en que debemos actuar y reaccionar pero no debemos hacerlo porque necesitamos un enfoque equilibrado
+632_mustc y al parecer era muy popular
+519_mustc podemos hacer lo mismo con mucha menos energía
+1015_mustc a través de mi trabajo trato de articular que los humanos no somos separados de la naturaleza y que todo está interconectado
+
+
+
+MT
+1149_epst también debería ser una parte importante de nuestro enfoque al presupuesto dos mil doce
+476_epst su familia que son mí circunscripciones están convencidas de estos inocentes
+651_epst de la directiva a todos los impuestos impluyendo las contribuciones de seguridad social el intercambio automático de la información y mejorar la cooperación entre los estados miembros y las cuestiones de impuestos
+890_epst la información y el interés de los ciudadanos alientan los intereses de las cuestiones públicas y su participación
+632_mustc y un líder era bastante popular
+519_mustc podemos hacer lo mismo que mucha menos energía
+
+
+
+
+
+
+S2T_TTS
+1149_epst esto también debería ser una parte importante de nuestro enfoque al presupuesto de dos mildos mil dos mil doce
+476_epst su familia que son mis electores están convencidos de su inocencia
+651_epst de la directiva para todos los impuestos incluidos las contribuciones de seguridad social el intercambio automático de información y la mejor cooperación entre los estados miembros en la cuestión de la fiscalidad
+890_epst la información alienta el interés de los ciudadanos en asuntos públicos y en su participación
+632_mustc y aparentemente era bastante popular
+519_mustc en realidad podemos hacerlo mismo con menos energía
+
+
+
+
+LND
+
+
+
+
+
+
+
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/reference.txt b/enhanced_direct_s2st_units/audios/en-es/set1/reference.txt
new file mode 100644
index 0000000..e42c024
--- /dev/null
+++ b/enhanced_direct_s2st_units/audios/en-es/set1/reference.txt
@@ -0,0 +1,6 @@
+1149_epst esto también debería ser una parte importante de nuestro enfoque del presupuesto dos mil doce
+476_epst su familia que son mis electores está convencida de su inocencia
+651_epst de la directiva a todos los impuestos incluidas las contribuciones a la seguridad social el intercambio automático de información y la mejora de la cooperación fiscal entre los estados miembros
+890_epst la información fomenta el interés de los ciudadanos por los asuntos públicos y su participación
+632_mustc y al parecer era muy popular
+519_mustc podemos hacer lo mismo con mucha menos energía
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/1149_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/1149_epst.wav
new file mode 100644
index 0000000..770b9a3
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/1149_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/476_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/476_epst.wav
new file mode 100644
index 0000000..42d3b3f
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/476_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/519_mustc.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/519_mustc.wav
new file mode 100644
index 0000000..c2f5612
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/519_mustc.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/632_mustc.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/632_mustc.wav
new file mode 100644
index 0000000..b7e7d55
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/632_mustc.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/651_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/651_epst.wav
new file mode 100644
index 0000000..8a8359c
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/651_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/890_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/890_epst.wav
new file mode 100644
index 0000000..b11937d
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2t_tts/890_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/1149_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/1149_epst.wav
new file mode 100644
index 0000000..8d8734c
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/1149_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/476_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/476_epst.wav
new file mode 100644
index 0000000..89f0651
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/476_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/519_mustc.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/519_mustc.wav
new file mode 100644
index 0000000..8d82533
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/519_mustc.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/632_mustc.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/632_mustc.wav
new file mode 100644
index 0000000..251276e
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/632_mustc.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/651_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/651_epst.wav
new file mode 100644
index 0000000..465b5c8
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/651_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/890_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/890_epst.wav
new file mode 100644
index 0000000..ad46a7c
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_lnd/890_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/1149_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/1149_epst.wav
new file mode 100644
index 0000000..11d7641
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/1149_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/476_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/476_epst.wav
new file mode 100644
index 0000000..069c912
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/476_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/519_mustc.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/519_mustc.wav
new file mode 100644
index 0000000..3f2b21c
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/519_mustc.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/632_mustc.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/632_mustc.wav
new file mode 100644
index 0000000..a889ff4
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/632_mustc.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/651_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/651_epst.wav
new file mode 100644
index 0000000..2c542db
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/651_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/890_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/890_epst.wav
new file mode 100644
index 0000000..889a3f9
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/s2ut_mt/890_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/source/1149_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/source/1149_epst.wav
new file mode 100644
index 0000000..c79bff9
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/source/1149_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/source/476_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/source/476_epst.wav
new file mode 100644
index 0000000..be1e9e9
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/source/476_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/source/519_mustc.wav b/enhanced_direct_s2st_units/audios/en-es/set1/source/519_mustc.wav
new file mode 100644
index 0000000..ff681b6
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/source/519_mustc.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/source/632_mustc.wav b/enhanced_direct_s2st_units/audios/en-es/set1/source/632_mustc.wav
new file mode 100644
index 0000000..02f5a08
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/source/632_mustc.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/source/651_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/source/651_epst.wav
new file mode 100644
index 0000000..2c90daf
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/source/651_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/source/890_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/source/890_epst.wav
new file mode 100644
index 0000000..cc93099
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/source/890_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/target/1149_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/target/1149_epst.wav
new file mode 100644
index 0000000..f886903
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/target/1149_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/target/476_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/target/476_epst.wav
new file mode 100644
index 0000000..31334bf
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/target/476_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/target/519_mustc.wav b/enhanced_direct_s2st_units/audios/en-es/set1/target/519_mustc.wav
new file mode 100644
index 0000000..94d0a84
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/target/519_mustc.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/target/632_mustc.wav b/enhanced_direct_s2st_units/audios/en-es/set1/target/632_mustc.wav
new file mode 100644
index 0000000..333eec1
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/target/632_mustc.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/target/651_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/target/651_epst.wav
new file mode 100644
index 0000000..d1f8eec
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/target/651_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set1/target/890_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set1/target/890_epst.wav
new file mode 100644
index 0000000..6825387
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set1/target/890_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/asr.txt b/enhanced_direct_s2st_units/audios/en-es/set2/asr.txt
new file mode 100644
index 0000000..994a4da
--- /dev/null
+++ b/enhanced_direct_s2st_units/audios/en-es/set2/asr.txt
@@ -0,0 +1,31 @@
+LND
+37_epst queremos ver la pobreza energética como parte de este deate
+923_epst en mi opinión uno de los elementos más importantes es el seguimiento de las peticiones de la iniciativa legislativa por parte del pagamento
+970_epst debemos encontrar un procedimiento abierto y constructivo sobre el próximo marco financiero
+45_epst estoy de acuerdo en que debemos actuar y reaccionar pero no debemos hacerlo porque necesitamos un enfoque equilibrado
+651_epst de la directiva a todos los impuestos incluidas las contribuciones a la seguridad social el intercambio automático de información y la mejor cooperación entre los estados miembros en las cuestiones de impuestos
+1015_mustc a través de mi trabajo trato de articular que los humanos no somos separados de la naturaleza y que todo está interconectado
+
+
+
+LR50
+37_epst queremos ver la pobreza energética como parte de este date
+923_epst en mi opinión uno de los elementos más importantes es el seguimiento de las emiendas de iniciativas legislativas de ley
+970_epst debemos encontrar un procedimiento abierto y constructivo en el sistema financiero financiero financiero financiero
+45_epst estoy considerando que actuamos y reaccionamos pero no deberíamos hacerlo porque necesitamos un enfoque realmente valioso
+651_epst la directiva sobre el impuesto de todos los contribuyentes inpluyendo las contribuciones sociales la introducción automática y mejorada de los estados miembros y mejorar la cooperación entre los estados miembros
+1015_mustc a través de mi trabajo estoy tratando de articular que los humanos no están separados de la naturaleza y que todo está interconectado
+
+
+
+
+LND-ASR
+37_epst queremos ver la pobreza energética como parte de este deate
+923_epst en mi opinión uno de los elementos más importantes es el seguimiento de las solicitudes de iniciativa legislativa del pagamento
+970_epst debemos encontrar un procedimiento abierto y constructivo en el próximo marco financiero
+45_epst estoy de acuerdo en que deberíamos actuar y reaccionar pero no deberíamos exagerarlo porque necesitamos un enfoque equilibrado
+651_epst de la directiva a todos los impuestos incluidas las contribuciones a la seguridad social el intercambio automático de información y la mejor cooperación entre los estados miembros en materia de impuestos
+1015_mustc a través de mi trabajo trato de articular que los humanos no estamos separados de la naturaleza y que todo está interconectado
+
+
+
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/reference.txt b/enhanced_direct_s2st_units/audios/en-es/set2/reference.txt
new file mode 100644
index 0000000..5c6c423
--- /dev/null
+++ b/enhanced_direct_s2st_units/audios/en-es/set2/reference.txt
@@ -0,0 +1,6 @@
+37_epst queremos ver la pobreza energética como parte de este debate
+923_epst en mi opinión uno de los elementos más importantes es el seguimiento de las solicitudes de iniciativa legislativa del parlamento
+970_epst debemos encontrar un procedimiento abierto y constructivo en el próximo marco financiero
+45_epst estoy de acuerdo en que deberíamos actuar y reaccionar pero no deberíamos excedernos porque necesitamos un enfoque equilibrado
+651_epst de la directiva a todos los impuestos incluidas las contribuciones a la seguridad social el intercambio automático de información y la mejora de la cooperación fiscal entre los estados miembros
+1015_mustc a través de mi trabajo estoy tratando de expresar que los humanos no están separados de la naturaleza y que todo está interconectado
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/1015_mustc.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/1015_mustc.wav
new file mode 100644
index 0000000..6157254
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/1015_mustc.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/37_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/37_epst.wav
new file mode 100644
index 0000000..2175457
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/37_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/45_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/45_epst.wav
new file mode 100644
index 0000000..1708801
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/45_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/651_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/651_epst.wav
new file mode 100644
index 0000000..465b5c8
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/651_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/923_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/923_epst.wav
new file mode 100644
index 0000000..adb0ba8
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/923_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/970_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/970_epst.wav
new file mode 100644
index 0000000..ae3de4e
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd/970_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/1015_mustc.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/1015_mustc.wav
new file mode 100644
index 0000000..006e1bc
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/1015_mustc.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/37_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/37_epst.wav
new file mode 100644
index 0000000..ab9d0ac
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/37_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/45_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/45_epst.wav
new file mode 100644
index 0000000..8553284
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/45_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/651_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/651_epst.wav
new file mode 100644
index 0000000..5a517f5
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/651_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/923_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/923_epst.wav
new file mode 100644
index 0000000..cc9c277
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/923_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/970_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/970_epst.wav
new file mode 100644
index 0000000..7074169
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lnd_w_asr/970_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/1015_mustc.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/1015_mustc.wav
new file mode 100644
index 0000000..d8c7c28
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/1015_mustc.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/37_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/37_epst.wav
new file mode 100644
index 0000000..59dd9c3
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/37_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/45_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/45_epst.wav
new file mode 100644
index 0000000..f5bd251
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/45_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/651_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/651_epst.wav
new file mode 100644
index 0000000..6901a47
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/651_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/923_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/923_epst.wav
new file mode 100644
index 0000000..2a224c8
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/923_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/970_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/970_epst.wav
new file mode 100644
index 0000000..332b71c
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/s2ut_lr50/970_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/source/1015_mustc.wav b/enhanced_direct_s2st_units/audios/en-es/set2/source/1015_mustc.wav
new file mode 100644
index 0000000..dd3de56
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/source/1015_mustc.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/source/37_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/source/37_epst.wav
new file mode 100644
index 0000000..5cc39ae
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/source/37_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/source/45_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/source/45_epst.wav
new file mode 100644
index 0000000..d21b2fc
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/source/45_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/source/651_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/source/651_epst.wav
new file mode 100644
index 0000000..2c90daf
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/source/651_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/source/923_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/source/923_epst.wav
new file mode 100644
index 0000000..98b461b
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/source/923_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/source/970_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/source/970_epst.wav
new file mode 100644
index 0000000..2e64c8f
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/source/970_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/target/1015_mustc.wav b/enhanced_direct_s2st_units/audios/en-es/set2/target/1015_mustc.wav
new file mode 100644
index 0000000..20e504f
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/target/1015_mustc.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/target/37_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/target/37_epst.wav
new file mode 100644
index 0000000..baf2290
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/target/37_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/target/45_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/target/45_epst.wav
new file mode 100644
index 0000000..7d5f7cf
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/target/45_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/target/651_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/target/651_epst.wav
new file mode 100644
index 0000000..d1f8eec
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/target/651_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/target/923_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/target/923_epst.wav
new file mode 100644
index 0000000..4bbe0d0
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/target/923_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/en-es/set2/target/970_epst.wav b/enhanced_direct_s2st_units/audios/en-es/set2/target/970_epst.wav
new file mode 100644
index 0000000..b2e0d54
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/en-es/set2/target/970_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/metadata.txt b/enhanced_direct_s2st_units/audios/es-en/metadata.txt
new file mode 100644
index 0000000..1cc3321
--- /dev/null
+++ b/enhanced_direct_s2st_units/audios/es-en/metadata.txt
@@ -0,0 +1,24 @@
+
+EPST
+1507 otro aspecto más institucional es el equilibrio de fuerzas entre el parlamento y el consejo /large_experiments/ust/annl/datasets/st/europarl-st/es/en/test/flac_16k/en.20120328.21.3-210-000_4.flac
+1700 además su capacidad de regeneración es muy limitada /large_experiments/ust/annl/datasets/st/europarl-st/es/en/test/flac_16k/en.20120911.4.2-035-000_2.flac
+1313 señor presidente señorías nos encontramos ante un tema extremadamente sensible /large_experiments/ust/annl/datasets/st/europarl-st/es/en/test/flac_16k/en.20111114.20.1-233-000_0.flac
+289 desde un punto de vista presupuestario no parece adecuada la propuesta de financiación procedente de la comisión de desarrollo ya que este dinero no existe al /large_experiments/ust/annl/datasets/st/europarl-st/es/en/test/flac_16k/en.20090218.24.3-251_2.flac
+
+
+
+MTEDX
+581 pero por qué te pusiste en esta situación /large_experiments/ust/annl/datasets/st/mtedx/es-en/test/flac_16k/a0G1K1A269Y_0024.flac
+591 en dos mil dieciseis hubo tres mil quinientos sesenta y nueve suicidios en españa según el instituto nacional de estadística /large_experiments/ust/annl/datasets/st/mtedx/es-en/test/flac_16k/a0G1K1A269Y_0034.flac
+100 empezó en asia y de allí pasó a venecia el puerto más cosmopolita de su tiempo /large_experiments/ust/annl/datasets/st/mtedx/es-en/test/flac_16k/9VA26uZPqYA_0008.flac
+592 cuando casi diez personas al día una cada dos horas y media /large_experiments/ust/annl/datasets/st/mtedx/es-en/test/flac_16k/a0G1K1A269Y_0035.flac
+
+
+CV
+11375 /large_experiments/ust/annl/datasets/st/covost2/es/flac_16k/common_voice_es_19717026.flac autobuses adicionales normalmente proporcionados por go south coast van desde bristol al festival
+12411 /large_experiments/ust/annl/datasets/st/covost2/es/flac_16k/common_voice_es_19607691.flac así el principito decidió abandonar su planeta y explorar el resto del universo
+2692 /large_experiments/ust/annl/datasets/st/covost2/es/flac_16k/common_voice_es_19980264.flac encontró un país con dos gobiernos en la capital maximiliano era el emperador
+9756 /large_experiments/ust/annl/datasets/st/covost2/es/flac_16k/common_voice_es_19599961.flac cada uno de ellos es un derecho exclusivo sujeto a ciertas limitaciones y excepciones
+12478 /large_experiments/ust/annl/datasets/st/covost2/es/flac_16k/common_voice_es_19970262.flac esta experiencia representa un paso trascendental en la historia espacial del país
+4109 /large_experiments/ust/annl/datasets/st/covost2/es/flac_16k/common_voice_es_19969961.flac desde la perspectiva del balance físico químico y biológico está en una posición clave
+
diff --git a/enhanced_direct_s2st_units/audios/es-en/reference.txt b/enhanced_direct_s2st_units/audios/es-en/reference.txt
new file mode 100644
index 0000000..c28287a
--- /dev/null
+++ b/enhanced_direct_s2st_units/audios/es-en/reference.txt
@@ -0,0 +1,17 @@
+11375_cv ADDITIONAL BUSES USUALLY PROVIDED BY GO SOUTH COAST GO FROM BRISTOL TO THE FESTIVAL
+
+12411_cv THIS WAY THE LITTLE PRINCE DECIDED TO LEAVE HIS PLANET AND EXPLORE THE REST OF THE UNIVERSE
+2692_cv HE FOUND A COUNTRY WITH TWO GOVERNMENTS IN THE CAPITAL MAXIMILIAN WAS THE EMPEROR
+1313_epst MR PRESIDENT LADIES AND GENTLEMEN WE ARE DEALING WITH AN EXTREMELY SENSITIVE ISSUE
+1507_epst ANOTHER MORE INSTITUTIONAL ASPECT IS THE BALANCE OF POWER BETWEEN PARLIAMENT AND THE COUNCIL
+1700_epst MOREOVER THEIR CAPACITY FOR REGENERATION IS VERY LIMITED
+581_mtedx BUT WHY DID YOU PUT YOURSELF IN THIS SITUATION
+591_mtedx IN TWENTY SIXTEEN THERE WERE THREE THOUSAND FIVE HUNDRED SIXTY NINE SUICIDES IN SPAIN ACCORDING TO THE NATIONAL INSTITUTE OF STATISTICS
+9756_cv EACH ONE OF THEM IS AN EXCLUSIVE RIGHT SUBJECT TO CERTAIN LIMITATIONS AND EXCEPTIONS
+12478_cv THIS EXPERIENCE REPRESENTS A TRANSCENDENTAL STEP IN THE SPATIAL HISTORY OF THE COUNTRY
+4109_cv FROM THE PERSPECTIVE OF PHYSICAL CHEMICAL AND BIOLOGICAL BALANCE IT IS IN A KEY POSITION
+289_epst IN ANY CASE GIVEN THAT THE FINANCING OF THIS NEW COOPERATION INSTRUMENT MUST BE COMPATIBLE WITH THE TWO THOUSAND SEVEN TWENTY THIRTEEN FINANCIAL FRAMEWORK IT IS WORTH
+1528_epst LADIES AND GENTLEMEN THE SITUATION IN THE MARKETS DOES NOT REFLECT SPAIN'S STRENGTH
+100_mtedx IT STARTED IN ASIA AND FROM THERE IT WENT TO VENICE THE MOST COSMOPOLITAN PORT OF ITS TIME
+592_mtedx ALMOST TEN PEOPLE A DAY ONE EVERY TWO AND A HALF HOURS
+
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/asr.txt b/enhanced_direct_s2st_units/audios/es-en/set1/asr.txt
new file mode 100644
index 0000000..8570bbb
--- /dev/null
+++ b/enhanced_direct_s2st_units/audios/es-en/set1/asr.txt
@@ -0,0 +1,51 @@
+
+LND
+11375_cv ADDITIONAL BUSES USUALLY PROVIDED BY GO SOUTH COAST GO FROM BRISTOL TO THE FESTIVAL
+2692_cv HE FOUND A COUNTRY WITH TWO GOVERNMENTS IN THE CAPITAL MAXIMILIAN WAS THE EMPEROR
+9756_cv EACH OF THEM IS AN EXCLUSIVE RIGHT SUBJECT TO CERTAIN LIMITATIONS AND EXCEPTIONS
+12478_cv THIS EXPERIENCE REPRESENTS A TRANSCENDENT STEP IN THE SPACE HISTORY OF THE COUNTRY
+4109_cv FROM A PHYSICAL CHEMICAL AND BIOLOGICAL BALANCE HE IS IN A KEY POSITION
+
+1507_epst ANOTHER MORE INSTITUTIONAL ASPECT IS THE BALANCE OF FORCES BETWEEN PARLIAMENT AND THE COUNCIL
+1700_epst MOREOVER ITS CAPACITY FOR REGENERATION IS VERY LIMITED
+289_epst IN ANY CASE GIVEN THAT THE FUNDING OF THIS NEW CORPORATION INSTRUMENT MUST BE COMPATIBLE WITH THE TWO THOUSAND SEVEN TWENTY THIRTEEN FINANCIAL FRAMEWORK IT IS IMPORTANT
+1528_epst LADIES AND GENTLEMEN THE SITUATION OF THE MARKETS DOES NOT REFLECT THE STRENGTH OF SPAIN
+
+
+581_mtedx BUT WHY DID YOU PUT YOURSELF IN THIS SITUATION
+591_mtedx IN TWENTY SIXTEEN THERE WERE THREE THOUSAND FIVE HUNDRED SIXTY NINE SUICIDES IN SPAIN ACCORDING TO THE NATIONAL INSTITUTE OF STATISTICS
+100_mtedx AND IT STARTED IN ASIA AND FROM THERE IT PASSED TO VENUS THE MOST COSMOPOLITAN PORT IN ITS TIME
+592_mtedx ALMOST TEN PEOPLE A DAY ONE EVERY TWO HOURS AND A HALF
+
+
+
+MT
+11375_cv ADDITIONAL UP TO BORSES NORMALLY PROVIDED BY COAST SO CAST BANDS OF BRISTOL ALL FESTIVAL
+2692_cv HE FOUND A COUNTRY WITH TWO GOVERNMENTS IN THE CAPITAL THE MOST SIMILIAN CAPITAL WAS THE EMPEROR
+
+1507_epst ANOTHER MORE INSTITUTIONAL ASPECT IS THE BALANCE OF FORCES BETWEEN PARLIAMENT AND THE COUNCIL
+1700_epst IN ADDITION HIS REGENERATION CAPACITY IS VERY LIMITED
+
+
+581_mtedx BUT WHY DID THE SITUATION IN THIS SITUATION
+591_mtedx IN TWENTY SIXTEEN THERE WERE THREE THOUSAND FIVE HUNDRED SIXTY NINE SIXTY NINE SUICIDES IN SPAIN ACCORDING TO THE NATIONAL STATISTICS INSTITUTE
+
+
+
+
+
+S2T_TTS
+11375_cv ADDITIONAL BUSES USUALLY PROVIDED BY GO SOUTH COAST GO FROM BRUCE TO THE FESTIVAL
+2692_cv HE FOUND A COUNTRY WITH TWO GOVERNMENTS AND THE CAPITAL MAXIMILIAN WAS AN EMPEROR
+
+1507_epst ANOTHER MORE INSTITUTIONAL ASPECT IS THE BALANCE OF POWER BETWEEN PARLIAMENT AND THE COUNCIL
+1700_epst MOREOVER ITS RECOVERY IS VERY LIMITED
+
+
+581_mtedx WHY DID YOU PUT YOU THIS SITUATION
+591_mtedx IN TWENTY SIXTEEN THERE WERE THREE THOUSAND FIVE HUNDRED SIXTY NINE KILLINGS IN SPAIN ACCORDING TO THE NATIONAL STATISTICS INSTITUTE
+
+
+
+
+
\ No newline at end of file
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/reference.txt b/enhanced_direct_s2st_units/audios/es-en/set1/reference.txt
new file mode 100644
index 0000000..3874b90
--- /dev/null
+++ b/enhanced_direct_s2st_units/audios/es-en/set1/reference.txt
@@ -0,0 +1,8 @@
+11375_cv ADDITIONAL BUSES USUALLY PROVIDED BY GO SOUTH COAST GO FROM BRISTOL TO THE FESTIVAL
+12411_cv THIS WAY THE LITTLE PRINCE DECIDED TO LEAVE HIS PLANET AND EXPLORE THE REST OF THE UNIVERSE
+2962_cv HE FOUND A COUNTRY WITH TWO GOVERNMENTS IN THE CAPITAL MAXIMILIAN WAS THE EMPEROR
+1313_epst MR PRESIDENT LADIES AND GENTLEMEN WE ARE DEALING WITH AN EXTREMELY SENSITIVE ISSUE
+1507_epst ANOTHER MORE INSTITUTIONAL ASPECT IS THE BALANCE OF POWER BETWEEN PARLIAMENT AND THE COUNCIL
+1700_epst MOREOVER THEIR CAPACITY FOR REGENERATION IS VERY LIMITED
+581_mtedx BUT WHY DID YOU PUT YOURSELF IN THIS SITUATION
+591_mtedx IN TWENTY SIXTEEN THERE WERE THREE THOUSAND FIVE HUNDRED SIXTY NINE SUICIDES IN SPAIN ACCORDING TO THE NATIONAL INSTITUTE OF STATISTICS
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/11375_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/11375_cv.wav
new file mode 100644
index 0000000..3900ffd
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/11375_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/12411_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/12411_cv.wav
new file mode 100644
index 0000000..0b64ad1
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/12411_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/1313_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/1313_epst.wav
new file mode 100644
index 0000000..28bb3b3
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/1313_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/1507_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/1507_epst.wav
new file mode 100644
index 0000000..3efddbe
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/1507_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/1700_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/1700_epst.wav
new file mode 100644
index 0000000..b16314e
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/1700_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/2692_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/2692_cv.wav
new file mode 100644
index 0000000..8a65a8e
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/2692_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/2962_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/2962_cv.wav
new file mode 100644
index 0000000..5d10a84
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/2962_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/581_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/581_mtedx.wav
new file mode 100644
index 0000000..1d3ce70
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/581_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/591_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/591_mtedx.wav
new file mode 100644
index 0000000..e9a80a7
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2t_tts/591_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/11375_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/11375_cv.wav
new file mode 100644
index 0000000..f5c6e20
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/11375_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/12411_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/12411_cv.wav
new file mode 100644
index 0000000..0ca52df
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/12411_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/1313_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/1313_epst.wav
new file mode 100644
index 0000000..021a596
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/1313_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/1507_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/1507_epst.wav
new file mode 100644
index 0000000..14dd46e
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/1507_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/1700_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/1700_epst.wav
new file mode 100644
index 0000000..c37a68f
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/1700_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/2692_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/2692_cv.wav
new file mode 100644
index 0000000..800275f
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/2692_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/2962_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/2962_cv.wav
new file mode 100644
index 0000000..2291376
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/2962_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/581_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/581_mtedx.wav
new file mode 100644
index 0000000..61c3833
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/581_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/591_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/591_mtedx.wav
new file mode 100644
index 0000000..1823e10
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_lnd/591_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/11375_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/11375_cv.wav
new file mode 100644
index 0000000..2c7c5b0
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/11375_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/12411_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/12411_cv.wav
new file mode 100644
index 0000000..0ce1968
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/12411_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/1313_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/1313_epst.wav
new file mode 100644
index 0000000..7fe84c8
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/1313_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/1507_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/1507_epst.wav
new file mode 100644
index 0000000..606986e
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/1507_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/1700_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/1700_epst.wav
new file mode 100644
index 0000000..7fd95ef
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/1700_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/2692_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/2692_cv.wav
new file mode 100644
index 0000000..c319977
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/2692_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/2962_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/2962_cv.wav
new file mode 100644
index 0000000..869c6e3
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/2962_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/581_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/581_mtedx.wav
new file mode 100644
index 0000000..ac97ff3
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/581_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/591_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/591_mtedx.wav
new file mode 100644
index 0000000..cec34b3
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/s2ut_mt/591_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/source/11375.flac b/enhanced_direct_s2st_units/audios/es-en/set1/source/11375.flac
new file mode 100644
index 0000000..7c23ba2
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/source/11375.flac differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/source/11375_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/source/11375_cv.wav
new file mode 100644
index 0000000..e12cc7b
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/source/11375_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/source/12411_cv.flac b/enhanced_direct_s2st_units/audios/es-en/set1/source/12411_cv.flac
new file mode 100644
index 0000000..6bcf1ae
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/source/12411_cv.flac differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/source/12411_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/source/12411_cv.wav
new file mode 100644
index 0000000..e40d748
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/source/12411_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/source/1313_epst.flac b/enhanced_direct_s2st_units/audios/es-en/set1/source/1313_epst.flac
new file mode 100644
index 0000000..29c04a4
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/source/1313_epst.flac differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/source/1507_epst.flac b/enhanced_direct_s2st_units/audios/es-en/set1/source/1507_epst.flac
new file mode 100644
index 0000000..46f94f9
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/source/1507_epst.flac differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/source/1507_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set1/source/1507_epst.wav
new file mode 100644
index 0000000..ba30788
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/source/1507_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/source/1700_epst.flac b/enhanced_direct_s2st_units/audios/es-en/set1/source/1700_epst.flac
new file mode 100644
index 0000000..87fa15b
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/source/1700_epst.flac differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/source/1700_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set1/source/1700_epst.wav
new file mode 100644
index 0000000..4a04f0c
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/source/1700_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/source/2692_cv.flac b/enhanced_direct_s2st_units/audios/es-en/set1/source/2692_cv.flac
new file mode 100644
index 0000000..ba59c61
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/source/2692_cv.flac differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/source/2692_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/source/2692_cv.wav
new file mode 100644
index 0000000..3550ffb
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/source/2692_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/source/581_mtedx.flac b/enhanced_direct_s2st_units/audios/es-en/set1/source/581_mtedx.flac
new file mode 100644
index 0000000..2e6bae4
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/source/581_mtedx.flac differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/source/581_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set1/source/581_mtedx.wav
new file mode 100644
index 0000000..e1e4f31
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/source/581_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/source/591_mtedx.flac b/enhanced_direct_s2st_units/audios/es-en/set1/source/591_mtedx.flac
new file mode 100644
index 0000000..b7528ac
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/source/591_mtedx.flac differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/source/591_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set1/source/591_mtedx.wav
new file mode 100644
index 0000000..1d461ee
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/source/591_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/target/11375_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/target/11375_cv.wav
new file mode 100644
index 0000000..fd99ddb
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/target/11375_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/target/12411_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/target/12411_cv.wav
new file mode 100644
index 0000000..6b635a5
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/target/12411_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/target/1313_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set1/target/1313_epst.wav
new file mode 100644
index 0000000..2567cea
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/target/1313_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/target/1507_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set1/target/1507_epst.wav
new file mode 100644
index 0000000..73f15f3
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/target/1507_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/target/1528_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set1/target/1528_epst.wav
new file mode 100644
index 0000000..81dc97a
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/target/1528_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/target/1700_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set1/target/1700_epst.wav
new file mode 100644
index 0000000..315ae7f
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/target/1700_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/target/2692_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/target/2692_cv.wav
new file mode 100644
index 0000000..8b6a34e
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/target/2692_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/target/2962_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set1/target/2962_cv.wav
new file mode 100644
index 0000000..8cbcc60
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/target/2962_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/target/581_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set1/target/581_mtedx.wav
new file mode 100644
index 0000000..854c78c
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/target/581_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set1/target/591_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set1/target/591_mtedx.wav
new file mode 100644
index 0000000..67b458e
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set1/target/591_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/asr.txt b/enhanced_direct_s2st_units/audios/es-en/set2/asr.txt
new file mode 100644
index 0000000..3833985
--- /dev/null
+++ b/enhanced_direct_s2st_units/audios/es-en/set2/asr.txt
@@ -0,0 +1,58 @@
+LND
+11375_cv ADDITIONAL BUSES USUALLY PROVIDED BY GO SOUTH COAST GO FROM BRISTOL TO THE FESTIVAL
+2692_cv HE FOUND A COUNTRY WITH TWO GOVERNMENTS IN THE CAPITAL MAXIMILIAN WAS THE EMPEROR
+9756_cv EACH OF THEM IS AN EXCLUSIVE RIGHT SUBJECT TO CERTAIN LIMITATIONS AND EXCEPTIONS
+12478_cv THIS EXPERIENCE REPRESENTS A TRANSCENDENT STEP IN THE SPACE HISTORY OF THE COUNTRY
+4109_cv FROM A PHYSICAL CHEMICAL AND BIOLOGICAL BALANCE HE IS IN A KEY POSITION
+
+1507_epst ANOTHER MORE INSTITUTIONAL ASPECT IS THE BALANCE OF FORCES BETWEEN PARLIAMENT AND THE COUNCIL
+1700_epst MOREOVER ITS CAPACITY FOR REGENERATION IS VERY LIMITED
+289_epst IN ANY CASE GIVEN THAT THE FUNDING OF THIS NEW CORPORATION INSTRUMENT MUST BE COMPATIBLE WITH THE TWO THOUSAND SEVEN TWENTY THIRTEEN FINANCIAL FRAMEWORK IT IS IMPORTANT
+1528_epst LADIES AND GENTLEMEN THE SITUATION OF THE MARKETS DOES NOT REFLECT THE STRENGTH OF SPAIN
+
+
+581_mtedx BUT WHY DID YOU PUT YOURSELF IN THIS SITUATION
+591_mtedx IN TWENTY SIXTEEN THERE WERE THREE THOUSAND FIVE HUNDRED SIXTY NINE SUICIDES IN SPAIN ACCORDING TO THE NATIONAL INSTITUTE OF STATISTICS
+100_mtedx AND IT STARTED IN ASIA AND FROM THERE IT PASSED TO VENUS THE MOST COSMOPOLITAN PORT IN ITS TIME
+592_mtedx ALMOST TEN PEOPLE A DAY ONE EVERY TWO HOURS AND A HALF
+
+
+LR50
+9756_cv EACH ONE OF THEM IS AN EXCLUSIVE RIGHT SUBJECT TO CERTAIN LIMITATIONS AND EXCEPTIONS
+12478_cv THIS EXPERIENCE REPRESENTS A TRANSCENDENTAL STEP IN THE SPATIAL HISTORY OF THE COUNTRY
+4109_cv FROM A PHYSICAL PERSPECTIVE OF PHYSICAL CHEMICAL AND BIOLOGICAL POSITION
+289_epst IN ANY CASE SINCE THE FINANCING OF THIS NEW INSTRUMENT OF CORPORATION MUST COMPATIBLE WITH THE FINANCIAL FRAMEWORK FOR TWENTY THIRTEEN
+1528_epst LADIES AND GENTLEMEN THE MARKET SITUATION DOES NOT REFLECT THE STRENGTH OF SPAIN
+100_mtedx HE BECAME IN ASIA AND FROM THAT TIME THE MOST COSMOPOLITAN IN HIS LIFE
+592_mtedx EVEN TEN PEOPLE EVERY DAY ONE EVERY TWO HOURS
+
+
+
+ASR + LND
+9756_cv EACH OF THEM IS AN EXCLUSIVE RIGHT SUBJECT TO CERTAIN LIMITATIONS AND EXCEPTIONS
+12478_cv THIS EXPERIENCE REPRESENTS A MOVEMENT STEP IN THE SPACE HISTORY OF THE COUNTRY
+4109_cv FROM THE PERSPECTIVE OF PHYSICAL CHEMICAL AND BIOLOGICAL BALANCE IT IS IN A KEY POSITION
+289_epst IN ANY CASE GIVEN THAT THE FINANCING OF THIS NEW CORPORATION INSTRUMENT MUST BE COMPATIBLE WITH THE TWO THOUSAND SEVEN TWENTY THIRTEEN FINANCIAL FRAMEWORK
+1528_epst LADIES AND GENTLEMEN THE SITUATION IN THE MARKETS DOES NOT REFLECT SPAIN'S STRENGTH
+100_mtedx IT STARTED IN ASIA AND FROM THERE IT WENT TO VENICE THE MOST COSMOPOLITAN PORT OF ITS TIME
+592_mtedx ALMOST TEN PEOPLE A DAY ONE EVERY TWO AND A HALF HOURS
+
+
+
+
+
+LND
+
EACH OF THEM IS AN EXCLUSIVE RIGHT SUBJECT TO CERTAIN LIMITATIONS AND EXCEPTIONS
+
EACH ONE OF THEM IS AN EXCLUSIVE RIGHT SUBJECT TO CERTAIN LIMITATIONS AND EXCEPTIONS
+
EACH OF THEM IS AN EXCLUSIVE RIGHT SUBJECT TO CERTAIN LIMITATIONS AND EXCEPTIONS
+
+
+
+
+
+
+
+
+
+
+
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/reference.txt b/enhanced_direct_s2st_units/audios/es-en/set2/reference.txt
new file mode 100644
index 0000000..a748a68
--- /dev/null
+++ b/enhanced_direct_s2st_units/audios/es-en/set2/reference.txt
@@ -0,0 +1,7 @@
+9756_cv EACH ONE OF THEM IS AN EXCLUSIVE RIGHT SUBJECT TO CERTAIN LIMITATIONS AND EXCEPTIONS
+12478_cv THIS EXPERIENCE REPRESENTS A TRANSCENDENTAL STEP IN THE SPATIAL HISTORY OF THE COUNTRY
+4109_cv FROM THE PERSPECTIVE OF PHYSICAL CHEMICAL AND BIOLOGICAL BALANCE IT IS IN A KEY POSITION
+289_epst IN ANY CASE GIVEN THAT THE FINANCING OF THIS NEW COOPERATION INSTRUMENT MUST BE COMPATIBLE WITH THE TWO THOUSAND SEVEN TWENTY THIRTEEN FINANCIAL FRAMEWORK IT IS WORTH
+1528_epst LADIES AND GENTLEMEN THE SITUATION IN THE MARKETS DOES NOT REFLECT SPAIN'S STRENGTH
+100_mtedx IT STARTED IN ASIA AND FROM THERE IT WENT TO VENICE THE MOST COSMOPOLITAN PORT OF ITS TIME
+592_mtedx ALMOST TEN PEOPLE A DAY ONE EVERY TWO AND A HALF HOURS
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/100_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/100_mtedx.wav
new file mode 100644
index 0000000..590d5d3
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/100_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/12478_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/12478_cv.wav
new file mode 100644
index 0000000..1275911
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/12478_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/1528_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/1528_epst.wav
new file mode 100644
index 0000000..6f9a668
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/1528_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/289_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/289_epst.wav
new file mode 100644
index 0000000..3e5423b
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/289_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/4109_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/4109_cv.wav
new file mode 100644
index 0000000..6e5a922
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/4109_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/592_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/592_mtedx.wav
new file mode 100644
index 0000000..40df15e
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/592_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/9756_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/9756_cv.wav
new file mode 100644
index 0000000..82bb8f8
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd/9756_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/100_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/100_mtedx.wav
new file mode 100644
index 0000000..7c090e8
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/100_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/12478_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/12478_cv.wav
new file mode 100644
index 0000000..bd59d71
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/12478_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/1528_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/1528_epst.wav
new file mode 100644
index 0000000..d95170c
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/1528_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/289_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/289_epst.wav
new file mode 100644
index 0000000..1a3694a
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/289_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/4109_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/4109_cv.wav
new file mode 100644
index 0000000..00f226a
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/4109_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/592_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/592_mtedx.wav
new file mode 100644
index 0000000..10b1af8
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/592_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/9756_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/9756_cv.wav
new file mode 100644
index 0000000..74ab7c6
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lnd_w_asr/9756_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/100_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/100_mtedx.wav
new file mode 100644
index 0000000..eca7cc9
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/100_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/12478_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/12478_cv.wav
new file mode 100644
index 0000000..bed24f6
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/12478_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/289_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/289_epst.wav
new file mode 100644
index 0000000..8cdac96
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/289_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/4109_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/4109_cv.wav
new file mode 100644
index 0000000..044a1d5
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/4109_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/592_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/592_mtedx.wav
new file mode 100644
index 0000000..1cc8a4e
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/592_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/9756_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/9756_cv.wav
new file mode 100644
index 0000000..6406049
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/s2ut_lr50/9756_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/source/100_mtedx.flac b/enhanced_direct_s2st_units/audios/es-en/set2/source/100_mtedx.flac
new file mode 100644
index 0000000..6783922
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/source/100_mtedx.flac differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/source/100_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set2/source/100_mtedx.wav
new file mode 100644
index 0000000..777a485
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/source/100_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/source/12478_cv.flac b/enhanced_direct_s2st_units/audios/es-en/set2/source/12478_cv.flac
new file mode 100644
index 0000000..efb6d88
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/source/12478_cv.flac differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/source/12478_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set2/source/12478_cv.wav
new file mode 100644
index 0000000..095d13b
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/source/12478_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/source/289_epst.flac b/enhanced_direct_s2st_units/audios/es-en/set2/source/289_epst.flac
new file mode 100644
index 0000000..31e0e87
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/source/289_epst.flac differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/source/289_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set2/source/289_epst.wav
new file mode 100644
index 0000000..0b4202d
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/source/289_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/source/4109_cv.flac b/enhanced_direct_s2st_units/audios/es-en/set2/source/4109_cv.flac
new file mode 100644
index 0000000..3a956cf
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/source/4109_cv.flac differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/source/4109_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set2/source/4109_cv.wav
new file mode 100644
index 0000000..5a7b684
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/source/4109_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/source/592_mtedx.flac b/enhanced_direct_s2st_units/audios/es-en/set2/source/592_mtedx.flac
new file mode 100644
index 0000000..49093ee
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/source/592_mtedx.flac differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/source/592_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set2/source/592_mtedx.wav
new file mode 100644
index 0000000..d97facf
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/source/592_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/source/9756_cv.flac b/enhanced_direct_s2st_units/audios/es-en/set2/source/9756_cv.flac
new file mode 100644
index 0000000..2d96b5c
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/source/9756_cv.flac differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/source/9756_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set2/source/9756_cv.wav
new file mode 100644
index 0000000..f1eb2e7
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/source/9756_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/target/100_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set2/target/100_mtedx.wav
new file mode 100644
index 0000000..46d3262
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/target/100_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/target/11375_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set2/target/11375_cv.wav
new file mode 100644
index 0000000..fd99ddb
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/target/11375_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/target/12411_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set2/target/12411_cv.wav
new file mode 100644
index 0000000..6b635a5
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/target/12411_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/target/12478_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set2/target/12478_cv.wav
new file mode 100644
index 0000000..adfa6a3
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/target/12478_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/target/1528_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set2/target/1528_epst.wav
new file mode 100644
index 0000000..81dc97a
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/target/1528_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/target/289_epst.wav b/enhanced_direct_s2st_units/audios/es-en/set2/target/289_epst.wav
new file mode 100644
index 0000000..6861b45
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/target/289_epst.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/target/4109_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set2/target/4109_cv.wav
new file mode 100644
index 0000000..25e0eaf
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/target/4109_cv.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/target/592_mtedx.wav b/enhanced_direct_s2st_units/audios/es-en/set2/target/592_mtedx.wav
new file mode 100644
index 0000000..cb95d9e
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/target/592_mtedx.wav differ
diff --git a/enhanced_direct_s2st_units/audios/es-en/set2/target/9756_cv.wav b/enhanced_direct_s2st_units/audios/es-en/set2/target/9756_cv.wav
new file mode 100644
index 0000000..16384f8
Binary files /dev/null and b/enhanced_direct_s2st_units/audios/es-en/set2/target/9756_cv.wav differ
diff --git a/enhanced_direct_s2st_units/index.html b/enhanced_direct_s2st_units/index.html
new file mode 100644
index 0000000..f9f8414
--- /dev/null
+++ b/enhanced_direct_s2st_units/index.html
@@ -0,0 +1,1913 @@
+
+
+
+
+
+ Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
+
+
+
+
+
+
+
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data
+ Augmentation
+
+
+
+ Sravya Popuri☆, Peng-Jen Chen☆, Changhan
+ Wang, Juan Pino, Yossi Adi,
+ Jiatao Gu, Wei-Ning Hsu†, Ann Lee†
+ (☆ = Equal contribution and † = Equal supervision)
+
+
+
+
+
+
+
+
+ We explore self-supervised pre-training with unlabeled speech data and data augmentation to improve direct
+ Speech to speech model training. We take advantage of a recently proposed speech-to-unit translation (S2UT)
+ framework that encodes
+ target
+ speech into discrete representations, and study both speech encoder and discrete unit decoder pre-training
+ as well as
+ efficient partial finetuning methods. We conduct experiments under various data setups and show that
+ self-supervised
+ pre-training consistently improves model performance compared with multitask learning and is complementary
+ to data
+ augmentation techniques that apply ASR and MT models to create weakly supervised training data.
+
+
We provide ground truth source and target audios with the corresponding reference text,
+ as well as audio samples from three systems:
+ (1) S2UT+LNA-D: the proposed direct speeech-to-unit translation
+ system initialized with wav2vec 2.0 encoder, unit mBART decoder and finetuned using LNA-D strategy
+ (2) Supervised S2UT: a baseline direct speech-to-unit translation system trained with
+ source and target text as auxiliary task targets.
+
+ (3) S2T+TTS: a baseline cascaded system with a speech-to-text translation model initialized
+ with wav2vec 2.0 encoder and a random decoder, followed by a text-to-speech synthesis model.
+ Both (1) and (2) use an open sourced HiFi-GAN vocoder to convert units to waveforms.
+
+
+
+
+
Ground truth
+
Predictions
+
+
+
+
Source (Spanish)
+
Target (English)
+
S2UT+LNA-D
+
Supervised S2UT
+
S2T+TTS
+
+
+
Sample 1: S2UT+LNAD performs best
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
en dos mil dieciseis hubo tres mil quinientos sesenta y nueve
+ suicidios en españa según el instituto
+ nacional de estadística
+
IN TWENTY SIXTEEN THERE WERE THREE THOUSAND FIVE HUNDRED SIXTY
+ NINE SUICIDES IN SPAIN ACCORDING TO
+ THE NATIONAL INSTITUTE OF STATISTICS
+
+
+
ASR:
+
+
+
IN TWENTY SIXTEEN THERE WERE THREE THOUSAND FIVE HUNDRED SIXTY
+ NINE SUICIDES IN SPAIN ACCORDING TO
+ THE NATIONAL
+ INSTITUTE OF STATISTICS
+
IN TWENTY SIXTEEN THERE WERE THREE THOUSAND FIVE HUNDRED SIXTY
+ NINE SIXTY NINE SUICIDES IN SPAIN
+ ACCORDING TO THE
+ NATIONAL STATISTICS INSTITUTE
+
IN TWENTY SIXTEEN THERE WERE THREE THOUSAND FIVE HUNDRED SIXTY
+ NINE KILLINGS IN SPAIN ACCORDING TO
+ THE NATIONAL
+ STATISTICS INSTITUTE
+
+
+
Sample 2: S2UT+LNAD performs best
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
pero por qué te pusiste en esta situación
+
BUT WHY DID YOU PUT YOURSELF IN THIS SITUATION
+
+
+
ASR:
+
+
+
BUT WHY DID YOU PUT YOURSELF IN THIS SITUATION
+
BUT WHY DID THE SITUATION IN THIS SITUATION
+
WHY DID YOU PUT YOU THIS SITUATION
+
+
+
Sample 3: S2UT+LNAD performs best
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
autobuses adicionales normalmente proporcionados por go south coast
+ van desde bristol al festival
+
+
ADDITIONAL BUSES USUALLY PROVIDED BY GO SOUTH COAST GO FROM
+ BRISTOL TO THE FESTIVAL
+
+
+
ASR:
+
+
+
ADDITIONAL BUSES USUALLY PROVIDED BY GO SOUTH COAST GO FROM BRISTOL
+ TO THE FESTIVAL
+
ADDITIONAL UP TO BORSES NORMALLY PROVIDED BY COAST SO CAST BANDS OF
+ BRISTOL ALL FESTIVAL
+
ADDITIONAL BUSES USUALLY PROVIDED BY GO SOUTH COAST GO FROM BRUCE
+ TO THE FESTIVAL
+
+
+
Sample 4: S2UT+LNAD performs best
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
encontró un país con dos gobiernos en la capital maximiliano era el
+ emperador
+
HE FOUND A COUNTRY WITH TWO GOVERNMENTS IN THE CAPITAL MAXIMILIAN
+ WAS THE EMPEROR
+
+
+
ASR:
+
+
+
HE FOUND A COUNTRY WITH TWO GOVERNMENTS IN THE CAPITAL MAXIMILIAN
+ WAS THE EMPEROR
+
HE FOUND A COUNTRY WITH TWO GOVERNMENTS IN THE CAPITAL THE MOST
+ SIMILIAN CAPITAL WAS THE EMPEROR
+
+
HE FOUND A COUNTRY WITH TWO GOVERNMENTS AND THE CAPITAL MAXIMILIAN
+ WAS AN EMPEROR
+
+
+
+
Sample 5: S2T+TTS performs best
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
otro aspecto más institucional es el equilibrio de fuerzas entre
+ el parlamento y el consejo
+
ANOTHER MORE INSTITUTIONAL ASPECT IS THE BALANCE OF POWER BETWEEN
+ PARLIAMENT AND THE COUNCIL
+
+
+
ASR:
+
+
+
ANOTHER MORE INSTITUTIONAL ASPECT IS THE BALANCE OF FORCES BETWEEN
+ PARLIAMENT AND THE COUNCIL
+
ANOTHER MORE INSTITUTIONAL ASPECT IS THE BALANCE OF FORCES BETWEEN
+ PARLIAMENT AND THE COUNCIL
+
ANOTHER MORE INSTITUTIONAL ASPECT IS THE BALANCE OF POWER BETWEEN
+ PARLIAMENT AND THE COUNCIL
+
+
+
Sample 6: All systems make errors
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
además su capacidad de regeneración es muy limitada
+
MOREOVER THEIR CAPACITY FOR REGENERATION IS VERY LIMITED
+
+
+
ASR:
+
+
+
MOREOVER ITS CAPACITY FOR REGENERATION IS VERY LIMITED
+
IN ADDITION HIS REGENERATION CAPACITY IS VERY LIMITED
+
MOREOVER ITS RECOVERY IS VERY LIMITED
+
+
+
+
+
+
+
+
+ Spanish To English
+
+
Different Data Setups
+
+
+
We provide ground truth source and target audios with the corresponding reference text,
+ as well as audio samples from three systems. All the three models are initialized with wav2vec 2.0 encoder,
+ unit
+ mBART decoder and finetuned using LNA-D strategy but use different datasets for finetuning:
+ (1) S2UT_Base: finetuned on the combination of CoVoST2, Europarl-ST, mTEDx datasets.
+
+ (2) S2UT_LR: finetuned on low resource setup with 50hr of data sampled from the the
+ combination of CoVoST2, Europarl-ST, mTEDx datasets
+
+ (3) S2UT_Aug: finetuned on the the combination of CoVoST2, Europarl-ST, mTEDx datasets
+ datasets plus the ASR data.
+ with wav2vec 2.0 encoder and a random decoder, followed by a text-to-speech synthesis model.
+ All models use an open sourced HiFi-GAN vocoder to convert units to waveforms.
+
+
+
+
+
Ground truth
+
Predictions
+
+
+
+
Source (Spanish)
+
Target (English)
+
S2UT_LR
+
S2UT_Base
+
S2UT_Aug
+
+
+
Sample 1: All systems do well
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
cada uno de ellos es un derecho exclusivo sujeto a ciertas
+ limitaciones y excepciones
+
+
EACH ONE OF THEM IS AN EXCLUSIVE RIGHT SUBJECT TO CERTAIN
+ LIMITATIONS AND EXCEPTIONS
+
+
+
ASR:
+
+
+
EACH OF THEM IS AN EXCLUSIVE RIGHT SUBJECT TO CERTAIN LIMITATIONS
+ AND EXCEPTIONS
+
EACH ONE OF THEM IS AN EXCLUSIVE RIGHT SUBJECT TO CERTAIN
+ LIMITATIONS AND EXCEPTIONS
+
EACH OF THEM IS AN EXCLUSIVE RIGHT SUBJECT TO CERTAIN LIMITATIONS
+ AND EXCEPTIONS
+
+
+
Sample 2: S2UT_LR performs best
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
esta experiencia representa un paso trascendental en la historia
+ espacial del país
+
THIS EXPERIENCE REPRESENTS A TRANSCENDENTAL STEP IN THE SPATIAL
+ HISTORY OF THE COUNTRY
+
+
+
ASR:
+
+
+
THIS EXPERIENCE REPRESENTS A TRANSCENDENT STEP IN THE SPACE HISTORY
+ OF THE COUNTRY
+
THIS EXPERIENCE REPRESENTS A TRANSCENDENTAL STEP IN THE SPATIAL
+ HISTORY OF THE COUNTRY
+
THIS EXPERIENCE REPRESENTS A MOVEMENT STEP IN THE SPACE HISTORY OF
+ THE COUNTRY
+
+
+
Sample : S2UT_LR has errors but S2UT_Base and S2UT_Aug got
+ it
+ right.
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
cuando casi diez personas al día una cada dos horas y media
+
ALMOST TEN PEOPLE A DAY ONE EVERY TWO AND A HALF HOURS
+
+
+
ASR:
+
+
+
ALMOST TEN PEOPLE A DAY ONE EVERY TWO HOURS AND A HALF
+
EVEN TEN PEOPLE EVERY DAY ONE EVERY TWO HOURS
+
ALMOST TEN PEOPLE A DAY ONE EVERY TWO AND A HALF HOURS
+
+
+
Sample 4: S2UT_Aug performs best
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
desde la perspectiva del balance físico químico y biológico está
+ en una posición clave
+
THE PERSPECTIVE OF PHYSICAL CHEMICAL AND BIOLOGICAL BALANCE IT IS
+ IN A KEY POSITION
+
+
+
ASR:
+
+
+
FROM A PHYSICAL CHEMICAL AND BIOLOGICAL BALANCE HE IS IN A KEY
+ POSITION
+
FROM A PHYSICAL PERSPECTIVE OF PHYSICAL CHEMICAL AND BIOLOGICAL
+ POSITION
+
FROM THE PERSPECTIVE OF PHYSICAL CHEMICAL AND BIOLOGICAL BALANCE IT
+ IS IN A KEY POSITION
+
+
+
+
Sample 5: S2UT_Aug performs best
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
ASR:
+
+
+
AND IT STARTED IN ASIA AND FROM THERE IT PASSED TO VENUS THE MOST
+ COSMOPOLITAN PORT IN ITS TIME
+
HE BECAME IN ASIA AND FROM THAT TIME THE MOST COSMOPOLITAN IN HIS
+ LIFE
+
IT STARTED IN ASIA AND FROM THERE IT WENT TO VENICE THE MOST
+ COSMOPOLITAN PORT OF ITS TIME
+
+
+
+
Reference:
+
empezó en asia y de allí pasó a venecia el puerto más cosmopolita
+ de su tiempo
+
IT STARTED IN ASIA AND FROM THERE IT WENT TO VENICE THE MOST
+ COSMOPOLITAN PORT OF ITS TIME
+
+
+
Sample 6: S2UT_Aug performs best
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
desde un punto de vista presupuestario no parece adecuada la
+ propuesta de financiación procedente de la comisión de
+ desarrollo ya que este dinero no existe al
+
IN ANY CASE GIVEN THAT THE FINANCING OF THIS NEW COOPERATION
+ INSTRUMENT MUST BE COMPATIBLE WITH THE
+ TWO
+ THOUSAND SEVEN
+ TWENTY THIRTEEN FINANCIAL FRAMEWORK IT IS WORTH
+
+
+
ASR:
+
+
+
IN ANY CASE GIVEN THAT THE FUNDING OF THIS NEW CORPORATION
+ INSTRUMENT MUST BE COMPATIBLE WITH THE
+ TWO THOUSAND SEVEN
+ TWENTY THIRTEEN FINANCIAL FRAMEWORK IT IS IMPORTANT
+
IN ANY CASE SINCE THE FINANCING OF THIS NEW INSTRUMENT OF
+ CORPORATION MUST COMPATIBLE WITH THE
+ FINANCIAL FRAMEWORK
+ FOR TWENTY THIRTEEN
+
IN ANY CASE GIVEN THAT THE FINANCING OF THIS NEW CORPORATION
+ INSTRUMENT MUST BE COMPATIBLE WITH THE
+ TWO THOUSAND
+ SEVEN TWENTY THIRTEEN FINANCIAL FRAMEWORK
+
+
+
+
+
+
+
+
+ English to Spanish
+
+
Comparison with Baselines
+
+
+
We provide ground truth source and target audios with the corresponding reference text,
+ as well as audio samples from three systems:
+ (1) S2UT+LNA-D: the proposed direct speeech-to-unit translation
+ system initialized with wav2vec 2.0 encoder, unit mBART decoder and finetuned using LNA-D strategy
+ (2) Supervised S2UT: a baseline direct speech-to-unit translation system trained with
+ source and target text as auxiliary task targets.
+
+ (3) S2T+TTS: a baseline cas2caded system with a speech-to-text translation model
+ initialized
+ with wav2vec 2.0 encoder and a random decoder, followed by a text-to-speech synthesis model.
+ Both (1) and (2) use an open sourced HiFi-GAN vocoder to convert units to waveforms.
+
+
+
+
+
Ground truth
+
Predictions
+
+
+
+
Source (English)
+
Target (Spanish)
+
S2UT+LNA-D
+
Supervised S2UT
+
S2T+TTS
+
+
+
Sample 1: S2UT+LNAD performs the best.
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
this should also be an important part of our approach to the twenty
+ twelve budget
+
esto también debería ser una parte importante de nuestro enfoque
+ del
+ presupuesto dos mil doce
+
+
+
+
ASR:
+
+
+
esto también debería ser una parte importante de nuestro enfoque al
+ presupuesto dos mil doce
+
también debería ser una parte importante de nuestro enfoque al
+ presupuesto dos mil doce
+
esto también debería ser una parte importante de nuestro enfoque al
+ presupuesto de dos mildos mil
+ dos mil doce
+
+
+
+
Sample 2: S2UT+LNAD performs the best.
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
information encourages citizens interest in public matters and
+ their participation
+
la información fomenta el interés de los ciudadanos por los
+ asuntos
+ públicos y su participación
+
+
+
ASR:
+
+
+
la información fomenta el interés de los ciudadanos en asuntos
+ públicos y su participación
+
la información y el interés de los ciudadanos alientan los
+ intereses de las cuestiones públicas y su
+ participación
+
+
la información alienta el interés de los ciudadanos en asuntos
+ públicos y en su participación
+
+
+
+
Sample 3: S2UT+LNAD performs the best.
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
and apparently it was quite popular
+
y al parecer era muy popular
+
+
+
ASR:
+
+
+
y al parecer era muy popular
+
y un líder era bastante popular
+
y aparentemente era bastante popular
+
+
+
+
Sample 4: S2UT+LNAD performs the best.
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
his family who are my constituents are convinced of his innocence
+
+
su familia que son mis electores está convencida de su inocencia
+
+
+
+
ASR:
+
+
+
su familia que son mis electores está convencida de su inocencia
+
+
su familia que son mí circunscripciones están convencidas de estos
+ inocentes
+
su familia que son mis electores están convencidos de su inocencia
+
+
+
+
+
Sample 5: All systems do reasonably well.
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
we can actually do the same thing with much less energy
+
podemos hacer lo mismo con mucha menos energía
+
+
+
ASR:
+
+
+
podemos hacer lo mismo con mucha menos energía
+
podemos hacer lo mismo que mucha menos energía
+
en realidad podemos hacerlo mismo con menos energía
+
+
+
+
Sample 6: All systems make errors.
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
of the directive on all taxes including social security
+ contributions the automatic exchange of information and improved
+ cooperation between the member states in matters of taxation
+
de la directiva a todos los impuestos incluidas las
+ contribuciones a
+ la seguridad social el intercambio automático de
+ información y la mejora de la cooperación fiscal entre los estados miembros
+
+
+
ASR:
+
+
+
de la directiva a todos los impuestos incluidas las contribuciones
+ a la seguridad social el
+ intercambio automático
+ de información y la mejor cooperación entre los estados miembros en las cuestiones de impuestos
+
de la directiva a todos los impuestos impluyendo las contribuciones
+ de seguridad social el
+ intercambio automático de
+ la información y mejorar la cooperación entre los estados miembros y las cuestiones de impuestos
+
+
de la directiva para todos los impuestos incluidos las
+ contribuciones de seguridad social el
+ intercambio automático
+ de información y la mejor cooperación entre los estados miembros en la cuestión de la fiscalidad
+
+
+
+
+
+
+
+
+ English To Spanish
+
+
Different Data Setups
+
+
+
We provide ground truth source and target audios with the corresponding reference text,
+ as well as audio samples from three systems. All the three models are initialized with wav2vec 2.0 encoder,
+ unit
+ mBART decoder and finetuned using LNA-D strategy but use different datasets for finetuning:
+ (1) S2UT_Base: finetuned on the combination of Europarl-ST, MUST-C datasets.
+
+ (2) S2UT_LR: finetuned on low resource setup with 50hr of data sampled from the combination
+ of Europarl-ST, MUST-C datasets
+
+ (3) S2UT_Aug: finetuned on the combination of Europarl-ST, MUST-C datasets plus the ASR
+ data.
+ with wav2vec 2.0 encoder and a random decoder, followed by a text-to-speech synthesis model.
+ All models use an open sourced HiFi-GAN vocoder to convert units to waveforms.
+
+
+
+
+
Ground truth
+
Predictions
+
+
+
+
Source (English)
+
Target (Spanish)
+
S2UT_LR
+
S2UT_Base
+
S2UT_Aug
+
+
+
Sample Set 1: All systems do well.
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
we want to see energy poverty as a part of this debate
+
queremos ver la pobreza energética como parte de este debate
+
+
+
+
ASR:
+
+
+
queremos ver la pobreza energética como parte de este deate
+
queremos ver la pobreza energética como parte de este date
+
queremos ver la pobreza energética como parte de este deate
+
+
+
Sample Set 2: S2UT_LR has errors but S2UT_Base and S2UT_Aug got
+ it
+ right.
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
in my view one of the most important elements is the follow up of
+ legislative initiative requests from parliament
+
en mi opinión uno de los elementos más importantes es el
+ seguimiento de
+ las solicitudes de iniciativa legislativa del
+ parlamento
+
+
+
ASR:
+
+
+
n mi opinión uno de los elementos más importantes es el
+ seguimiento de las peticiones de la
+ iniciativa legislativa
+ por parte del pagamento
+
en mi opinión uno de los elementos más importantes es el
+ seguimiento de las emiendas de iniciativas
+ legislativas de
+ ley
+
en mi opinión uno de los elementos más importantes es el
+ seguimiento de las solicitudes de
+ iniciativa legislativa
+ del pagamento
+
+
+
+
Sample Set 3: S2UT_Aug performs the best
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
we must find an open and constructive procedure on the next
+ financial framework
+
debemos encontrar un procedimiento abierto y constructivo en el
+ próximo marco financiero
+
+
+
ASR:
+
+
+
debemos encontrar un procedimiento abierto y constructivo sobre el
+ próximo marco financiero
+
debemos encontrar un procedimiento abierto y constructivo en el
+ sistema financiero financiero
+ financiero financiero
+
+
debemos encontrar un procedimiento abierto y constructivo en el
+ próximo marco financiero
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
i agree that we should act and react but we should not overdo it
+ because we need a balanced approach
+
estoy de acuerdo en que deberíamos actuar y reaccionar pero no
+ deberíamos excedernos porque necesitamos un enfoque equilibrado
+
+
+
ASR:
+
+
+
estoy de acuerdo en que debemos actuar y reaccionar pero no
+ debemos hacerlo porque necesitamos un
+ enfoque
+ equilibrado
+
estoy considerando que actuamos y reaccionamos pero no deberíamos
+ hacerlo porque necesitamos un
+ enfoque realmente
+ valioso
+
estoy de acuerdo en que deberíamos actuar y reaccionar pero no
+ deberíamos exagerarlo porque
+ necesitamos un enfoque
+ equilibrado
+
+
+
+
Sample Set 4: All systems make errors
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
of the directive on all taxes including social security
+ contributions the automatic exchange of information and improved
+ cooperation between the member states in matters of taxation
+
de la directiva a todos los impuestos incluidas las contribuciones
+ a la
+ seguridad social el intercambio automático de
+ información y la mejora de la cooperación fiscal entre los estados miembros
+
+
+
ASR:
+
+
+
de la directiva a todos los impuestos incluidas las contribuciones
+ a la seguridad social el
+ intercambio automático
+ de información y la mejor cooperación entre los estados miembros en las cuestiones de impuestos
+
la directiva sobre el impuesto de todos los contribuyentes
+ inpluyendo las contribuciones sociales la
+ introducción
+ automática y mejorada de los estados miembros y mejorar la cooperación entre los estados miembros
+
+
de la directiva a todos los impuestos incluidas las contribuciones
+ a la seguridad social el
+ intercambio automático
+ de información y la mejor cooperación entre los estados miembros en materia de impuestos
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Reference:
+
through my work im trying to articulate that humans are not
+ separate from nature and that everything is interconnected
+
a través de mi trabajo estoy tratando de expresar que los humanos
+ no
+ están separados de la naturaleza y que todo está
+ interconectado
+
+
+
ASR:
+
+
+
a través de mi trabajo estoy tratando de articular que los humanos
+ no están separados de la
+ naturaleza y que todo
+ está interconectado
+
a través de mi trabajo trato de articular que los humanos no somos
+ separados de la naturaleza y que
+ todo está
+ interconectado
+
a través de mi trabajo trato de articular que los humanos no
+ estamos separados de la naturaleza y
+ que todo está
+ interconectado