Authors
Gino Brunner, Yuyi Wang, Roger Wattenhofer, Michael Weigelt
Publication date
2018/1/18
Journal
arXiv preprint arXiv:1801.06024
Description
We train multi-task autoencoders on linguistic tasks and analyze the learned hidden sentence representations. The representations change significantly when translation and part-of-speech decoders are added. The more decoders a model employs, the better it clusters sentences according to their syntactic similarity, as the representation space becomes less entangled. We explore the structure of the representation space by interpolating between sentences, which yields interesting pseudo-English sentences, many of which have recognizable syntactic structure. Lastly, we point out an interesting property of our models: The difference-vector between two sentences can be added to change a third sentence with similar features in a meaningful way.
Total citations
201820192020202120222023202423211
Scholar articles