Master 2014 2015
Stages de la spécialité SAR
Vocal double


Site :Image & Vision
Lieu : Imaging Science Lab. Rennes
Encadrant : Patrick PĂ©rez, Alexey Ozerov and Ngoc Q. K. Duong
Dates :du 01/03/2015 au 31/08/2015
Rémunération :1200 euros / month (brut)
Mots-clés : Parcours ATIAM : Acoustique, Parcours ATIAM : Informatique musicale, Parcours ATIAM : Traitement du signal

Description

The ability to capture and reproduce the voice timber and the speech prosody of a given person has made impressive progress, e.g., with recent Ircam projects : based on several hours of recording of an actor reading texts, entirely new synthetic speech of incredible resemblance can be uttered in the same language ; also, using much less training data, voice timber can also be transferred to some extent to the natural speech of another speaker. In this work, we would like to investigate ways to push further such systems to create polyglot voice doubles, with various applications. The intern will have to get acquainted with existing voice analysis and reproduction (either by synthesis or by transfer) techniques and will try and alleviate the current mono-lingual limitations.

Technicolor is an industry leader in the production of video content for movies, TV, advertising, games and more. The company provides production, postproduction, and distribution services to content creators, network service providers and broadcasters. Technicolor Research Rennes is the largest Technicolor Research Center conducting research in various domains applying to the creation, management and delivery of digital content. For more information on Technicolor R&I Rennes : https://research.technicolor.com/rennes/

The internship will be hosted in Imaging Science Lab (ISL) within Technicolor R&I Rennes. More specifically, he/she will join the “Image & Vision” team composing of more than twenty researchers and engineers coming from many different countries. One major goal of the team is to conduct the research towards describing and searching for contents from large data collections as well as enhancing and adapting these contents to any devices.