In this project, we are trying to dub videos in low-resource languages so that it can be made accessible to large number of people.
With the development of the Internet and multimedia, the way people acquire information has been significantly changed. Nowadays, there are more and more people consuming news through social media, which can provide all kinds of multimedia information on the events taking place all over the world. Unfortunately, social media websites also have fostered various fake news which usually contain misrepresented or even forged multimedia content, to mislead the readers and get rapid spread. Some evil guys even use rumors to mislead public opinion, which can damage the credibility of the government on purpose. Therefore, it is necessary and urgent to use an automatic detector to prevent fake news from causing serious negative effects and make users receive truthful information. There have been several efforts to combat this problem using single modal data. In this project, we leverage text+image/video to attack this problem.
In this project, we are working towards predicting speaker’s attribute from audio.
In this project, we are working towards model for automatically translating medical reports in low-resource languages.