USC ISI to Develop Translation and Information-Retrieval System for Uncommon Languages

Posted On Feb 08, 2018


USC ISI to Develop Translation and Information-Retrieval System for Uncommon Languages

“Since we don’t have a lot of written data in these languages, we have to do more with less….Ideally, we would use about 300 million words to train a machine translation system—and in this case, we have around 800,000 words. There are about 100,000 words per novel, so we have only eight novels’ worth of words to work from.”

Read more—