John D. Lafferty, newly named as the John C. Malone Professor of Statistics and Data Science, conducts research on statistical machine learning, with a focus on computational and statistical aspects ...
This paper presents a novel method to segment/decode DNA sequences based on n-gram statistical language model. Firstly, we find the length of most DNA “words” is 12 to 15 bps by analyzing the genomes ...
Statistical language models assign probabilities to sequences of words, and are used in systems that perform text summarization, machine translation, question answering, information extraction, text ...
In this article, the first public release of GREAT as an open-source, statistical machine translation (SMT) software toolkit is described. GREAT is based on a bilingual language modelling approach for ...
How is it that a program such as OpenAI's GPT-3 neural network can answer multiple choice questions, or write a poem in a particular style, despite never being programmed for those specific tasks? It ...