Genomic Grammar

Published:

A series of neural networks and algorithms for identifying a genomic grammar that can be used to classify regulatory circuits in the mammalian genome. The current model, is a convolutional network that uses a dot-product attention mechanism. This network has performance on par with or exceeding current state-of-the-art methods such as DeepBind at distinguishing regulatory circuits from random genomic background. Additionally, the attention mechanism of the model can be used to extract the architectures of regulatory circuits in the genome via a post-processing step performed on the output of the model.

Please see the recent abstract submitted to the International Workshop on Bio-Design Automation for details: Download abstract

Model Overview

grammar is available at github.com/jenhantao/genomic_grammar