DarkQ: Continuous genomic monitoring using message queues

Adrian Viehweger,Christian Brandt,Martin Hölzer

DarkQ: Continuous genomic monitoring using message queues

2020

Adrian Viehweger
Christian Brandt
Martin Hölzer

Motivation: The representation of text as dense, low-dimensional vectors of numbers ("embeddings") is a common practice in the field of natural language processing (NLP), because these vectors can be used as direct input to a variety of learning algorithms such as neural networks and make training more efficient due to their "pretrained" nature. Results: We developed nanotext, an open source Python library and command line interface that allows for training and analysis of protein domain and genome embeddings analogous to word and document embeddings in NLP. Availability: nanotext is released under the BSD-3 license at [github.com/phiweger/nanotext](https://github.com/phiweger/nanotext).

Keywords:

Message queue
Programming language
License
Artificial neural network
open source
Computer science
Python (programming language)
Command-line interface

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations