dipwmsearch
Protein binding sites in DNA or RNA sequences are modeled by probabilistic motifs. A Position Weight Matrix (PWM) is a simple, powerful, and widely used representation of such motifs. Because PWMs assume that sequence positions are independent of eachother (which is too restrictive for some binding or interaction sites), a generalisation of PWMs, termed di-nucleotidic PWMs (di-PWMs) have been introduced by Kulakovskiy et al. 2013 Kulakovskiy_2013. For instance, the HOCOMOCO database is a collection of PWMs and di-PWMs for Transcription Factor binding sites Kulakovskiy_2018.
This package, called dipwmsearch, offers you efficient algorithms to search for occurrences of di-PWMs motifs in DNA or RNA sequences. The search sequence may contain positions with non-ATGC symbols, encoded with IUPAC. A description of the algorithms implemented in this package is available in our article dipwmsearch_2022.
Other tools
WAVES
Summary WAVES is a web application dedicated to bioinformatic tool integration. It provides an efficient way to implement a service for any bioinformatic software. Such services are automatically made available in three ways: web pages, web forms to include in remote websites, and a RESTful web services API to access remotely from applications. In order…
MYST
What is MYST? MYST is the orchestration platform behind ATGC online bioinformatics services. It provides a unified web interface and a public REST API to submit analyses, monitor jobs, and retrieve results across a growing catalog of phylogenetic and sequence-analysis tools. TIDE is a modernized redesign of WAVES, an older tool previously developped by ATGC…
PEWO: a collection of workflows…
Introduction and context In the Bioinformatics team of the LIRMM (CNRS & Univ. Montpellier), we develop a series of tools for metagenomics / metabarcoding analysis. Our tools exploit phylo-k-mers (which are k-mers combined with phylogenetic information) computed for an input set of reference sequences and their phylogeny. The phylo-k-mers are computed and indexed with IPK,…