UniversalMer

UPDATE version 2.0.0

UniversalMer

A CLI k-mer counting tool for multiple sizes of k at once.

OVERVIEW

UniversalMer is a k-mer counting tool for multiple size of k at once.The program counts and summarizes the exact frequency of all k-mers from 1-mer to a user-defined maximum length (kmax). This kmax can be specified as any length or can be automatically determined by the longest repeated patterns found in the input sequence. The available sequence alphabets support are DNA, RNA, and protein. The Input file must be fasta format with .txt or .fasta or .fna.

The program offers several output options:

The efficiency of the program depends on the sequence length (in base pairs) and the number of unique patterns.

Memory Usage:

Running Time:

REQUIREMENT

OS: Windows10, MacOS_ARM64, MacOS_X64
RAM: 8-64 GB, depend on the large size of the input sequence and output patterns. We recommend RAM 32 GB for 200 million bp up.

NAME :

UniversalMer -The k-mer counting tool for multiple sizes of k at once.

SYNOPSIS:

universalmer [OPTIONS][FILE][DUMP][K-SIZE]

DESCRIPTION:

[OPTIONS]: For specifying maximum k and alphabet
    -k<maximum lenght> : Example: -k100 for maximum k length = 100 (counting 1-mers to 100-mers)
    -a<alphabet> : -ad for DNA alphabet = {A,C,G,T}
                   -ar for RNA alphabet = {A,C,G,U}
                   -ap for protein alphabet =  {A,C,D,E,F,G,H,I,K,L,M,N,P,Q,R,S,T,V,W,Y}
    -n : For not saving output file 
    -t : For count and saving spectrum file
    -m<minimum frequency>: Example: -m100 for minimum frequency = 100

[FILE]: To specify an input path, use a file with  .fasta, .fna, or .txt extension that is in FASTA format.

[DUMP]: For specifying whether or not to dump singleton patterns.
        -d<type>:   
              -d  :  Dump repeats and singleton k-mers
              -dx :  Dump only repeats k-mer, excluding singleton k-mers

[K-SIZE]:Here is how to specify the size of k for dumping:
      Range of k's: Use a hyphen to define a range (e.g., 10-100).
      Multiple k's: Separate each size with a space (e.g., 28 38 55 65).
      Single k: Simply enter one value (e.g., 55).

EXAMPLE:

NOTE:

HOW TO USE THE PROGRAM

OUTPUT

Your program will save the output files in the current directory. The output files are:

EXAMPLE

BASIC OF UNIVERSALMER CODING

If you want to study the basic of Universal mer please visit at..
..coming soon..

LICENSE

! YOU may install and use an unlimited number of copies for personal and education use.
! YOU may not sell, decompile, disassemble, reverse engineer or modify this program.

AUTHOR

Jittakorn Pullpothong
Department of computer engineering, Faculty of engineering, Ramkhamhaeng University, Thailand.
Email: jittakorn.p@rumail.ru.ac.th