|
|
|
Clone Library Dereplicator Dereplicate your clone libraries in a matter of moments
Description
When building clone libraries it is often the case that many of the sequenced clones are 100% identical. Therefore, before further DNA sequence analysis, it is necessary to find duplicate DNA sequences and remove them. Our DNA sequence dereplication tool sorts all unique DNA sequences (FASTA) belonging to your clone libraries, by moving/coping them into the specified folder.
Clone Library Dereplicator simplifies the dereplication of all type sequence libraries (16S rRNA, 18S rRNA, 23S rRNA, 28S rRNA, functional and structural proteins) and prepares the raw sequences for subsequent analyses or contig assembly.
Availability
Starting with August 2008, Clone Library Dereplicator was integrated into the DNA Baser Assembler package.
Freeware
With a small limitation related to metadata integration (RNA Baser users only), Clone Library Dereplicator can be used free for unlimited time.
How to use it
The tool is located under the 'Tools->Clone library dereplicatior' menu in DNA/RNA Baser package.
Choose the input folder - the folder that contains the FASTA sequences that need to be de-replicated. Choose output folder - the folder where the unique files will be copied/moved. For your commodity, DNA Sequence Dereplicator will create an output folder for you so you can skip this step. Choose file operation type. This will instruct the program whether to copy or move the unique files from the input folder to the output folder. Choose comparison type: - 'Text match' will make a direct comparison between the sequences. This is very fast but it can't tell you how many differences have been found between sequences. - 'Global alignment' will compare the sequences by using a global alignment algorithm. Slow. Finally, press the 'Dereplicate' button.
|
|
| Last update: October 2008 |
|
|