Posted on 2005-06-29 19:28:48-07 by mayagao
Questions about generating 100 sequences with motif hidden
Hi all! I am doing a project generating a width of 8 letters. For each column,randomly pick a letter (a, c, g, or t) that will be the dominant nucleotide in that column. For example, we are expected to design a sequence as ATGAACGT as following: 1 2 3 4 5 6 7 8 a t g a a c g t The possibility for each letter is: 1 2 3 4 5 6 7 8 A 0.85 0.05 C 0.05 0.05 G 0.05 0.05 T 0.05 0.85 The detailed information about the work is: (1) for first column, pick letter A with 85% probability, otherwise pick letter g, c, or t with 5% probability (2) for second column, pick letter T with 85% probability, otherwise pick letter g, c, or a with 5% probability (3) repeat this for all other 6 columns (4) result is a motif site: A T G A A G C T I am wondering which functions in Matlab is able to finish this work or a part of this work. Any suggestions or help would be highly appreciated. Thank you very much and look forward to your reply! Best Regards, Alex
Direct Responses: Write a response
Perl Weekly newsletter
A free weekly newsletter for people who are busy to read all the blogs. click here to check it out.