Posted on 2007-04-10 08:54:03-07 by gopu
extract nonoverlapping subsequences from a whole genome
Hi, I am one of the newbee venturingout bioperl for my research purposes. I have a whole genome sequence of a pathogen. I am trying to break them into non-overlapping 1000bps subsequences. For example if my whole genome sequence is 400000 bps length, then I should be beak them into 4000 subsequences of each 1000 bps and they should be non-overlapping but at the same time continous. To be precise, my first substring would be from 1 to 1000 bps, second substing would be from 1001 to 2000 etcc.. Could anyone help me. I tried with the following code but it gives me only the first substring and rest are not! I would appreciate very much if someone could help me!
Direct Responses: 5081 | Write a response
Posted on 2007-05-08 10:35:17-07 by mpignatelli in response to 4802
Re: extract nonoverlapping subsequences from a whole genome
Hi, Do you still need help for doing that? I have a script (not in bioperl) that does exactly what you need. If you are interested send me an e-mail (miguel.pignatelli (at) uv.es) M;
Direct Responses: Write a response
Perl Weekly newsletter
A free weekly newsletter for people who are busy to read all the blogs. click here to check it out.