TY - GEN
T1 - Fast phased small RNA cycle counting algorithms
AU - Bao, Forrest Sheng
AU - Xie, Zhixin
AU - Zhang, Yuanlin
PY - 2010
Y1 - 2010
N2 - Counting phased small RNA cycles (PSRC) from mapped small RNA positions is a repeatedly invoked subproblem in the computation of identifying TRANS-ACTING siRNA (TAS) loci and loci of other small RNAs forming through mechanisms similar to that of transacting small interfering RNAs (ta-siRNAs). The efficiency of counting PSRC has a clear impact on the efficiency of the algorithms predicting these loci. There are two closely related variants on counting PSRC in real applications: WPSRC, which counts the number of distinct small RNAs falling onto the phased positions in a sliding window, and MPSRC, which counts the maximum consecutive PSRC from mapped small RNA positions. In this paper, we develop fast algorithms for both WPSRC and MPSRC. Our algorithms have O(max(S)) time complexity, while the existing algorithm and its variant have O(\S\·max(S)) and O(\S\·L) time complexity for MPSRC and WPSRC respectively, where S is a set of mapped small RNA positions and L the length of sliding window for WPSRC. Experimental results on two real-life datasets show that our algorithms are significantly faster than the existing algorithm and its variant. The proposed algorithms are applicable to TAS-like clusters with any PSRC length including 21-nt.
AB - Counting phased small RNA cycles (PSRC) from mapped small RNA positions is a repeatedly invoked subproblem in the computation of identifying TRANS-ACTING siRNA (TAS) loci and loci of other small RNAs forming through mechanisms similar to that of transacting small interfering RNAs (ta-siRNAs). The efficiency of counting PSRC has a clear impact on the efficiency of the algorithms predicting these loci. There are two closely related variants on counting PSRC in real applications: WPSRC, which counts the number of distinct small RNAs falling onto the phased positions in a sliding window, and MPSRC, which counts the maximum consecutive PSRC from mapped small RNA positions. In this paper, we develop fast algorithms for both WPSRC and MPSRC. Our algorithms have O(max(S)) time complexity, while the existing algorithm and its variant have O(\S\·max(S)) and O(\S\·L) time complexity for MPSRC and WPSRC respectively, where S is a set of mapped small RNA positions and L the length of sliding window for WPSRC. Experimental results on two real-life datasets show that our algorithms are significantly faster than the existing algorithm and its variant. The proposed algorithms are applicable to TAS-like clusters with any PSRC length including 21-nt.
UR - http://www.scopus.com/inward/record.url?scp=77956159315&partnerID=8YFLogxK
U2 - 10.1109/BIBE.2010.30
DO - 10.1109/BIBE.2010.30
M3 - Conference contribution
AN - SCOPUS:77956159315
SN - 9780769540832
T3 - 10th IEEE International Conference on Bioinformatics and Bioengineering 2010, BIBE 2010
SP - 130
EP - 135
BT - 10th IEEE International Conference on Bioinformatics and Bioengineering 2010, BIBE 2010
T2 - 10th IEEE International Conference on Bioinformatics and Bioengineering, BIBE-2010
Y2 - 31 May 2010 through 3 June 2010
ER -