![]() |
![]() |
![]() |
Applied probability. sampling theory. data quality control. clustering and coincidence models. matching in DNA sequencesIn studying DNA sequences. molecular biologists sometimes search for fragments of two sequences that when aligned match at most of the positions. The scientists seek to determine what is an unusually long match. One problem area we work on is where the two long sequences are aligned by an overall criterion. and then the scientists highlight matching (perfectly or almost) subsequences with common location in the two sequences. A second problem area we are working on is where the two long sequences are compared under all alignments. and searched for unusual almost matching fragments. For various models and scoring we seek to determine the statistical significance of such "matches". Selected PublicationsNaus J. Wallenstein S. (2006) Temporal surveillance using scan statistics. Stat Med. 25(2):311-24. Naus. J. and Wallenstein. S. (2004) . Simultaneously testing for a range of cluster or scanning window sizes. Methodology and Computing in Applied Probability 6. 389-400. Naus. J. and Stefanov. VT (2002). Double scan statistics. Methodology and Computing in Applied Probability 4:163--180. Glaz. J.. Naus. J. and Wallenstein. S. (2001). Scan Statistics. Springer-Verlag. 370 pages. over 600 references. Naus. J. (1999). Scanning multiple sequences. pp 97-109. in Recent Advances on Scan Statistics. N. Balakrishnan and J.Glaz. Editors. Birkhauser. Boston. Naus. J.I.. and Sheng. K.-N. (1997). Matching among multiple random sequences. Bulletin of Mathematical Biology. in press. Naus. J.. and Sheng. K.-N. (1997). Matching among multiple random sequences. Bulletin of Mathematical Biology. 59. 483-496. Karwe. V.. and Naus. J.I. (1997). New recursive methods for scan statistic probabilities. Computational Statistics and Data Analysis 23; 389-402. Naus. J.I.. and Sheng. K.-N. (1996). Screening for unusual matched segments in multiple protein sequences. Communications in Statistics: Simulation and Computation 25: 937-952. Sheng. K-N. and Naus. J. (1994). Pattern matching between two non-aligned random sequences. Bulletin of Mathematical Biology 56:1143-1162. |