TY - GEN
T1 - An intelligent data-centric approach toward identification of conserved motifs in protein sequences
AU - Dempsey, Kathryn
AU - Currall, Benjamin
AU - Hallworth, Richard
AU - Ali, Hesham
PY - 2010
Y1 - 2010
N2 - The continued integration of the computational and biological sciences has revolutionized genomic and proteomic studies. However, efficient collaboration between these fields requires the creation of shared standards. A common problem arises when biological input does not properly fit the expectations of the algorithm, which can result in misinterpretation of the output. This potential confounding of input/output is a drawback especially when regarding motif finding software. Here we propose a method for improving output by selecting input based upon evolutionary distance, domain architecture, and known function. This method improved detection of both known and unknown motifs in two separate case studies. By standardizing input considerations, both biologists and bioinformaticians can better interpret and design the evolving sophistication of bioinformatic software.
AB - The continued integration of the computational and biological sciences has revolutionized genomic and proteomic studies. However, efficient collaboration between these fields requires the creation of shared standards. A common problem arises when biological input does not properly fit the expectations of the algorithm, which can result in misinterpretation of the output. This potential confounding of input/output is a drawback especially when regarding motif finding software. Here we propose a method for improving output by selecting input based upon evolutionary distance, domain architecture, and known function. This method improved detection of both known and unknown motifs in two separate case studies. By standardizing input considerations, both biologists and bioinformaticians can better interpret and design the evolving sophistication of bioinformatic software.
UR - http://www.scopus.com/inward/record.url?scp=77958030578&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77958030578&partnerID=8YFLogxK
U2 - 10.1145/1854776.1854839
DO - 10.1145/1854776.1854839
M3 - Conference contribution
AN - SCOPUS:77958030578
SN - 9781450304382
T3 - 2010 ACM International Conference on Bioinformatics and Computational Biology, ACM-BCB 2010
SP - 398
EP - 401
BT - 2010 ACM International Conference on Bioinformatics and Computational Biology, ACM-BCB 2010
T2 - 2010 ACM International Conference on Bioinformatics and Computational Biology, ACM-BCB 2010
Y2 - 2 August 2010 through 4 August 2010
ER -