Facilitating Programming-by-Demonstration for Bioinformatics using Semantic Web Resources.

atmire.migration.oldid1738
dc.contributor.advisorBarker, Kenneth
dc.contributor.advisorSensen, Christoph
dc.contributor.authorGordon, Paul-Michael
dc.date.accessioned2014-01-07T18:44:25Z
dc.date.available2014-03-15T07:00:18Z
dc.date.issued2014-01-07
dc.date.submitted2013en
dc.description.abstractMany software applications have “macro” capabilities, allowing users to record keystrokes, then replay them verbatim to save time performing repetitive tasks. Programming by demonstration (PbD) is a more sophisticated technique where the application translates the semantics (i.e. meaning) of users’ actions into programming language notation. The PbD engine generalizes to a functional level, rather than simply being rote sets of keystrokes. This can help nonprogrammers robustly automate their computer based tasks. Most extant PbD systems have user actions based on physical phenomena such as maze traversal, but automating gaming tasks is of limited interest to most computer users. One user group that could benefit from task automation without programming is molecular biologists, analyzing large genetics datasets ("bioinformatics") but typically having no programming skills. Visual workflow systems help automate bioinformatics, but using empirical studies I show five significant barriers to their use. I propose here that a domain-savvy PbD system can mitigate these barriers by inferring data analysis workflows from molecular biologists' Web browsing sessions. I call this technique Workflow by Demonstration (WbD). The success of this approach depends on replacing the typical physical phenomenon model of PbD with a biology-specific data model and hypertext interface. Emerging informatics standards ("Semantic Web" technologies) facilitate the use of common data models across different data providers on the Web. Many molecular biology resources are Web based, therefore this work implements Semantic Web technologies to facilitate WbD. Biologists were given pen-and-paper workflow design tasks, revealing the types of data flow they intuitively understood. These defined the types of workflow "code" a WbD system should support, and the corresponding hypertext demonstration actions were modeled. A browser (Seahawk) implements these action to code mappings. User studies evaluating Seahawk show that biologists could 1) demonstrate Web based analysis for realistic tasks, 2) understand the automatically generated workflows, and 3) use them in the workflow environment Taverna. This suggests WbD is a viable technique for bioinformatics. Although the data model used was biology-specific, the underlying semantic technologies used are domain agnostic. Techniques described here may therefore be applicable to novice programmers in other domains.en_US
dc.identifier.citationGordon, P. (2014). Facilitating Programming-by-Demonstration for Bioinformatics using Semantic Web Resources. (Doctoral thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca. doi:10.11575/PRISM/26143en_US
dc.identifier.doihttp://dx.doi.org/10.11575/PRISM/26143
dc.identifier.urihttp://hdl.handle.net/11023/1241
dc.language.isoeng
dc.publisher.facultyGraduate Studies
dc.publisher.facultyScience
dc.publisher.institutionUniversity of Calgaryen
dc.publisher.placeCalgaryen
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.
dc.subjectComputer Science
dc.subject.classificationProgramming-by-Demonstrationen_US
dc.subject.classificationBioinformaticsen_US
dc.subject.classificationSemantic Weben_US
dc.titleFacilitating Programming-by-Demonstration for Bioinformatics using Semantic Web Resources.
dc.typedoctoral thesis
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Calgary
thesis.degree.nameDoctor of Philosophy (PhD)
ucalgary.item.requestcopytrue
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ucalgary_2013_gordon_paul-michael.pdf
Size:
4.18 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.65 KB
Format:
Item-specific license agreed upon to submission
Description: