Show simple item record

dc.contributor.authorSengoz, Cenker
dc.date.accessioned2015-04-22T21:03:46Z
dc.date.available2015-04-22T21:03:46Z
dc.date.issued2015-04-22
dc.identifier.citationSengoz, Cenker. A Granular-based Approach for Semisupervised Web Information Labeling; A Dissertation Submitted in Partial Fulfillment of the requirements for the degree of Master of Science in the Department of Applied Computer Science [University of Winnipeg]. Winnipeg, Manitoba, Canada: University of Winnipeg, 2014.
dc.identifier.urihttp://hdl.handle.net/10680/821
dc.description.abstractA key issue when mining web information is the labeling problem: data is abundant on the web but is unlabelled. In this thesis, we address this problem by proposing i) a novel theoretical granular model that structures categorical noun phrase instances as well as semantically related noun phrase pairs from a given corpus representing unstructured web pages with a variant of Tolerance Rough Sets Model (TRSM), ii) a semi-supervised learning algorithm called Tolerant Pattern Learner (TPL) that labels categorical instances as well as relations. TRSM has so far been successfully employed for document retrieval and classification, but not for learning categorical and relational phrases. We use the ontological information from the Never Ending Language Learner (Nell) system. We compared the performance of our algorithm with Coupled Bayesian Sets (CBS) and Coupled Pattern Learner (CPL) algorithms for categorical and relational labeling, respectively. Experimental results suggest that TPL can achieve comparable performance with CBS and CPL in terms of precision.en_US
dc.description.sponsorshipNatural Sciences and Engineering Research Council of Canada (NSERC) Discovery grant 194376.en_US
dc.language.isoenen_US
dc.publisherUniversity of Winnipeg
dc.rightsinfo:eu-repo/semantics/openAccess
dc.subjectSemi-supervised learningen_US
dc.subjectWeb information labelingen_US
dc.subjectNever Ending Language Learneren_US
dc.subjectGranular Computingen_US
dc.subjectTolerance Rough Setsen_US
dc.subjectMachine learning.en_US
dc.subjectPattern perception.en_US
dc.subjectData mining.en_US
dc.titleA Granular-based Approach for Semisupervised Web Information Labelingen_US
dc.typeThesisen_US
dc.description.degreeMaster of Science in Applied Computer Science
dc.publisher.grantorUniversity of Winnipeg
thesis.degree.disciplineApplied Computer Science
thesis.degree.levelmasters
thesis.degree.nameMaster of Science in Applied Computer Science
thesis.degree.grantorUniversity of Winnipeg


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record