Vis enkel innførsel

dc.contributor.advisorKvalnes, Åge
dc.contributor.advisorJohansen, Dag
dc.contributor.authorHurley, Joseph
dc.date.accessioned2009-04-27T08:09:55Z
dc.date.available2009-04-27T08:09:55Z
dc.date.issued2009-01-23
dc.description.abstractThis thesis covers the design, implementation, and evaluation of a search engine which can give each user a customized index based on the documents they are authorized to view. A common solution available today for this situation is to filter the results of a query based on the list of documents a user has access to. In this scenario, it is possible for information to leak from the search engine because the filtering takes place after the results are ranked. Ranking algorithms are usually based on information which considers characteristics of the entire corpus when calculating the score a document will receive. This type of information in the index must be cleaned before it is used to judge the relevant documents for a user query, otherwise data leakage is possible. Cleaning this information at query time might have a dramatic effect on query performance which would discourage use. The work presented here takes this sensitive information and calculates it for every user authorized to view the documents at index time. At query time, the search engine uses a filtered global index for selecting relevant results, but ranks the results using the information stored for the individual user’s authorized view of the index. Different designs are compared, but the same concept is present in all implementations. An open source index, Apache’s Lucene, was used as a starting point for this work. All modifications were made to Lucene and then compared to an unmodified Lucene for performance evaluations. The findings are that it is feasible to include access control in a basic search engine without incurring dramatic loss in performance.en
dc.format.extent2063 bytes
dc.format.extent100335610 bytes
dc.format.extent461603 bytes
dc.format.mimetypetext/plain
dc.format.mimetypeapplication/octet-stream
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/10037/1827
dc.identifier.urnURN:NBN:no-uit_munin_1589
dc.language.isoengen
dc.publisherUniversitetet i Tromsøen
dc.publisherUniversity of Tromsøen
dc.rights.accessRightsopenAccess
dc.rights.holderCopyright 2009 The Author(s)
dc.subject.courseIDINF-3996nor
dc.subjectVDP::Mathematics and natural science: 400::Information and communication science: 420::Security and vulnerability: 424en
dc.subjectVDP::Mathematics and natural science: 400::Information and communication science: 420::System development and system design: 426en
dc.titlePreventing Information Leakage in the Search Engineen
dc.typeMaster thesisen
dc.typeMastergradsoppgaveen


Tilhørende fil(er)

Thumbnail
Thumbnail
Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel