CPE5005

Information retrieval and browsing

(IT)

K O0Yang

6 points + 4 hours per week + Peninsula + Prohibitions: COT3201, COT4300

Synopsis: Information retrieval: record structures and text processing, Boolean, probabilistic and vector-space models, search strategies, thesauri and clustering techniques, relevance evaluation and performance evaluation. Structured documents: markup languages, automatic indexing from markup, query formulation. User interfaces in IR: interface design for bibliographic retrieval and image retrieval systems, visualisation of information in 2-D spaces, information filtering displays. Internet information retrieval: Gopher, WWW, Z39.50, retrieval using WWW, resource discovery and search engines. Large alphabet language retrieval: encoding schemes, segmentation, stemming and thesaurus construction.

Assessment: Examination: 40% + Practical work: 40% + Written (4000 words): 20%