Text and Document in Databases

Documents are usually called unstructured data: that's because their structure is implicit and very flexible. Hence, database systems have trouble handling documents. And yet, most of the information in an enterprise is in documents (in emails, tech reports, Web pages, memorandums, etc.). Current DBMS try to handle documents using Information Retrieval techniques; however, this is not enough for many tasks.

RESEARCH TOPICS