Academic Thesis

Basic information

Name Sato Yoichiro
Belonging department
Occupation name
researchmap researcher code 1000035611
researchmap agency Okayama University of Science

Title

Complexity and a method of extracting a database schema over semistructured documents

Bibliography Type

 

Author

N Suzuki
Y Sato
M Hayase

Summary

Semistructured data comprises irregular structure and has no a-priori database schema. therefore we encounter several problems such as inefficient data retrieval and wasteful data storage. To cope with such problems. some schema extraction algorithms over semistructured data have been proposed, ill which data is modeled as all unordered, tree. However, the order of elements, is indispensable for document data, therefore we consider extracting an optimal database schema over an ordered free. We consider all optimization problem to extract a smallest database schema such that the density of each class is no less than a given threshold, where the density of a class represents a similarity between the type of the class and those of the objects in the class. We first prove that the corresponding decision problem is strongly NP-complete, and show that another version of the, problem is strongly NP-hard and belongs to Delta(2)P. Then we show that for any r < 3/2; there is no polynomial-time r-approximation algorithm that solves the optimization problem unless P = NP. Finally, we propose a kind of class called bounded class that can be constructed efficiently, then show a polynomial-time algorithm for constructing a database schema by using bounded classes.

Magazine(name)

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS

Publisher

IEICE-INST ELECTRONICS INFORMATION COMMUNICATIONS ENG

Volume

E85D

Number Of Pages

6

StartingPage

940

EndingPage

949

Date of Issue

2002-06

Referee

Not exist

Invited

Not exist

Language

English

Thesis Type

Research papers (academic journals)

ISSN

 

DOI

 

NAID

 

PMID

 

J-GLOBAL ID

 

arXiv ID

 

ORCID Put Code

 

DBLP ID