MISC

基本情報

氏名 佐藤 洋一郎
氏名(カナ) サトウ ヨウイチロウ
氏名(英語) Sato Yoichiro
所属 機構 研究社会連携機構 研究社会連携センター
職名 教授
researchmap研究者コード 1000035611
researchmap機関 岡山理科大学

題名

Complexity and a method of extracting a database schema over semistructured documents

単著・共著の別

 

著者

N Suzuki
Y Sato
M Hayase

概要

Semistructured data comprises irregular structure and has no a-priori database schema. therefore we encounter several problems such as inefficient data retrieval and wasteful data storage. To cope with such problems. some schema extraction algorithms over semistructured data have been proposed, ill which data is modeled as all unordered, tree. However, the order of elements, is indispensable for document data, therefore we consider extracting an optimal database schema over an ordered free. We consider all optimization problem to extract a smallest database schema such that the density of each class is no less than a given threshold, where the density of a class represents a similarity between the type of the class and those of the objects in the class. We first prove that the corresponding decision problem is strongly NP-complete, and show that another version of the, problem is strongly NP-hard and belongs to Delta(2)P. Then we show that for any r < 3/2; there is no polynomial-time r-approximation algorithm that solves the optimization problem unless P = NP. Finally, we propose a kind of class called bounded class that can be constructed efficiently, then show a polynomial-time algorithm for constructing a database schema by using bounded classes.

発表雑誌等の名称

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS

出版者

IEICE-INST ELECTRONICS INFORMATION COMMUNICATIONS ENG

E85D

6

開始ページ

940

終了ページ

949

発行又は発表の年月

2002-06

査読の有無

無し

依頼の有無

無し

記述言語

英語

掲載種別

 

ISSN

 

ID:DOI

 

ID:NAID(CiNiiのID)

 

ID:PMID

 

JGlobalID

 

arXiv ID

 

ORCIDのPut Code

 

DBLP ID