Abstract
|
|
---|---|
In this paper we present a suite of tools to automatically acquire and browse conceptual schemas from large collections of HTML-based biomedical documents. This suite is composed of two tools: the schema acquisition tool (SAT) and the zoomable browser (ZB). The SAT is the implementation of a novel four-phased method to extract conceptual schemas from non-structured sources. First, all documents in the collection are analyzed to extract relevant concepts. Second, the vocabulary discovered during the first phase is organized into a hierarchical structure. Third, the schema is enriched with non-hierarchical ad-hoc relationships. The last phase is an optional refinement activity that must be conducted by experts in the domain covered by the collection. The extracted schemas can be navigated using the ZB. We have used these tools for different purposes in the EC funded biomedical research project Advancing Clinico- Genomic Trials on Cancer (ACGT), obtaining promising results. | |
International
|
Si |
Congress
|
International Conference on Conceptual Modeling, Serie: Conferences in Research and Practice in Information Technology (CRPIT) |
|
960 |
Place
|
Auckland (New Zealand) |
Reviewers
|
Si |
ISBN/ISSN
|
|
|
|
Start Date
|
05/11/2007 |
End Date
|
09/11/2007 |
From page
|
|
To page
|
|
|