OEM-based schema extraction of semi-structured data
2004
Web data is typical semi-structured data without an explicit structure that characterises most data sets. The lack of data structure makes querying and integrating web data very inefficient. An approach was developed to identify structures in semi-structured and hierarchical data using the OEM (object exchange model) and a pruning strategy to quickly extract simple paths from the OEM graph for integrating and querying semi-structured data. The method can effectively reduce the scale of the target structure and enhance the efficiency of structure abstraction.
Keywords:
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
0
References
3
Citations
NaN
KQI