Jinbeom Kang

Hanyang University

Author Statistics

Papers

Citation

H-Index

i-10 index

Research Trends

Author Order

Document Type

Co-Authors

Joongmin Choi

Hanyang University

Jaeyoung Yang

University at Buffalo, State University of New York

Eunshil Lee

Hanyang University

Kibeom Hong

Swatch Group (Switzerland)

Juyoung Park

Korea Advanced Institute of Science and Technology

Jeahyun Park

Samsung (South Korea)

Dongwook Shin

Ajou University

Joon Lee

University of Calgary

Jea-Hyun Park

Kunsan National University

Cheolhee Choi

Hanyang University

Cooperative Institutions

Hanyang University

Yonsei University

Korea Advanced Institute of Science and Technology

Samsung (South Korea)

Sungkyunkwan University

Swatch Group (Switzerland)

National Yang Ming Chiao Tung University

Naver (South Korea)

University of Arizona

Atlantic University College

Author Statistics

Papers

Citation

H-Index

i-10 index

Research Field

Topic-Specific Web Content Adaptation to Mobile Devices

Eunshil Lee Jinbeom Kang Joongmin Choi Jaeyoung Yang

Content Adaptation

Web content

Content management

10.1109/wi.2006.172

Cite

Citations (22)

Detecting Informative Web Page Blocks for Efficient Information Extraction Using Visual Block Segmentation

Jinbeom Kang Joongmin Choi

As the structure of a Web page is getting more complicated, the construction of wrapper induction rules becomes more difficult and time-consuming. The main problem in most wrapper induction methods is the difficulty in discriminating the meaningful blocks that contain the target information from the noise blocks that contains irrelevant information such as advertisements, menus, or copyright statements. To solve this problem, this paper proposes the RIPB(recognizing informative page blocks) algorithm that detects the informative blocks in a Web page by exploiting the visual block segmentation scheme. RIPB uses the visual page segmentation algorithm to analyze and partition a Web page into a set of logical blocks, and then groups related blocks with similar structures into a block cluster and recognizes the informative block clusters by applying some heuristic rules to the cluster information. The results of a series of experiments indicate that RIPB contributes to improve the accuracy of information extraction by allowing the wrapper induction module to focus only on the informative block information and ignore other noise information in building extraction rules.

10.1109/isitc.2007.6

Cite

Citations (16)

A Focused Crawler with Document Segmentation

Lecture notes in computer science (2005)

Jaeyoung Yang Jinbeom Kang Joongmin Choi

Web crawler

Focused crawler

Hyperlink

tf–idf

Relevance

Document Clustering

10.1007/11508069_13

Cite

Citations (6)

An Ontology-Based Recommendation System Using Long-Term and Short-Term Preferences

International Conference on Information Science and Applications (2011)

Jinbeom Kang Joongmin Choi

Personalized information retrieval and recommendation systems have been proposed to deliver the right information to users with different interests. However, most of previous systems are using keyword frequencies as the main factor for personalization, and as a result, they could not analyze semantic relations between words. Also, previous methods often fail to provide the documents that are related semantically with the query words. To solve these problems, we propose a recommendation system which provides relevant documents to users by identifying semantic relations between an ontology that semantically represents the documents crawled by a Web robot and user behavior history. Recommendation is mainly based on content-based similarity, semantic similarity, and preference weights.

Similarity (geometry)

10.1109/icisa.2011.5772322

Cite

Citations (14)

Topic-Specific Mobile Web Contents Adaptation

Jeongbo gwahaghoe nonmunji. so'peuteuweeo mich eung'yong (2007)

Eun-Shil Lee Jinbeom Kang Joongmin Choi

Mobile content adaptation is a technology of effectively representing the contents originally built for the desktop PC on wireless mobile devices. Previous approaches for Web content adaptation are mostly device-dependent. Also, the content transformation to suit to a smaller device is done manually. Furthermore, the same contents are provided to different users regardless of their individual preferences. As a result, the user has difficulty in selecting relevant information from a heavy volume of contents since the context information related to the content is not provided. To resolve these problems, this paper proposes an enhanced method of Web content adaptation for mobile devices. In our system, the process of Web content adaptation consists of 4 stages including block filtering, block title extraction, block content summarization, and personalization through learning. Learning is initiated when the user selects the full content menu from the content summary page. As a result of learning, personalization is realized by showing the information for the relevant block at the top of the content list. A series of experiments are performed to evaluate the content adaptation for a number of Web sites including online newspapers. The results of evaluation are satisfactory, both in block filtering accuracy and in user satisfaction by personalization.

Content Adaptation

Web content

Source

Cite

Citations (1)

ScalableWeb News Adaptation To Mobile Devices Using Visual Block Segmentation for Ubiquitous Media Services

Eunshil Lee Jinbeom Kang Jea-Hyun Park Joongmin Choi Jaeyoung Yang

This paper describes an enhanced method of Web content adaptation to mobile devices for online News article provision in ubiquitous environments. Our system exploits a scheme of visual block segmentation for Web pages that filters out unnecessary blocks and extracts useful article information from content blocks. This method resolves the problems of previous approaches to Web content adaptation in which the content transformation to suit to a smaller device is device-dependent and manually-driven. Our method also employs a learning module that is initiated when the user selects to view the full content in the content summary page. As a result of learning, personalization is realized by showing the information for the relevant block at the top of the content list. A series of experiments are performed to evaluate our mobile content adaptation method for a number of well-known Web News sites, and the result of evaluation is satisfactory both in block filtering accuracy and in user satisfaction by personalization.

Content Adaptation

Web content

Content management

10.1109/mue.2007.185

Cite

Citations (6)

Detecting Collaborative Fields Using Social Networks

Dongwook Shin Jinbeom Kang Joongmin Choi Jaeyoung Yang

It is generally difficult for researchers to obtain information related to their own fields and novel technologies from huge data residing in the World Wide Web. Furthermore, they often try to apply them to other particular fields which are different from theirs. The main motivation of this phenomenon is to solve existing problems or improve the performance of their systems. Hence, it is important to detect collaborative fields in which technologies of particular fields are applied to another area to find various trends. In this paper, we propose a method to detect collaborative fields by using social networks representing the relations among authors of papers, and describe some experimental results to show the effectiveness of the proposed method when collaborative fields are detected by using social networks.

10.1109/ncm.2008.80

Cite

Citations (5)

An Enhanced Clustering Method Based on Grid-Shaking

Jinbeom Kang Joongmin Choi Jaeyoung Yang

Clustering is an essential way to extract meaningful information from massive data without human intervention in the field of data mining. Clustering algorithms can be divided into four types: partitioning algorithms, hierarchical algorithms, grid-based algorithms, and locality-based algorithms. Each algorithm, however, has problems that are not easily solved. K-means, for example, suffer from setting up an initial centroid problem when distribution of data is not hyper-ellipsoid. Chain effect, outlier, and degree of density in data are problems occurring in other types of algorithms. To solve these problems, various kinds of algorithms were proposed. In this paper, we propose a novel grid-based clustering algorithm through building clusters in each cell and show how to solve the previously mentioned problems.

Hierarchical clustering

Data stream clustering

10.1109/waina.2009.100

Cite

Citations (1)

An enhanced feature selection method for text classification.

Computational intelligence (2006)

Jinbeom Kang Eunshil Lee Kibeom Hong Jeahyun Park T. Kim

Feature (linguistics)

Source

Cite

Citations (0)

An Enhanced Feature Selection Method Based on the Impurity of Words Considering Unbalanced Distribution of Documents

Jeongbo gwahaghoe nonmunji. so'peuteuweeo mich eung'yong (2007)

Jinbeom Kang Jaeyoung Yang Joongmin Choi

Sample training data for machine learning often contain irrelevant information or redundant concept. It is also the case that the original data may include noise. If the information collected for constructing learning model is not reliable, it is difficult to obtain accurate information. So the system attempts to find relations or regulations between features and categories in the teaming phase. The feature selection is to remove irrelevant or redundant information before constructing teaming model. for improving its performance. Existing feature selection methods assume that the distribution of documents is balanced in terms of the number of documents for each class and the length of each document. In practice, however, it is difficult not only to prepare a set of documents with almost equal length, but also to define a number of classes with fixed number of document elements. In this paper, we propose a new feature selection method that considers the impurities among the words and unbalanced distribution of documents in categories. We could obtain feature candidates using the word impurity and eventually select the features through unbalanced distribution of documents. We demonstrate that our method performs better than other existing methods via some experiments.

Feature (linguistics)

Source

Cite

Citations (0)