Thomas Olson

San Francisco State University

Author Statistics

Papers

Citation

H-Index

i-10 index

Research Trends

Author Order

Document Type

Co-Authors

David Naddor

Johns Hopkins University

Rahul Singh

Institute of Technology Management

Chi-Bin Chien

Johns Hopkins University

Joseph O’Rourke

Smith College

Arno Puder

San Francisco State University

Hui Yang

University of Electronic Science and Technology of China

Cooperative Institutions

San Francisco State University

Johns Hopkins University

Massachusetts Institute of Technology

University of San Francisco

San Jose State University

International Computer Science Institute

AT&T (United States)

Deutsche Telekom (United Kingdom)

Deutsche Telekom (Germany)

Goethe University Frankfurt

Author Statistics

Papers

Citation

H-Index

i-10 index

Research Field

Analyzing Computer Science Students' Performance Data to Identify ImpactfulCurricular Changes

2021 IEEE Frontiers in Education Conference (FIE) (2021)

Hui Yang Thomas Olson Arno Puder

This full research paper presents a systematic analysis of 10 years' student performance data of Computer Science (CS) majors at San Francisco State University, a public 4-year degree-granting university, aiming to address the ongoing challenges of early dropouts and low graduation rate. The main objective is two-fold: (1) gain a comprehensive understanding of how the existing curriculum has been supporting (or hindering) students' progress towards graduation; and (2) suggest data-informed curricular changes. To this end, we utilize both explorative statistical analysis and data mining/machine learning approaches to first learn how individual courses and the prescribed course sequences influence a student's dropout/graduation status, and then build machine learning models to interpret/validate the observed interdependency among key courses in the current curriculum. Such patterns/models are consequently utilized to suggest impactful curricular changes towards reducing early dropouts and improving the overall student success as measured by graduation with a CS degree. One main finding of this research is that a successful CS student needs to excel in both critical thinking and core CS skills. To help students gain critical thinking skills, it is essential to strengthen the presence of mathematics and physics courses in the CS curriculum. Furthermore, our results suggest that CS students without a solid math foundation before starting their college career should complete a remedial math course earlier than putting it off for later. Moreover, before students advance to the second half of their CS study to gain core CS knowledge/skills (e.g., operating systems), they should complete the required physics class. Finally, we observe that it is necessary to introduce new prerequisite requirements among upper-level CS courses, for example, Operating Systems as a prerequisite to an upper-level CS core course on programming theories.

Graduation (instrument)

Remedial education

Dropout (neural networks)

Common core

10.1109/fie49875.2021.9637474

Cite

Citations (4)

Computational prediction of ATC codes of drug-like compounds using tiered learning

Thomas Olson Rahul Singh

The Anatomical Therapeutic Chemical (ATC) Code System is a World Health Organization (WHO) proposed classification that assigns codes to compounds based on their therapeutic, pharmacological and chemical characteristics as well as the in-vivo site of activity. The ability to predict the ATC code of an arbitrary compound with high accuracy can go a long way in selecting molecules for lead identification. We propose a computational approach to this problem that utilizes a natural pharmacological constraint, namely, that anatomical-therapeutic biological activity of certain types must preclude activities of many other types. The method proposed here utilizes machine learning in a tiered architecture; prediction of the ATC code at a certain level is constrained by the ATC code at the higher levels. Using this learning architecture, we have built classifiers that incorporate information from a compound's structure, as well as its chemical and protein interactions. The proposed approach has been validated using 2335 drugs from the ChEMBL database in both cross-validation and test setting. The prediction accuracy obtained with this approach is 78.72% and is comparable or better than the prediction accuracy of other methods at the state of the art.

chEMBL

Code (set theory)

Identification

Drug target

10.1109/iccabs.2015.7344719

Cite

Citations (0)

A new linear algorithm for intersecting convex polygons

Computer Graphics and Image Processing (1982)

Joseph O’Rourke Chi-Bin Chien Thomas Olson David Naddor

Polygon (computer graphics)

Convex polygon

Star-shaped polygon

Point in polygon

10.1016/0146-664x(82)90023-5

Cite

Citations (122)

Indians--State Jurisdiction over Real Estate Developments on Tribal Lands

New Mexico law review (1972)

Thomas Olson

Source

Cite

Citations (0)

A new linear algorithm for intersecting convex polygons

Computer Graphics and Image Processing (1982)

Joseph O’Rourke Chi-Bin Chien Thomas Olson David Naddor

10.1016/0146-664x(82)90156-3

Cite

Citations (6)

Predicting anatomic therapeutic chemical classification codes using tiered learning

BMC Bioinformatics (2017)

Thomas Olson Rahul Singh

The low success rate and high cost of drug discovery requires the development of new paradigms to identify molecules of therapeutic value. The Anatomical Therapeutic Chemical (ATC) Code System is a World Health Organization (WHO) proposed classification that assigns multi-level codes to compounds based on their therapeutic, pharmacological and chemical characteristics as well as the in-vivo sites(s) of activity. The ability to predict ATC codes of compounds can assist in creation of high-quality chemical libraries for drug screening and in applications such as drug repositioning. We propose a machine learning architecture called tiered learning for prediction of ATC codes that relies on the prediction results of the higher levels of the ATC code to simplify the predictions of the lower levels.The proposed approach was validated using a number of compounds in both cross-validation and test setting. The validation experiments compared chemical descriptors, initialization methods and classification algorithms. The prediction accuracy obtained with tiered learning was found to be either comparable or better than that of established methods. Additionally, the experiments demonstrated the generalizability of the tiered learning architecture, in that its use was found to improve prediction rates for a majority of machine learning algorithms when compared to their stand-alone application.The basis of our approach lies in the observation that anatomical-therapeutic biological activity of certain types typically precludes activities of many other types. Thus, there exists a characteristic distribution of the ATC codes, which can be leveraged to limit the search-space of possible codes that can be ascribed at a particular level once the codes at the preceding levels are known. Tiered learning utilizes this observation to constrain the learning space for ATC codes at a particular level based on the ATC code at higher levels. This simplifies the prediction and allows for improved accuracy.

Initialization

Chemical space

Code (set theory)

10.1186/s12859-017-1660-6

Cite

Citations (17)