Shape Perception in Chemistry
2013
Organic chemists make extensive use of a diagrammatic language for designing, exchanging and analysing the features of chemicals. In this language, chemicals are represented on a flat (2D) plane following standard stylistic conventions. In the search for novel drugs and therapeutic agents, vast quantities of chemical data are generated and subjected to virtual screening procedures that harness algorithmic features and complex statistical models. However, in silico approaches do not yet compare to the abilities of experienced chemists in detecting more subtle features relevant for evaluating how likely a molecule is to be suitable to a given purpose. Our hypothesis is that one reason for this discrepancy is that human perceptual capabilities, particularly that of ‘gestalt’ shape perception, make additional information available to our reasoning processes that are not available to in silico processes. This contribution investigates this hypothesis. Algorithmic and logic-based approaches to representation and automated reasoning with chemical structures are able to efficiently compute certain features, such as detecting presence of specific functional groups. To investigate the specific differences between human and machine capabilities, we focus here on those tasks and chemicals for which humans reliably outperform computers: the detection of the overall shape and parts with specific diagrammatic features, in molecules that are large and composed of relatively homogeneous part types with many cycles. We conduct a study in which we vary the diagrammatic representation from the canonical diagrammatic standard of the chemicals, and evaluate speed of human determination of chemical class. We find that human performance varies with the quality of the pictorial representation, rather than the size of the molecule. This can be contrasted with the fact that machine performance varies with the size of the molecule, and is of course impervious to the quality of diagrammatic representation. This result has implications for the design of hybrid algorithms that take features of the overall diagrammatic aspects of the molecule as input into the feature detection and automated reasoning over chemical structure. It also has the potential to inform the design of interactive systems at the interface between human experts and machines.
Keywords:
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
36
References
2
Citations
NaN
KQI